BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781145|ref|YP_003065558.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] (350 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781145|ref|YP_003065558.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] gi|254040822|gb|ACT57618.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] Length = 350 Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust. Identities = 350/350 (100%), Positives = 350/350 (100%) Query: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD Sbjct: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 Query: 61 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM 120 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM Sbjct: 61 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM 120 Query: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE 180 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE Sbjct: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE 180 Query: 181 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM 240 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM Sbjct: 181 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM 240 Query: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS Sbjct: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY Sbjct: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 >gi|315122535|ref|YP_004063024.1| hypothetical protein CKC_03935 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495937|gb|ADR52536.1| hypothetical protein CKC_03935 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 637 Score = 612 bits (1579), Expect = e-173, Method: Compositional matrix adjust. Identities = 294/343 (85%), Positives = 317/343 (92%), Gaps = 1/343 (0%) Query: 9 MLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP 68 +LI D +VEVLEH+ R+D E +HD+RIRRKYSQGKVCVDAV PDEFLIHPD+ DIEKSP Sbjct: 156 LLISDPEVEVLEHTQRKDREEIIHDIRIRRKYSQGKVCVDAVPPDEFLIHPDATDIEKSP 215 Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYV 128 IVGRKLYLTRSDLISMGYDR+ IN L + SSQ EN+W+ K +SD ALEMIEYYELYV Sbjct: 216 IVGRKLYLTRSDLISMGYDRKYINQLQVASSQGNENSWQLSKYHHSDTALEMIEYYELYV 275 Query: 129 TIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIE 188 T+DYD DGIAELRRV+M GGTGKDNIL NEEW+ELPFTCLRA+RAPHCF+GESLA+SIIE Sbjct: 276 TLDYDNDGIAELRRVVMVGGTGKDNILVNEEWDELPFTCLRAIRAPHCFVGESLASSIIE 335 Query: 189 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGI 248 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI+DPESVLNPQFGKPIRV +GMDIRSVLGI Sbjct: 336 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIVDPESVLNPQFGKPIRVVSGMDIRSVLGI 395 Query: 249 HSVPMI-EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVEL 307 HSVPMI +KSFSMLHYLDQELVDRTGISDISSG SPEILQNMTATATSLIEQSGVGQVEL Sbjct: 396 HSVPMIADKSFSMLHYLDQELVDRTGISDISSGLSPEILQNMTATATSLIEQSGVGQVEL 455 Query: 308 IVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 IVRTLAQGLE LFRGLLRLIIQHQDKVRMVRLRDQW+SFDPR+ Sbjct: 456 IVRTLAQGLERLFRGLLRLIIQHQDKVRMVRLRDQWISFDPRH 498 >gi|150397041|ref|YP_001327508.1| hypothetical protein Smed_1838 [Sinorhizobium medicae WSM419] gi|150028556|gb|ABR60673.1| hypothetical protein Smed_1838 [Sinorhizobium medicae WSM419] Length = 683 Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust. Identities = 189/356 (53%), Positives = 251/356 (70%), Gaps = 15/356 (4%) Query: 10 LIKDSDVEVLEHSH---REDGGEKV-----HDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 L+ D +VEVLE S R + + V + ++IRR+ +G + AV +EFLIHPD+ Sbjct: 156 LVGDDEVEVLEQSQTVERMETPQGVVEQPSYSVKIRRRAERGTPRLAAVPLEEFLIHPDA 215 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK------FPKNQYSD 115 + I SPI G + + RSDL++MG+DR+ I+ LP + ++ F Sbjct: 216 ISIADSPITGFAMRMRRSDLVAMGHDRDLIDGLPAAEAGGRDDEASTRRRDAFETKDAVP 275 Query: 116 KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPH 175 KALE ++YYELYV +D D DGIAELRR++ AGGT ++N+L NEEW+E+PF L R PH Sbjct: 276 KALEEVDYYELYVKVDADDDGIAELRRLVFAGGTSEENLLSNEEWDEVPFADLTVERRPH 335 Query: 176 CFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIR 235 G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +PE+VLNP+FG+PIR Sbjct: 336 QREGGSVTGDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANPEAVLNPKFGQPIR 395 Query: 236 VAAGMDIRSVLGIHSVPMIEK-SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT 294 V+ G+D R+ LG VP + K SF+ML YLDQE DRTGISD SSG +P+ LQNMTA AT Sbjct: 396 VSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGMAPDALQNMTARAT 455 Query: 295 SLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 +L+EQ+G+GQ EL+VRT AQGL +FRGLLRL+++HQD+ R VRLR QWV+FDPR+ Sbjct: 456 ALVEQAGIGQTELMVRTFAQGLRRVFRGLLRLVVKHQDRPRAVRLRGQWVTFDPRH 511 >gi|227822448|ref|YP_002826420.1| hypothetical protein NGR_c19030 [Sinorhizobium fredii NGR234] gi|227341449|gb|ACP25667.1| hypothetical protein NGR_c19030 [Sinorhizobium fredii NGR234] Length = 684 Score = 375 bits (962), Expect = e-102, Method: Compositional matrix adjust. Identities = 188/357 (52%), Positives = 249/357 (69%), Gaps = 16/357 (4%) Query: 10 LIKDSDVEVLEHSHREDGGEK--------VHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 LI D +VEV+E S + E + ++IRR+ +G + AV +EFLIHP++ Sbjct: 156 LIGDDEVEVVEQSRTTEKIETPQGMVEQPSYSVKIRRRLERGTPRLAAVPLEEFLIHPEA 215 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQ-------YS 114 + I SPI G + RSDLI+ GYDR+ I LP + + + +F + + Sbjct: 216 ISIADSPIAGIATRMRRSDLIATGYDRDLIEGLPASTGDSGRDDEEFTRRRGVFEAKDAV 275 Query: 115 DKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAP 174 KALE ++YYELYV +D D DGIAELRR+++AGGTG++++L NEEW+E+PF L R P Sbjct: 276 PKALEEVDYYELYVKVDADDDGIAELRRLVLAGGTGEEHLLSNEEWDEVPFADLIIERRP 335 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 H G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +PESVLNP+F +PI Sbjct: 336 HQREGGSVTDDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANPESVLNPKFAQPI 395 Query: 235 RVAAGMDIRSVLGIHSVPMIEK-SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATA 293 RV+ G+D R+ LG VP + K SF+ML YLDQE DRTGISD SSG +P+ L NMTA A Sbjct: 396 RVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGLAPDALTNMTARA 455 Query: 294 TSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 T+LIEQ+G+GQ EL+VRT AQGL +F+GLLRL+I+HQD+ R VRLR QWV+FDPR+ Sbjct: 456 TALIEQAGIGQTELMVRTFAQGLRRVFKGLLRLVIKHQDRPRAVRLRGQWVTFDPRH 512 >gi|227822445|ref|YP_002826417.1| hypothetical protein NGR_c19000 [Sinorhizobium fredii NGR234] gi|227341446|gb|ACP25664.1| hypothetical protein NGR_c19000 [Sinorhizobium fredii NGR234] Length = 361 Score = 251 bits (642), Expect = 9e-65, Method: Compositional matrix adjust. Identities = 119/189 (62%), Positives = 147/189 (77%), Gaps = 1/189 (0%) Query: 163 LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDP 222 +PF L R PH G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +P Sbjct: 1 MPFADLIIERRPHQREGGSVTDDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANP 60 Query: 223 ESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEK-SFSMLHYLDQELVDRTGISDISSGF 281 ESVLNP+FG+PIRV+ G+D R+ LG VP + K SF+ML YLDQE DRTGISD SSG Sbjct: 61 ESVLNPKFGQPIRVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGL 120 Query: 282 SPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD 341 +P+ L NMTA AT+LIEQ+G+GQ EL+VRT AQGL +F+GLLRL+I+HQD+ R VRLR Sbjct: 121 APDALTNMTARATALIEQAGIGQTELMVRTFAQGLRRVFKGLLRLVIKHQDRPRAVRLRG 180 Query: 342 QWVSFDPRY 350 QWV+FDPR+ Sbjct: 181 QWVTFDPRH 189 >gi|291334599|gb|ADD94249.1| hypothetical protein Daci_1943 [uncultured phage MedDCM-OCT-S04-C136] Length = 741 Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 101/327 (30%), Positives = 187/327 (57%), Gaps = 12/327 (3%) Query: 31 VHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRES 90 +++ +I+R G+V ++++ P+EFLI + IE + V K+ +TRSDL++MGY ++ Sbjct: 206 LYNCKIKRIKKTGRVKIESIPPEEFLIDRSAKTIEDADFVSHKVLMTRSDLVAMGYPQDE 265 Query: 91 INNLPI--ISSQNIENTWKFPK------NQYSDKALEMIEYYELYVTIDYDGDGIAELRR 142 ++ LP + N E T + + +D + E + YE YV DYD DGIAELR+ Sbjct: 266 VDELPKSDLDIYNDEETVRLADVDDYRISSSTDTSTEKVLVYESYVKYDYDEDGIAELRK 325 Query: 143 VIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLD 202 ++ AG G +IL N + +PF + + PH F G S++ + ++Q +K+ ++RQ LD Sbjct: 326 IVSAGADGH-HILSNMPCDSVPFVTITPIPMPHRFYGRSISELVEDVQLMKSTVMRQLLD 384 Query: 203 NLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLH 262 N+Y N + V +G +++ + +L + G +R + + + + + P+ +++F +L Sbjct: 385 NMYLTNNNRVAVMDG-MVNMDDLLTTRPGGIVRTKQPPN-QVMQPLQAQPISQQAFPLLS 442 Query: 263 YLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFR 321 YLD RTG+S + G SP+ L TAT + + Q + ELI R A+ G++ LF+ Sbjct: 443 YLDSVREGRTGVSKEAQGLSPDTLNAKTATGVNALMQQTQMRSELIARVFAETGVKDLFK 502 Query: 322 GLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + L++++QDK +++ + +Q++ P Sbjct: 503 KIFELMVKYQDKEKIIMMSNQYIPVRP 529 >gi|160897386|ref|YP_001562968.1| hypothetical protein Daci_1943 [Delftia acidovorans SPH-1] gi|160362970|gb|ABX34583.1| conserved hypothetical protein [Delftia acidovorans SPH-1] Length = 763 Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 110/330 (33%), Positives = 165/330 (50%), Gaps = 22/330 (6%) Query: 33 DLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDR-ESI 91 D+ +R G+V V+ V P+EFLI + IE + VG ++ T S+L SMGY + I Sbjct: 233 DVVCKRVKKGGRVRVENVPPEEFLISRKAKSIEDASFVGHRVARTISELKSMGYKNVDDI 292 Query: 92 NNLPIISSQNIEN----TW-------KFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL 140 + +S N+E +W + Q D + I E Y+ DYDGDGIAEL Sbjct: 293 TSDDQAASLNMERIERLSWDDEMAYLQMDNVQSMDTSQRQIWVTECYLRCDYDGDGIAEL 352 Query: 141 RRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 R+V+ AG + IL NE + PF + + PH F G S+A +E Q+I T+LLR Sbjct: 353 RKVVRAG----NQILENEVCDVAPFVSITPVPMPHKFFGLSVADLALEGQRINTILLRNQ 408 Query: 201 LDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV-AAGMDIRSVLGIHSVPMIEKSFS 259 LDN + + EG + + + +L + G +R+ +AGM R G + + + Sbjct: 409 LDNNNLEVNGRYFAVEGQV-NLDDLLTSRPGGVVRMKSAGMAGRLDQGAGNSGL---NLQ 464 Query: 260 MLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEIL 319 M+ Y+ D TG + + G + L N TAT + I +++LI R A G L Sbjct: 465 MMEYMKGFQEDSTGWTRYNQGSDGDSL-NQTATGVNQIVNRADMRLDLIARNYADGFREL 523 Query: 320 FRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 FR +L+L Q+Q MV+LR +WV PR Sbjct: 524 FRLMLKLCSQYQQTEDMVKLRGKWVPVSPR 553 >gi|291334641|gb|ADD94289.1| portal protein [uncultured phage MedDCM-OCT-S04-C64] Length = 755 Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 96/321 (29%), Positives = 167/321 (52%), Gaps = 16/321 (4%) Query: 43 GKVCVDAVSPDEFLI--HPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQ 100 G + ++ V P+EF I + S +E + + + S+L++MGYD E I +LP S Sbjct: 190 GSIRIEPVPPEEFGIARNARSPYVEDTNFCYHRTLKSFSELVAMGYDVELIRSLPFDESA 249 Query: 101 NIENTW--------KFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGG---T 149 E + P + S++++ E Y+ ID DGD IAEL RV +AGG + Sbjct: 250 MTEEELARRNKTDEEEPFDYVSEESMRNYFITECYIKIDRDGDDIAELLRVTLAGGNYTS 309 Query: 150 GKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 G +L EE + +PF + PH F G S+A +++Q+IK+VL RQ LDN Y N Sbjct: 310 GSSRLLGIEEVDHMPFATCSPILMPHKFYGLSIADITMDLQRIKSVLTRQMLDNTYLANN 369 Query: 210 PQTIVQEGSIIDPESVLNPQFGKPIRVAA-GMDIRSVLGIHSVPMIEKSFSMLHYLDQEL 268 +T V + S ++ + +L + G +R G + + I P+ ++++M+ YLD Sbjct: 370 SRTAVND-SHVNLDDLLTSRPGGVVRYKGEGSASQYITPIPHNPLPNEAYTMMGYLDDVR 428 Query: 269 VDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLI 327 RTG+ D ++G L N+ +L + ++ELI R L + G + +FR + +L+ Sbjct: 429 RQRTGVGDETAGLGENSLSNVNTGVAALAFDAKRMKIELIARILGEVGFKDVFRLIHKLL 488 Query: 328 IQHQDKVRMVRLRDQWVSFDP 348 ++HQD+ ++ + + + +P Sbjct: 489 MKHQDRKMLLNVAGNFQAINP 509 >gi|167600438|ref|YP_001671938.1| portal protein [Pseudomonas phage LUZ24] gi|161168301|emb|CAP45466.1| portal protein [Pseudomonas phage LUZ24] Length = 706 Score = 136 bits (342), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 102/354 (28%), Positives = 183/354 (51%), Gaps = 26/354 (7%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ D D E+L S EDG + ++IR+ + ++ V + P+ FL+ + I+ + Sbjct: 163 ILADPDTEILAQSVDEDG---TYSIKIRKDKKKREIKVTCIKPENFLVDRLATCIDDARF 219 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPI------------ISSQNIENTWKFPKNQYSD-K 116 + + T SDL +G + ++ LP + N + T + N D + Sbjct: 220 LCHREKYTVSDLRLLGVPEDVLDELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAE 279 Query: 117 ALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHC 176 A + E Y +D DGDGI+ELRR++ G D I+ NE W+ PF L A R H Sbjct: 280 ANREVWASECYTLLDVDGDGISELRRILYVG----DYIISNEPWDSRPFADLNAYRIAHK 335 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+ I +IQ+I++VL+R +DN+Y NQ +++V +G ++ E +L + +RV Sbjct: 336 FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQ-VNLEDLLTNEAAGIVRV 394 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ-NMTATATS 295 A + S++ + + + + + ML L+ + RTGI+D + G L N A + + Sbjct: 395 KA---MNSIMPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVN 451 Query: 296 LIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + + Q++LI R A+ G++ LF+ L I++Q++ + +LR +WV+ +P Sbjct: 452 QLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAINP 505 >gi|291334834|gb|ADD94474.1| hypothetical protein CLIBASIA_05245 [uncultured phage MedDCM-OCT-S06-C1041] Length = 265 Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 85/263 (32%), Positives = 138/263 (52%), Gaps = 14/263 (5%) Query: 84 MGYDRESINNLPIISSQNIENTW--------KFPKNQYSDKALEMIEYYELYVTIDYDGD 135 MGYD E I +LP S E + P + S++++ E Y+ ID DGD Sbjct: 1 MGYDVELIRSLPFDESAMTEEELARRNKTDEEEPFDYVSEESMRNYFITECYIKIDRDGD 60 Query: 136 GIAELRRVIMAGG---TGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKI 192 IAEL RV +AGG +G +L EE + +PF + PH F G S+A +++Q+I Sbjct: 61 DIAELLRVTLAGGNYTSGSSRLLGIEEVDHMPFATCSPILMPHKFYGLSIADITMDLQRI 120 Query: 193 KTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAA-GMDIRSVLGIHSV 251 K+VL RQ LDN Y N +T V + S ++ + +L + G +R G + + I Sbjct: 121 KSVLTRQMLDNTYLANNSRTAVND-SHVNLDDLLTSRPGGVVRYKGEGSASQYITPIPHN 179 Query: 252 PMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRT 311 P+ ++++M+ YLD RTG+ D ++G L N+ +L + ++ELI R Sbjct: 180 PLPNEAYTMMGYLDDVRRQRTGVGDETAGLGENSLSNVNTGVAALAFDAKRMKIELIARI 239 Query: 312 LAQ-GLEILFRGLLRLIIQHQDK 333 L + G + +FR + +L+++HQD+ Sbjct: 240 LGEVGFKDVFRLIHKLLMKHQDR 262 >gi|221199509|ref|ZP_03572553.1| putative portal protein [Burkholderia multivorans CGD2M] gi|221205589|ref|ZP_03578604.1| putative portal protein [Burkholderia multivorans CGD2] gi|221174427|gb|EEE06859.1| putative portal protein [Burkholderia multivorans CGD2] gi|221180794|gb|EEE13197.1| putative portal protein [Burkholderia multivorans CGD2M] Length = 807 Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 103/339 (30%), Positives = 166/339 (48%), Gaps = 32/339 (9%) Query: 30 KVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDR- 88 ++H++ + R G V ++AV P++FL+ S I + ++ T SDL + GY+ Sbjct: 272 RLHNVVLTRSKKAGHVAIEAVMPEDFLVSARSRRI-RDGFCAHRVRKTLSDLKAEGYENV 330 Query: 89 ESINNLP----------IISSQNIEN--TWKFPKNQYSDKALEMIEYYELYVTIDYDGDG 136 E I++ P ++ QN +N + + D++ +E YE Y+ ID DGDG Sbjct: 331 ELIDSEPNAVAADLSELALARQNEQNRVVTNALDDGFGDESQREVELYECYLPIDVDGDG 390 Query: 137 IAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 I+E R++ AG IL NE + PF + + P IG S+A + IQ+IKT Sbjct: 391 ISEWRKITKAGNA----ILDNEVVDGPPFALVSPISIPGLLIGRSIADLAMPIQRIKTKF 446 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGI----HSVP 252 LR DN+ Q + + +G + + + N R G+ I+S I +P Sbjct: 447 LRGLDDNMQIQINGRVGLVDGKVNVNDWMDN-------RPGGGVRIKSADAIVPIKQGLP 499 Query: 253 MIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTL 312 I + +L Y+D +RTGI+ S G + L N TA I +V++I R Sbjct: 500 DIAGAMQLLQYVDAMSQERTGITKYSQGLDADTL-NHTADGIKRITARADLRVKMIARKF 558 Query: 313 AQ-GLEILFRGLLRLIIQHQDKVRMVRL-RDQWVSFDPR 349 A+ G+ LFR + +L++QHQDK + L + +WV DPR Sbjct: 559 AETGVTDLFRLIQKLLMQHQDKPMSIALSKGKWVDIDPR 597 >gi|27476052|ref|NP_775254.1| putative portal protein [Pseudomonas phage PaP3] gi|27414482|gb|AAL85568.1| ORF.04 [Pseudomonas phage PaP3] Length = 705 Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 101/354 (28%), Positives = 180/354 (50%), Gaps = 26/354 (7%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ D D +L S +DG + ++IR+ + ++ V V P+ FL+ + I+ + Sbjct: 162 ILSDPDTSILAQSVDDDG---TYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARF 218 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPI------------ISSQNIENTWKFPKNQYSD-K 116 + + T SDL +G + I LP + N + T + N D + Sbjct: 219 LCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAE 278 Query: 117 ALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHC 176 A + E Y +D DGDGI+ELRR++ G D I+ NE W+ PF L A R H Sbjct: 279 ANREVWASECYTLLDVDGDGISELRRILYVG----DYIISNEPWDCRPFADLNAYRIAHK 334 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+ I +IQ+I++VL+R +DN+Y NQ +++V +G ++ E +L + +RV Sbjct: 335 FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQ-VNLEDLLTNEAAGIVRV 393 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ-NMTATATS 295 + + S+ + + + + + ML L+ + RTGI+D + G L N A + + Sbjct: 394 KS---MNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVN 450 Query: 296 LIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + + Q++LI R A+ G++ LF+ L I++Q++ + +LR +WV+ +P Sbjct: 451 QLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNP 504 >gi|148257059|ref|YP_001241644.1| hypothetical protein BBta_5791 [Bradyrhizobium sp. BTAi1] gi|146409232|gb|ABQ37738.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 557 Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 98/340 (28%), Positives = 163/340 (47%), Gaps = 30/340 (8%) Query: 31 VHDLRI--RRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYL-TRSDLISMGYD 87 HD+ I RK++Q +V V P+EF I + I ++ T + LI+ G+D Sbjct: 19 THDVTIVTTRKFAQARVM--GVPPEEFGIERGARSIRDCNYCFHEIVTKTEAQLIAEGFD 76 Query: 88 RESINNLP-IISSQNIENTWKFPKNQ-------YSDKALEMIEYYELYVTIDYDGDGIAE 139 I +L + +E + ++ ++ ++ E YV +DY+G+G Sbjct: 77 AAQIRSLGDYAGTTRVETLARDTVDEQSRASASAANSGTRLVRITEHYVRMDYEGEGRPC 136 Query: 140 LRRVIMAGGTG----KDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTV 195 L ++I G G KD C ++ +PF + H F G S+A ++ +Q+ KT Sbjct: 137 LYQIITGGDQGEILRKDGQDCITPFDAIPFAATTPVPMTHRFFGRSIADLVMPLQREKTA 196 Query: 196 LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKP-----IRVAAGMDIRSVLGIHS 250 L R LDNLY N P+ V E + P ++ + +P + A G++ + V Sbjct: 197 LKRGALDNLYLHNNPRVEVAEANA-GPNTLDDLLVSRPGGVVRTKTAGGLNWQVV----- 250 Query: 251 VPMIEKS-FSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIV 309 P I S + ML Y+D EL R+G+S + G LQN +ATA + + + +++LI Sbjct: 251 -PDITSSIYPMLQYIDAELESRSGLSKQAQGIDANALQNQSATAVAQVFSASQMRIKLIA 309 Query: 310 RTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 R +A+G+ +F L I +H + + VRLR+ WV DPR Sbjct: 310 RIMAEGVRDMFGLLHATIRKHGQQRQTVRLRNAWVQVDPR 349 >gi|294083946|ref|YP_003550703.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663518|gb|ADE38619.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 697 Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 94/359 (26%), Positives = 175/359 (48%), Gaps = 29/359 (8%) Query: 7 IHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEK 66 + ML+ DV++++ S +D G + + R SQ V ++ + P+E ++ +E+ Sbjct: 180 LDMLLAQDDVDLIDSS-TDDVGMVSGTIGVTRDTSQ--VVIETIPPEELIVEAQCKSLEE 236 Query: 67 SPIVGRKLYLTRSDLISMGYDRESINNL-----------PIISSQN--IENTWKFPKNQY 113 S + T S+L M D + ++++ P I +++ + F + Y Sbjct: 237 STFSAHRTRKTLSELREMYPDSDKLDDIGDHEDVEMETDPEILARHDGVSENRGFSSHGY 296 Query: 114 SDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCN-EEWNELPFTCLRAMR 172 D+ ++ YE Y+ +D +G GIA+L +V AG N+L + EE PF + Sbjct: 297 QDQVRHIL-CYEAYIMLDVEGSGIAKLHKVTKAG-----NVLLDIEEVKRRPFVTFCPLP 350 Query: 173 APHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 PH F G + A + Q +TVL R LD+ N P+ +V +G + +P +++ + G Sbjct: 351 IPHAFYGSNFAEKLCATQNARTVLTRSILDHAMITNNPRYMVVKGGLSNPRELIDNRVGG 410 Query: 233 PIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEIL--QNMT 290 + V+ ++ + P+ F L LDQ+L D TG+S +S G + + + QN Sbjct: 411 LVNVSRP---DAISAMPQAPLNPFVFQTLQQLDQDLEDNTGVSRLSQGLNKDAISKQNSA 467 Query: 291 ATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 A L S Q +++ R AQ ++ LF + RL+++++D+ ++V + +V DPR Sbjct: 468 AMVEQLATMSQQRQ-KILARHFAQFVKSLFHEIYRLVVENEDQQKIVEISGAYVEVDPR 525 >gi|241760934|ref|ZP_04759023.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374553|gb|EER64014.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 729 Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 96/288 (33%), Positives = 146/288 (50%), Gaps = 28/288 (9%) Query: 75 YLTRSDLISMGYDRESINNLPIISSQNIENT-------WKFPK--NQYSDKALEMIEYYE 125 Y T SDLISMG+DR+ + +LP S ++ N+ W+ + SD+A + YE Sbjct: 237 YKTLSDLISMGFDRDIVESLP--SDKSFPNSDGRSDARWRDESFLSGSSDQANREVLLYE 294 Query: 126 LYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAAS 185 YV ID DGDGIAEL ++ KD +L EE +E PF H IG SLA Sbjct: 295 EYVRIDRDGDGIAELLQIFRV----KDVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEK 350 Query: 186 IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSV 245 +++IQ++K+VL+RQ LD +Y N P+ V + + + F + + G +R Sbjct: 351 VMDIQRVKSVLMRQALDGVYQTNAPRMAV------NVDGLTEDTFDDLLTIRPGAIVRYR 404 Query: 246 LGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 GI P+ I+KS M+ Y+ RTGI+ ++ G + L N TAT +L++ Sbjct: 405 GGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTATGQALLQAQ 463 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 G E + R AQ L LF+ L L+I D + +++ + + DP Sbjct: 464 GQQMEEYVARNFAQSLGRLFQKKLWLMIASGDPM-AIKVEGLYKTVDP 510 >gi|260753098|ref|YP_003225991.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552461|gb|ACV75407.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 729 Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 96/288 (33%), Positives = 146/288 (50%), Gaps = 28/288 (9%) Query: 75 YLTRSDLISMGYDRESINNLPIISSQNIENT-------WKFPK--NQYSDKALEMIEYYE 125 Y T SDLISMG+DR+ + +LP S ++ N+ W+ + SD+A + YE Sbjct: 237 YKTLSDLISMGFDRDIVESLP--SDKSFPNSDGRSDARWRDESFLSGSSDQANREVLLYE 294 Query: 126 LYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAAS 185 YV ID DGDGIAEL ++ KD +L EE +E PF H IG SLA Sbjct: 295 EYVRIDRDGDGIAELLQIFRV----KDVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEK 350 Query: 186 IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSV 245 +++IQ++K+VL+RQ LD +Y N P+ V + + + F + + G +R Sbjct: 351 VMDIQRVKSVLMRQALDGVYQTNAPRMAV------NVDGLTEDTFDDLLTIRPGAIVRYR 404 Query: 246 LGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 GI P+ I+KS M+ Y+ RTGI+ ++ G + L N TAT +L++ Sbjct: 405 GGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTATGQALLQAQ 463 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 G E + R AQ L LF+ L L+I D + +++ + + DP Sbjct: 464 GQQMEEYVARNFAQSLGRLFQKKLWLMIASGDPM-AIKVEGLYKTVDP 510 >gi|56551276|ref|YP_162115.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] gi|56542850|gb|AAV89004.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] Length = 729 Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 96/288 (33%), Positives = 146/288 (50%), Gaps = 28/288 (9%) Query: 75 YLTRSDLISMGYDRESINNLPIISSQNIENT-------WKFPK--NQYSDKALEMIEYYE 125 Y T SDLISMG+DR+ + +LP S ++ N+ W+ + SD+A + YE Sbjct: 237 YKTLSDLISMGFDRDIVESLP--SDKSFPNSDGRSDARWRDESFLSGSSDQANREVLLYE 294 Query: 126 LYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAAS 185 YV ID DGDGIAEL ++ KD +L EE +E PF H IG SLA Sbjct: 295 EYVRIDRDGDGIAELLQIFRV----KDVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEK 350 Query: 186 IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSV 245 +++IQ++K+VL+RQ LD +Y N P+ V + + + F + + G +R Sbjct: 351 VMDIQRVKSVLMRQALDGVYQTNAPRMAV------NVDGLTEDTFDDLLTIRPGAIVRYR 404 Query: 246 LGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 GI P+ I+KS M+ Y+ RTGI+ ++ G + L N TAT +L++ Sbjct: 405 GGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTATGQALLQAQ 463 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 G E + R AQ L LF+ L L+I D + +++ + + DP Sbjct: 464 GQQMEEYVARNFAQSLGRLFQKKLWLMIASGDPM-AIKVEGLYKTVDP 510 >gi|227821703|ref|YP_002825673.1| hypothetical protein NGR_c11350 [Sinorhizobium fredii NGR234] gi|227340702|gb|ACP24920.1| hypothetical protein NGR_c11350 [Sinorhizobium fredii NGR234] Length = 348 Score = 115 bits (289), Expect = 7e-24, Method: Compositional matrix adjust. Identities = 77/165 (46%), Positives = 104/165 (63%), Gaps = 15/165 (9%) Query: 10 LIKDSDVEVLEH-SHRED-----GGEKV--HDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 L+ D DVEVLE S++E G + V +++RIRR G + AV +EFLIHPD+ Sbjct: 156 LVADDDVEVLEQESYQEQIDTPQGPQSVTLYNVRIRRTKEYGCTKLAAVPLEEFLIHPDA 215 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKF-----PKNQYSD- 115 + I+ SPI G K L RSDL++MGYDRE ++ SS N E T +F P ++ + Sbjct: 216 MSIDDSPITGIKTRLRRSDLVAMGYDREKVDKFATASSSNEEETEEFARRREPFDEKDEI 275 Query: 116 -KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE 159 KAL+ ++YYELYV ID D DGIAELRR+ AGG + N+L +EE Sbjct: 276 IKALQEVDYYELYVKIDVDDDGIAELRRMCFAGGLAEVNLLDDEE 320 >gi|288817860|ref|YP_003432207.1| putative portal protein [Hydrogenobacter thermophilus TK-6] gi|288787259|dbj|BAI69006.1| putative portal protein [Hydrogenobacter thermophilus TK-6] Length = 618 Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 97/345 (28%), Positives = 177/345 (51%), Gaps = 31/345 (8%) Query: 25 EDGGEKVHDLRIR-RKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVG-RKL----YLTR 78 +D G ++ + ++ + S+ + C++ V EF+ HP ++ ++ SP V RK+ YL R Sbjct: 172 QDLGNGIYRVALKISRLSKNQPCLENVPATEFIFHPSTLSVKDSPFVAHRKVVTVDYLKR 231 Query: 79 SDL--ISMGYDR--ESINNLPIISSQNIENTWKFPKNQYS------DKALEMIEYYELYV 128 + I D+ ES ++ + +Q + K P +Y+ D A + YE Y Sbjct: 232 KEKEGIYKNVDKVIESASSDDLRYTQMADYYLK-PYKKYAVSESDQDLARRKVLLYECYT 290 Query: 129 TIDYDGDGIAELRRVIMAGGTGKDNILCNEE--WNELPFTCLRAMRAPHCFIGESLAASI 186 D + DG+ L VI+ G + IL +E + PF L + P+ G+S A + Sbjct: 291 KYDINNDGL--LEDVIIT--VGNNTILRIQENIYGRPPFFVLAPILEPYQLWGKSFADVL 346 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKP-IRVAAGMDIR-S 244 +IQ +KT L+ Q + N+ N + + + ++++ + ++N KP IR+ AG DIR + Sbjct: 347 KDIQDLKTALVNQIIVNVGMNNDYKIAIND-TLVNVQDIVN---DKPVIRMKAGADIRQA 402 Query: 245 VLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQ 304 ++ + + P+ SF+ L Y++ +RTGI+ + G L N TA+ S+I Q+ + Sbjct: 403 IMPLPTQPLAPWSFNFLEYIEGTKENRTGITRYNQGLDGRSL-NKTASGISMIMQAANQR 461 Query: 305 VELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 +ELI R A+ G++ LF L+ L Q D+ ++RL ++ + P Sbjct: 462 LELIARIFAETGIKDLFSFLVYLNQQFIDQKTVIRLTNKSLPIAP 506 >gi|308751459|gb|ADO44942.1| hypothetical protein Hydth_0542 [Hydrogenobacter thermophilus TK-6] Length = 618 Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 97/345 (28%), Positives = 177/345 (51%), Gaps = 31/345 (8%) Query: 25 EDGGEKVHDLRIR-RKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVG-RKL----YLTR 78 +D G ++ + ++ + S+ + C++ V EF+ HP ++ ++ SP V RK+ YL R Sbjct: 172 QDLGNGIYRVALKISRLSKNQPCLENVPATEFIFHPSTLSVKDSPFVAHRKVVTVDYLKR 231 Query: 79 SDL--ISMGYDR--ESINNLPIISSQNIENTWKFPKNQYS------DKALEMIEYYELYV 128 + I D+ ES ++ + +Q + K P +Y+ D A + YE Y Sbjct: 232 KEKEGIYKNVDKVIESASSDDLRYTQMADYYLK-PYKKYAVSESDQDLARRKVLLYECYT 290 Query: 129 TIDYDGDGIAELRRVIMAGGTGKDNILCNEE--WNELPFTCLRAMRAPHCFIGESLAASI 186 D + DG+ L VI+ G + IL +E + PF L + P+ G+S A + Sbjct: 291 KYDINNDGL--LEDVIIT--VGNNTILRIQENIYGRPPFFVLAPILEPYQLWGKSFADVL 346 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKP-IRVAAGMDIR-S 244 +IQ +KT L+ Q + N+ N + + + ++++ + ++N KP IR+ AG DIR + Sbjct: 347 KDIQDLKTALVNQIIVNVGMNNDYKIAIND-TLVNVQDIVN---DKPVIRMKAGADIRQA 402 Query: 245 VLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQ 304 ++ + + P+ SF+ L Y++ +RTGI+ + G L N TA+ S+I Q+ + Sbjct: 403 IMPLPTQPLAPWSFNFLEYIEGTKENRTGITRYNQGLDGRSL-NKTASGISMIMQAANQR 461 Query: 305 VELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 +ELI R A+ G++ LF L+ L Q D+ ++RL ++ + P Sbjct: 462 LELIARIFAETGIKDLFSFLVYLNQQFIDQKTVIRLTNKSLPIAP 506 >gi|316934283|ref|YP_004109265.1| putative portal protein [Rhodopseudomonas palustris DX-1] gi|315601997|gb|ADU44532.1| putative portal protein [Rhodopseudomonas palustris DX-1] Length = 673 Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 80/314 (25%), Positives = 143/314 (45%), Gaps = 21/314 (6%) Query: 47 VDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTW 106 V+ V P+EF S + GRK TR++LIS GY R+ ++ + + S ++ Sbjct: 197 VEVVPPEEFYSDA-SKKRRQDGTRGRKTLKTRAELISEGYPRDKVSKVRVSSEIEFDSER 255 Query: 107 KFPKNQYSD--------KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 + + +D L+ I +E ++ + GDG A L R++ A G ++ Sbjct: 256 QERDRETNDGIGSDAPQSELDQILVHETFIQLSLKGDGKASLYRIVHADG----HLFEMG 311 Query: 159 EWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGS 218 E + F +R PH G + + I+ Q +TV+ R LD+ N P+ V S Sbjct: 312 EVADDNFLDFVPLRRPHSQFGNNFSKRIVPTQNARTVITRSILDHAATVNNPRWTVLNNS 371 Query: 219 IIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS-FSMLHYLDQELVDRTGISDI 277 + +P+ +L+ + +R + R +GI P + + F +L L + TGIS + Sbjct: 372 LSNPKELLDAR----LRGVVNVKNRDAIGILPYPQLNNAVFPLLEMLKTNKEETTGISSL 427 Query: 278 SSGFSPEIL--QNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVR 335 S G + + + QN LI S Q ++I R A L LF +++I++Q + + Sbjct: 428 SQGLNKDAISSQNSQGMVNDLITVSQTRQ-KIIARNFAMFLHDLFLAARKVVIENQTRKK 486 Query: 336 MVRLRDQWVSFDPR 349 + + + + DPR Sbjct: 487 VWEFDNNFQNIDPR 500 >gi|167583563|ref|YP_001671753.1| portal protein [Enterobacteria phage phiEco32] gi|164375401|gb|ABY52809.1| portal protein [Enterobacteria phage phiEco32] Length = 747 Score = 65.9 bits (159), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 69/312 (22%), Positives = 145/312 (46%), Gaps = 26/312 (8%) Query: 33 DLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESIN 92 D+++ + + +V V+ V ++ + + + ++ ++ DL++MG+ ++ I Sbjct: 186 DVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIE 245 Query: 93 --NLPIISSQNIENTWKFPKNQY-----------SDKALEMIEYYELYV-TIDYDGDGIA 138 N + ++T + + + ++ M+ YE Y+ T D + + Sbjct: 246 AFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRTGVLDKNKES 305 Query: 139 ELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLR 198 +L +VI AG ++IL EE +PF P F G+S+ +IQ ++T L+R Sbjct: 306 KLYQVIQAG----EHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVR 361 Query: 199 QTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSF 258 +DN+ N + G+ D S+L+ + G + + D + H++P + Sbjct: 362 GYIDNVNNANYGRYKALVGA-YDRRSLLDNRPGGVVEMER-QDAIDLFPYHNLP---QGI 416 Query: 259 SMLHYLDQELVD-RTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQ-G 315 L + +EL + RTG++ + G +P++ +N A AT L+ + ++ ++ R +A G Sbjct: 417 DGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMVCRNIAHNG 476 Query: 316 LEILFRGLLRLI 327 + L RG+ LI Sbjct: 477 MVELMRGIYSLI 488 >gi|307308935|ref|ZP_07588618.1| hypothetical protein SinmeBDRAFT_4502 [Sinorhizobium meliloti BL225C] gi|306900569|gb|EFN31182.1| hypothetical protein SinmeBDRAFT_4502 [Sinorhizobium meliloti BL225C] Length = 677 Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 76/344 (22%), Positives = 151/344 (43%), Gaps = 40/344 (11%) Query: 27 GGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS-VDIE----KSPIVGRKLYLTRSDL 81 GG +V D++IR + + V V P++ ++ D+ D E ++ + G + ++RS L Sbjct: 184 GGVQVRDVKIRTVTRRSCINVFPVDPEDAVLSTDAQFDPETGGIRAKLQGHRKIMSRSVL 243 Query: 82 ISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKAL--EMIEYYELYVTIDYDGDGIAE 139 I +G+D+ +++ +P ++ + + K+ ++A +M+E Y +Y + D + Sbjct: 244 IDLGFDKATVDRIPGVNEKTDGIALERLKDVSGERAFDKDMVEVYTVYTRLKL--DTTSR 301 Query: 140 LRRVIMAGGTGKDNILCNEEWNEL-PFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLR 198 R+ G + +L EE P+ G+ +A I E + + + R Sbjct: 302 HYRITFGGDSANPILLDYEETTRFYPYAAFVPYPLAGTLFGQGIADRIGEDHEKISKMER 361 Query: 199 QTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAA---GMDI--RSVLGIHSVPM 253 D+L P T+V + + + N GK IR ++ G++ G + + Sbjct: 362 AVQDSLNMSVFPITVVDDDVSSI-DDLTNLHPGKVIRSSSPNGGINFVQHPFTGAQATGI 420 Query: 254 IEKSFSMLHYLDQELVDRTGISDISSGFSPEI-------LQNMTATATSLIEQSGVGQVE 306 IE+ L+Q+L D S+G P++ LQ TATA + +E Sbjct: 421 IER-------LEQKL-------DFSTGVGPQMMTLDASDLQRTTATAINQRSNQQQTLIE 466 Query: 307 LIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMV--RLRDQWVSFD 347 + R A+ G L + ++ L++Q D+ + + RL ++ D Sbjct: 467 TVSRFFAETGYRYLTKVIVDLLVQKPDESQELIGRLTGNFIPVD 510 >gi|282598927|ref|YP_003358477.1| N4 gp59-like protein [Pseudomonas phage LIT1] gi|259048687|emb|CAZ66336.1| N4 gp59-like protein [Pseudomonas phage LIT1] Length = 726 Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust. Identities = 55/234 (23%), Positives = 102/234 (43%), Gaps = 9/234 (3%) Query: 113 YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW---NELPFTCLR 169 + DK+ + + +E + D GDG+ I+A G I E +P+ + Sbjct: 314 FQDKSRKRLVVHEYWGYYDIHGDGVL---HPIVATWVGAVMIRMEENPFPDKRIPYVVVN 370 Query: 170 AMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQ 229 + GES A +I+ Q+I + R +D + Q V +G+ +D + Sbjct: 371 YIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGA-LDVTNRRRFD 429 Query: 230 FGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ-ELVDRTGISDISSGFSPEILQN 288 G+ G D R+ + +H+ P I +S + L Q E TG+ ++G S L + Sbjct: 430 RGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGD 489 Query: 289 MTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 TATA + + I+R L+ G+ + R ++ + + D V +VR+ ++ Sbjct: 490 -TATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNE 542 >gi|221271428|dbj|BAH15181.1| portal protein [Serratia phage KSP100] Length = 374 Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust. Identities = 21/76 (27%), Positives = 44/76 (57%), Gaps = 2/76 (2%) Query: 259 SMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQG-L 316 S+L ++Q RTG++ I G SPE+ +N + AT ++ + ++ ++ R +AQ + Sbjct: 59 SILEQIEQAKERRTGVTRIGMGLSPEVFKNDNSFATVDMMMSAAQNRMRMVARNVAQNFM 118 Query: 317 EILFRGLLRLIIQHQD 332 LF + RL+ ++++ Sbjct: 119 TQLFLAIYRLLKENEN 134 >gi|319956914|ref|YP_004168177.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] gi|319419318|gb|ADV46428.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] Length = 561 Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust. Identities = 36/153 (23%), Positives = 73/153 (47%), Gaps = 7/153 (4%) Query: 179 GESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAA 238 G S +I +Q+ TV Q +D + + + + S ++ + +L+ + I V++ Sbjct: 269 GGSFIEPMIPLQEEYTVTRNQQIDAIAESLSKRFLATKTSGLNEKDLLSNR--TKISVSS 326 Query: 239 GMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIE 298 +++ + P I+ S + LD E+ + +GI+ + G + N TAT S++ Sbjct: 327 LNEVKEL----QAPRIDPSIFGIDRLDSEMQEVSGITKYNQGLNDPHNLNQTATGVSILT 382 Query: 299 QSGVGQVELIVRTLAQG-LEILFRGLLRLIIQH 330 + G + IVR L + E R ++RLI ++ Sbjct: 383 EEGNAVIADIVRALNESFFEPAIRRMVRLIYKY 415 >gi|285018477|ref|YP_003376188.1| 3-phosphoshikimate 1-carboxyvinyltransferase (5-enolpyruvylshikimate-3-phosphate synthase) (epsp synthase) (epsps) protein [Xanthomonas albilineans GPE PC73] gi|283473695|emb|CBA16198.1| probable 3-phosphoshikimate 1-carboxyvinyltransferase (5-enolpyruvylshikimate-3-phosphate synthase) (epsp synthase) (epsps) protein [Xanthomonas albilineans] Length = 437 Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust. Identities = 16/38 (42%), Positives = 23/38 (60%) Query: 16 VEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPD 53 +++EH+H E GGE V DLR+R +G + V PD Sbjct: 280 ADIVEHNHSEHGGEPVADLRVRYALLRGAKIPETVVPD 317 >gi|84995979|ref|XP_952711.1| hypothetical protein [Theileria annulata strain Ankara] gi|65303708|emb|CAI76085.1| hypothetical telomeric protein, conserved [Theileria annulata] Length = 605 Score = 37.0 bits (84), Expect = 4.6, Method: Compositional matrix adjust. Identities = 27/124 (21%), Positives = 57/124 (45%), Gaps = 9/124 (7%) Query: 92 NNLPIISSQNIENTWKFPKNQYSDKALEMI--EYYELYVTIDYDGDGIAELRRVIMAGGT 149 +N P+ S Q+ ++ K+PK + + +MI ++ + Y+ YDG G+ + + G Sbjct: 390 DNRPVWSFQSNDSENKYPKKVFFNSFTKMIIVDFSKFYMIYQYDGSGLYLISTDTLYGSL 449 Query: 150 GKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 D E +EL T + P +G + A +++ ++ +T+ W++ Sbjct: 450 ISDFKFIGENNSELAKTDYETVNYP---LGYAFACRFNKVKCVEFSFKNKTV----WKHN 502 Query: 210 PQTI 213 PQ + Sbjct: 503 PQKM 506 >gi|226526985|ref|YP_002791004.1| putative nucleotidyltransferase/DNA polymerase [Lactobacillus brevis] gi|226442577|dbj|BAH56447.1| putative nucleotidyltransferase/DNA polymerase [Lactobacillus brevis] Length = 433 Score = 37.0 bits (84), Expect = 5.0, Method: Compositional matrix adjust. Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 5/53 (9%) Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMI 121 ++G +LY ++ G DR + N PI+ SQ+I N+ PK+ S A+E + Sbjct: 235 VIGLQLYA-----VAWGVDRSQLKNKPIVKSQSIGNSQVLPKDYRSQAAIETV 282 Searching..................................................done Results from round 2 >gi|254781145|ref|YP_003065558.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] gi|254040822|gb|ACT57618.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] Length = 350 Score = 488 bits (1256), Expect = e-136, Method: Composition-based stats. Identities = 350/350 (100%), Positives = 350/350 (100%) Query: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD Sbjct: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 Query: 61 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM 120 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM Sbjct: 61 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM 120 Query: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE 180 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE Sbjct: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE 180 Query: 181 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM 240 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM Sbjct: 181 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM 240 Query: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS Sbjct: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY Sbjct: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 >gi|315122535|ref|YP_004063024.1| hypothetical protein CKC_03935 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495937|gb|ADR52536.1| hypothetical protein CKC_03935 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 637 Score = 457 bits (1176), Expect = e-126, Method: Composition-based stats. Identities = 294/343 (85%), Positives = 316/343 (92%), Gaps = 1/343 (0%) Query: 9 MLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP 68 +LI D +VEVLEH+ R+D E +HD+RIRRKYSQGKVCVDAV PDEFLIHPD+ DIEKSP Sbjct: 156 LLISDPEVEVLEHTQRKDREEIIHDIRIRRKYSQGKVCVDAVPPDEFLIHPDATDIEKSP 215 Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYV 128 IVGRKLYLTRSDLISMGYDR+ IN L + SSQ EN+W+ K +SD ALEMIEYYELYV Sbjct: 216 IVGRKLYLTRSDLISMGYDRKYINQLQVASSQGNENSWQLSKYHHSDTALEMIEYYELYV 275 Query: 129 TIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIE 188 T+DYD DGIAELRRV+M GGTGKDNIL NEEW+ELPFTCLRA+RAPHCF+GESLA+SIIE Sbjct: 276 TLDYDNDGIAELRRVVMVGGTGKDNILVNEEWDELPFTCLRAIRAPHCFVGESLASSIIE 335 Query: 189 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGI 248 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI+DPESVLNPQFGKPIRV +GMDIRSVLGI Sbjct: 336 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIVDPESVLNPQFGKPIRVVSGMDIRSVLGI 395 Query: 249 HSVPMIE-KSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVEL 307 HSVPMI KSFSMLHYLDQELVDRTGISDISSG SPEILQNMTATATSLIEQSGVGQVEL Sbjct: 396 HSVPMIADKSFSMLHYLDQELVDRTGISDISSGLSPEILQNMTATATSLIEQSGVGQVEL 455 Query: 308 IVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 IVRTLAQGLE LFRGLLRLIIQHQDKVRMVRLRDQW+SFDPR+ Sbjct: 456 IVRTLAQGLERLFRGLLRLIIQHQDKVRMVRLRDQWISFDPRH 498 >gi|227822448|ref|YP_002826420.1| hypothetical protein NGR_c19030 [Sinorhizobium fredii NGR234] gi|227341449|gb|ACP25667.1| hypothetical protein NGR_c19030 [Sinorhizobium fredii NGR234] Length = 684 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 188/357 (52%), Positives = 248/357 (69%), Gaps = 16/357 (4%) Query: 10 LIKDSDVEVLEHSHREDGGEK--------VHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 LI D +VEV+E S + E + ++IRR+ +G + AV +EFLIHP++ Sbjct: 156 LIGDDEVEVVEQSRTTEKIETPQGMVEQPSYSVKIRRRLERGTPRLAAVPLEEFLIHPEA 215 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKN-------QYS 114 + I SPI G + RSDLI+ GYDR+ I LP + + + +F + Sbjct: 216 ISIADSPIAGIATRMRRSDLIATGYDRDLIEGLPASTGDSGRDDEEFTRRRGVFEAKDAV 275 Query: 115 DKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAP 174 KALE ++YYELYV +D D DGIAELRR+++AGGTG++++L NEEW+E+PF L R P Sbjct: 276 PKALEEVDYYELYVKVDADDDGIAELRRLVLAGGTGEEHLLSNEEWDEVPFADLIIERRP 335 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 H G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +PESVLNP+F +PI Sbjct: 336 HQREGGSVTDDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANPESVLNPKFAQPI 395 Query: 235 RVAAGMDIRSVLGIHSVPMIEK-SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATA 293 RV+ G+D R+ LG VP + K SF+ML YLDQE DRTGISD SSG +P+ L NMTA A Sbjct: 396 RVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGLAPDALTNMTARA 455 Query: 294 TSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 T+LIEQ+G+GQ EL+VRT AQGL +F+GLLRL+I+HQD+ R VRLR QWV+FDPR+ Sbjct: 456 TALIEQAGIGQTELMVRTFAQGLRRVFKGLLRLVIKHQDRPRAVRLRGQWVTFDPRH 512 >gi|150397041|ref|YP_001327508.1| hypothetical protein Smed_1838 [Sinorhizobium medicae WSM419] gi|150028556|gb|ABR60673.1| hypothetical protein Smed_1838 [Sinorhizobium medicae WSM419] Length = 683 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 189/356 (53%), Positives = 251/356 (70%), Gaps = 15/356 (4%) Query: 10 LIKDSDVEVLEHSH---REDGGEKV-----HDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 L+ D +VEVLE S R + + V + ++IRR+ +G + AV +EFLIHPD+ Sbjct: 156 LVGDDEVEVLEQSQTVERMETPQGVVEQPSYSVKIRRRAERGTPRLAAVPLEEFLIHPDA 215 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK------FPKNQYSD 115 + I SPI G + + RSDL++MG+DR+ I+ LP + ++ F Sbjct: 216 ISIADSPITGFAMRMRRSDLVAMGHDRDLIDGLPAAEAGGRDDEASTRRRDAFETKDAVP 275 Query: 116 KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPH 175 KALE ++YYELYV +D D DGIAELRR++ AGGT ++N+L NEEW+E+PF L R PH Sbjct: 276 KALEEVDYYELYVKVDADDDGIAELRRLVFAGGTSEENLLSNEEWDEVPFADLTVERRPH 335 Query: 176 CFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIR 235 G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +PE+VLNP+FG+PIR Sbjct: 336 QREGGSVTGDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANPEAVLNPKFGQPIR 395 Query: 236 VAAGMDIRSVLGIHSVPMIEK-SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT 294 V+ G+D R+ LG VP + K SF+ML YLDQE DRTGISD SSG +P+ LQNMTA AT Sbjct: 396 VSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGMAPDALQNMTARAT 455 Query: 295 SLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 +L+EQ+G+GQ EL+VRT AQGL +FRGLLRL+++HQD+ R VRLR QWV+FDPR+ Sbjct: 456 ALVEQAGIGQTELMVRTFAQGLRRVFRGLLRLVVKHQDRPRAVRLRGQWVTFDPRH 511 >gi|291334599|gb|ADD94249.1| hypothetical protein Daci_1943 [uncultured phage MedDCM-OCT-S04-C136] Length = 741 Score = 377 bits (969), Expect = e-102, Method: Composition-based stats. Identities = 103/348 (29%), Positives = 196/348 (56%), Gaps = 12/348 (3%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++K +++ ++ S + +++ +I+R G+V ++++ P+EFLI + IE + Sbjct: 185 VLKQYEMQGVDISQVQVPNFNLYNCKIKRIKKTGRVKIESIPPEEFLIDRSAKTIEDADF 244 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIISSQ--NIENTWKFPK------NQYSDKALEMI 121 V K+ +TRSDL++MGY ++ ++ LP N E T + + +D + E + Sbjct: 245 VSHKVLMTRSDLVAMGYPQDEVDELPKSDLDIYNDEETVRLADVDDYRISSSTDTSTEKV 304 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGES 181 YE YV DYD DGIAELR+++ AG G +IL N + +PF + + PH F G S Sbjct: 305 LVYESYVKYDYDEDGIAELRKIVSAGADGH-HILSNMPCDSVPFVTITPIPMPHRFYGRS 363 Query: 182 LAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMD 241 ++ + ++Q +K+ ++RQ LDN+Y N + V +G +++ + +L + G +R + Sbjct: 364 ISELVEDVQLMKSTVMRQLLDNMYLTNNNRVAVMDG-MVNMDDLLTTRPGGIVRTKQPPN 422 Query: 242 IRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSG 301 + + + + P+ +++F +L YLD RTG+S + G SP+ L TAT + + Q Sbjct: 423 -QVMQPLQAQPISQQAFPLLSYLDSVREGRTGVSKEAQGLSPDTLNAKTATGVNALMQQT 481 Query: 302 VGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + ELI R A+ G++ LF+ + L++++QDK +++ + +Q++ P Sbjct: 482 QMRSELIARVFAETGVKDLFKKIFELMVKYQDKEKIIMMSNQYIPVRP 529 >gi|291334641|gb|ADD94289.1| portal protein [uncultured phage MedDCM-OCT-S04-C64] Length = 755 Score = 361 bits (927), Expect = 7e-98, Method: Composition-based stats. Identities = 102/356 (28%), Positives = 182/356 (51%), Gaps = 20/356 (5%) Query: 10 LIKDSDV--EVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVD--IE 65 L+ D +V E++E S + E ++ + G + ++ V P+EF I ++ +E Sbjct: 157 LLSDPNVQRELIEDSIEQT--EFGLNVEFKVIEKMGSIRIEPVPPEEFGIARNARSPYVE 214 Query: 66 KSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTW--------KFPKNQYSDKA 117 + + + S+L++MGYD E I +LP S E + P + S+++ Sbjct: 215 DTNFCYHRTLKSFSELVAMGYDVELIRSLPFDESAMTEEELARRNKTDEEEPFDYVSEES 274 Query: 118 LEMIEYYELYVTIDYDGDGIAELRRVIMAGG---TGKDNILCNEEWNELPFTCLRAMRAP 174 + E Y+ ID DGD IAEL RV +AGG +G +L EE + +PF + P Sbjct: 275 MRNYFITECYIKIDRDGDDIAELLRVTLAGGNYTSGSSRLLGIEEVDHMPFATCSPILMP 334 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 H F G S+A +++Q+IK+VL RQ LDN Y N +T V + S ++ + +L + G + Sbjct: 335 HKFYGLSIADITMDLQRIKSVLTRQMLDNTYLANNSRTAVND-SHVNLDDLLTSRPGGVV 393 Query: 235 RVAA-GMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATA 293 R G + + I P+ ++++M+ YLD RTG+ D ++G L N+ Sbjct: 394 RYKGEGSASQYITPIPHNPLPNEAYTMMGYLDDVRRQRTGVGDETAGLGENSLSNVNTGV 453 Query: 294 TSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 +L + ++ELI R L + G + +FR + +L+++HQD+ ++ + + + +P Sbjct: 454 AALAFDAKRMKIELIARILGEVGFKDVFRLIHKLLMKHQDRKMLLNVAGNFQAINP 509 >gi|160897386|ref|YP_001562968.1| hypothetical protein Daci_1943 [Delftia acidovorans SPH-1] gi|160362970|gb|ABX34583.1| conserved hypothetical protein [Delftia acidovorans SPH-1] Length = 763 Score = 351 bits (901), Expect = 8e-95, Method: Composition-based stats. Identities = 109/335 (32%), Positives = 164/335 (48%), Gaps = 22/335 (6%) Query: 28 GEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYD 87 + + D+ +R G+V V+ V P+EFLI + IE + VG ++ T S+L SMGY Sbjct: 228 PKMLWDVVCKRVKKGGRVRVENVPPEEFLISRKAKSIEDASFVGHRVARTISELKSMGYK 287 Query: 88 R-ESINNLPIISSQNIEN-----------TWKFPKNQYSDKALEMIEYYELYVTIDYDGD 135 + I + +S N+E + Q D + I E Y+ DYDGD Sbjct: 288 NVDDITSDDQAASLNMERIERLSWDDEMAYLQMDNVQSMDTSQRQIWVTECYLRCDYDGD 347 Query: 136 GIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTV 195 GIAELR+V+ AG + IL NE + PF + + PH F G S+A +E Q+I T+ Sbjct: 348 GIAELRKVVRAG----NQILENEVCDVAPFVSITPVPMPHKFFGLSVADLALEGQRINTI 403 Query: 196 LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVA-AGMDIRSVLGIHSVPMI 254 LLR LDN + + EG ++ + +L + G +R+ AGM R G + + Sbjct: 404 LLRNQLDNNNLEVNGRYFAVEGQ-VNLDDLLTSRPGGVVRMKSAGMAGRLDQGAGNSGL- 461 Query: 255 EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ 314 + M+ Y+ D TG + + G + L N TAT + I +++LI R A Sbjct: 462 --NLQMMEYMKGFQEDSTGWTRYNQGSDGDSL-NQTATGVNQIVNRADMRLDLIARNYAD 518 Query: 315 GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 G LFR +L+L Q+Q MV+LR +WV PR Sbjct: 519 GFRELFRLMLKLCSQYQQTEDMVKLRGKWVPVSPR 553 >gi|294083946|ref|YP_003550703.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663518|gb|ADE38619.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 697 Score = 342 bits (876), Expect = 6e-92, Method: Composition-based stats. Identities = 92/358 (25%), Positives = 169/358 (47%), Gaps = 27/358 (7%) Query: 7 IHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEK 66 + ML+ DV++++ S +D G + + R SQ V ++ + P+E ++ +E+ Sbjct: 180 LDMLLAQDDVDLIDSS-TDDVGMVSGTIGVTRDTSQ--VVIETIPPEELIVEAQCKSLEE 236 Query: 67 SPIVGRKLYLTRSDLISMGYDRESINNLP-------------IISSQNIENTWKFPKNQY 113 S + T S+L M D + ++++ + + F + Y Sbjct: 237 STFSAHRTRKTLSELREMYPDSDKLDDIGDHEDVEMETDPEILARHDGVSENRGFSSHGY 296 Query: 114 SDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRA 173 D+ I YE Y+ +D +G GIA+L +V AG + +L EE PF + Sbjct: 297 QDQVRH-ILCYEAYIMLDVEGSGIAKLHKVTKAG----NVLLDIEEVKRRPFVTFCPLPI 351 Query: 174 PHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKP 233 PH F G + A + Q +TVL R LD+ N P+ +V +G + +P +++ + G Sbjct: 352 PHAFYGSNFAEKLCATQNARTVLTRSILDHAMITNNPRYMVVKGGLSNPRELIDNRVGGL 411 Query: 234 IRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEIL--QNMTA 291 + V+ ++ + P+ F L LDQ+L D TG+S +S G + + + QN A Sbjct: 412 VNVSRP---DAISAMPQAPLNPFVFQTLQQLDQDLEDNTGVSRLSQGLNKDAISKQNSAA 468 Query: 292 TATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 L S Q +++ R AQ ++ LF + RL+++++D+ ++V + +V DPR Sbjct: 469 MVEQLATMSQQRQ-KILARHFAQFVKSLFHEIYRLVVENEDQQKIVEISGAYVEVDPR 525 >gi|167600438|ref|YP_001671938.1| portal protein [Pseudomonas phage LUZ24] gi|161168301|emb|CAP45466.1| portal protein [Pseudomonas phage LUZ24] Length = 706 Score = 333 bits (855), Expect = 2e-89, Method: Composition-based stats. Identities = 102/354 (28%), Positives = 182/354 (51%), Gaps = 26/354 (7%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ D D E+L S EDG + ++IR+ + ++ V + P+ FL+ + I+ + Sbjct: 163 ILADPDTEILAQSVDEDG---TYSIKIRKDKKKREIKVTCIKPENFLVDRLATCIDDARF 219 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIIS------------SQNIENTWKFPKNQYSD-K 116 + + T SDL +G + ++ LP N + T + N D + Sbjct: 220 LCHREKYTVSDLRLLGVPEDVLDELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAE 279 Query: 117 ALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHC 176 A + E Y +D DGDGI+ELRR++ G D I+ NE W+ PF L A R H Sbjct: 280 ANREVWASECYTLLDVDGDGISELRRILYVG----DYIISNEPWDSRPFADLNAYRIAHK 335 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+ I +IQ+I++VL+R +DN+Y NQ +++V +G ++ E +L + +RV Sbjct: 336 FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQ-VNLEDLLTNEAAGIVRV 394 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ-NMTATATS 295 A + S++ + + + + + ML L+ + RTGI+D + G L N A + + Sbjct: 395 KA---MNSIMPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVN 451 Query: 296 LIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + + Q++LI R A+ G++ LF+ L I++Q++ + +LR +WV+ +P Sbjct: 452 QLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAINP 505 >gi|148257059|ref|YP_001241644.1| hypothetical protein BBta_5791 [Bradyrhizobium sp. BTAi1] gi|146409232|gb|ABQ37738.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 557 Score = 331 bits (848), Expect = 1e-88, Method: Composition-based stats. Identities = 95/338 (28%), Positives = 157/338 (46%), Gaps = 24/338 (7%) Query: 30 KVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLY-LTRSDLISMGYDR 88 HD+ I + V V P+EF I + I ++ T + LI+ G+D Sbjct: 18 TTHDVTIVTTRKFAQARVMGVPPEEFGIERGARSIRDCNYCFHEIVTKTEAQLIAEGFDA 77 Query: 89 ESINNL-PIISSQNIENTWKFPKNQ-------YSDKALEMIEYYELYVTIDYDGDGIAEL 140 I +L + +E + ++ ++ ++ E YV +DY+G+G L Sbjct: 78 AQIRSLGDYAGTTRVETLARDTVDEQSRASASAANSGTRLVRITEHYVRMDYEGEGRPCL 137 Query: 141 RRVIMAGGTG----KDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 ++I G G KD C ++ +PF + H F G S+A ++ +Q+ KT L Sbjct: 138 YQIITGGDQGEILRKDGQDCITPFDAIPFAATTPVPMTHRFFGRSIADLVMPLQREKTAL 197 Query: 197 LRQTLDNLYWQNQPQTIVQEGSI--IDPESVLNPQFGKPIRVA--AGMDIRSVLGIHSVP 252 R LDNLY N P+ V E + + +L + G +R G++ + V P Sbjct: 198 KRGALDNLYLHNNPRVEVAEANAGPNTLDDLLVSRPGGVVRTKTAGGLNWQVV------P 251 Query: 253 MIEKS-FSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRT 311 I S + ML Y+D EL R+G+S + G LQN +ATA + + + +++LI R Sbjct: 252 DITSSIYPMLQYIDAELESRSGLSKQAQGIDANALQNQSATAVAQVFSASQMRIKLIARI 311 Query: 312 LAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 +A+G+ +F L I +H + + VRLR+ WV DPR Sbjct: 312 MAEGVRDMFGLLHATIRKHGQQRQTVRLRNAWVQVDPR 349 >gi|27476052|ref|NP_775254.1| putative portal protein [Pseudomonas phage PaP3] gi|27414482|gb|AAL85568.1| ORF.04 [Pseudomonas phage PaP3] Length = 705 Score = 329 bits (843), Expect = 4e-88, Method: Composition-based stats. Identities = 101/354 (28%), Positives = 179/354 (50%), Gaps = 26/354 (7%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ D D +L S +DG + ++IR+ + ++ V V P+ FL+ + I+ + Sbjct: 162 ILSDPDTSILAQSVDDDG---TYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARF 218 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIIS------------SQNIENTWKFPKNQYSD-K 116 + + T SDL +G + I LP N + T + N D + Sbjct: 219 LCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAE 278 Query: 117 ALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHC 176 A + E Y +D DGDGI+ELRR++ G D I+ NE W+ PF L A R H Sbjct: 279 ANREVWASECYTLLDVDGDGISELRRILYVG----DYIISNEPWDCRPFADLNAYRIAHK 334 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+ I +IQ+I++VL+R +DN+Y NQ +++V +G ++ E +L + +RV Sbjct: 335 FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQ-VNLEDLLTNEAAGIVRV 393 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ-NMTATATS 295 + + S+ + + + + + ML L+ + RTGI+D + G L N A + + Sbjct: 394 KS---MNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVN 450 Query: 296 LIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + + Q++LI R A+ G++ LF+ L I++Q++ + +LR +WV+ +P Sbjct: 451 QLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNP 504 >gi|221199509|ref|ZP_03572553.1| putative portal protein [Burkholderia multivorans CGD2M] gi|221205589|ref|ZP_03578604.1| putative portal protein [Burkholderia multivorans CGD2] gi|221174427|gb|EEE06859.1| putative portal protein [Burkholderia multivorans CGD2] gi|221180794|gb|EEE13197.1| putative portal protein [Burkholderia multivorans CGD2M] Length = 807 Score = 320 bits (821), Expect = 2e-85, Method: Composition-based stats. Identities = 100/335 (29%), Positives = 166/335 (49%), Gaps = 24/335 (7%) Query: 30 KVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDR- 88 ++H++ + R G V ++AV P++FL+ S I ++ T SDL + GY+ Sbjct: 272 RLHNVVLTRSKKAGHVAIEAVMPEDFLVSARSRRIRD-GFCAHRVRKTLSDLKAEGYENV 330 Query: 89 ESINNLP----------IISSQNIEN--TWKFPKNQYSDKALEMIEYYELYVTIDYDGDG 136 E I++ P ++ QN +N + + D++ +E YE Y+ ID DGDG Sbjct: 331 ELIDSEPNAVAADLSELALARQNEQNRVVTNALDDGFGDESQREVELYECYLPIDVDGDG 390 Query: 137 IAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 I+E R++ AG IL NE + PF + + P IG S+A + IQ+IKT Sbjct: 391 ISEWRKITKAGNA----ILDNEVVDGPPFALVSPISIPGLLIGRSIADLAMPIQRIKTKF 446 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEK 256 LR DN+ Q + + +G ++ ++ + G +R+ + I + +P I Sbjct: 447 LRGLDDNMQIQINGRVGLVDGK-VNVNDWMDNRPGGGVRIKSADAIVPIK--QGLPDIAG 503 Query: 257 SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-G 315 + +L Y+D +RTGI+ S G + L N TA I +V++I R A+ G Sbjct: 504 AMQLLQYVDAMSQERTGITKYSQGLDADTL-NHTADGIKRITARADLRVKMIARKFAETG 562 Query: 316 LEILFRGLLRLIIQHQDKVRMVRL-RDQWVSFDPR 349 + LFR + +L++QHQDK + L + +WV DPR Sbjct: 563 VTDLFRLIQKLLMQHQDKPMSIALSKGKWVDIDPR 597 >gi|288817860|ref|YP_003432207.1| putative portal protein [Hydrogenobacter thermophilus TK-6] gi|288787259|dbj|BAI69006.1| putative portal protein [Hydrogenobacter thermophilus TK-6] Length = 618 Score = 298 bits (763), Expect = 9e-79, Method: Composition-based stats. Identities = 93/366 (25%), Positives = 180/366 (49%), Gaps = 33/366 (9%) Query: 8 HMLIKDSDVEVLEHSHR------EDGGEKVHDLRIR-RKYSQGKVCVDAVSPDEFLIHPD 60 +++ ++++ +H +D G ++ + ++ + S+ + C++ V EF+ HP Sbjct: 149 EIVLGWDELQLAQHDPTAVVESAQDLGNGIYRVALKISRLSKNQPCLENVPATEFIFHPS 208 Query: 61 SVDIEKSPIVGRKL-----YLTRSDL--ISMGYDR--ESINNLPIISSQNIENTWKFPKN 111 ++ ++ SP V + YL R + I D+ ES ++ + +Q + K K Sbjct: 209 TLSVKDSPFVAHRKVVTVDYLKRKEKEGIYKNVDKVIESASSDDLRYTQMADYYLKPYKK 268 Query: 112 QY-----SDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE--WNELP 164 D A + YE Y D + DG+ L VI+ G + IL +E + P Sbjct: 269 YAVSESDQDLARRKVLLYECYTKYDINNDGL--LEDVIIT--VGNNTILRIQENIYGRPP 324 Query: 165 FTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPES 224 F L + P+ G+S A + +IQ +KT L+ Q + N+ N + + + ++++ + Sbjct: 325 FFVLAPILEPYQLWGKSFADVLKDIQDLKTALVNQIIVNVGMNNDYKIAIND-TLVNVQD 383 Query: 225 VLNPQFGKPIRVAAGMDIR-SVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSP 283 ++N + IR+ AG DIR +++ + + P+ SF+ L Y++ +RTGI+ + G Sbjct: 384 IVNDKP--VIRMKAGADIRQAIMPLPTQPLAPWSFNFLEYIEGTKENRTGITRYNQGLDG 441 Query: 284 EILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 L N TA+ S+I Q+ ++ELI R A+ G++ LF L+ L Q D+ ++RL ++ Sbjct: 442 RSL-NKTASGISMIMQAANQRLELIARIFAETGIKDLFSFLVYLNQQFIDQKTVIRLTNK 500 Query: 343 WVSFDP 348 + P Sbjct: 501 SLPIAP 506 >gi|308751459|gb|ADO44942.1| hypothetical protein Hydth_0542 [Hydrogenobacter thermophilus TK-6] Length = 618 Score = 298 bits (763), Expect = 9e-79, Method: Composition-based stats. Identities = 93/366 (25%), Positives = 180/366 (49%), Gaps = 33/366 (9%) Query: 8 HMLIKDSDVEVLEHSHR------EDGGEKVHDLRIR-RKYSQGKVCVDAVSPDEFLIHPD 60 +++ ++++ +H +D G ++ + ++ + S+ + C++ V EF+ HP Sbjct: 149 EIVLGWDELQLAQHDPTAVVESAQDLGNGIYRVALKISRLSKNQPCLENVPATEFIFHPS 208 Query: 61 SVDIEKSPIVGRKL-----YLTRSDL--ISMGYDR--ESINNLPIISSQNIENTWKFPKN 111 ++ ++ SP V + YL R + I D+ ES ++ + +Q + K K Sbjct: 209 TLSVKDSPFVAHRKVVTVDYLKRKEKEGIYKNVDKVIESASSDDLRYTQMADYYLKPYKK 268 Query: 112 QY-----SDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE--WNELP 164 D A + YE Y D + DG+ L VI+ G + IL +E + P Sbjct: 269 YAVSESDQDLARRKVLLYECYTKYDINNDGL--LEDVIIT--VGNNTILRIQENIYGRPP 324 Query: 165 FTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPES 224 F L + P+ G+S A + +IQ +KT L+ Q + N+ N + + + ++++ + Sbjct: 325 FFVLAPILEPYQLWGKSFADVLKDIQDLKTALVNQIIVNVGMNNDYKIAIND-TLVNVQD 383 Query: 225 VLNPQFGKPIRVAAGMDIR-SVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSP 283 ++N + IR+ AG DIR +++ + + P+ SF+ L Y++ +RTGI+ + G Sbjct: 384 IVNDKP--VIRMKAGADIRQAIMPLPTQPLAPWSFNFLEYIEGTKENRTGITRYNQGLDG 441 Query: 284 EILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 L N TA+ S+I Q+ ++ELI R A+ G++ LF L+ L Q D+ ++RL ++ Sbjct: 442 RSL-NKTASGISMIMQAANQRLELIARIFAETGIKDLFSFLVYLNQQFIDQKTVIRLTNK 500 Query: 343 WVSFDP 348 + P Sbjct: 501 SLPIAP 506 >gi|291334834|gb|ADD94474.1| hypothetical protein CLIBASIA_05245 [uncultured phage MedDCM-OCT-S06-C1041] Length = 265 Score = 296 bits (757), Expect = 4e-78, Method: Composition-based stats. Identities = 85/264 (32%), Positives = 138/264 (52%), Gaps = 14/264 (5%) Query: 84 MGYDRESINNLPIISSQNIENTW--------KFPKNQYSDKALEMIEYYELYVTIDYDGD 135 MGYD E I +LP S E + P + S++++ E Y+ ID DGD Sbjct: 1 MGYDVELIRSLPFDESAMTEEELARRNKTDEEEPFDYVSEESMRNYFITECYIKIDRDGD 60 Query: 136 GIAELRRVIMAGG---TGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKI 192 IAEL RV +AGG +G +L EE + +PF + PH F G S+A +++Q+I Sbjct: 61 DIAELLRVTLAGGNYTSGSSRLLGIEEVDHMPFATCSPILMPHKFYGLSIADITMDLQRI 120 Query: 193 KTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAA-GMDIRSVLGIHSV 251 K+VL RQ LDN Y N +T V + S ++ + +L + G +R G + + I Sbjct: 121 KSVLTRQMLDNTYLANNSRTAVND-SHVNLDDLLTSRPGGVVRYKGEGSASQYITPIPHN 179 Query: 252 PMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRT 311 P+ ++++M+ YLD RTG+ D ++G L N+ +L + ++ELI R Sbjct: 180 PLPNEAYTMMGYLDDVRRQRTGVGDETAGLGENSLSNVNTGVAALAFDAKRMKIELIARI 239 Query: 312 LAQ-GLEILFRGLLRLIIQHQDKV 334 L + G + +FR + +L+++HQD+ Sbjct: 240 LGEVGFKDVFRLIHKLLMKHQDRK 263 >gi|316934283|ref|YP_004109265.1| putative portal protein [Rhodopseudomonas palustris DX-1] gi|315601997|gb|ADU44532.1| putative portal protein [Rhodopseudomonas palustris DX-1] Length = 673 Score = 293 bits (750), Expect = 3e-77, Method: Composition-based stats. Identities = 86/351 (24%), Positives = 151/351 (43%), Gaps = 24/351 (6%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 + DVEV D + R + V+ V P+EF S + Sbjct: 163 ITSQEDVEV---DLELDEATGTYSGSWTRVTDTSGLRVEVVPPEEFYSDA-SKKRRQDGT 218 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSD--------KALEMI 121 GRK TR++LIS GY R+ ++ + + S ++ + + +D L+ I Sbjct: 219 RGRKTLKTRAELISEGYPRDKVSKVRVSSEIEFDSERQERDRETNDGIGSDAPQSELDQI 278 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGES 181 +E ++ + GDG A L R++ A G ++ E + F +R PH G + Sbjct: 279 LVHETFIQLSLKGDGKASLYRIVHADG----HLFEMGEVADDNFLDFVPLRRPHSQFGNN 334 Query: 182 LAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMD 241 + I+ Q +TV+ R LD+ N P+ V S+ +P+ +L+ + + V Sbjct: 335 FSKRIVPTQNARTVITRSILDHAATVNNPRWTVLNNSLSNPKELLDARLRGVVNVK---- 390 Query: 242 IRSVLGIHSVPMIEKS-FSMLHYLDQELVDRTGISDISSGFSPEIL--QNMTATATSLIE 298 R +GI P + + F +L L + TGIS +S G + + + QN LI Sbjct: 391 NRDAIGILPYPQLNNAVFPLLEMLKTNKEETTGISSLSQGLNKDAISSQNSQGMVNDLIT 450 Query: 299 QSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 S Q ++I R A L LF +++I++Q + ++ + + + DPR Sbjct: 451 VSQTRQ-KIIARNFAMFLHDLFLAARKVVIENQTRKKVWEFDNNFQNIDPR 500 >gi|241760934|ref|ZP_04759023.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374553|gb|EER64014.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 729 Score = 290 bits (743), Expect = 2e-76, Method: Composition-based stats. Identities = 102/352 (28%), Positives = 167/352 (47%), Gaps = 24/352 (6%) Query: 9 MLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP 68 +L++ D ++ + D G +++ + R Q + + +E+ + + + + Sbjct: 171 LLMEAEDNPDIQITLNSDNGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSARTRHEDDAD 230 Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPII-----SSQNIENTWKFPK--NQYSDKALEMI 121 Y T SDLISMG+DR+ + +LP S + W+ + SD+A + Sbjct: 231 YQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSSDQANREV 290 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGES 181 YE YV ID DGDGIAEL ++ KD +L EE +E PF H IG S Sbjct: 291 LLYEEYVRIDRDGDGIAELLQIFRV----KDVLLSIEEVDEAPFVVWTPFPRAHRMIGNS 346 Query: 182 LAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMD 241 LA +++IQ++K+VL+RQ LD +Y N P+ V + + + F + + G Sbjct: 347 LAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAV------NVDGLTEDTFDDLLTIRPGAI 400 Query: 242 IRSVLGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSL 296 +R GI P+ I+KS M+ Y+ RTGI+ ++ G + L N TAT +L Sbjct: 401 VRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTATGQAL 459 Query: 297 IEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 ++ G E + R AQ L LF+ L L+I D +++ + + DP Sbjct: 460 LQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGD-PMAIKVEGLYKTVDP 510 >gi|260753098|ref|YP_003225991.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552461|gb|ACV75407.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 729 Score = 290 bits (743), Expect = 2e-76, Method: Composition-based stats. Identities = 102/352 (28%), Positives = 167/352 (47%), Gaps = 24/352 (6%) Query: 9 MLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP 68 +L++ D ++ + D G +++ + R Q + + +E+ + + + + Sbjct: 171 LLMEAEDNPDIQITLNNDDGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSARTRHEDDAD 230 Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPII-----SSQNIENTWKFPK--NQYSDKALEMI 121 Y T SDLISMG+DR+ + +LP S + W+ + SD+A + Sbjct: 231 YQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSSDQANREV 290 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGES 181 YE YV ID DGDGIAEL ++ KD +L EE +E PF H IG S Sbjct: 291 LLYEEYVRIDRDGDGIAELLQIFRV----KDVLLSIEEVDEAPFVVWTPFPRAHRMIGNS 346 Query: 182 LAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMD 241 LA +++IQ++K+VL+RQ LD +Y N P+ V + + + F + + G Sbjct: 347 LAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAV------NVDGLTEDTFDDLLTIRPGAI 400 Query: 242 IRSVLGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSL 296 +R GI P+ I+KS M+ Y+ RTGI+ ++ G + L N TAT +L Sbjct: 401 VRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTATGQAL 459 Query: 297 IEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 ++ G E + R AQ L LF+ L L+I D +++ + + DP Sbjct: 460 LQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGD-PMAIKVEGLYKTVDP 510 >gi|56551276|ref|YP_162115.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] gi|56542850|gb|AAV89004.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] Length = 729 Score = 290 bits (742), Expect = 2e-76, Method: Composition-based stats. Identities = 102/352 (28%), Positives = 167/352 (47%), Gaps = 24/352 (6%) Query: 9 MLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP 68 +L++ D ++ + D G +++ + R Q + + +E+ + + + + Sbjct: 171 LLMEAEDNPDIQITLNNDDGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSARTRHEDDAD 230 Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPII-----SSQNIENTWKFPK--NQYSDKALEMI 121 Y T SDLISMG+DR+ + +LP S + W+ + SD+A + Sbjct: 231 YQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSSDQANREV 290 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGES 181 YE YV ID DGDGIAEL ++ KD +L EE +E PF H IG S Sbjct: 291 LLYEEYVRIDRDGDGIAELLQIFRV----KDVLLSIEEVDEAPFVVWTPFPRAHRMIGNS 346 Query: 182 LAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMD 241 LA +++IQ++K+VL+RQ LD +Y N P+ V + + + F + + G Sbjct: 347 LAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAV------NVDGLTEDTFDDLLTIRPGAI 400 Query: 242 IRSVLGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSL 296 +R GI P+ I+KS M+ Y+ RTGI+ ++ G + L N TAT +L Sbjct: 401 VRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTATGQAL 459 Query: 297 IEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 ++ G E + R AQ L LF+ L L+I D +++ + + DP Sbjct: 460 LQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGD-PMAIKVEGLYKTVDP 510 >gi|167583563|ref|YP_001671753.1| portal protein [Enterobacteria phage phiEco32] gi|164375401|gb|ABY52809.1| portal protein [Enterobacteria phage phiEco32] Length = 747 Score = 285 bits (730), Expect = 6e-75, Method: Composition-based stats. Identities = 70/343 (20%), Positives = 154/343 (44%), Gaps = 24/343 (6%) Query: 24 REDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLIS 83 E+ + D+++ + + +V V+ V ++ + + + ++ ++ DL++ Sbjct: 177 TEENEDGTVDVKVTYEQTVKRVKVEYVPSEQIFVDEHATSFADAQYFCHRVRRSKEDLVA 236 Query: 84 MGYDRESIN--NLPIISSQNIENTWKFPKNQY-----------SDKALEMIEYYELYVT- 129 MG+ ++ I N + ++T + + + ++ M+ YE Y+ Sbjct: 237 MGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDIDADIGTDTEDIASMVWVYEHYIRT 296 Query: 130 IDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEI 189 D + ++L +VI AG ++IL EE +PF P F G+S+ +I Sbjct: 297 GVLDKNKESKLYQVIQAG----EHILHTEEVTHIPFVTFCPYPIPGSFYGQSVYDITKDI 352 Query: 190 QKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIH 249 Q ++T L+R +DN+ N + G+ D S+L+ + G + + D + H Sbjct: 353 QDLRTALVRGYIDNVNNANYGRYKALVGA-YDRRSLLDNRPGGVVEMER-QDAIDLFPYH 410 Query: 250 SVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELI 308 ++P + +L ++ RTG++ + G +P++ +N A AT L+ + ++ ++ Sbjct: 411 NLP--QGIDGLLGMSEELKETRTGVTKLGMGINPDVFKNDNAYATVGLMMNAAQNRLRMV 468 Query: 309 VRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 R +A G+ L RG+ LI ++ + V+ V +P+ Sbjct: 469 CRNIAHNGMVELMRGIYSLIRENGEVPIEVQTPRGMVQVNPKQ 511 >gi|307308935|ref|ZP_07588618.1| hypothetical protein SinmeBDRAFT_4502 [Sinorhizobium meliloti BL225C] gi|306900569|gb|EFN31182.1| hypothetical protein SinmeBDRAFT_4502 [Sinorhizobium meliloti BL225C] Length = 677 Score = 253 bits (647), Expect = 2e-65, Method: Composition-based stats. Identities = 72/366 (19%), Positives = 156/366 (42%), Gaps = 32/366 (8%) Query: 9 MLIKDSDVEVLEHSHRED----------------GGEKVHDLRIRRKYSQGKVCVDAVSP 52 ++ DV++++ S +++ GG +V D++IR + + V V P Sbjct: 150 LIKGIPDVQLVQFSEQQEAGQIIIEESGEPYTIPGGVQVRDVKIRTVTRRSCINVFPVDP 209 Query: 53 DEFLIHPDS-VDIE----KSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK 107 ++ ++ D+ D E ++ + G + ++RS LI +G+D+ +++ +P ++ + + Sbjct: 210 EDAVLSTDAQFDPETGGIRAKLQGHRKIMSRSVLIDLGFDKATVDRIPGVNEKTDGIALE 269 Query: 108 FPKNQYSDKALE--MIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNEL-P 164 K+ ++A + M+E Y +Y + D + R+ G + +L EE P Sbjct: 270 RLKDVSGERAFDKDMVEVYTVYTRLKL--DTTSRHYRITFGGDSANPILLDYEETTRFYP 327 Query: 165 FTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPES 224 + G+ +A I E + + + R D+L P T+ + + + Sbjct: 328 YAAFVPYPLAGTLFGQGIADRIGEDHEKISKMERAVQDSLNMSVFPITV-VDDDVSSIDD 386 Query: 225 VLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPE 284 + N GK IR ++ + + ++ ++ L+Q+L TG+ Sbjct: 387 LTNLHPGKVIRSSSPNGGINFVQHPFTG--AQATGIIERLEQKLDFSTGVGPQMMTLDAS 444 Query: 285 ILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMV--RLRD 341 LQ TATA + +E + R A+ G L + ++ L++Q D+ + + RL Sbjct: 445 DLQRTTATAINQRSNQQQTLIETVSRFFAETGYRYLTKVIVDLLVQKPDESQELIGRLTG 504 Query: 342 QWVSFD 347 ++ D Sbjct: 505 NFIPVD 510 >gi|227822445|ref|YP_002826417.1| hypothetical protein NGR_c19000 [Sinorhizobium fredii NGR234] gi|227341446|gb|ACP25664.1| hypothetical protein NGR_c19000 [Sinorhizobium fredii NGR234] Length = 361 Score = 249 bits (637), Expect = 3e-64, Method: Composition-based stats. Identities = 119/189 (62%), Positives = 147/189 (77%), Gaps = 1/189 (0%) Query: 163 LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDP 222 +PF L R PH G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +P Sbjct: 1 MPFADLIIERRPHQREGGSVTDDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANP 60 Query: 223 ESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEK-SFSMLHYLDQELVDRTGISDISSGF 281 ESVLNP+FG+PIRV+ G+D R+ LG VP + K SF+ML YLDQE DRTGISD SSG Sbjct: 61 ESVLNPKFGQPIRVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGL 120 Query: 282 SPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD 341 +P+ L NMTA AT+LIEQ+G+GQ EL+VRT AQGL +F+GLLRL+I+HQD+ R VRLR Sbjct: 121 APDALTNMTARATALIEQAGIGQTELMVRTFAQGLRRVFKGLLRLVIKHQDRPRAVRLRG 180 Query: 342 QWVSFDPRY 350 QWV+FDPR+ Sbjct: 181 QWVTFDPRH 189 >gi|227821703|ref|YP_002825673.1| hypothetical protein NGR_c11350 [Sinorhizobium fredii NGR234] gi|227340702|gb|ACP24920.1| hypothetical protein NGR_c11350 [Sinorhizobium fredii NGR234] Length = 348 Score = 156 bits (395), Expect = 4e-36, Method: Composition-based stats. Identities = 75/165 (45%), Positives = 103/165 (62%), Gaps = 15/165 (9%) Query: 10 LIKDSDVEVLEH-SHRE--DGGEK-----VHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 L+ D DVEVLE S++E D + ++++RIRR G + AV +EFLIHPD+ Sbjct: 156 LVADDDVEVLEQESYQEQIDTPQGPQSVTLYNVRIRRTKEYGCTKLAAVPLEEFLIHPDA 215 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENT-----WKFPKNQYSD- 115 + I+ SPI G K L RSDL++MGYDRE ++ SS N E T + P ++ + Sbjct: 216 MSIDDSPITGIKTRLRRSDLVAMGYDREKVDKFATASSSNEEETEEFARRREPFDEKDEI 275 Query: 116 -KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE 159 KAL+ ++YYELYV ID D DGIAELRR+ AGG + N+L +EE Sbjct: 276 IKALQEVDYYELYVKIDVDDDGIAELRRMCFAGGLAEVNLLDDEE 320 >gi|282598927|ref|YP_003358477.1| N4 gp59-like protein [Pseudomonas phage LIT1] gi|259048687|emb|CAZ66336.1| N4 gp59-like protein [Pseudomonas phage LIT1] Length = 726 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 62/318 (19%), Positives = 125/318 (39%), Gaps = 20/318 (6%) Query: 43 GKVCVDAVSPDEFLIHPDS-VDIEKSPIVGRKLYLTRSDLISMG----YDRESINNLPII 97 V + +I P D K+ + + ++L + G D+ + ++ Sbjct: 238 NHPTVQVCDYNNIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLL 297 Query: 98 SS-QNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILC 156 S + + DK+ + + +E + D GDG+ L ++ + Sbjct: 298 SEPDYTGPSEGVRNFDFQDKSRKRLVVHEYWGYYDIHGDGV--LHPIVATWVGAVMIRME 355 Query: 157 NEEW--NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 + +P+ + + GES A +I+ Q+I + R +D + Q V Sbjct: 356 ENPFPDKRIPYVVVNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGV 415 Query: 215 QEGSIIDPESVLNPQ---FGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ-ELVD 270 +G++ V N + G+ G D R+ + +H+ P I +S + L Q E Sbjct: 416 MKGAL----DVTNRRRFDRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAES 471 Query: 271 RTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQH 330 TG+ ++G S L TATA + + I+R L+ G+ + R ++ + + Sbjct: 472 MTGVKAFNAGISGAALG-DTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEF 530 Query: 331 QDKVRMVRLRDQ-WVSFD 347 D V +VR+ ++ +V Sbjct: 531 LDDVEVVRITNEHFVDIR 548 >gi|221271428|dbj|BAH15181.1| portal protein [Serratia phage KSP100] Length = 374 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 70/154 (45%), Gaps = 6/154 (3%) Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEK 256 +R +DN+ N + +G D S+L+ + G + A I V P+ Sbjct: 1 VRGYIDNIMSANYGRFRAVKGQ-YDKRSLLDNRPGGVVEENA---IGMVDLFPHHPLPAG 56 Query: 257 SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQ- 314 S+L ++Q RTG++ I G SPE+ +N + AT ++ + ++ ++ R +AQ Sbjct: 57 VDSILEQIEQAKERRTGVTRIGMGLSPEVFKNDNSFATVDMMMSAAQNRMRMVARNVAQN 116 Query: 315 GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + LF + RL+ ++++ + + P Sbjct: 117 FMTQLFLAIYRLLKENENSTLPIEVNGAMKEVMP 150 >gi|326562389|gb|EGE12709.1| putative portal protein [Moraxella catarrhalis 103P14B1] Length = 806 Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 120/318 (37%), Gaps = 23/318 (7%) Query: 43 GKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 K VD + I P + E + V + S+L G NL + Q+ Sbjct: 292 NKPTVDICNLKNVFIDPTCKGNFENAQFVVHAYESSLSELKKQG----IYQNLGYLMEQH 347 Query: 102 I--ENTWKFPKNQ---YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILC 156 +N+ P + + D A + YE + D +G E ++ A D I+ Sbjct: 348 AQADNSIDKPSDDVFKFQDNARRKLTVYEYWGYWDIHDNG--ETTAIVCA--WVGDTIIR 403 Query: 157 NEE----WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 EE +LPF + G A + + Q+I + R +D L QT Sbjct: 404 MEENPFPKGKLPFVVFNYLPEEESIWGIPNAELLGDNQEILGAVTRGMIDLLGKSANSQT 463 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS-FSMLHYLDQELVDR 271 + + V G+ G D R + H+ P I S M+H ++ E Sbjct: 464 AFPKNFLDSANKVKYST-GQDYEYNQGFDPRVHVHTHTFPEIPNSAMMMVHSMNNEAESL 522 Query: 272 TGISDI-SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQH 330 +G+ S G S L +ATA + + + I+R +++G + R ++ + + Sbjct: 523 SGVKAFSSQGISASHLG-DSATAARGVLDAVSKREMSILRRISEGFIQMGRFIMAMNSEF 581 Query: 331 QDKVRMVRLRDQ-WVSFD 347 + +VR+ ++ +V+ Sbjct: 582 LSEKEIVRITNKEFVTIR 599 >gi|326567485|gb|EGE17600.1| putative portal protein [Moraxella catarrhalis BC1] Length = 806 Score = 93.2 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 119/318 (37%), Gaps = 23/318 (7%) Query: 43 GKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 K VD + I P + E + V + S+L G NL + Q Sbjct: 292 NKPTVDICNLKNVFIDPTCRGNFENAQFVVHAYESSLSELKKQG----IYQNLGYLMEQQ 347 Query: 102 --IENTWKFPKNQ---YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILC 156 +N+ P + + D A + YE + D +G E ++ A D I+ Sbjct: 348 SQADNSIDKPSDDVFKFQDNARRKLTVYEYWGYWDIHDNG--ETTAIVCA--WVGDTIIR 403 Query: 157 NEE----WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 EE +LPF + G A + + Q+I + R +D L QT Sbjct: 404 MEENPFPKGKLPFVVFNYLPEEESIWGIPNAELLGDNQEILGAVTRGMIDLLGKSANSQT 463 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS-FSMLHYLDQELVDR 271 + + V G+ G D R + H+ P I S M+H ++ E Sbjct: 464 AFPKNFLDSANKVKYST-GQDYEYNQGFDPRVHVHTHTFPEIPNSAMMMVHSMNNEAESL 522 Query: 272 TGISDI-SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQH 330 +G+ S G S L +ATA + + + I+R +++G + R ++ + + Sbjct: 523 SGVKAFSSQGISASHLG-DSATAARGVLDAVSKREMSILRRISEGFIQMGRFIMAMNSEF 581 Query: 331 QDKVRMVRLRDQ-WVSFD 347 + +VR+ ++ +V+ Sbjct: 582 LSEKEIVRITNKEFVTIR 599 >gi|326573143|gb|EGE23112.1| putative portal protein [Moraxella catarrhalis CO72] Length = 806 Score = 93.2 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 66/318 (20%), Positives = 119/318 (37%), Gaps = 23/318 (7%) Query: 43 GKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 K VD + I P + E + V + S+L G NL + Q Sbjct: 292 NKPTVDICNLKNVFIDPTCRGNFENAQFVVHAYESSLSELKKQG----IYQNLGYLMEQQ 347 Query: 102 --IENTWKFPKNQ---YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILC 156 +N+ P + + D A + YE + D +G E ++ A D I+ Sbjct: 348 SQADNSIDKPSDDVFKFQDNARRKLTVYEYWGYWDIHDNG--ETTAIVCA--WVGDTIIR 403 Query: 157 NEE----WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 EE +LPF + G A + + Q+I + R +D L QT Sbjct: 404 MEENPFPKGKLPFVVFNYLPEEESIWGIPNAELLGDNQEILGAVTRGMIDLLGKSANSQT 463 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS-FSMLHYLDQELVDR 271 + + V G+ G D R + H+ P I S M+H ++ E Sbjct: 464 AFPKNFLDSANKVKYST-GQDYEYNQGFDPRVHVHTHTFPEIPNSAMMMVHSMNNEAESL 522 Query: 272 TGISDI-SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQH 330 +G+ S G S L +ATA + + + I+R +++G + R ++ + + Sbjct: 523 SGVKAFSSQGISASHLG-DSATAARGVLDAVSKREMSILRRISEGFIQMGRFIMAMNSEF 581 Query: 331 QDKVRMVRLRDQ-WVSFD 347 + +VR+ ++ +V+ Sbjct: 582 LSEKEIVRITNKEFVTIR 599 >gi|319956914|ref|YP_004168177.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] gi|319419318|gb|ADV46428.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] Length = 561 Score = 89.4 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 54/305 (17%), Positives = 111/305 (36%), Gaps = 35/305 (11%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G++ ++ V + P++ ++ ++ T +L + N S Sbjct: 137 GQLRIERVKLKNMYLDPNASNVFDIQYCVHRVTTTIGNLRQQFGRKFKWKNYIGDSEDGT 196 Query: 103 ENTWKFPKNQYSDKALEMIEYYEL---YVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE 159 S + + Y+ YV+ GD L +E Sbjct: 197 SYLSSADLGDASRIEVRDVYRYQSGKWYVSTVLPGDAFVRL----------------DEP 240 Query: 160 W-NELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQP 210 + LPF G S +I +Q+ TV Q +D + Sbjct: 241 LKDGLPFIIGSVEPQFVRLDESNAVEAYGGSFIEPMIPLQEEYTVTRNQQIDAIAESLSK 300 Query: 211 QTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVD 270 + + + S ++ + +L+ + I V++ +++ + P I+ S + LD E+ + Sbjct: 301 RFLATKTSGLNEKDLLSNRTK--ISVSSLNEVKELQA----PRIDPSIFGIDRLDSEMQE 354 Query: 271 RTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQ 329 +GI+ + G + N TAT S++ + G + IVR L + E R ++RLI + Sbjct: 355 VSGITKYNQGLNDPHNLNQTATGVSILTEEGNAVIADIVRALNESFFEPAIRRMVRLIYK 414 Query: 330 HQDKV 334 + + Sbjct: 415 YGESP 419 >gi|307545235|ref|YP_003897714.1| Haemophilus-specific protein, uncharacterized [Halomonas elongata DSM 2581] gi|307217259|emb|CBV42529.1| Haemophilus-specific protein, uncharacterized [Halomonas elongata DSM 2581] Length = 749 Score = 88.2 bits (217), Expect = 1e-15, Method: Composition-based stats. Identities = 48/319 (15%), Positives = 104/319 (32%), Gaps = 30/319 (9%) Query: 44 KVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM----GYDRESINNLPIISS 99 + + VSP + PD+ I+ + + TRS L + Y ++I + Sbjct: 240 RPEFERVSPFDMYPSPDATSIDDGAFIIERARFTRSQLNQLIGVPSYSEDAIRQVLHQYG 299 Query: 100 QNIENTWKFPKNQYSDKALEM---------------------IEYYELYVTIDYDGDGIA 138 Q W + + ++ + + ++ D D +A Sbjct: 300 QGGLRDWLWSDGERAELEGRGHEWLTPGETIDGLIYSGGAQGVTLLQWGISPDEIEDPLA 359 Query: 139 ELR-RVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLL 197 E I+ G + + P+ P F G+ + + ++Q + Sbjct: 360 EYEVEAILIGQHVIRVRINRDPLERRPYHKSSFQPVPGSFWGQGIPELMADVQDVCNATA 419 Query: 198 RQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM---DIRSVLGIHSVPMI 254 R ++NL + PQ V E + E + K R A + + ++ Sbjct: 420 RGLVNNLAISSGPQVEVYEDRLQPQEDPTDIYPWKIWRTKASIETGNNPALRFFQPQSNA 479 Query: 255 EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ 314 + ++ + + T I G TA+ S++ +S ++ +R + + Sbjct: 480 SELLAVYEQFEYRADESTNIPRYMYGSDEAGGAGQTASGLSMLMESANKGIKDAIRHIDR 539 Query: 315 GL-EILFRGLLRLIIQHQD 332 G+ + L +Q D Sbjct: 540 GVLRRVIEALWLHNMQFSD 558 >gi|308516960|emb|CBW47065.1| structural protein, N4 gp59-like [Roseovarius sp. 217 phage 1] Length = 801 Score = 87.4 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 59/331 (17%), Positives = 119/331 (35%), Gaps = 24/331 (7%) Query: 35 RIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISM--------- 84 + + + V V+ + P D E+S + T+S+L++ Sbjct: 253 TVEERMVKNCPSVRIVNIANLFVDPSCEGDWEQSQYMVYTYEATKSELMAKKGTYQNLEN 312 Query: 85 -GYDRESINNL---PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL 140 ++ I + P S N + +DK + + YE + D +GI Sbjct: 313 VNWESAKIQSNAGNPDHESNTPNNDMRTSGTGATDK--QKVLVYEYWGLYDIYDNGIMVP 370 Query: 141 RRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 V G T + PF + M GE+ A+ + + Q+I + R Sbjct: 371 IVVTWVGETIIEMRENPFPDKRPPFVIVPYMPILKSVFGEADASLLQDNQRIIGAVTRGV 430 Query: 201 LDNLYWQNQPQTIVQEGSI--IDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS- 257 +D + QT +G + ++ +N G+ D ++ + P I +S Sbjct: 431 IDLMGRSANAQTGYAKGFLDPVNKRRFVN---GEDFEFNPNGDPKANIRQMEYPEIPRSA 487 Query: 258 FSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLE 317 + + E TG+ S G S + AT S + I+R LA+G++ Sbjct: 488 HETIQMQNAEAEALTGVKSFSGGISGDAYG-SVATGIRGALDSAATREMSILRRLAKGMQ 546 Query: 318 ILFRGLLRLIIQHQDKVRMVRLRD-QWVSFD 347 + ++ + + + +VR+ + ++V Sbjct: 547 AIGTKMIAMNAKFLSEKEIVRVTNEEFVEVS 577 >gi|282599474|ref|YP_003358364.1| N4 gp59-like protein [Pseudomonas phage LUZ7] gi|259048573|emb|CAZ66223.1| N4 gp59-like protein [Pseudomonas phage LUZ7] Length = 720 Score = 87.0 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 59/338 (17%), Positives = 130/338 (38%), Gaps = 29/338 (8%) Query: 27 GGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMG 85 ++++ K + + +I P D+ K+ V + ++L + G Sbjct: 217 VPNGSEKVKVQ-KTIVNQPTLKVCDFRNIVIDPSCNGDMNKAKFVVESFESSYAELKADG 275 Query: 86 YDRESINNLPIISSQNIENTWKFPKN---------QYSDKALEMIEYYELYVTIDYDGDG 136 +NL I+ QN + + ++D++ + + +E + D GDG Sbjct: 276 ----RYSNLEKINEQNSDILSQPDYATGSESVRNFDFADRSRKRLVVHEYWGYYDIHGDG 331 Query: 137 IAELRRVIMAGGTGKDNILCNEEW--NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKT 194 EL ++ L + ++P+ + G+S + +I+ QKI Sbjct: 332 --ELHSIVATWVGQVLIRLELNPFPDGKIPYVVAAYLPVKDSVYGDSDGSLLIDNQKIVG 389 Query: 195 VLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQ---FGKPIRVAAGMDIRSVLGIHSV 251 + R +D + Q Q+G++ + N + G+ G + + + H+ Sbjct: 390 AISRGMIDIMAQSANGQVGFQKGAL----DITNRRRYERGETYEFNPGNNPATAIYTHTF 445 Query: 252 PMIEKSFS-MLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVR 310 I +S ML+ E TG+ ++G S + L TAT + + I+R Sbjct: 446 QEIPRSAEYMLNQQQLEAESMTGVKAFNTGISGQALG-DTATGIRGALDAASKRELGILR 504 Query: 311 TLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ-WVSFD 347 L+ L + R ++ + + D ++R+ ++ +V+ Sbjct: 505 RLSDCLIEVGRRVIAMNAEFLDDEEVIRITNEGFVTVR 542 >gi|316995429|gb|ADU79210.1| hypothetical protein EcP1_gp59 [Enterobacter phage EcP1] Length = 719 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 59/316 (18%), Positives = 120/316 (37%), Gaps = 24/316 (7%) Query: 42 QGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMG-YD-------RESIN 92 Q + V+ ++ + I P D++K+ V + + ++L G Y ++S Sbjct: 230 QNQPYVEVLNIENVYIDPSCQGDMDKATFVIHRFETSIAELKKSGNYKNLDKLTVKDSDE 289 Query: 93 NLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKD 152 +P IS I+ + N K+ + E + D D G+ L +++A G Sbjct: 290 LIPSISDDEIKTSTPTDYNISG-KSRKRFNVTEYWGYYDIDDSGV--LTPIVVA-YVGDV 345 Query: 153 NILCNE---EWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 I C+E + PF + + GE A I + Q I R +D + Sbjct: 346 KIRCSENPYPHGKPPFVVIPYLPMDSSVYGEPDAELIYDNQAIIGASTRAMIDLVARSAN 405 Query: 210 PQTIVQEGSI--IDPESVL--NPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLD 265 Q I+++ ++ + P+ V IR+V +I ++ + Sbjct: 406 GQNIIRKDVFDPVNYRKFMAGEDAQSNPLNVPLAEAIRTVTTPEVPSIIP---GLIQQQN 462 Query: 266 QELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLR 325 E +G+ S G S L ++ A + + + I+R L +G+ L R ++ Sbjct: 463 NEAESLSGVKAFSEGISSGSLGDVAA-GIRGVLDASSKREMSILRRLKKGMVDLGRMIIA 521 Query: 326 LIIQHQDKVRMVRLRD 341 + + ++R+ + Sbjct: 522 MNQEFLTDEEIIRITN 537 >gi|237651526|ref|YP_002898997.1| putative portal protein [Roseophage EE36P1] gi|220898158|gb|ACL81415.1| N4 gp59 protein [Sulfitobacter phage EE36phi1] Length = 800 Score = 85.5 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 55/331 (16%), Positives = 119/331 (35%), Gaps = 24/331 (7%) Query: 35 RIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISM--------- 84 I + + V ++ + P + EK+ + T S+L + Sbjct: 253 TIEERMVKNCPSVRIINVANLFVDPSCEGEWEKAQYMIYTYEATPSELKAKKDYYQNLDQ 312 Query: 85 -GYDRESINNL---PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL 140 ++ I + P S+ N + +DK + + YE + D +G+ Sbjct: 313 VNWESAKIQSNHGNPDHESKTPNNDMRTSGTGSADK--QKVLVYEYWGLYDIYNNGVMVP 370 Query: 141 RRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 V G T + PF + M GE+ A+ + + Q+I + R Sbjct: 371 IVVTWVGETIIEMRENPFPDKRPPFVIVPYMPILKSVFGEADASLLQDNQRIIGAVTRGV 430 Query: 201 LDNLYWQNQPQTIVQEGSI--IDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS- 257 +D + QT +G + ++ N G+ D ++ + P I +S Sbjct: 431 IDLMGRSANAQTGYAKGFLDPVNKRRFTN---GEDFEFNPNGDPKANIRQMEYPEIPRSA 487 Query: 258 FSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLE 317 + + + E TG+ S G + + + AT S + I+R LA+G++ Sbjct: 488 HETIQWQNAEAEALTGVKSFSGGITGDAYGRV-ATGIRGALDSAAQREMSILRRLAKGIQ 546 Query: 318 ILFRGLLRLIIQHQDKVRMVRLRDQ-WVSFD 347 + ++ + + + ++R+ ++ +V Sbjct: 547 DIGMKMIAMNGKFLSEKEIIRVTNREFVEVS 577 >gi|237651609|ref|YP_002899079.1| putative portal protein [Roseophage DSS3P2] gi|220898079|gb|ACL81337.1| N4 94kDa-like protein [Silicibacter phage DSS3phi2] Length = 800 Score = 83.6 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 57/329 (17%), Positives = 118/329 (35%), Gaps = 20/329 (6%) Query: 35 RIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISM--------- 84 I + + V ++ + P + EK+ + T S+L + Sbjct: 253 TIEERMVKNCPSVRIINVANLFVDPSCEGEWEKAQYMIYTYEATPSELKAKKNYYQNLDK 312 Query: 85 -GYDRESINNL---PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL 140 ++ I + P S N + +DK + + YE + D +G+ Sbjct: 313 VNWESAKIQSNHGNPDHESNTPNNDMRTSGTGSADK--QKVLVYEYWGLYDIYANGVMVP 370 Query: 141 RRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 V G T + PF + M GE+ A+ + + Q+I + R Sbjct: 371 IVVTWVGETIIEMRENPFPDKRPPFVIVPYMPILKSVFGEADASLLQDNQRIIGAVTRGV 430 Query: 201 LDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS-FS 259 +D + QT +G + DP + G+ D ++ + P I +S Sbjct: 431 IDLMGRSANAQTGYAKGFL-DPVNKRRFTQGEDFEFNPNGDPKANIRQMEYPEIPRSAHE 489 Query: 260 MLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEIL 319 + + + E TG+ S G S + + AT S + I+R LA+G++ + Sbjct: 490 TIQWQNAEAEALTGVKSFSGGISGDAYGRV-ATGIRGALDSASQREMSILRRLAKGIQDI 548 Query: 320 FRGLLRLIIQHQDKVRMVRLRDQ-WVSFD 347 ++ + + + ++R+ ++ +V Sbjct: 549 GMKMISMNGKFLSEKEIIRVTNREFVEVS 577 >gi|228905598|ref|ZP_04069542.1| hypothetical protein bthur0014_66580 [Bacillus thuringiensis IBL 4222] gi|228854038|gb|EEM98752.1| hypothetical protein bthur0014_66580 [Bacillus thuringiensis IBL 4222] Length = 707 Score = 80.5 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 58/305 (19%), Positives = 116/305 (38%), Gaps = 15/305 (4%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIIS 98 K G++ P I P + E+ + + D I Y ++ + + Sbjct: 174 KLYTGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDI-DYIQERYGKDVAADENVGF 232 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVT-IDYDGDGIAELRRVIMAGGTGKDNILCN 157 + + T + N S K M E++V +G+ +V +AGG D + Sbjct: 233 AAAFDVTPQNGFNSTSKKRPNMAMVDEMWVKPCGKHPNGL----KVTIAGGQLLDI---D 285 Query: 158 EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEG 217 E ++PF + P E+ ++ IQ+ ++ + +V G Sbjct: 286 ENAGDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMWLVPMG 345 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFS-MLHYLDQELVDRTGISD 276 S +D + + N + G + R P I + +L+ D ++ D +G + Sbjct: 346 SSVDEDEITNEE-GGIVHYTPIEGARPER--VGAPDIPSFYDRILNNHDADIDDLSGARE 402 Query: 277 ISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRM 336 IS G P L T + SL+ + ++ + + G++ L + +L L+ +H + RM Sbjct: 403 ISQGRLPSGL--DTYSGLSLMVEQENEKLAVSSQNYEHGMKRLLQRVLMLMKKHYTEERM 460 Query: 337 VRLRD 341 R+ Sbjct: 461 ARILG 465 >gi|75761880|ref|ZP_00741807.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228905318|ref|ZP_04069295.1| hypothetical protein bthur0014_63940 [Bacillus thuringiensis IBL 4222] gi|228937950|ref|ZP_04100577.1| hypothetical protein bthur0008_6260 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970830|ref|ZP_04131470.1| hypothetical protein bthur0003_6170 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228977404|ref|ZP_04137799.1| hypothetical protein bthur0002_6190 [Bacillus thuringiensis Bt407] gi|74490640|gb|EAO53929.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228782381|gb|EEM30564.1| hypothetical protein bthur0002_6190 [Bacillus thuringiensis Bt407] gi|228788955|gb|EEM36894.1| hypothetical protein bthur0003_6170 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228821741|gb|EEM67742.1| hypothetical protein bthur0008_6260 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228854317|gb|EEM98998.1| hypothetical protein bthur0014_63940 [Bacillus thuringiensis IBL 4222] gi|326938429|gb|AEA14325.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 707 Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 57/305 (18%), Positives = 117/305 (38%), Gaps = 15/305 (4%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIIS 98 + G++ P I P + E+ + + D I Y ++ + + Sbjct: 174 RLYTGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDI-DYIKERYGKDVAADENVGF 232 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVT-IDYDGDGIAELRRVIMAGGTGKDNILCN 157 + + T + N S K M E++V +G+ +V +AGG D + Sbjct: 233 AAAFDVTPQNGFNSTSKKRPNMAMVDEMWVKPCGKHPNGL----KVTIAGGQLLDI---D 285 Query: 158 EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEG 217 E ++PF + P E+ ++ IQ+ ++ + +V G Sbjct: 286 ENAGDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMWLVPMG 345 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFS-MLHYLDQELVDRTGISD 276 S +D + + N + G + +R P I + +L+ D ++ D +G + Sbjct: 346 SSVDEDEITNEE-GGIVHYTPIEGVRPER--VGAPDIPSFYDRILNNHDADIDDLSGARE 402 Query: 277 ISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRM 336 IS G P L T + SL+ + ++ + + G++ L + +L L+ +H + RM Sbjct: 403 ISQGRLPSGL--DTYSGLSLMVEQENEKLAVSSQNYEHGMKRLLQRVLLLMKKHYTEERM 460 Query: 337 VRLRD 341 R+ Sbjct: 461 ARILG 465 >gi|119952228|ref|YP_950537.1| 94 kDa protein [Enterobacteria phage N4] gi|117650947|gb|ABK54420.1| 94 kDa protein [Enterobacteria phage N4] Length = 763 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 60/357 (16%), Positives = 137/357 (38%), Gaps = 41/357 (11%) Query: 17 EVLEHSHR--EDGGEKVHDLRIRRKYSQ------GKVCVDAVSPDEFLIHPDSV-DIEKS 67 E ++ S R ++ G+ + ++ ++ V+ ++P+ +I P DI K+ Sbjct: 202 EAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKA 261 Query: 68 PIVGRKLYLTRSDLISM---GYDRESIN---NLPIISSQNIENTWKFPKNQYSDKALEMI 121 ++DL+ ++ I+ + P+ + T + Q SD + + Sbjct: 262 MFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQE--FQISDPMRKRV 319 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW--NELPFTCLRAMRAPHCFIG 179 YE + D +G+G+ L ++ L + +LPF + M G Sbjct: 320 VAYEYWGFWDIEGNGV--LEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYG 377 Query: 180 ESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAG 239 E A + + Q + ++R +D L Q + +G + LN + R G Sbjct: 378 EPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGML----DALNSR-----RYREG 428 Query: 240 MD---------IRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMT 290 D + ++ + + + +M +QE TG+ + G + E ++ Sbjct: 429 EDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVA 488 Query: 291 ATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-QWVSF 346 A + + + I+R LA+G+ + ++ + + +VR+ + ++V+ Sbjct: 489 A-GIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTI 544 >gi|167749268|ref|ZP_02421395.1| hypothetical protein EUBSIR_00219 [Eubacterium siraeum DSM 15702] gi|167657761|gb|EDS01891.1| hypothetical protein EUBSIR_00219 [Eubacterium siraeum DSM 15702] Length = 534 Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 107/310 (34%), Gaps = 11/310 (3%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G + + P DIE+S + + R L M E P + Sbjct: 140 GDIAIRNADILNLFWEPGIKDIEESANLFYVTLVDRERLNLMYP--ELCGEEPESVAGGT 197 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL-----RRVIMAGGTGKDNILCN 157 N K+ +D + +E + Y +G RVI + + Sbjct: 198 GNVEKYKTEDKTDDSA-KVEVVDWYYKKTINGRKQLCYCKFCGDRVIYSSEDDESCADGF 256 Query: 158 EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEG 217 + + PF G + + Q L + L++ ++ + +++ Sbjct: 257 YKHSRYPFVMDTLFVQEGTPCGFGYIDVMRDAQMYIDKLSQVVLEHTVMMSRKRYFIRQN 316 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDI 277 S ++ + + + + VA + + I + P+ + L + EL + +G D Sbjct: 317 SAVNEAEFADLK-NRFVHVAGNLGEEDIREIKAEPLDSSVMNALSFKIDELKETSGNRDF 375 Query: 278 SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMV 337 S G + A+A + ++++G +++ + + ++ LI Q D R Sbjct: 376 SQGSVSNGV--TAASAIAALQEAGSKLSRDMIKGTYFAFQQVCYLIIELIRQFYDTPRSF 433 Query: 338 RLRDQWVSFD 347 R+ + +FD Sbjct: 434 RITGGYDAFD 443 >gi|291529975|emb|CBK95560.1| hypothetical protein EUS_02210 [Eubacterium siraeum 70/3] Length = 534 Score = 77.8 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 107/310 (34%), Gaps = 11/310 (3%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G + + P DIE+S + + R L M E + + Sbjct: 140 GDIAIRNADILNLFWEPGIKDIEESANLFYVTLVDRERLNLMYP--ELCEDDTESVAGGT 197 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL-----RRVIMAGGTGKDNILCN 157 EN K+ +D + +E + Y +G RVI + + Sbjct: 198 ENVEKYKTEDKTDDSA-KVEVIDWYYKKTINGRKQLCYCKFCGDRVIYSSEDDESCADGF 256 Query: 158 EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEG 217 + + PF G + + Q L + L + ++ + +++ Sbjct: 257 YKHSRYPFVMDTLFVQEGTPCGFGYIDVMRDAQMYIDKLSQVVLAHTVMMSRKRYFIRQN 316 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDI 277 S ++ + + + + VA + + I + P+ + L + EL + +G D Sbjct: 317 SAVNEAEFADLK-NRFVHVAGNLGEEDIREIKAEPLDSSVMNALSFKIDELKETSGNRDF 375 Query: 278 SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMV 337 S G + A+A + ++++G +++ + + ++ LI Q D R Sbjct: 376 SQGSVSNGV--TAASAIAALQEAGSKLSRDMIKGTYFAFQQVCYLIIELIRQFYDTPRSF 433 Query: 338 RLRDQWVSFD 347 R+ + +FD Sbjct: 434 RITGGYDAFD 443 >gi|291556862|emb|CBL33979.1| hypothetical protein ES1_09090 [Eubacterium siraeum V10Sc8a] Length = 534 Score = 77.4 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 49/310 (15%), Positives = 106/310 (34%), Gaps = 11/310 (3%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G + + P DIE+S + + R L M E P + Sbjct: 140 GDIAIRNADILNLFWEPGIKDIEESANLFYVTLVDRERLNLMYP--ELCGEEPESVAGGT 197 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL-----RRVIMAGGTGKDNILCN 157 N K+ +D + +E + Y +G RVI + + Sbjct: 198 GNVEKYKTEDKTDDSA-KVEVVDWYYKKTINGRKQLCYCKFCGDRVIYSSEDDESCADGF 256 Query: 158 EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEG 217 + + PF G + + Q L + L++ ++ + +++ Sbjct: 257 YKHSRYPFVMDTLFVQEGTPCGFGYIDVMRDAQMYIDKLSQVVLEHTVMMSRKRYFIRQN 316 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDI 277 S ++ + + + + VA + + I + P+ + L EL + +G D Sbjct: 317 SAVNEAEFADLK-NRFVHVAGNLGEEDIREIKAEPLDSSVMNALSLKIDELKETSGNRDF 375 Query: 278 SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMV 337 S G + A+A + ++++G +++ + + ++ LI Q D R Sbjct: 376 SQGSVSNGV--TAASAIAALQEAGSKLSRDMIKGTYFAFQQVCYLIIELIRQFYDTPRSF 433 Query: 338 RLRDQWVSFD 347 R+ + +FD Sbjct: 434 RITGGYDAFD 443 >gi|257459274|ref|ZP_05624388.1| conserved hypothetical protein [Campylobacter gracilis RM3268] gi|257443287|gb|EEV18416.1| conserved hypothetical protein [Campylobacter gracilis RM3268] Length = 516 Score = 75.5 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 60/307 (19%), Positives = 121/307 (39%), Gaps = 33/307 (10%) Query: 33 DLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESIN 92 ++ +S+ + +D VS + P + + + ++YL+ D++S G Sbjct: 124 SCAVKVYWSKDRAMIDEVSLQDLYFDPGARGLNDISYLVHRIYLSSEDILSYG------- 176 Query: 93 NLPIISSQNIENTWKFPKNQYSDKALEMIEYYEL-----YVTIDYDGDGIAELRRVIMAG 147 IEN F + ++ E+ E YEL YV+ Y+ EL R ++ Sbjct: 177 ---KRGIFRIENKEAFADKKPYER-FEIYEIYELRGGKWYVSSLYEN----ELLRDLIEL 228 Query: 148 GTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQ 207 G+ I+ LP GE S++ +Q V D + Q Sbjct: 229 RDGQPFIVGYM----LPQIRCTDEEIYVSAYGEPALMSMLPLQNELNVNRNSITDVIRQQ 284 Query: 208 NQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQE 267 P+ I+ + S+++ + G PI ++ + I + + L ++ E Sbjct: 285 VAPKIILGKASMVERGEL--ESVGTPIYADQPSAVQVL----PAGDIGGAMAALQVIENE 338 Query: 268 LVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRL 326 + + +G+S +G ++ TAT S++ G +++ +RT + E +F L L Sbjct: 339 MSEVSGVSPQQNG--ATTVRKETATMASIMANEGSVRLQGYIRTFNETFFEPIFERLAFL 396 Query: 327 IIQHQDK 333 + ++ D Sbjct: 397 VWKYADP 403 >gi|153951607|ref|YP_001398216.1| hypothetical protein JJD26997_1133 [Campylobacter jejuni subsp. doylei 269.97] gi|153952365|ref|YP_001397542.1| hypothetical protein JJD26997_0326 [Campylobacter jejuni subsp. doylei 269.97] gi|152939053|gb|ABS43794.1| conserved hypothetical protein [Campylobacter jejuni subsp. doylei 269.97] gi|152939811|gb|ABS44552.1| hypothetical protein JJD26997_0326 [Campylobacter jejuni subsp. doylei 269.97] Length = 507 Score = 74.7 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 54/302 (17%), Positives = 118/302 (39%), Gaps = 41/302 (13%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 +G ++ V D P++++ E + ++YLT +++ Y+R+ + Sbjct: 136 KGMPRIERVGIDSIFFDPNALNSEDVGYIVNEIYLTYNEI----YERQKL---------G 182 Query: 102 IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW- 160 + PK + + ++ Y++Y + D ++ L ++++L NE Sbjct: 183 FYKKLETPKLLDEEDEYKKVKLYDIYERKNDDAWVVSTLF---------ENHLLRNEVIL 233 Query: 161 -NELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 + PF + GE + AS + +Q + +D + P+ Sbjct: 234 QDGQPFVWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMPK 293 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 ++ + + E + GKP+ ++ I P + + L L+ EL + Sbjct: 294 IMLPKSMGVSREDIETL--GKPLYTDDPKGVQ----ILPPPDVNSAGMNLQLLESELTEV 347 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQH 330 TG+S ++G + N TAT S+ Q G + +R + +E LF L+ ++ Sbjct: 348 TGVSPQNNG--AQTAHNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFKY 405 Query: 331 QD 332 + Sbjct: 406 GE 407 >gi|315929405|gb|EFV08607.1| hypothetical protein CSS_1407 [Campylobacter jejuni subsp. jejuni 305] Length = 512 Score = 70.9 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 56/302 (18%), Positives = 119/302 (39%), Gaps = 41/302 (13%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 +G ++ V D P++++ E + ++YLT + + ++R+ + Sbjct: 138 KGMPRIERVDIDSIFFDPNALNSEDVGYIVNEIYLTYNQI----HERQKL-------GFY 186 Query: 102 IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW- 160 + K ++ D + ++ Y++Y + D ++ L ++N+L NE Sbjct: 187 KKIEIKKLFDE--DDEYKKVKLYDIYERKNDDEWVVSTLF---------ENNLLRNEVTL 235 Query: 161 -NELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 + PF + GE + AS + +Q + +D + P+ Sbjct: 236 QDGQPFIWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMPK 295 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 ++ + + E + GKPI ++ I P + + L L+ EL + Sbjct: 296 IMMPKSMGVSREDIETL--GKPIYTDDPKGVQ----ILPPPNVNSAGMNLQLLESELTEV 349 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQH 330 TG+S ++G + QN TAT S+ Q G + +R + +E LF L+ ++ Sbjct: 350 TGVSPQNNG--AQTAQNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFKY 407 Query: 331 QD 332 + Sbjct: 408 GE 409 >gi|154174760|ref|YP_001409087.1| hypothetical protein CCV52592_0034 [Campylobacter curvus 525.92] gi|153793129|gb|EAU00312.2| conserved hypothetical protein [Campylobacter curvus 525.92] Length = 554 Score = 70.1 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 60/306 (19%), Positives = 115/306 (37%), Gaps = 46/306 (15%) Query: 41 SQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDL----ISMGYDRESINNLPI 96 +G ++ V D+ P++ D + ++ L+ DL YD+E+ N L Sbjct: 134 RKGLPVIEEVELDDIFFDPEAKDHDDIRYYVNRISLSYEDLGNLAKQKIYDKEATNEL-- 191 Query: 97 ISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILC 156 + + + ++ LE+ + YE + D + +A +L Sbjct: 192 ---ISRDEAKERRYDR-----LEIYDVYEC------END---KWYLSTIADNA----LLR 230 Query: 157 N--EEWNELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYW 206 + E + PF + F GE ASI+ +Q+ +D + Sbjct: 231 DKVELKDGCPFILGYMVPQVRDFSEQNFVCAYGEPPLASILPLQEEMNFARNSLIDAMNM 290 Query: 207 QNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ 266 +P+ IV + I + GKP+ I V P I + + +D Sbjct: 291 HLKPKAIVPLSANISRTDLET--IGKPVYAQTPAQITFV----PPPNIGSAQINISLIDN 344 Query: 267 ELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLR 325 E+ + +G+S +G + TAT S++ G +V+ VR+ + +E LF L Sbjct: 345 EMSEASGVSPQQNG--ATTPRKETATMASIMANEGSVRVQGYVRSFNETFIEPLFERLAM 402 Query: 326 LIIQHQ 331 L+ ++ Sbjct: 403 LVWKYG 408 >gi|283956319|ref|ZP_06373799.1| hypothetical protein C1336_000250090 [Campylobacter jejuni subsp. jejuni 1336] gi|283792039|gb|EFC30828.1| hypothetical protein C1336_000250090 [Campylobacter jejuni subsp. jejuni 1336] Length = 512 Score = 70.1 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 55/302 (18%), Positives = 113/302 (37%), Gaps = 41/302 (13%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 +G ++ V D P++++ E + ++YLT N + + Sbjct: 138 KGMPRIERVDIDSIFFDPNALNSEDVGYIVNEIYLTY-------------NQIHERQNLG 184 Query: 102 IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW- 160 + K D + ++ Y++Y + D ++ L ++N+L N+ Sbjct: 185 FYKNIEIQKLFDEDDEYKKVKLYDIYERKNDDEWVVSTLF---------ENNLLRNKVTL 235 Query: 161 -NELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 + PF + GE + AS + +Q + +D + P+ Sbjct: 236 QDGQPFVWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMPK 295 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 ++ + + E + GKPI ++ I P + + L L+ EL + Sbjct: 296 IMMPKSMGVSREDIETL--GKPIYTDDPKGVQ----ILPPPNVNSAGMNLQLLESELTEV 349 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQH 330 TG+S ++G + QN TAT S+ Q G + +R + +E LF L+ ++ Sbjct: 350 TGVSPQNNG--AQTAQNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFKY 407 Query: 331 QD 332 + Sbjct: 408 GE 409 >gi|57237581|ref|YP_178595.1| hypothetical protein CJE0579 [Campylobacter jejuni RM1221] gi|57166385|gb|AAW35164.1| hypothetical protein CJE0579 [Campylobacter jejuni RM1221] Length = 512 Score = 68.9 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 56/302 (18%), Positives = 119/302 (39%), Gaps = 41/302 (13%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 +G ++ V D P++++ E + ++YLT + + ++R+ + Sbjct: 138 KGMPRIERVDIDSIFFDPNALNSEDVGYIVNEIYLTYNQI----HERQKL-------GFY 186 Query: 102 IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW- 160 +N K ++ D + ++ Y++Y + D ++ L ++N+L NE Sbjct: 187 KKNEIKKLFDE--DDEYKKVKLYDIYERKNDDEWVVSTLF---------ENNLLRNEVTL 235 Query: 161 -NELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 + PF + GE + AS + +Q + +D + P+ Sbjct: 236 QDGQPFIWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMPK 295 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 ++ + + E + GKPI ++ I P + + L L+ EL + Sbjct: 296 IMMPKSMGVSREDIETL--GKPIYTDDPKGVQ----ILPPPNVNSAGMNLQLLESELTEV 349 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQH 330 G+S ++G + QN TAT S+ Q G + +R + +E LF L+ ++ Sbjct: 350 IGVSPQNNG--AQTAQNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFKY 407 Query: 331 QD 332 + Sbjct: 408 GE 409 >gi|113461527|ref|YP_719596.1| hypothetical protein HS_1384 [Haemophilus somnus 129PT] gi|112823570|gb|ABI25659.1| hemophilus-specific protein, uncharacterized [Haemophilus somnus 129PT] Length = 688 Score = 60.1 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 44/337 (13%), Positives = 104/337 (30%), Gaps = 49/337 (14%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 +++ V+ ++ D G + S+ V V P +F+ + ++ Sbjct: 123 ILRGPVVDTIDERIWSDDGMGNWSA---QTKSKIVPKVRLVLPWDFVPDMTAPTLKDCQF 179 Query: 70 VGRKLYLTRSDLISM----GYDRESINNLPIISSQNIENTWKFPK------------NQY 113 V + YLT+ L ++ Y +++ L + + + Sbjct: 180 VFERSYLTKKQLQNLLNNPYYLADTVQALIESEASETHTSSSDMDGYLDTLRTLSGLEKA 239 Query: 114 SDKALEMIEYYELYVTIDY-------------------DGDGIAELRRVIMAGGTGKDNI 154 S+ + Y + + AE+ VI+ G GK Sbjct: 240 SNDKRYEVWTYHGGIPVSVLEQANQSLEEGYALELTEEQKSEKAEIDGVIVMTGNGKILS 299 Query: 155 LCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 + + E P++ C G + + Q+I R +DN Q Sbjct: 300 VNLNPLDTAEFPYSVYTCEPDVACVFGFGIPYLCRDAQEILNTAWRGMIDNGVLTIGSQI 359 Query: 213 IVQEGSIIDPESVLNPQFGKPIRV------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLD 265 +V + + + K R A + + G+ + ++ +++ Sbjct: 360 VVNSSVLSPVDKSWEIKPNKLWRTNDRASANASFEAQRAFGVFNFESRQQELANIIQLAK 419 Query: 266 QELVDRTGISDISSGFSPEILQNMTATATSLIEQSGV 302 + + +G+ I+ G ++ T S++ + Sbjct: 420 SFMDEESGLPMIAQGEQGQVTP--TLGGMSMLMNAAN 454 >gi|170719076|ref|YP_001784230.1| hypothetical protein HSM_0898 [Haemophilus somnus 2336] gi|168827205|gb|ACA32576.1| Haemophilus-specific protein, uncharacterized [Haemophilus somnus 2336] Length = 725 Score = 59.3 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 44/337 (13%), Positives = 104/337 (30%), Gaps = 49/337 (14%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 +++ V+ ++ D G + S+ V V P +F+ + ++ Sbjct: 160 ILRGPVVDTIDERIWSDDGMGNWSA---QTKSKIVPKVRLVLPWDFVPDMTAPTLKDCQF 216 Query: 70 VGRKLYLTRSDLISM----GYDRESINNLPIISSQNIENTWKFPK------------NQY 113 V + YLT+ L ++ Y +++ L + + + Sbjct: 217 VFERSYLTKKQLQNLLNNPYYLADTVQALIESEASETHTSSSDMDGYLDTLRTLSGLEKA 276 Query: 114 SDKALEMIEYYELYVTIDY-------------------DGDGIAELRRVIMAGGTGKDNI 154 S+ + Y + + AE+ VI+ G GK Sbjct: 277 SNDKRYEVWTYHGGIPVSVLEQANQSLEEGYALELTEEQKSEKAEIDGVIVMTGNGKILS 336 Query: 155 LCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 + + E P++ C G + + Q+I R +DN Q Sbjct: 337 VNLNPLDTAEFPYSVYTCEPDVACVFGFGIPYLCRDAQEILNTAWRGMIDNGVLTIGSQI 396 Query: 213 IVQEGSIIDPESVLNPQFGKPIRV------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLD 265 +V + + + K R A + + G+ + ++ +++ Sbjct: 397 VVNSSVLSPVDKSWEIKPNKLWRTNDRASANASFEAQRAFGVFNFESRQQELANIIQLAK 456 Query: 266 QELVDRTGISDISSGFSPEILQNMTATATSLIEQSGV 302 + + +G+ I+ G ++ T S++ + Sbjct: 457 SFMDEESGLPMIAQGEQGQVTP--TLGGMSMLMNAAN 491 >gi|171914969|ref|ZP_02930439.1| hypothetical protein VspiD_27370 [Verrucomicrobium spinosum DSM 4136] Length = 711 Score = 57.8 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/240 (15%), Positives = 82/240 (34%), Gaps = 9/240 (3%) Query: 76 LTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM--IEYYELYVTIDYD 133 LT ++ + + + + S E+ P + ++ + E ++ +D Sbjct: 312 LTDAETRELYHAAKERSESLKPESARSESAPADPDQDDPNGSIPNLPVRLIEGFMRVDAL 371 Query: 134 GDGIAELRRVIMAGG----TGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEI 189 G G A ++ A D + +LP R P +G ++ Sbjct: 372 GKGQASNIYMVFAPQCEMCLKLDYLGNITPKGKLPVHAHTINRLPWRIVGRGFFERFDKV 431 Query: 190 QKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFG--KPIRVAAGMDIRSVLG 247 Q L + + + P T + + + + F KP+ + + + Sbjct: 432 QTFVDDLFNRINWHDRKSSDPITGFDKSKLAQEDEEEDEPFNSEKPLNLKPDSKLDEAIQ 491 Query: 248 IHSVPMIE-KSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVE 306 ++P + ++ ML + Q + RTGI+ + G + + TAT + ++ Sbjct: 492 FKALPDLNDRTKEMLQMMVQMVQLRTGITAANQGDVAGLPEASTATGIKQLMSRAAVLLK 551 >gi|115304377|ref|YP_762669.1| PfWMP4_39 [Cyanophage Pf-WMP4] gi|113201871|gb|ABI33183.1| PfWMP4_39 [Phormidium phage Pf-WMP4] Length = 641 Score = 57.0 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 47/326 (14%), Positives = 104/326 (31%), Gaps = 18/326 (5%) Query: 33 DLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGY-DRESI 91 D+ + R + ++ ++ +SP + + + + R +L++ GY D + Sbjct: 186 DVAVNR--QRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLT 243 Query: 92 NNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGK 151 + + + N ++IEY Y + +G + V G Sbjct: 244 QVEQYVDYKFADPDTPKDVNGTDTSGWDIIEY---YGPLLVEGVQFWCVHAVFY--GKQL 298 Query: 152 DNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 + ++ W PF + G S+ + + VL LDNL Sbjct: 299 IRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKM 358 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 + E I+ E V + G +VA ++ + ++ + + Sbjct: 359 WTLVEDGILKREDV-KAKPGAVFKVAQHGSLQPID--MGRQDFVVTYQEAQVQESSVYRN 415 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLE-ILFRGLLRLIIQH 330 T + +P + +TA + +G ++ + + L + L+ Q Sbjct: 416 TSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQF 475 Query: 331 QDKVRMVRL------RDQWVSFDPRY 350 +R+ D + P Y Sbjct: 476 YVTPETIRMYVPEEQMDGFFEVSPEY 501 >gi|291334833|gb|ADD94473.1| hypothetical protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 110 Score = 56.6 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 32/73 (43%), Gaps = 6/73 (8%) Query: 10 LIKDSDV--EVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVD--IE 65 L+ D +V E++E S E E ++ + G + ++ V P+EF I ++ +E Sbjct: 38 LLSDPNVQREIIEDSVEET--EFGLNVEFKVIEKMGSIRIEPVPPEEFGIARNARSPYVE 95 Query: 66 KSPIVGRKLYLTR 78 + + + Sbjct: 96 DTNFCYHRTLKSF 108 >gi|313113989|ref|ZP_07799544.1| hypothetical protein HMPREF9436_01396 [Faecalibacterium cf. prausnitzii KLE1255] gi|310623691|gb|EFQ07091.1| hypothetical protein HMPREF9436_01396 [Faecalibacterium cf. prausnitzii KLE1255] Length = 649 Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 46/324 (14%), Positives = 110/324 (33%), Gaps = 37/324 (11%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G++C+ +V+ P DI+ +P + + D + + + + Sbjct: 172 GEICIRSVNLLMLYWEPGVEDIQDTPHLFSLSLM----------DNDQLEGRYPQMAGHT 221 Query: 103 ENTWKFPKNQYSDKAL--EMIEYYELYVTIDYDGDGIAELR-----RVIMAGGTGKDNIL 155 ++ K + D + + Y +G V++ Sbjct: 222 GSSMDVAKYIHDDSIDTGDKSVVVDWYYKKALEGGQTVLHYCKYCNGVVLYASENDPQYA 281 Query: 156 CNEEWNE--LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTI 213 ++ PF R G + + Q + +N+ + + + Sbjct: 282 QRGFYDHGKYPFVFDPLFREEDSPAGFGYIDVMKDTQTAIDEMNHAMDENVKLAAKARYV 341 Query: 214 VQEGSIIDPESVLNPQFGK-PIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRT 272 + + + ++ E + + FGK + V + S + + + S EL + + Sbjct: 342 LSDTAGVNEEELAD--FGKDIVHVVGRLTDDSFRPLQTNVLSGNCISYRDARVSELKEIS 399 Query: 273 GISDISSGFSPEILQNMTATATSLIEQSG----VGQVELIVRTLAQGLEILFRGLLRLII 328 G D+S G + L A+A + ++++G ++ RT A + + + L+ Sbjct: 400 GNRDVSQGGTTSGL--TAASAIAALQEAGSKLSRDMLKSAYRTFA---KECYLVI-ELMR 453 Query: 329 QHQDKVRMVRLRD-----QWVSFD 347 Q D+ R+ R+ ++V F Sbjct: 454 QFYDEERVYRITGESGGVEYVPFS 477 >gi|75760981|ref|ZP_00740986.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74491524|gb|EAO54735.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 304 Score = 52.4 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 71/195 (36%), Gaps = 11/195 (5%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIIS 98 K G++ P I P + E+ + + D I Y ++ + + Sbjct: 113 KLYTGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDI-DYIQERYGKDVAADENVGF 171 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVT-IDYDGDGIAELRRVIMAGGTGKDNILCN 157 + + T + N S K M E++V +G+ +V +AGG +L Sbjct: 172 AAAFDVTPQNGFNSTSKKRPNMAMVDEMWVKPCGKHPNGL----KVTIAGGQ----LLDI 223 Query: 158 EE-WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQE 216 +E ++PF + P E+ ++ IQ+ ++ + +V Sbjct: 224 DENAGDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMWLVPM 283 Query: 217 GSIIDPESVLNPQFG 231 GS +D + + N + G Sbjct: 284 GSSVDEDEITNEEGG 298 >gi|153212119|ref|ZP_01947936.1| hypothetical protein A55_1887 [Vibrio cholerae 1587] gi|124116915|gb|EAY35735.1| hypothetical protein A55_1887 [Vibrio cholerae 1587] Length = 740 Score = 52.4 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 57/318 (17%), Positives = 107/318 (33%), Gaps = 48/318 (15%) Query: 25 EDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM 84 + G + + I R S AV P +F+ + I+ S + YLTR L+ Sbjct: 197 DQTGIEQWEAVIERSAS---PSARAVMPWDFVPDMSATSIDDSEFTFERSYLTRKKLLKT 253 Query: 85 -----GY----DRESINNLPIISSQNIENTWKFPKNQYSDKAL------EMIEYYELYVT 129 GY RE P S E+ + L E +E + Sbjct: 254 MTEQAGYVAKNVRELAEKEPRDSHALTEDVLGTINQIRALNGLQPTYKDRRYEIWEYHGP 313 Query: 130 IDYD-------------GDGIAELRRVIMAGGTG-----KDNILCNEEWNELPFTCLRAM 171 I + +E+ VI+ G G N EEW P++ A Sbjct: 314 IPREVLQEAGLLTEEEFESTPSEVDGVIVMSGCGLILKAGINPFDTEEW---PYSVYCAE 370 Query: 172 RAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFG 231 C G + + Q I R +DN Q +V + +++ ++ + Sbjct: 371 EDVSCIFGYGIPHLCSDAQSILNTAWRAMIDNGVATVGDQIVVNQSALMPADNDWSFSPL 430 Query: 232 KPIR------VAAGMDIRSVLGIHSVPMIEKSF-SMLHYLDQELVDRTGISDISSGFSPE 284 K + V+A + + G+ S+ + + +++ + + +G+ IS G + Sbjct: 431 KVWKTTDKASVSAQFEAQKAFGVFSLQNRQAEYANIISMAKAFMDEESGLPMISQGEQGQ 490 Query: 285 ILQNMTATATSLIEQSGV 302 + T S++ + Sbjct: 491 VTP--TLGGMSMLMNAAN 506 >gi|330958837|gb|EGH59097.1| hypothetical genomic island protein [Pseudomonas syringae pv. maculicola str. ES4326] Length = 699 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 78/229 (34%), Gaps = 22/229 (9%) Query: 132 YDGDGIAELRRVIMAGG--TGKDNILCNEEWNELPFTCLRAMRAPHCF---IGESLAASI 186 + GI RR+ + K+ +L + + PF + F L Sbjct: 282 IENGGIQSKRRIKRVRWLVSTKETVLHD---DWSPFNHFTVVPFFPTFRRGHTRGLVDDA 338 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK------PIRVAAGM 240 I Q++ + Q L L I G++ + G I+ + Sbjct: 339 IGPQQLLNKAMSQYLHVLNTSANSGWITVAGTLANMRDEELANRGSETGLHLMIKSKTPV 398 Query: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 + R I + ++ L TGI++ SG + + A + + Sbjct: 399 EDRP-QKIQPNQVPTGIDRLIDRAGALLEQSTGINEAMSGNQGNEV---SGIAIQTRQFA 454 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 Q+ + + LA+ ++L +L +I D+ R++R+ DPR Sbjct: 455 AQQQLAVPLDNLARTRQMLATRMLEMIQVFYDQPRIIRIT----ETDPR 499 >gi|196232183|ref|ZP_03131038.1| hypothetical protein CfE428DRAFT_4204 [Chthoniobacter flavus Ellin428] gi|196223905|gb|EDY18420.1| hypothetical protein CfE428DRAFT_4204 [Chthoniobacter flavus Ellin428] Length = 899 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 56/344 (16%), Positives = 111/344 (32%), Gaps = 54/344 (15%) Query: 36 IRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSD--LISMGYDRESIN- 92 I Q + + ++FL + SP+ +Y D L + G+ + Sbjct: 389 IGTDTIQDCLVGAGLQVEDFLYDIKQPKLMDSPLQVH-VYDDELDRVLAAYGHSSDPAEK 447 Query: 93 --------NLPIISSQNIENTWKFPKNQYSD-------KALEMIEYYELYVTIDYDGDGI 137 N P + ++ PK ++ + +E Y + D Sbjct: 448 YAPLKAGLNFPAPEEDSPKSGASMPKREHGEYYNTQGSSIRRKCNVHECYYRVRLHPDDA 507 Query: 138 AELRRVIMAGGTGKDNI----LCNEEWNELPFTCLRAM-RAPHCFIGESLAASIIEIQKI 192 E ++ + + L N PF LR + P G + E Sbjct: 508 NETWLFLVLDYARQVPLHAEYLGNMRMKSPPFRILRGLESVPSRAYGVGIYQKFHE---- 563 Query: 193 KTVLLRQTLDNLYWQNQPQTIVQEGSIID---PESVLNPQFGKPIRVAAGMDIRSVLGIH 249 R + YW N+ + + + D ++ + Q + R+ R ++ Sbjct: 564 -----RNLAVD-YWFNRAKLKTTKNASADFMKVDAFIEAQDAQEFRMGGTEIYRVKSELN 617 Query: 250 SVPMIEKS---------------FSMLHYLDQELVDRTGISDISSGFS--PEILQNMTAT 292 P+ + + M+ + Q GI+DI +I ++ TAT Sbjct: 618 DRPLGPSNPVIWRENLHDQNQLEWEMIQKIIQTGQIEFGIADIGQLQDEGAQIAKDATAT 677 Query: 293 ATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRM 336 AT +E++G +A+ L L + LI+++ D + Sbjct: 678 ATRNVERTGNTLQRATENMMAEDLTGLCELAIDLILENADLETL 721 >gi|326203482|ref|ZP_08193346.1| hypothetical protein Cpap_1526 [Clostridium papyrosolvens DSM 2782] gi|325986302|gb|EGD47134.1| hypothetical protein Cpap_1526 [Clostridium papyrosolvens DSM 2782] Length = 660 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 73/211 (34%), Gaps = 18/211 (8%) Query: 143 VIMAGGTGKDNILCNEEWN---------ELPFTCLRAMRAPHCFIGESLAASIIEIQKIK 193 +IMAG DN+L E+ P + P F S+ +I IQ+ Sbjct: 293 IIMAG----DNLLHYGEFIYRVGNDGKYGFPLVMQVCVETPGRFWPVSIIERLIPIQRSF 348 Query: 194 TVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNP--QFGKPIRVAAGMDIRSVLGIHSV 251 L + D L + V++ +D + + GK + G + I + Sbjct: 349 NALKNRKKDILNRKAIGNWAVEDDGNVDVDDLEEEGFYPGKIHFYSRG--GKPPQEIQNR 406 Query: 252 PMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRT 311 I L E +G+S +S P N AT I++S ++ L Sbjct: 407 SSITDFDVEEQRLLDEFTTISGVSPFASQSLPPTGSNSGATLEK-IKESDDTRIGLTAEN 465 Query: 312 LAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 + ++ LR+ Q R++R + Sbjct: 466 INIAAIASYKIDLRMYRQFAKTPRLLRHVGK 496 >gi|319776214|ref|YP_004138702.1| hypothetical protein HICON_18250 [Haemophilus influenzae F3047] gi|317450805|emb|CBY87027.1| Putative uncharacterized protein [Haemophilus influenzae F3047] Length = 731 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 52/342 (15%), Positives = 113/342 (33%), Gaps = 57/342 (16%) Query: 10 LIKDSDVEVLEHS--HREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKS 67 +++ V+V+E ++ G V ++ + V V P +F+ + ++ Sbjct: 163 ILRAPVVDVVESKAWKQDSLGNWVGEIVNKTI-----PAVRLVLPWDFVPDMTAPTLKDC 217 Query: 68 PIVGRKLYLTRSDLISM----GYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEY 123 V + ++T+ L ++ Y +ES+ L + T + Y D + Sbjct: 218 QFVFERSHVTKKQLQALAKNPYYLKESVLELCELDG-GDTRTASNDMDGYVDTLRTLSGL 276 Query: 124 --------YELYV---------------------TIDYDGDGIA-----ELRRVIMAGGT 149 YEL+ ++ D + E+ VI+ G Sbjct: 277 ETQSKDNRYELWTYHGGIPLNVLSGANELLGEDNKLNIPDDEESRAANLEIEGVIVMAGN 336 Query: 150 GKDNILCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQ 207 GK + + E P++ C G + + Q+I R +DN Sbjct: 337 GKILSVNLNPLDTAEFPYSVYTCEPDVCCLFGFGIPYLCRDAQEILNTAWRGMIDNGILG 396 Query: 208 NQPQTIVQEGSIIDPESVLNPQFGKPIR------VAAGMDIRSVLGIHSVPMIEKSFSML 261 PQ +V + + K + V A + + GI + ++ + + Sbjct: 397 IGPQAVVNSSVLTPVDGNWELAPYKLWKTNDRATVNAQFEAQRAFGIFDIGSRQQELANI 456 Query: 262 HYL-DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGV 302 L + + +G+ I+ G ++ T S++ + Sbjct: 457 IQLSKSFMDEESGLPMIAQGEQGQVTP--TLGGMSMLMNAAN 496 >gi|148747833|ref|YP_001285799.1| portal protein [Phormidium phage Pf-WMP3] gi|146230066|gb|ABQ12474.1| portal protein [Phormidium phage Pf-WMP3] Length = 651 Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 48/324 (14%), Positives = 113/324 (34%), Gaps = 17/324 (5%) Query: 29 EKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSD---LISMG 85 E ++ + + + + + P+ D + + RKL T++D L+S G Sbjct: 181 EPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFI-RKLTKTKADILNLLSEG 239 Query: 86 Y-----DRESINNLPIISSQNIENTWKFPKNQYSD--KALEMIEYYELYVTIDYDGDGIA 138 Y + + + +S ++ + + + +E E + D + Sbjct: 240 YYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYW--GDIHLENKT 297 Query: 139 ELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLR 198 V+ G N W PF + + + ++ Sbjct: 298 YHDVVVTIMGNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITN 357 Query: 199 QTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSF 258 Q LDNL ++ ++ PE V + GK V+ D++ + + ++ Sbjct: 358 QRLDNLELAIDQMYTLRSDGLLQPEDVYT-EPGKVFLVSDHGDLQPL--ANQSSNFSITY 414 Query: 259 SMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLE 317 +L+ + G + + + +TA + + ++G ++ I + + + L Sbjct: 415 QESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLL 474 Query: 318 ILFRGLLRLIIQHQDKVRMVRLRD 341 +L ++ L+ Q D+ MVR+ Sbjct: 475 VLLEKVMHLVQQFTDQPGMVRVAG 498 >gi|225155390|ref|ZP_03723882.1| hypothetical protein ObacDRAFT_9438 [Opitutaceae bacterium TAV2] gi|224803846|gb|EEG22077.1| hypothetical protein ObacDRAFT_9438 [Opitutaceae bacterium TAV2] Length = 672 Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/330 (12%), Positives = 94/330 (28%), Gaps = 29/330 (8%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLI------SMGYDRESIN 92 + S+ ++ ++A++ + ++ + + + ++ + D ++IN Sbjct: 126 ETSRKRLKIEAINRYDVIVPNWTGRLADCDWIVHVQRFSKHAFRRLVKRMAWTIDDDTIN 185 Query: 93 NLPIISSQNIENTWKFPKNQ-----YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMA- 146 L + N S + I +E+Y D DG I + + Sbjct: 186 ALAGQDATNTGAASAEQSKFQRQGITSPSKDDEIVLWEVYSRND-DGAWIIKTYSPVRPE 244 Query: 147 -------GGTGKDNILCNEEWNELPFTCLRAMRAPHCFI-GESLAASIIEIQKIKTVLLR 198 G + + PF + + + + + Sbjct: 245 QVLRPDFGLPYNQGVFADS-LPPPPFFEISCELKDRGYYDSRGIVKRVAPFEASLCKDWN 303 Query: 199 QTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSF 258 D + P S + S L Q G+ + + + + + Sbjct: 304 TVKDYQTLTSTPILTASARSDVGNNSTLRFQPGQVLPF-------PLSAVQMPTLPVDTQ 356 Query: 259 SMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEI 318 + Q G+ D +G + TA SLI V++ R + L Sbjct: 357 QGMLGTRQTAEQLVGVPDFGTGSQQPSGERKTAKEVSLIANVMGQSVDMRARIFRKELAH 416 Query: 319 LFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + ++ Q+ + + D + P Sbjct: 417 GLAIMWAILSQYAREELDYFVLDNLIQIPP 446 >gi|281357154|ref|ZP_06243643.1| hypothetical protein Vvad_PD2246 [Victivallis vadensis ATCC BAA-548] gi|281316185|gb|EFB00210.1| hypothetical protein Vvad_PD2246 [Victivallis vadensis ATCC BAA-548] Length = 752 Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/157 (23%), Positives = 65/157 (41%), Gaps = 7/157 (4%) Query: 176 CFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ-EGSIIDPESVLNPQFGKPI 234 GE +A + +Q+ L+R +NL PQ I+ + + P L K Sbjct: 432 SIWGEGIADLLHHVQRSVNSLMRSRNNNLALAGAPQVIINTDAVRLKPGEPLQITPFKQW 491 Query: 235 RVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELV--DR-TGISDISSGFSPEILQNM-- 289 V+ + + + + S S+ L++ELV DR +GI + S G S Sbjct: 492 FVSGSGYYGAQKPFELMQIPDVSDSLSRELEKELVFADRISGIPEYSQGVSKGAENGAAG 551 Query: 290 TATATSLIEQSGVGQVELIVRTLAQGLEI-LFRGLLR 325 TA+ S++ + Q++ + + +GL L R L Sbjct: 552 TASGLSMLLDAASNQIKDPINNIDEGLYEPLIRDLYY 588 >gi|239907145|ref|YP_002953886.1| hypothetical protein DMR_25090 [Desulfovibrio magneticus RS-1] gi|239797011|dbj|BAH76000.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 682 Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/304 (12%), Positives = 88/304 (28%), Gaps = 26/304 (8%) Query: 42 QGKVCVDAVSPDEFLIHPDS-VDIEKSPIVGRKLYLTRSDLISM----GYDRESIN---- 92 + + VSP F + + + + D++ + G+D + + Sbjct: 208 ERRPYYRRVSPWSFYWDQSANRRMGDCRYGYEEYRMVYGDVLELAGRTGFDGDVVRAYLA 267 Query: 93 --NLPIISSQNIENTWKFPKNQYSDKALEMIE-YYELY--VTID-YDGDGIAELRRVIMA 146 + + E+ + + L+ E Y + D + G+ + A Sbjct: 268 EKRDGDATEYDFESQLRSINGGTPEPQLQGRWRVLERYGWLRGDELEECGVDLGNDPVQA 327 Query: 147 GGTGKDNILCNEEWN---------ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLL 197 +L + E PF R G + + Q ++ Sbjct: 328 DYFCNVWMLGGKIIKAVRAPIRGVEFPFQIFPMFRDDSSLCGLGVTGVYRDAQSAINAVV 387 Query: 198 RQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDI-RSVLGIHSVPMIEK 256 R +DN P V ++ N + G ++ G D+ +++ + Sbjct: 388 RAMMDNARMSLGPIGGVNVPALQQTLDADNIRGGTWLKFDTGEDMSKAITFWQASSHTSD 447 Query: 257 SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGL 316 ++ Y D + T + G T S++ + + +V+ + Sbjct: 448 YLALAKYFDDMGDELT-VPRWVHGDGNVSDAARTLGGLSMLMNAMSINLAEMVKIFDDEV 506 Query: 317 EILF 320 F Sbjct: 507 TSQF 510 >gi|225158777|ref|ZP_03725094.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] gi|224802612|gb|EEG20867.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] Length = 562 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 39/303 (12%), Positives = 92/303 (30%), Gaps = 34/303 (11%) Query: 38 RKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPII 97 R G + V F+I D+ + + + + Sbjct: 149 RGQRGGLLNFTNVPVGTFVIEEDAEGLVDTVFREFR------------FTARQCAQKWGE 196 Query: 98 SSQNIENTWKFPKNQYSDKALEMIEYYELYVTID-YDGDGIAELRRVIMAG-GTGKDNIL 155 + SD+ + +Y D G GI + R + +++ Sbjct: 197 DKLSKPMLDALNSKTASDRDKRFQIIHAVYPRRDGKQGPGIGKKRPIASVYVDKQAIHVI 256 Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 + E+P R +R + G ++ K+ + R L +L Q P + Sbjct: 257 EEGGFYEMPIAVARLLRGNNEIYGRGPGDQVMPEIKLVNRMERDLLLSLEQQVNPPWLAP 316 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 + S P+ + G + ++ P + + L D+ L D+ + Sbjct: 317 QDSSWRPD----NRPGGVF------YWDASNP-NNKPERLRDTARLDIGDKVLNDKREVI 365 Query: 276 DIS-------SGFSPEILQN-MTATATSLIEQSGVGQVE-LIVRTLAQGLEILFRGLLRL 326 + +P+ ++ TA + + Q + + R + L + + + Sbjct: 366 RRAWFVDMFKMLSNPDAMKRDKTAFEVAQLMQEKLVLFHPMFARITQEKLNPVLERVFNI 425 Query: 327 IIQ 329 +++ Sbjct: 426 LMR 428 >gi|288957023|ref|YP_003447364.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] gi|288909331|dbj|BAI70820.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] Length = 534 Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 45/273 (16%), Positives = 85/273 (31%), Gaps = 30/273 (10%) Query: 41 SQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQ 100 + + AV + ++ + R+ T + + + P Sbjct: 161 AVSSLRFTAVPLADAVLEEGPDGRLDATF--RRSEATLAQI---------LQRFPGAG-- 207 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 + + ++ E V DG A V++ G + L + Sbjct: 208 ----LPDELRRRAAEDPDHRFPLVEAVVP-----DGAAYRWGVVLDSGLADPSWLAQGRF 258 Query: 161 NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSII 220 + PF R ++AP G S + K ++ L N + ++ Sbjct: 259 AQSPFVNFRWLKAPGETYGRSPVMKALPDIKTANKVVELVLKNASIAVTGIWQADDDGVL 318 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQE-----LVDRTGIS 275 +P ++ G I A G + + + + S +L L LVDR G Sbjct: 319 NPSTI-RLVPGTIIPKAVGSAG--LTPLANPGRFDVSQLVLDDLRGRIRHALLVDRLGPV 375 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELI 308 D + + E+L+ A L G Q EL+ Sbjct: 376 DSARMTATEVLERSVEMARLLGATYGRLQAELM 408 >gi|295425358|ref|ZP_06818061.1| leucine--tRNA ligase [Lactobacillus amylolyticus DSM 11664] gi|295065134|gb|EFG56039.1| leucine--tRNA ligase [Lactobacillus amylolyticus DSM 11664] Length = 804 Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFS-MLHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + M++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLMVNDLDLKAIPQEKIVDENDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYNETVKKVTEDFDALHFNTAISQMMVFANAAQKAKTIPREYAEGFVKLLAPIAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + + +W ++DP Sbjct: 715 -----EIWSVFGHDESITYAKWPTYDP 736 >gi|218781805|ref|YP_002433123.1| hypothetical protein Dalk_3969 [Desulfatibacillum alkenivorans AK-01] gi|218763189|gb|ACL05655.1| hypothetical protein Dalk_3969 [Desulfatibacillum alkenivorans AK-01] Length = 643 Score = 47.4 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 47/350 (13%), Positives = 116/350 (33%), Gaps = 43/350 (12%) Query: 34 LRIRRKYSQGKVCVDAVSPDEFLIHP--DSVDIEKSPIVGRKLYLTRSDLISMGY--DRE 89 + R G + + + FL+ P D+ + R+ LT+ + S+ R Sbjct: 132 VDYSRDPLNGDIKIKRAPHNRFLLDPAFSERDLSDCNYICRRELLTKDAVRSILPAGKRH 191 Query: 90 SINNL-PIISSQNIENTWKFPKNQYSDKALE--MIEYYEL---YVTID--------YDGD 135 I+N+ P + +N D L + +D + GD Sbjct: 192 LIDNISPRGVADGKYEYGALARNPAGDNLLRYDEFWTRTHKPLTILMDPAAGLSLPWKGD 251 Query: 136 GIA---------------ELRRVIMAGGTGKDNILCNEE----WNELPFTCLRAMRAPH- 175 +RR + +D ++ + PF + P Sbjct: 252 AQTLERLLSAHPGLKIFETMRRSVELAVFVEDQLIHQGPEPGGLEDFPFVPVLGFFEPEY 311 Query: 176 ---CFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 + + + + + Q + LD + Q +E S+++PE++ G+ Sbjct: 312 DQASWKLQGIVRCMRDPQTEVNKRRSKMLDIIDSQISTGWKARENSVVNPEALYQSGQGQ 371 Query: 233 PIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTAT 292 + + D+ + + + LD+++++ G + G + + Sbjct: 372 VVWMKG--DMDQAQRLLPPDIPAGLMQLSEILDKDIMEIPGANSELFGMAESDHMQIAGI 429 Query: 293 ATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 + + +G+ ++ I +++ + L++LI + ++ R+ Q Sbjct: 430 LAKMRQAAGLTILQDIFDNYRLAKKLVGQKLIKLIQANYTPQKVSRILGQ 479 >gi|209544596|ref|YP_002276825.1| hypothetical protein Gdia_2465 [Gluconacetobacter diazotrophicus PAl 5] gi|209532273|gb|ACI52210.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 730 Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 52/319 (16%), Positives = 96/319 (30%), Gaps = 34/319 (10%) Query: 48 DAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRS-DLISMGYDRESINNLPIISSQNIENTW 106 + V P I+ + + +T DL Y E + + + Sbjct: 235 ETVDPLRLCINYKAKSFATA------ARMTEEIDL----YPWEIEERIRAGLFLDEDYGT 284 Query: 107 KFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGK--------------- 151 + A + + E + D DGDG AE V +A +G+ Sbjct: 285 NHDDGSQDEDA--PVTFLEQHRRWDLDGDGYAEPYIVTIARDSGQLARIVAGFDADGVMF 342 Query: 152 -DNILCNEEWNELPFTC-LRAMRAPHC-FIGESLAASIIEIQKIKTVLLRQTLDNLYWQN 208 + +P+ + + +P + + + L Q D + N Sbjct: 343 DPVTHRIRKIEAVPYYTRFQFIPSPQSAIYAMGFGSLLYPLNGAINTSLNQMFDAGHLAN 402 Query: 209 QPQTIVQEGSIIDPESV-LNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQE 267 + G ++ SV K + +++ + F +L YL + Sbjct: 403 AGGGFIGSGMSLNTGSVRFQVGEYKVVNTPGATLRENMVPLQFPGPSPALFQLLQYLVEA 462 Query: 268 LVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLI 327 + I DI SG P N + Q G+ I + + + L F L RL Sbjct: 463 GREIASIKDILSGAMPGG--NTPGILGLAVIQQGMKVFSAIFKRVHRALGAEFDKLYRLN 520 Query: 328 IQHQDKVRMVRLRDQWVSF 346 + RL +Q+ Sbjct: 521 RLYLPDDAGYRLGEQYFEV 539 >gi|225155663|ref|ZP_03724152.1| hypothetical protein ObacDRAFT_9274 [Opitutaceae bacterium TAV2] gi|224803636|gb|EEG21870.1| hypothetical protein ObacDRAFT_9274 [Opitutaceae bacterium TAV2] Length = 657 Score = 46.2 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 44/298 (14%), Positives = 94/298 (31%), Gaps = 42/298 (14%) Query: 46 CVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRES--INNLPIISSQNIE 103 + V D FL + + + L++ YD++ I ++ I + Sbjct: 248 RSEIVPSDRFLCPVNVASPDDAK------------LVAELYDKDIAWIEDMWI---ERPW 292 Query: 104 NTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMA---------GGTGKDNI 154 W+ K +++ + E D D LR++I G + I Sbjct: 293 AIWEEVKGEFTQSGADEKTEGESKAKEDATHDDKESLRKIIECWGRRDVLGLEGPQEFVI 352 Query: 155 LCNEEWNE---------------LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQ 199 +E+ PFT + R + G+S+ + + Q+ Sbjct: 353 FIDEDSERAVFYEFTAKVCPDFKRPFTTIAVGRTRRRWWGKSIPEKVAQYQEKIDENFNG 412 Query: 200 TLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIE-KSF 258 P V + ++ E L K + + + +P + K+ Sbjct: 413 EAYRNLMNANPLKGVNPDATVEEEEDLVFDPEKVYHLKLNKKMEDFVSFAKLPDADFKTR 472 Query: 259 SMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGL 316 + ++ + ISD+ +G + +N TAT + ++ R + +G Sbjct: 473 DIAQFVFWFVQRWLHISDVGTGDYEALPENNTATGIEINREASQSTSRRWNRRINEGY 530 >gi|239787361|emb|CAX83837.1| Head-to-tail joining protein [uncultured bacterium] Length = 524 Score = 45.8 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 84/273 (30%), Gaps = 29/273 (10%) Query: 41 SQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQ 100 S A+ + + + R +T S I + + + S+ Sbjct: 151 SASAFRFTAIPLAQLALEESVEGRLDTTF--RSSEMTISA-IRERFPKAQLPESMGRKSK 207 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 + +F + Y+ + DG+G TG L + Sbjct: 208 -DDADARFKVVEAVLPERHGYAYHAI-----LDGEG------------TGGAETLAEGRF 249 Query: 161 NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSII 220 PF R ++AP G S + K ++ L N + ++ Sbjct: 250 EMSPFINFRWLKAPGEVYGRSPVMKSLPDIKTANKVVELVLKNATIAVTGIWQADDDGVL 309 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQE-----LVDRTGIS 275 +P ++ G I A G + + + + S ML L Q L DR G Sbjct: 310 NPANI-KLVPGTIIPKAVGSAG--LTPLETPGRFDISQLMLTDLRQRISHALLADRLGQI 366 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELI 308 D + + E+L+ A L G Q EL+ Sbjct: 367 DAPNMTATEVLERSAEMARLLGATYGRLQSELL 399 >gi|218782387|ref|YP_002433705.1| hypothetical protein Dalk_4559 [Desulfatibacillum alkenivorans AK-01] gi|218763771|gb|ACL06237.1| hypothetical protein Dalk_4559 [Desulfatibacillum alkenivorans AK-01] Length = 704 Score = 45.4 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 66/183 (36%), Gaps = 15/183 (8%) Query: 166 TCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESV 225 R + G + + + Q+ Q L+ L QP T +++G++ D + Sbjct: 340 ANTDPSRRSNNHFG--VIRLMKDPQREINKRWSQALNLLNNMVQPGTDIEDGAVPDIDQY 397 Query: 226 LNPQ--FGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSP 283 + G V++G + S P + + + Q+ I +G +P Sbjct: 398 SEARKTPGGVGIVSSGALRDGKIKERSAPQFPSAPMQMEQMSQD------IIRKITGINP 451 Query: 284 EILQNMTATATSLIEQSGVGQVELI--VRTLAQG---LEILFRGLLRLIIQHQDKVRMVR 338 ++L + + + LI + + +F+ ++ +I ++ +++R Sbjct: 452 DLLGQDSGRQEPGVVVQTRQRQGLILLQKLFKEHKRVRREIFKRVIAIISKYMPDGQILR 511 Query: 339 LRD 341 + Sbjct: 512 ILG 514 >gi|256843734|ref|ZP_05549222.1| leucyl-tRNA synthetase [Lactobacillus crispatus 125-2-CHN] gi|256615154|gb|EEU20355.1| leucyl-tRNA synthetase [Lactobacillus crispatus 125-2-CHN] Length = 804 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSGF-- 281 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDENDGTLD 654 Query: 282 ------------SPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFMNAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|83313332|ref|YP_423596.1| hypothetical protein amb4233 [Magnetospirillum magneticum AMB-1] gi|82948173|dbj|BAE53037.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 545 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 57/167 (34%), Gaps = 8/167 (4%) Query: 147 GGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYW 206 G D +L +++ PF R ++AP G S + K ++ L N Sbjct: 254 DDDGSDLVLGRGQFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNATI 313 Query: 207 QNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ 266 + +++P ++ G I A G + + + + S +L L Sbjct: 314 AVTGIWQADDDGVLNPANI-KLVPGTIIPKAVGSAG--LQPLTAPGRFDTSQLVLDDLRG 370 Query: 267 ELVDRTGISDISSGFSP-----EILQNMTATATSLIEQSGVGQVELI 308 + +S SP E+LQ A L G Q EL+ Sbjct: 371 RIRHALMGDKLSQPASPALTATEVLQRADDMARLLGATYGRLQSELL 417 >gi|209966578|ref|YP_002299493.1| hypothetical protein RC1_3320 [Rhodospirillum centenum SW] gi|209960044|gb|ACJ00681.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 521 Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 56/175 (32%), Gaps = 18/175 (10%) Query: 152 DNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 +L + E PF R M+AP G S + + ++ L N Sbjct: 252 PVLLAEGRFAEPPFIAFRWMKAPGEVYGRSPVMKALPDIRTANKVVELVLKNASVAVTGI 311 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDI-RSVLGIHSVPM-IEKSFSMLHYLDQELV 269 + +++P IR+ G I ++V P+ F + + +L Sbjct: 312 WQADDDGVLNP---------GTIRLVPGAIIPKAVGSAGLTPLASPGRFDVSQLVLDDLR 362 Query: 270 DRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLL 324 + ++ P MTAT +E+S R L L LL Sbjct: 363 AHIRHALLADRLGPVQGPRMTATEV--LERSAEM-----ARMLGATYGRLQSELL 410 >gi|58337877|ref|YP_194462.1| leucyl-tRNA synthetase [Lactobacillus acidophilus NCFM] gi|227904527|ref|ZP_04022332.1| leucyl-tRNA synthetase [Lactobacillus acidophilus ATCC 4796] gi|81311412|sp|Q5FIP3|SYL_LACAC RecName: Full=Leucyl-tRNA synthetase; AltName: Full=Leucine--tRNA ligase; Short=LeuRS gi|58255194|gb|AAV43431.1| leucyl-tRNA synthetase [Lactobacillus acidophilus NCFM] gi|227867736|gb|EEJ75157.1| leucyl-tRNA synthetase [Lactobacillus acidophilus ATCC 4796] Length = 804 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDENDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFMNAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESISYAEWSTYDP 736 >gi|23015763|ref|ZP_00055531.1| hypothetical protein Magn03010200 [Magnetospirillum magnetotacticum MS-1] Length = 543 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 61/180 (33%), Gaps = 8/180 (4%) Query: 147 GGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYW 206 D +L ++ PF R ++AP G S + K ++ L N Sbjct: 254 DDESSDVVLGRGSFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNATI 313 Query: 207 QNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ 266 + +++P ++ G I A G + + + + S +L L Sbjct: 314 AVTGIWQADDDGVLNPANI-KLVPGTIIPKAVGSAG--LQPLTAPGRFDTSQLVLDDLRG 370 Query: 267 ELVDRTGISDISSGFSP-----EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFR 321 + +S SP E+LQ A L G Q EL+ + + + IL R Sbjct: 371 RIRHALMGDKLSQPASPSLTATEVLQRSDDMARLLGATYGRLQSELLTPLIMRAIHILRR 430 >gi|227876926|ref|ZP_03995022.1| leucyl-tRNA synthetase [Lactobacillus crispatus JV-V01] gi|256850182|ref|ZP_05555612.1| leucyl-tRNA synthetase [Lactobacillus crispatus MV-1A-US] gi|262047820|ref|ZP_06020770.1| leucyl-tRNA synthetase [Lactobacillus crispatus MV-3A-US] gi|227863483|gb|EEJ70906.1| leucyl-tRNA synthetase [Lactobacillus crispatus JV-V01] gi|256713154|gb|EEU28145.1| leucyl-tRNA synthetase [Lactobacillus crispatus MV-1A-US] gi|260571877|gb|EEX28448.1| leucyl-tRNA synthetase [Lactobacillus crispatus MV-3A-US] Length = 804 Score = 44.3 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFMNAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|293379839|ref|ZP_06625961.1| leucyl-tRNA synthetase [Lactobacillus crispatus 214-1] gi|290923611|gb|EFE00492.1| leucyl-tRNA synthetase [Lactobacillus crispatus 214-1] Length = 804 Score = 44.3 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFMNAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|162149432|ref|YP_001603893.1| hypothetical protein GDI_3670 [Gluconacetobacter diazotrophicus PAl 5] gi|161788009|emb|CAP57613.1| hypothetical protein GDI3670 [Gluconacetobacter diazotrophicus PAl 5] Length = 907 Score = 43.9 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 86/304 (28%), Gaps = 49/304 (16%) Query: 48 DAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK 107 + V P I ++ +P ++ I + Y E + + E Sbjct: 213 ETVDPLRLCIDYNAKSFAAAP--------RITEEIDL-YPWEVEEKIRAGLFLDDEYGCN 263 Query: 108 FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGK---------------- 151 + D + + E + D DGDG AE V +A +G+ Sbjct: 264 ---HDAGDDEDAPVTFLEQHRRYDLDGDGYAEPYIVTIARDSGRLARIVAGFESEGVIFG 320 Query: 152 ------DNILCNEEWNELPFTCLRAMRAPHC-FIGESLAASIIEIQKIKTVLLRQTLDNL 204 I + + PF + +P + + L Q D Sbjct: 321 AADHRIRRIDAVAYYTKFPF-----IPSPDSAIYDIGFGTLLHPLNAAVNTSLNQMFDAA 375 Query: 205 YWQNQPQTIVQEGSIIDPESV-LNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHY 263 + N + G ++ SV K + +++ + F +L + Sbjct: 376 HLANAGGGFIGSGMSLNSGSVRFQIGEYKVVNTPGATLRENLVPMQFSGPNPVLFQLLGF 435 Query: 264 LDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQV-----ELIVRTLAQGLEI 318 L + + DI SG P L +V + I R+L Sbjct: 436 LVDAGREIASVKDILSGAMP---GGNVPGVLGLAVIQQGLKVFSAIFKRIHRSLGMEFRK 492 Query: 319 LFRG 322 L+R Sbjct: 493 LYRL 496 >gi|209544682|ref|YP_002276911.1| hypothetical protein Gdia_2553 [Gluconacetobacter diazotrophicus PAl 5] gi|209532359|gb|ACI52296.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 707 Score = 43.1 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 86/304 (28%), Gaps = 49/304 (16%) Query: 48 DAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK 107 + V P I ++ +P ++ I + Y E + + E Sbjct: 202 ETVDPLRLCIDYNAKSFAAAP--------RITEEIDL-YPWEVEEKIRAGLFLDDEYGCN 252 Query: 108 FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGK---------------- 151 + D + + E + D DGDG AE V +A +G+ Sbjct: 253 ---HDAGDDEDAPVTFLEQHRRYDLDGDGYAEPYIVTIARDSGRLARIVAGFESEGVIFG 309 Query: 152 ------DNILCNEEWNELPFTCLRAMRAPHC-FIGESLAASIIEIQKIKTVLLRQTLDNL 204 I + + PF + +P + + L Q D Sbjct: 310 AADHRIRRIDAVAYYTKFPF-----IPSPDSAIYDIGFGTLLHPLNAAVNTSLNQMFDAA 364 Query: 205 YWQNQPQTIVQEGSIIDPESV-LNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHY 263 + N + G ++ SV K + +++ + F +L + Sbjct: 365 HLANAGGGFIGSGMSLNSGSVRFQIGEYKVVNTPGATLRENLVPMQFSGPNPVLFQLLGF 424 Query: 264 LDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQV-----ELIVRTLAQGLEI 318 L + + DI SG P L +V + I R+L Sbjct: 425 LVDAGREIASVKDILSGAMP---GGNVPGVLGLAVIQQGLKVFSAIFKRIHRSLGMEFRK 481 Query: 319 LFRG 322 L+R Sbjct: 482 LYRL 485 >gi|144899435|emb|CAM76299.1| head-to-tail joining protein [Magnetospirillum gryphiswaldense MSR-1] Length = 502 Score = 43.1 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 46/284 (16%), Positives = 87/284 (30%), Gaps = 36/284 (12%) Query: 46 CVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIEN- 104 AV + ++ S R+ LT + L + ++ P + + E+ Sbjct: 160 RFTAVPLAQAVLEEGPDGKLDSSF--RRSELTLAALRQR-FPAAQLD--PSLIRRGEEDP 214 Query: 105 TWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELP 164 +F + Y + D D A +L + + P Sbjct: 215 QARFAVIEAVIPNQRGHYDYAA-ILEDATDDDEA---------------LLAEGRFGQSP 258 Query: 165 FTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPES 224 F R ++AP G S + K ++ L N + +++P + Sbjct: 259 FINFRWLKAPGEIYGRSPVMKALPDIKTANKVVELVLKNATIAVTGIWQADDDGVLNPAN 318 Query: 225 VLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPE 284 + G I A G + + S + S +L L + + +P+ Sbjct: 319 I-KLIPGTIIPKAVGSAG--LQPLESPGRFDISQLVLDDLRGRIRHALLADKLGQADNPK 375 Query: 285 ILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII 328 + TAT ++E+S R L L LL +I Sbjct: 376 M------TATEVLERSADM-----ARLLGATYGRLQSELLTPLI 408 >gi|300362219|ref|ZP_07058396.1| leucine--tRNA ligase [Lactobacillus gasseri JV-V03] gi|300354838|gb|EFJ70709.1| leucine--tRNA ligase [Lactobacillus gasseri JV-V03] Length = 804 Score = 43.1 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVLGHDESITYAKWPTYDP 736 >gi|327184092|gb|AEA32539.1| leucyl-tRNA synthetase [Lactobacillus amylovorus GRL 1118] Length = 804 Score = 43.1 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQIFGHDESITYAKWPTYDP 736 >gi|325957385|ref|YP_004292797.1| leucyl-tRNA synthetase [Lactobacillus acidophilus 30SC] gi|325333950|gb|ADZ07858.1| leucyl-tRNA synthetase [Lactobacillus acidophilus 30SC] Length = 804 Score = 43.1 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|315038959|ref|YP_004032527.1| leucyl-tRNA synthetase [Lactobacillus amylovorus GRL 1112] gi|312277092|gb|ADQ59732.1| leucyl-tRNA synthetase [Lactobacillus amylovorus GRL 1112] Length = 804 Score = 43.1 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|169334552|ref|ZP_02861745.1| hypothetical protein ANASTE_00955 [Anaerofustis stercorihominis DSM 17244] gi|169259269|gb|EDS73235.1| hypothetical protein ANASTE_00955 [Anaerofustis stercorihominis DSM 17244] Length = 648 Score = 43.1 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 57/298 (19%), Positives = 117/298 (39%), Gaps = 32/298 (10%) Query: 35 RIRRKYSQGKVCVDAVSPDEFL-IHPDSVDIEKSPIVGRKLYLTRSDLISM------GYD 87 +I +G + + +SP +F + + DIE ++ ++ +M GY+ Sbjct: 183 KINVAIKEGGINYEIISPFDFFPSNVYAKDIESLDYAIWYKVMSVKEIENMFNITVEGYE 242 Query: 88 RESIN------NLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELR 141 ++ N+ + S+ + ++ D + E+I Y+E G I + Sbjct: 243 NNVVSYSKSKTNVGGLGSKGHGYS---ESSKNIDLSAEVISYFEKPTNRYPKGRYIVCTK 299 Query: 142 R-VIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 V+ G N E ELPF +++ F GES+ +I +Q+ + + Sbjct: 300 DNVLHMGDLPYINAEDGER--ELPFVIQKSLDY-GEFFGESIINRLIPLQRRFNNIKNRK 356 Query: 201 LDNLYWQNQPQTIVQEGSIIDPESVLN--PQFGKPIRVAAGMDIRSVLGIHSVP------ 252 + L Q ++GSI D + V++ G I G + S L ++P Sbjct: 357 QEYLNRVAIGQITYEKGSI-DEDDVIDMGLAPGAVIPRRQGSEEPSYLRTPALPSTILSD 415 Query: 253 --MIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSL-IEQSGVGQVEL 307 E+ F L + + + +++SG + +LQ+ T +L E ++++ Sbjct: 416 EKATEELFITLSGVSEMSRNSYNPKNVTSGVALSLLQDQDDTRLALNYENMYDTRIKI 473 >gi|238852506|ref|ZP_04642918.1| leucine--tRNA ligase [Lactobacillus gasseri 202-4] gi|238834853|gb|EEQ27078.1| leucine--tRNA ligase [Lactobacillus gasseri 202-4] Length = 804 Score = 42.7 bits (99), Expect = 0.073, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W +DP Sbjct: 715 -----EIWQVFGHDESISYAKWPEYDP 736 >gi|282851118|ref|ZP_06260492.1| leucine--tRNA ligase [Lactobacillus gasseri 224-1] gi|282558070|gb|EFB63658.1| leucine--tRNA ligase [Lactobacillus gasseri 224-1] Length = 804 Score = 42.7 bits (99), Expect = 0.076, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W +DP Sbjct: 715 -----EIWQVFGHDESISYAKWPEYDP 736 >gi|116629123|ref|YP_814295.1| leucyl-tRNA synthetase [Lactobacillus gasseri ATCC 33323] gi|311111095|ref|ZP_07712492.1| leucine--tRNA ligase [Lactobacillus gasseri MV-22] gi|122273879|sp|Q045L5|SYL_LACGA RecName: Full=Leucyl-tRNA synthetase; AltName: Full=Leucine--tRNA ligase; Short=LeuRS gi|116094705|gb|ABJ59857.1| leucyl-tRNA synthetase [Lactobacillus gasseri ATCC 33323] gi|311066249|gb|EFQ46589.1| leucine--tRNA ligase [Lactobacillus gasseri MV-22] Length = 804 Score = 42.7 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W +DP Sbjct: 715 -----EIWQVFGHDESISYAKWPEYDP 736 >gi|212696605|ref|ZP_03304733.1| hypothetical protein ANHYDRO_01145 [Anaerococcus hydrogenalis DSM 7454] gi|212676336|gb|EEB35943.1| hypothetical protein ANHYDRO_01145 [Anaerococcus hydrogenalis DSM 7454] Length = 2189 Score = 42.7 bits (99), Expect = 0.082, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 46/107 (42%), Gaps = 1/107 (0%) Query: 29 EKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDR 88 + V ++ I + V+A P+ F ++ +S +I++S + + + +L+ + ++ Sbjct: 1578 QDVENVVITKYRLYNNNKVEADMPESFGVNENSSNIKRSGYGVSRQSMDKGELLELPFEN 1637 Query: 89 ESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTID-YDG 134 I + K + ++ L+ YY + +D YDG Sbjct: 1638 GMSPRDSYIVKVTGRVSGKDKSSYQAESVLQTYNYYSGKLFVDRYDG 1684 >gi|268319013|ref|YP_003292669.1| hypothetical protein FI9785_522 [Lactobacillus johnsonii FI9785] gi|262397388|emb|CAX66402.1| leuS [Lactobacillus johnsonii FI9785] Length = 804 Score = 42.7 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 75/205 (36%), Gaps = 40/205 (19%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + +G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSKGNVVNPDEVIDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FSP- 283 R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 597 RMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQEKIVDENDGELDKV 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQ 331 E L TA + ++ + + ++I R A+G L + +++ Sbjct: 657 YAETVKKVTEDFEALHFNTAISQMMVFMNAAQKAKIIPREYAEGFVQLLAPVAPHMME-- 714 Query: 332 DKVRMVRLRD--------QWVSFDP 348 + + +W +DP Sbjct: 715 ---EIWSVFGHDESIAYAKWPEYDP 736 >gi|227892921|ref|ZP_04010726.1| leucine--tRNA ligase [Lactobacillus ultunensis DSM 16047] gi|227865259|gb|EEJ72680.1| leucine--tRNA ligase [Lactobacillus ultunensis DSM 16047] Length = 804 Score = 42.7 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 74/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDDVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R A+G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFVNAAQKAKSIPREYAEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|312978116|ref|ZP_07789861.1| leucine--tRNA ligase [Lactobacillus crispatus CTV-05] gi|310895091|gb|EFQ44160.1| leucine--tRNA ligase [Lactobacillus crispatus CTV-05] Length = 804 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 76/205 (37%), Gaps = 40/205 (19%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + +G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSKGNVVNPDEVIDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FSP- 283 R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 597 RMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELDKV 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQ 331 E L TA + ++ + + + I R A+G L + +++ Sbjct: 657 YAETVKKVTEDFESLHFNTAISQMMVFMNAAQKAKTIPREYAEGFVKLLAPVAPHMME-- 714 Query: 332 DKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 ---EIWQVFGHDESITYAKWPTYDP 736 >gi|295693491|ref|YP_003602101.1| leucyl-tRNA synthetase [Lactobacillus crispatus ST1] gi|295031597|emb|CBL51076.1| Leucyl-tRNA synthetase [Lactobacillus crispatus ST1] Length = 804 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 76/205 (37%), Gaps = 40/205 (19%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + +G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSKGNVVNPDEVIDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FSP- 283 R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 597 RMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELDKV 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQ 331 E L TA + ++ + + + I R A+G L + +++ Sbjct: 657 YAETVKKVSEDFESLHFNTAISQMMVFMNAAQKAKTIPREYAEGFVKLLAPVAPHMME-- 714 Query: 332 DKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 ---EIWQVFGHDESITYAKWPTYDP 736 >gi|42518602|ref|NP_964532.1| leucyl-tRNA synthetase [Lactobacillus johnsonii NCC 533] gi|81668177|sp|Q74KS1|SYL_LACJO RecName: Full=Leucyl-tRNA synthetase; AltName: Full=Leucine--tRNA ligase; Short=LeuRS gi|41582887|gb|AAS08498.1| leucyl-tRNA synthetase [Lactobacillus johnsonii NCC 533] gi|329666886|gb|AEB92834.1| leucyl-tRNA synthetase [Lactobacillus johnsonii DPC 6026] Length = 804 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 74/205 (36%), Gaps = 40/205 (19%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + +G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSKGNVVNPDEVIDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FSP- 283 R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 597 RMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQEKIVDENDGELDKV 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQ 331 E L TA + ++ + + + I R A+G L + +++ Sbjct: 657 YAETVKKVTEDFEALHFNTAISQMMVFMNAAQKAKTIPREYAEGFVQLLAPVAPHMME-- 714 Query: 332 DKVRMVRLRD--------QWVSFDP 348 + + +W +DP Sbjct: 715 ---EIWSVFGHDESIAYAKWPEYDP 736 >gi|227890482|ref|ZP_04008287.1| leucyl-tRNA synthetase [Lactobacillus johnsonii ATCC 33200] gi|227849051|gb|EEJ59137.1| leucyl-tRNA synthetase [Lactobacillus johnsonii ATCC 33200] Length = 804 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 74/205 (36%), Gaps = 40/205 (19%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + +G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSKGNVVNPDEVIDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FSP- 283 R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 597 RMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQEKIVDENDGELDKV 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQ 331 E L TA + ++ + + + I R A+G L + +++ Sbjct: 657 YAETVKKVTEDFEALHFNTAISQMMVFMNAAQKAKTIPREYAEGFVQLLAPVAPHMME-- 714 Query: 332 DKVRMVRLRD--------QWVSFDP 348 + + +W +DP Sbjct: 715 ---EIWSVFGHDESIAYAKWPEYDP 736 >gi|237736163|ref|ZP_04566644.1| D-ribose transporter ATP binding protein [Fusobacterium mortiferum ATCC 9817] gi|229421716|gb|EEO36763.1| D-ribose transporter ATP binding protein [Fusobacterium mortiferum ATCC 9817] Length = 498 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 109/294 (37%), Gaps = 38/294 (12%) Query: 14 SDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRK 73 + ++ + E+ +++ R + V+ + + S + + I+G Sbjct: 231 DEEFIIRNMVGRTLDEQFPRVKVERGKEI--LRVENLK--NSYVDGVSFSLHEGEILGIS 286 Query: 74 LYLT--RSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTID 131 + RS+L+ Y ++ + E + A E IE YV+ D Sbjct: 287 GLMGAGRSELVKTIYGHLKKDS-GKVFIDGEEKNIR--------SAKEGIECGIAYVSED 337 Query: 132 YDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQK 191 GDG +++ ++ L L F + H S+ I + + Sbjct: 338 RKGDG------LVLGMSVKENMTLS-----ALNFFS-TLFKLEHKKEEASVEEYIGKF-R 384 Query: 192 IKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSV 251 +KT + Q + NL NQ + + + + +P+ ++ + + + V A +I + Sbjct: 385 VKTPSMEQKIKNLSGGNQQKVAIAKALLTNPKILILDEPTRGVDVGAKKEIYDFI----N 440 Query: 252 PMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQV 305 + K S++ + E+ + G+SD ++ N T L E++ ++ Sbjct: 441 DLKTKGLSII-MISSEMPEIMGLSDRIM-----VIHNGKVTGEFLAEEATQEKI 488 >gi|226227231|ref|YP_002761337.1| hypothetical protein GAU_1825 [Gemmatimonas aurantiaca T-27] gi|226090422|dbj|BAH38867.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 799 Score = 41.6 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 72/192 (37%), Gaps = 7/192 (3%) Query: 152 DNILCNEEWNELPFTCLRAMRAPHCF--IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 D EE +LP R P G S + + + L+ LY N Sbjct: 374 DGEDEREEPLDLPVAQCRFFEDPADQDPYGLSPVEWLAPMDEAVATQTIAWLEYLYRFNH 433 Query: 210 PQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELV 269 P T + GS+I P + N + G PIR A I + P +S +++ + + Sbjct: 434 PNTFLPLGSVIQPGQL-NIRDGTPIRYNAAAGKLEYESIPTFP--SESTALIDKYEAWMR 490 Query: 270 DRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 +G+ + + G + + + I + + + +V + L R L+LI Sbjct: 491 TLSGLENAARGVADPSV--KSGIHAERIIEQALVALTQVVSNVQDFLLRRGRIRLQLIAT 548 Query: 330 HQDKVRMVRLRD 341 H R++R+ Sbjct: 549 HYTAPRLLRING 560 >gi|319783517|ref|YP_004142993.1| hypothetical protein Mesci_3826 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169405|gb|ADV12943.1| hypothetical protein Mesci_3826 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 784 Score = 41.2 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 44/317 (13%), Positives = 98/317 (30%), Gaps = 41/317 (12%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIIS 98 + + ++ +AVS +F H + P + + ++R + +D I Sbjct: 198 RLANERIEFEAVSWRDFR-HGPAKRWSDCPWLSFRFVVSREN-EDTAFDSAMI------G 249 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 Q + K +D I +V TG ++ Sbjct: 250 MQTSDQEKKARGESDNDLTGWEIWCKTS--------------LKVYFVDDTGVILKTVDD 295 Query: 159 EWNELPFTCLRAMRAPHCFIGE-------SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 F + P G S+ + + + + T + ++ + + Sbjct: 296 PLGLTNFFPIATPVQPVELTGRLMPVNPFSIYSKLADELDLTTKRINIITNH--MKVKGW 353 Query: 212 TIVQEGSIIDPESVLNPQF-----GKPIRVAAGMDIRSVL-GIHSVPMI-EKSFSMLHYL 264 G I + + + +F G+ + ++ + ++ Sbjct: 354 FSGDAGDIANMLAADDTEFVPIGNADIWAANGGLAGAVAFWPVEKFILVLRELYNAREQT 413 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLL 324 Q + + TGISDI G S N T A + Q G +++ + R + +G LF + Sbjct: 414 KQAIYEITGISDIVRGASK---SNETLGAQQIKTQWGSLRIQKMQRMMERGARDLFVMMS 470 Query: 325 RLIIQHQDKVRMVRLRD 341 +I + ++ Sbjct: 471 EIIPAKFSHETLQQMTG 487 >gi|169795385|ref|YP_001713178.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148312|emb|CAM86177.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 547 Score = 40.8 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 41/131 (31%), Gaps = 3/131 (2%) Query: 98 SSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAG---GTGKDNI 154 + + D ++++ E T GD + + A + I Sbjct: 192 GENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIKGDRQLMPKEMPFASYHVEVDEKII 251 Query: 155 LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 L +NE PF R + PH G + + K L+R TL + Sbjct: 252 LRETGYNEFPFVIPRFRKIPHSVYGTGQVSIALPDAKTANKLMRDTLRSAEISTLGMYAG 311 Query: 215 QEGSIIDPESV 225 + +P +V Sbjct: 312 VDDGTFNPRTV 322 >gi|296537022|ref|ZP_06899017.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296262651|gb|EFH09281.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 368 Score = 40.0 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 59/184 (32%), Gaps = 8/184 (4%) Query: 155 LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 L + + PF R ++AP G + + ++ L N Sbjct: 158 LAEGRFQDSPFIAFRWLKAPGEAYGRGPVMKALPDIRTANKVVELVLKNASIAATGIWQA 217 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 ++ +++P +V G I A G + L +F + + +L R Sbjct: 218 EDDGVLNPATV-RLVPGAIIPKAPGSSGLTPLAAPG------NFDVSQLVLDDLRGRIRA 270 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQV-ELIVRTLAQGLEILFRGLLRLIIQHQDK 333 + ++ P MTAT + R A+ L L L ++ + + Sbjct: 271 ALLADRLGPPGTAAMTATEVLERSAQTARLLGATYGRLQAELLTPLIGRCLSILRRRGEV 330 Query: 334 VRMV 337 ++ Sbjct: 331 PPLL 334 >gi|228469904|ref|ZP_04054843.1| cytosine-specific methyltransferase NlaX [Porphyromonas uenonis 60-3] gi|228308539|gb|EEK17327.1| cytosine-specific methyltransferase NlaX [Porphyromonas uenonis 60-3] Length = 395 Score = 40.0 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 65/182 (35%), Gaps = 23/182 (12%) Query: 28 GEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYD 87 G ++D ++ ++ + D P I D I +G + R+ L G+ Sbjct: 227 GNTIYDKQMAKRCAIRTKVSDIFEPQS--IERDGFTISDRMWLGHQTRKKRNRLNGKGFG 284 Query: 88 RESINNLPIISSQNIENTWKFPKNQYSDKAL-----EMIEYYELYVTIDYDGDGIAELRR 142 + +S WK D++ M+ E G + R Sbjct: 285 YSKFTGNSVYTSTISARYWKDGSEILIDQSNRGLNPRMLTPVEA---------GRLQGYR 335 Query: 143 VIMAG---GTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIE--IQKIKTVLL 197 VI +G N+ NE ++ F + + + + G S+A +I+ Q+I L+ Sbjct: 336 VIGSGWEHTESASNLAYNE--SDPEFRIVVSKKEAYRQFGNSVAIPVIKRLSQEIIKQLI 393 Query: 198 RQ 199 RQ Sbjct: 394 RQ 395 >gi|161507942|ref|YP_001577906.1| leucyl-tRNA synthetase [Lactobacillus helveticus DPC 4571] gi|260103261|ref|ZP_05753498.1| leucine-tRNA ligase [Lactobacillus helveticus DSM 20075] gi|229470715|sp|A8YWU4|SYL_LACH4 RecName: Full=Leucyl-tRNA synthetase; AltName: Full=Leucine--tRNA ligase; Short=LeuRS gi|160348931|gb|ABX27605.1| Leucyl-tRNA synthetase [Lactobacillus helveticus DPC 4571] gi|260082930|gb|EEW67050.1| leucine-tRNA ligase [Lactobacillus helveticus DSM 20075] gi|328464566|gb|EGF35935.1| leucyl-tRNA synthetase [Lactobacillus helveticus MTCC 5463] Length = 804 Score = 39.7 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDEVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R +G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYVEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|323465976|gb|ADX69663.1| Leucyl-tRNA synthetase [Lactobacillus helveticus H10] Length = 804 Score = 39.7 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 73/207 (35%), Gaps = 38/207 (18%) Query: 175 HCFIGESLAASIIEIQKIKTV--LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 H + ++ + T R L +N + +G++++P+ V++ Sbjct: 535 HLLYARFWHKVLYDLGVVPTKEPFQRLYNQGLILKNHEKMSKSKGNVVNPDEVIDEYGAD 594 Query: 233 PIRV--------AAGMDIRSVLGIHSVPMIEKSFSM-LHYLDQELVDRTGISDISSG-FS 282 +R+ A +D + +++ + + ++ LD + + + I D + G Sbjct: 595 SLRMYEMFMGPLDASIDWDDNGPASTKKFLDRVWRLFVNDLDLKAIPQERIVDKNDGELD 654 Query: 283 P-------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + L TA + ++ + + + I R +G L + +++ Sbjct: 655 KVYAETVKKVTEDFDALHFNTAISQMMVFINAAQKAKTIPREYVEGFVKLLAPVAPHMME 714 Query: 330 HQDKVRMVRLRD--------QWVSFDP 348 + ++ +W ++DP Sbjct: 715 -----EIWQVFGHDESITYAKWPTYDP 736 >gi|145642402|ref|ZP_01797960.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae R3021] gi|145272901|gb|EDK12789.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae 22.4-21] Length = 313 Score = 39.7 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 58/168 (34%), Gaps = 10/168 (5%) Query: 143 VIMAGGTGKDNI-LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTL 201 ++MAG ++ L + E P++ C G + + Q+I R + Sbjct: 27 IVMAGNGKILSVNLNPLDTAEFPYSVYTCEPDVCCLFGFGIPYLCRDAQEILNTAWRGMI 86 Query: 202 DNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV------AAGMDIRSVLGIHSVPMIE 255 DN PQ +V + + K + A + + GI + + Sbjct: 87 DNGILGIGPQAVVNSSVLTPVDGNWELAPYKLWKTNDRATANAQFEAQRAFGIFDIGSRQ 146 Query: 256 KSFSMLHYL-DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGV 302 + + + L + + +G+ I+ G ++ T S++ + Sbjct: 147 QELANIIQLSKSFMDEESGLPMIAQGEQGQVTP--TLGGMSMLMNAAN 192 >gi|254720050|ref|ZP_05181861.1| hypothetical protein Bru83_11004 [Brucella sp. 83/13] gi|265985058|ref|ZP_06097793.1| conserved hypothetical protein [Brucella sp. 83/13] gi|306839398|ref|ZP_07472212.1| Hypothetical protein BROD_2254 [Brucella sp. NF 2653] gi|264663650|gb|EEZ33911.1| conserved hypothetical protein [Brucella sp. 83/13] gi|306405521|gb|EFM61786.1| Hypothetical protein BROD_2254 [Brucella sp. NF 2653] Length = 678 Score = 39.3 bits (90), Expect = 0.82, Method: Composition-based stats. Identities = 40/306 (13%), Positives = 100/306 (32%), Gaps = 49/306 (16%) Query: 60 DSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALE 119 + + P + + Y+ R DL D E ++ + ++ + DK + Sbjct: 230 KAKRWKDVPWIAFRHYMPREDLEDF-RDAELYSSQAQPTDSADDDEDDIQVWEIWDKRTK 288 Query: 120 MIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIG 179 + + ID + +G+ + + F + P G Sbjct: 289 QVWF------IDAN---------------SGRVQKISEDPLGLPGFFPTPEIVQPITLTG 327 Query: 180 E-------SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLN----- 227 ++ + + + T + + L + + GS D + + + Sbjct: 328 NMTPVCPFTVYKKLADELDLCTKRINAIMKGLKVR-----GIILGSAEDIKRLADAGDNE 382 Query: 228 ----PQFGKPIRVAAGMDIRSVLGIH-SVPMIEKSFSMLHYLDQELVDRTGISDISSGFS 282 + ++ A + I ++ ++++ + + + + TGISDI G S Sbjct: 383 LIPVSDVEQIVQTAGLEKAIAWWPIQQAIAVLQQLYQQRGEIKAAIYEITGISDIVRGAS 442 Query: 283 PEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 N T A + Q G +++ + R + + + +F + +II + + Sbjct: 443 N---ANETLGAQQIKTQWGSLRIQKMQRMIERLVRDIFGMMADIIITKFSPETLQEMTG- 498 Query: 343 WVSFDP 348 + P Sbjct: 499 -IEITP 503 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 39.3 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 37/102 (36%), Gaps = 8/102 (7%) Query: 183 AASIIEIQKIKTVLLRQTLDNLYWQNQPQT---IVQEGSIIDPESVLNPQFGKPIRVAAG 239 I ++Q + + LDN N P+ IV ++ + + F + A G Sbjct: 170 TDKIGDLQAAASTAVENLLDNNIDPNNPRVRVAIVPYAEAVNTGGLADSVF---VEQAGG 226 Query: 240 MDIRSVLGIHSVPMIEKSFSML--HYLDQELVDRTGISDISS 279 ++ + P+ S L E D+ G +D SS Sbjct: 227 SNLPPPVPSAGAPIPVGSSVTLRPDKCATERKDKDGYADYSS 268 >gi|117924319|ref|YP_864936.1| hypothetical protein Mmc1_1012 [Magnetococcus sp. MC-1] gi|117608075|gb|ABK43530.1| conserved hypothetical protein [Magnetococcus sp. MC-1] Length = 671 Score = 39.3 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 37/81 (45%), Gaps = 3/81 (3%) Query: 262 HYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFR 321 + + + TGISDI G S TATA S+ Q G +++ + + + LFR Sbjct: 388 ESIKSVIYEITGISDIVRGVSKAS---ETATAQSIKSQWGGRRLQERQSQVQRFVRDLFR 444 Query: 322 GLLRLIIQHQDKVRMVRLRDQ 342 + ++++H + ++ Sbjct: 445 MVGEIMVEHFQPQTIAKMVGA 465 >gi|317152045|ref|YP_004120093.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] gi|316942296|gb|ADU61347.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] Length = 603 Score = 39.3 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 91/228 (39%), Gaps = 15/228 (6%) Query: 107 KFPKNQYSDKALEMIEY-YELYVTIDYD--GDGIAELRRVIMAGGTGKDNILCNEEWNEL 163 + + ++ + +E + ++ D D G G A + TG +++L + E+ Sbjct: 197 GETRRKAEERPDDTVEILHAVFPRTDRDPYGVGAAHFPFASVYVETGAEHVLEESGYLEM 256 Query: 164 PFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPE 223 P+ R +A G + + ++ + R L + P +V + + P Sbjct: 257 PYLVPRWAKAAGETYGRGPGQTALSDTRVLNAMARTALMAAEKMSDPPLMVPDDGFLGP- 315 Query: 224 SVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSP 283 ++ G AG R + + + + +M+ +E + R + D + P Sbjct: 316 --VHSGPGGLSYYRAGSPDR-IEPLPVNVDLAATETMMQQ-RRESIRRIFLGDQLTPEGP 371 Query: 284 EILQNMTATATSLIEQSGVGQV--ELIVRTLAQGLEILFRGLLRLIIQ 329 +TAT +LI QS +V ++ R A+ L L R + R++++ Sbjct: 372 ----AVTAT-EALIRQSEKMRVLGPVLGRLQAEFLSPLIRRVFRIMLR 414 >gi|300811526|ref|ZP_07092017.1| leucine--tRNA ligase [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] gi|300497493|gb|EFK32524.1| leucine--tRNA ligase [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] Length = 804 Score = 39.3 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 65/178 (36%), Gaps = 27/178 (15%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSRGNVVNPDDVVDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLDQELVDRTGISDISSG-FSP- 283 R A +D +++ + + ++ LD + + I+D + G Sbjct: 597 RTYEMFMGPLNASIDWDDNGPSGVKKFLDRVWRTFVNDLDLDPIPSEKITDKNDGKLDKI 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 E L TA + ++ + +V+ I R A+G L + +++ Sbjct: 657 YNETVKTVTEHFEELHFNTAISQMMVFMNACQKVDKIPREYAEGFVKLMAPVAPHMME 714 >gi|13186164|emb|CAC33475.1| hypothetical protein [Legionella pneumophila] Length = 519 Score = 38.5 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 69/233 (29%), Gaps = 26/233 (11%) Query: 25 EDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM 84 +D + I + V +AV P + + D+ + R Y R + I Sbjct: 160 QDVLISTGIIAINEGNRKRPVRYEAVPPAQVMFQGDAEGQVDAIF--RDWYQVRIENIKS 217 Query: 85 GYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVI 144 + + + L + K + ++ +E IDY+ E R Sbjct: 218 MWPKAEVAKL-------------------NKKPEDKVDIWEC-AWIDYEA---PEKERYQ 254 Query: 145 MAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNL 204 T ++L + + P+ R R G + S L L Sbjct: 255 YVVMTSSKDVLLEQSNSSWPWVVYRMRRLTGEIRGRGPSLSAYPTAATINQALEDELVAA 314 Query: 205 YWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS 257 +Q P + S + ++ P+ G + V + I+ + Sbjct: 315 AFQANPMYMAASDSAFNQQTF-TPRPGSIVPVQMVQGEWPIKPFEQSGNIQFN 366 >gi|313124236|ref|YP_004034495.1| leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus ND02] gi|312280799|gb|ADQ61518.1| Leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus ND02] gi|325685746|gb|EGD27823.1| leucine--tRNA ligase [Lactobacillus delbrueckii subsp. lactis DSM 20072] Length = 804 Score = 38.5 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 66/178 (37%), Gaps = 27/178 (15%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSRGNVVNPDDVIDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLDQELVDRTGISDISSG-FSP- 283 R A +D +++ + + ++ LD + + I+D + G Sbjct: 597 RTYEMFMGPLNASIDWDDNGPSGVKKFLDRVWRTFVNDLDLDPIPSEKITDKNDGKLDKI 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 E L+ TA + ++ + +V+ I R A+G L + +++ Sbjct: 657 YNETVKTVTEHFEELRFNTAISQMMVFMNACQKVDKIPREYAEGFVKLMAPVAPHMME 714 >gi|325126199|gb|ADY85529.1| Leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus 2038] Length = 804 Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 66/178 (37%), Gaps = 27/178 (15%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSRGNVVNPDDVVDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLDQELVDRTGISDISSG-FSP- 283 R A +D +++ + + ++ LD + + I+D + G Sbjct: 597 RTYEMFMGPLNASIDWDDNGPSGVKKFLDRVWRTFVNDLDLDPIPSEKITDKNDGKLDKI 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 E L+ TA + ++ + +V+ I R A+G L + +++ Sbjct: 657 YNETVKTVTEHFEELRFNTAISQMMVFMNACQKVDKIPREYAEGFVKLMAPVAPHMME 714 >gi|104774389|ref|YP_619369.1| leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842] gi|123251849|sp|Q1G971|SYL_LACDA RecName: Full=Leucyl-tRNA synthetase; AltName: Full=Leucine--tRNA ligase; Short=LeuRS gi|103423470|emb|CAI98369.1| Leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842] Length = 804 Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 66/178 (37%), Gaps = 27/178 (15%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSRGNVVNPDDVVDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLDQELVDRTGISDISSG-FSP- 283 R A +D +++ + + ++ LD + + I+D + G Sbjct: 597 RTYEMFMGPLNASIDWDDNGPSGVKKFLDRVWRTFVNDLDLDPIPSEKITDKNDGKLDKI 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 E L+ TA + ++ + +V+ I R A+G L + +++ Sbjct: 657 YNETVKTVTEHFEELRFNTAISQMMVFMNACQKVDKIPREYAEGFVKLMAPVAPHMME 714 >gi|116514487|ref|YP_813393.1| leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365] gi|122274781|sp|Q049B7|SYL_LACDB RecName: Full=Leucyl-tRNA synthetase; AltName: Full=Leucine--tRNA ligase; Short=LeuRS gi|116093802|gb|ABJ58955.1| leucyl-tRNA synthetase [Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365] Length = 804 Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 66/178 (37%), Gaps = 27/178 (15%) Query: 176 CFIGESLAAS-IIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 F + L ++ ++ L Q L +N + G++++P+ V++ + Sbjct: 540 RFWHKVLYDLGVVPTKEPFQKLYNQ---GLILKNHEKMSKSRGNVVNPDDVVDEYGADSL 596 Query: 235 RV--------AAGMDIRSVLGIHSVPMIEKSF-SMLHYLDQELVDRTGISDISSG-FSP- 283 R A +D +++ + + ++ LD + + I+D + G Sbjct: 597 RTYEMFMGPLNASIDWDDNGPSGVKKFLDRVWRTFVNDLDLDPIPSEKITDKNDGKLDKI 656 Query: 284 ------------EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 E L+ TA + ++ + +V+ I R A+G L + +++ Sbjct: 657 YNETVKTVTEHFEELRFNTAISQMMVFMNACQKVDKIPREYAEGFVKLMAPVAPHMME 714 >gi|147919964|ref|YP_686282.1| hypothetical protein RCIX1747 [uncultured methanogenic archaeon RC-I] gi|110621678|emb|CAJ36956.1| conserved hypothetical protein [uncultured methanogenic archaeon RC-I] Length = 326 Score = 38.5 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 49/158 (31%), Gaps = 17/158 (10%) Query: 92 NNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAG---- 147 + + IS++ E + ++ + Y+ D D + ++RV+ AG Sbjct: 83 DRIGFISTRIKEIELMTTYDPATNTG--EVAVNICYIPKDRFEDALGAIKRVVNAGYAIS 140 Query: 148 ------GTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESL-----AASIIEIQKIKTVL 196 G++ + + + F + ++ + + I+ IQ + Sbjct: 141 PRIKIVDEGEEILENDVPLGSVAFASVCSITLDGVLLKNGIPVNLKYGGILWIQNGEVQR 200 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 +D P ++ + + G + Sbjct: 201 FTDVIDYSGTSINPMSVFTSRKMTSVNKAIETGTGGVL 238 >gi|262043663|ref|ZP_06016772.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039001|gb|EEW40163.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 554 Score = 37.7 bits (86), Expect = 2.3, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 58/171 (33%), Gaps = 10/171 (5%) Query: 164 PFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPE 223 P+ R AP G S A ++ K+ + R ++ +P ++ E I+ P Sbjct: 260 PYAISRYYTAPGEVYGRSPAMVVLPDIKLLNEINRAIIEGAQMAVRPPMLLPEDGILQPF 319 Query: 224 SVL--NPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGF 281 ++ FG R + L +++ + ++ Q + D I+ Sbjct: 320 KMMPGALNFGGMNRDGKPL----ALPLNTATDFSVAMTLAEQKRQTINDGFFITLFQILV 375 Query: 282 SPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQD 332 + A + + G R A+ L L L + I +Q+ Sbjct: 376 DNPQMTATEAMLRAQ--EKGQLLAPTAGRIQAEFLGTL--ILREIDIAYQN 422 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.309 0.134 0.337 Lambda K H 0.267 0.0414 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,674,760,390 Number of Sequences: 14124377 Number of extensions: 213239232 Number of successful extensions: 550808 Number of sequences better than 10.0: 118 Number of HSP's better than 10.0 without gapping: 65 Number of HSP's successfully gapped in prelim test: 93 Number of HSP's that attempted gapping in prelim test: 550503 Number of HSP's gapped (non-prelim): 169 length of query: 350 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 210 effective length of database: 2,865,380,850 effective search space: 601729978500 effective search space used: 601729978500 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 82 (36.2 bits)