BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1142_1 (218 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2] gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus] Length = 809 Score = 240 bits (613), Expect = 8e-62, Method: Composition-based stats. Identities = 218/218 (100%), Positives = 218/218 (100%) Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 60 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS Sbjct: 592 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 651 Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS Sbjct: 652 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 711 Query: 121 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA Sbjct: 712 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 771 Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG Sbjct: 772 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 809 >gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 1175 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 48/229 (20%), Positives = 94/229 (41%), Gaps = 19/229 (8%) Query: 1 VQEHARGSVGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGV 58 + E + + ++++ W+T G G++ + + + QF S M + G+ Sbjct: 931 LDEQGMAVIEAGLRERTWMTVGAKGTITGEVFKGLMQFKSFSAS-FLMRQGSRAMAQEGL 989 Query: 59 SSQVYRAKALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPT----EYIKALINGITH- 111 + A L++ + + L +++G +PQ D +DP ++++L+ G Sbjct: 990 KGKAAYAIPLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLP 1049 Query: 112 -YERF--SPFNSSGWD----VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164 + ++SG D V GP S L + K F + Sbjct: 1050 VLGDILVAGTDTSGRDANSFVSGPLGSDFTSLLGLTVGNLTQYNEGKDTNFGNEAF--KF 1107 Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 V +P QNLWY + A N V + + D + PG R +A +RQ+ +++ Sbjct: 1108 VKGKIPAQNLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRER 1156 >gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 841 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 49/229 (21%), Positives = 90/229 (39%), Gaps = 19/229 (8%) Query: 1 VQEHARGSVGSTIQDKRWI-TGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGV 58 + E V + +++K I G G++ + R + QF + M + G+ Sbjct: 597 LDEQGLAVVEAGLREKTLINVGARGTITGEIVRGLAQFKSFSAA-FLMRHGSRAFAQEGI 655 Query: 59 SSQVYRAKALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPT----EYIKALINGIT-- 110 + A L + + + L L++G +PQ D +DP +I++ + G Sbjct: 656 KGKAGYAVPLFVTLTLLGGLVVQLKELLNGNDPQTIYDSNDPKKAGSFFIRSAVQGGGLS 715 Query: 111 HYERF--SPFNSSGWD----VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164 + ++SG D V GP + L + K F + Sbjct: 716 FLGDILVAGTDTSGRDANSFVAGPLGNDFTALLGLTVGNLTQYNEGKDTNFGNEAF--KF 773 Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 V +P QNLWY + A N V + + D + PG R +A +RQ+ +++ Sbjct: 774 VKGKIPAQNLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRER 822 >gi|294843482|ref|ZP_06788165.1| putative phage related protein [Acinetobacter sp. 6014059] Length = 841 Score = 204 bits (519), Expect = 6e-51, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 93/229 (40%), Gaps = 19/229 (8%) Query: 1 VQEHARGSVGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGV 58 + E + + ++++ W+T G G++ + + + QF S M + G+ Sbjct: 597 LDEQGMAVIEAGLRERTWMTVGAKGTITGEVFKGLMQFKSFSAS-FLMRQGSRAMAQEGL 655 Query: 59 SSQVYRAKALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPT----EYIKALINGITH- 111 + A L++ + + L +++G +PQ D +DP ++++L+ G Sbjct: 656 KGKAAYAIPLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLP 715 Query: 112 -YERF--SPFNSSGWD----VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164 + ++SG D V GP S L + K F + Sbjct: 716 VLGDILVAGTDTSGRDANSFVSGPLGSDFTALLGLTVGNLTQYNEGKDTNFGNEAF--KF 773 Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 V +P QNLWY + A N + + D + PG R +A +RQ+ +++ Sbjct: 774 VKGKIPAQNLWYTKAAINRMFFDEVQDTIAPGYREKALRKAERQQDRER 822 >gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] Length = 841 Score = 201 bits (510), Expect = 8e-50, Method: Composition-based stats. Identities = 47/229 (20%), Positives = 88/229 (38%), Gaps = 19/229 (8%) Query: 1 VQEHARGSVGSTIQDKRWI-TGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGV 58 + E + + +++K I G G++ + R + QF + M + G+ Sbjct: 597 LDEQGMAVIEAGLREKTLINVGARGTITGEIFRGIVQFKSFSAA-FLMRHGSRTMAQEGL 655 Query: 59 SSQVYRAKALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPT----EYIKALINGIT-- 110 + A L + + L L++G +PQ D +DP ++++ + G Sbjct: 656 KGKAAYAIPLFVMTTLLGGLVVQLKELLNGNDPQTIYDSNDPKKASNFFVRSAVQGGGLS 715 Query: 111 HYERF--SPFNSSGWD----VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164 + ++SG D V GP S L + K F + Sbjct: 716 FLGDILVAGTDTSGRDAHSFVAGPLGSDFESLLSLTVGNLTQYNEGKDTNFGNEAF--QF 773 Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 V +P QNLWY + A N V + I D + PG R +A + ++ +++ Sbjct: 774 VKRKIPAQNLWYTKAAINRMVFDEIQDFIAPGYREKALRKAEEKQDRER 822 >gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 825 Score = 195 bits (495), Expect = 4e-48, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 72/212 (33%), Gaps = 24/212 (11%) Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68 VGS +Q W L R + F PIS M + + Sbjct: 611 VGSGLQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIATF 662 Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119 + + + LI+G+ P+ D +I A + G + + + Sbjct: 663 LASTTMLGALSMQITDLINGRNPKEMTGDHMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 722 Query: 120 SSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176 S +LGP + + + EG +Q G + G +P NLWY Sbjct: 723 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLWY 778 Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + A +H + N + + +PG + E +++ Sbjct: 779 LKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 810 >gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 824 Score = 194 bits (493), Expect = 6e-48, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 72/212 (33%), Gaps = 24/212 (11%) Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68 VGS +Q W L R + F PIS M + + Sbjct: 610 VGSGLQRGTW-------KGELTRSVFLFKSFPISVV-MRHWHRAMGMPSAGGRAAYIATF 661 Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119 + + + LI+G+ P+ D +I A + G + + + Sbjct: 662 LASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 120 SSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176 S +LGP + + + EG +Q G + G +P NLWY Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLWY 777 Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + A +H + N + + +PG + E +++ Sbjct: 778 LKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 824 Score = 194 bits (492), Expect = 8e-48, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 72/212 (33%), Gaps = 24/212 (11%) Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68 VGS +Q W L R + F PIS M + + Sbjct: 610 VGSGLQRGTW-------KGELTRSVFLFKSFPISVV-MRHWHRAMGMPSAGGRAAYIATF 661 Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119 + + + LI+G+ P+ D +I A + G + + + Sbjct: 662 LASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 120 SSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176 S +LGP + + + EG +Q G + G +P NLWY Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLWY 777 Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + A +H + N + + +PG + E +++ Sbjct: 778 LKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605] gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605] Length = 824 Score = 193 bits (491), Expect = 1e-47, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 70/213 (32%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWHRAMGMPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPF 118 + + + LI+G+ P+ D +I A + G + + + Sbjct: 661 FLASTTMLGALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + EG +Q G + G +P NLW Sbjct: 721 GSGALASMLGPVVGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans'] gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 824 Score = 181 bits (458), Expect = 7e-44, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 74/215 (34%), Gaps = 24/215 (11%) Query: 6 RGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRA 65 R G+ +Q W L R + F PI+ M ++ + Sbjct: 607 RMVTGAAMQRGDW-------RGELVRSVFLFKSFPIAVM-MRHWSRALNMPSAGGRAAYL 658 Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFS 116 A + + + + +I+G+ P+ D ++ A + G + Sbjct: 659 AAFLASTTVLGAMSQQISEVIAGRNPRDITGDKALQFWVNAFLKGGGAGLYGDFLLSDHT 718 Query: 117 PFNSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQN 173 + S +LGP + + EG +Q G + +P QN Sbjct: 719 RYGSGALASMLGPVAGVVDDAIKLLQGIPLNAVEGKPEQTGGDLV----KFAKGMIPGQN 774 Query: 174 LWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 LWY + F+H V N + ++ +PG R E +++ Sbjct: 775 LWYTKAVFDHMVFNQLQEIFSPGYLRRMEKRSRKE 809 >gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1] gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252] Length = 824 Score = 174 bits (442), Expect = 5e-42, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 71/213 (33%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFSPF 118 + + + L L SG+ P+ + ++ AL+ G + + Sbjct: 661 FIASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + + EG +Q G + G +P NLW Sbjct: 721 GSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLG----KGLMPGANLW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14] Length = 824 Score = 174 bits (442), Expect = 5e-42, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 71/213 (33%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFSPF 118 + + + L L SG+ P+ + ++ AL+ G + + Sbjct: 661 FIASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + + EG +Q G + G +P NLW Sbjct: 721 GSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLG----KGLMPGANLW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2] Length = 824 Score = 174 bits (442), Expect = 5e-42, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 71/213 (33%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFSPF 118 + + + L L SG+ P+ + ++ AL+ G + + Sbjct: 661 FIASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + + EG +Q G + G +P NLW Sbjct: 721 GSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLG----KGLMPGANLW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str. E2348/69] gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 824 Score = 174 bits (441), Expect = 6e-42, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 71/213 (33%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 VTGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFSPF 118 + + + L + SG+ P+ + ++ AL+ G + + Sbjct: 661 FIASTTILGALSQQLNDMASGRNPRDMVGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + G+ EG +Q G + G P N+W Sbjct: 721 GSGALASMLGPVAGLVDDVIKIGQGIPLNAVEGKSEQTGGDLVKLG----KGLTPGANIW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 582 Score = 173 bits (439), Expect = 1e-41, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 70/213 (32%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 367 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIAT 418 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFSPF 118 + + + L L SG+ P+ + ++ AL+ G + + Sbjct: 419 FIASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 478 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S + GP + + + EG +Q G + G +P NLW Sbjct: 479 GSGALASMFGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLW 534 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 535 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 567 >gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] Length = 824 Score = 172 bits (436), Expect = 3e-41, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 70/213 (32%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGIPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFS--DPTEYIKALINGITHY-------ERFSPF 118 + + + L L SG+ P+ ++ AL+ G + + Sbjct: 661 FIASTTILGALSQQLNDLASGRNPREMTGGDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + + EG +Q G + G +P NLW Sbjct: 721 GSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v] Length = 824 Score = 171 bits (432), Expect = 7e-41, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 70/213 (32%), Gaps = 24/213 (11%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 GS IQ W L R + F PIS M + + Sbjct: 609 ITGSGIQRGTW-------KGELTRSVFLFKSFPISVV-MRHWSRAMGMPSAGGRAAYIAT 660 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFSPF 118 + + + L L SG+ + + ++ AL+ G + + Sbjct: 661 FIASTTILGALSQQLNDLASGRNHREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRY 720 Query: 119 NSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 S +LGP + + + EG +Q G + G +P NLW Sbjct: 721 GSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLG----KGLMPGANLW 776 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 Y + A +H + N + + +PG + E +++ Sbjct: 777 YLKAALDHMIFNQMQEYFSPGYLRKMEQRSKKE 809 >gi|332160979|ref|YP_004297556.1| hypothetical protein YE105_C1357 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665209|gb|ADZ41853.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862135|emb|CBX72299.1| hypothetical protein YEW_AK02360 [Yersinia enterocolitica W22703] Length = 841 Score = 170 bits (431), Expect = 1e-40, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 79/224 (35%), Gaps = 20/224 (8%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSV-----NNLARLMGQFLVMPISWSRMHLIEIPSSLV 56 E V +++ +T G+ + R QF PI+ M + Sbjct: 603 DEAQMAVVEPGARER--VTLHRGTTRGTWSGEIWRSATQFKSFPIAMV-MRHAHRALAQD 659 Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGI--THYER 114 G + A A++ + L + SG++P+ D + P + A + G Y Sbjct: 660 GA-GKGTYAAAIIAASTLLGGMAIQLNEIASGRDPR-DMTKPEFWGGAFLKGGALGLYGD 717 Query: 115 FSPFN------SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTF 168 F N S + GP + + + A + K A + Sbjct: 718 FLLTNQTQGGNSFIASIGGPLAGDIESVVKMTQGAAFKAIDGKDPH--TAANVVRFIKGH 775 Query: 169 VPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212 P NLWYA+ A +H + + I + +PG +R Q++ ++ Sbjct: 776 TPGANLWYAKAALDHMIFHDIQEQFSPGYLSRMRQRAQKEYDQQ 819 >gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] Length = 823 Score = 168 bits (426), Expect = 4e-40, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 75/215 (34%), Gaps = 24/215 (11%) Query: 6 RGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRA 65 R +G+ +Q W + R F PIS + + + +V Sbjct: 605 RMMIGAGLQRGDW-------KGEIVRSFFLFKSFPISVV-VRHWKRALGIQSAGGRVAYL 656 Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSD--PTEYIKALINGITHY-------ERFS 116 A + G I + + + SG+ P+ + ++ AL+ G + Sbjct: 657 AAFIAGTTVLGAISQQINDISSGRNPRDMADENWHKFWLNALLKGGGLGLYGDFLLSDHT 716 Query: 117 PFNSSG-WDVLGPWSSQAGKLAIAGKE--AVWDEGTRKQRGKAQAQFGKELVNTFVPFQN 173 + S +LGP + + EG +Q G + V +P QN Sbjct: 717 KYGSDAFASLLGPVAGVVDDAIKLAQGIPLNAVEGKPEQTGGD----TVKFVKGLIPGQN 772 Query: 174 LWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 LWY + +H V N + + +PG R E +++ Sbjct: 773 LWYTKAVLDHMVFNQLQEYFSPGYLRRMEKRSKKE 807 >gi|262371858|ref|ZP_06065137.1| predicted protein [Acinetobacter junii SH205] gi|262311883|gb|EEY92968.1| predicted protein [Acinetobacter junii SH205] Length = 841 Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 85/217 (39%), Gaps = 14/217 (6%) Query: 2 QEHARGSVGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVS 59 E + + + ++++ I G+ G++ L R + QF P+++ + + + Sbjct: 619 NEESVAIIEAGVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYM-FRMGHRAFAQGDIK 677 Query: 60 SQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT--HYERFS- 116 S+V +L+ + L +GK P+ F+ + K+L+ G Sbjct: 678 SRVTFLASLLAYQTLAGALIVQTQNLANGKNPEPVFT-IDFFGKSLLKGGGLSFLGDIMS 736 Query: 117 ----PFNSSGWD-VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171 P S D + GP Q+ KL + + K+ + + + +P Sbjct: 737 ALSDPTGRSASDFISGPLLGQSMKLGMLLTGMGNNIIEGKES--TRMMEVANTLKSNIPL 794 Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 QNLWY++ + + + + ++++P R + + Sbjct: 795 QNLWYSKLVVDRMLYSKMQNMIDPDYLPRTQQRLENL 831 >gi|226953662|ref|ZP_03824126.1| phage related protein [Acinetobacter sp. ATCC 27244] gi|226835534|gb|EEH67917.1| phage related protein [Acinetobacter sp. ATCC 27244] Length = 842 Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 85/217 (39%), Gaps = 14/217 (6%) Query: 2 QEHARGSVGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVS 59 E + + + ++++ I G+ G++ L R + QF P+++ + + + Sbjct: 619 NEESVAIIEAGVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYM-FRIGHRAFAQGDIK 677 Query: 60 SQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT--HYERFS- 116 S+V +L+ + L +GK P+ F+ + K+L+ G Sbjct: 678 SRVTFLASLLAYQTLAGALIVQTQNLANGKNPEPVFT-IDFFGKSLLKGGGLSFLGDIMS 736 Query: 117 ----PFNSSGWD-VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171 P S D + GP Q+ KL + + K+ + + + +P Sbjct: 737 ALSDPTGRSASDFISGPLLGQSMKLGMLLTGMGNNIIEGKES--TRMMEVANTLKSNIPL 794 Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 QNLWY++ + + + + ++++P R + + Sbjct: 795 QNLWYSKLVVDRMLYSKMQNMIDPDYLPRTQQRLENL 831 >gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] Length = 838 Score = 166 bits (420), Expect = 2e-39, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 81/224 (36%), Gaps = 19/224 (8%) Query: 1 VQEHARGSVGSTIQDKRWITG--KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVG 57 ++E V +++ + + G+ L R + F MPI+ M E S Sbjct: 592 LEEQNMAVVEPGSRERAALYSNLQRGTWKGELTRSVFLFKTMPIA-MLMRHWERGMSGPD 650 Query: 58 VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF-----SDPTEYIKALINGI--T 110 S+ AL++ ++ + L+ G++P + +++A + G Sbjct: 651 ARSKAGYIGALMVSTTVMGMLALQIDELLKGRDPVNMNPFEGKAGARNWVRAFLKGGSLG 710 Query: 111 HYERFS---PFNSSGWDV---LGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164 Y F G + LGP + + + G K + Sbjct: 711 IYGDFLFSEQNQHGGGPIASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGAELL--KF 768 Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 P NLWY + A NH + N + ++++PG AR + QR+ Sbjct: 769 AKGMTPGANLWYLKAATNHLIFNQLQEMVSPGYLARVKSRAQRE 812 >gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 918 Score = 161 bits (408), Expect = 5e-38, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 74/197 (37%), Gaps = 13/197 (6%) Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79 T L + F P + R L+ L V + + + G + Sbjct: 701 TYARDDAGELMKSFMLFKTTPFAGFR-QLVNRTRDLDTVP-AIKFLASYIGGTTLAGMFA 758 Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSS-GWDVLGPWSS 131 + L++G +P LD + PT +++AL+ G + ++ + + SS G + GP S Sbjct: 759 IQMNSLLNGNDP-LDMTKPTTWVQALLKGGSFGIYGDFIFQDHTQYGSSIGATMGGPVLS 817 Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191 A +L ++ + PF NLWYA+ NH + + + Sbjct: 818 FAEQLTKLLITNPQKALQGEETSFGADAL--KTARMITPFANLWYAKAITNHLILQQLQE 875 Query: 192 VLNPGGRARAEVYRQRQ 208 + NPG R QR+ Sbjct: 876 MANPGYNDRVRDRAQRE 892 >gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15] gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15] Length = 918 Score = 160 bits (404), Expect = 2e-37, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 78/211 (36%), Gaps = 14/211 (6%) Query: 7 GSVGSTIQDKR-WITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRA 65 G + S + T L + F P + R L+ + L V + Sbjct: 687 GEMTSAVTTATGLDTYARDDAGQLIKSFMLFKTTPFAGFR-QLVNRANDLDTVP-AIKFL 744 Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPF 118 + + G + + L++G +P LD + PT +++AL+ G + ++ + + Sbjct: 745 ASYIAGTTLAGMFANQMNSLLTGNDP-LDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQY 803 Query: 119 NSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 SS + GP S A +L ++ + PF NLWYA Sbjct: 804 GSSIAATIGGPVLSFAEQLTKLLITNPQKALQGEETSFGADAL--KTARMITPFANLWYA 861 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + NH + + ++ NPG R QR+ Sbjct: 862 KAITNHLILQQLQEMANPGYNDRVRDRAQRE 892 >gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1] gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus] Length = 864 Score = 154 bits (388), Expect = 9e-36, Method: Composition-based stats. Identities = 51/230 (22%), Positives = 96/230 (41%), Gaps = 32/230 (13%) Query: 1 VQEHARGSVGSTIQDKR---WITGKDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLV 56 VQ RG++ +++ D++ +T K G+ R+ QF P ++++++ +S Sbjct: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMF-LNILDLSNSAK 687 Query: 57 ---GVSSQVYRA-KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING--IT 110 G S + + + ++ L+ G++P L Y L NG + Sbjct: 688 MPKGASMALNHVWIQYSATMALAGIGVASIKALLRGEDPSLP---EVIYDGTLANGALLP 744 Query: 111 HYERFSPFNSSG-----WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELV 165 + +R + S G +LGP S L + E + + + + Sbjct: 745 YMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVN------ATKAI 798 Query: 166 NTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRK 215 +PF N+WY + +F+H + N I + LNPG Y RQ+ KK++K Sbjct: 799 RKTLPFMNMWYLKNSFDHLILNQILEELNPG-------YLDRQQSKKKKK 841 >gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] Length = 921 Score = 153 bits (386), Expect = 2e-35, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 66/197 (33%), Gaps = 13/197 (6%) Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79 T L + F P + R ++ +L V + A + G + Sbjct: 704 TYARDQGGELYKSFMLFKTTPFAGFR-QMVTRAQNLDRVP-ALKFLAAYIGGTTLTGMFA 761 Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSS-GWDVLGPWSS 131 L L+SG +P +D + P ++ A + G ++ + + SS + GP Sbjct: 762 NQLNALLSGNDP-IDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGSSIAATLGGPSLG 820 Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191 A L ++ + PF NLWY + NH + + + Sbjct: 821 LAESLMKLLITNPQKAMQGEETSFGADAI--KTARMITPFANLWYTKAVTNHLILQQLQE 878 Query: 192 VLNPGGRARAEVYRQRQ 208 + NPG R Q Q Sbjct: 879 MANPGYNDRVRDRAQNQ 895 >gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] Length = 924 Score = 152 bits (383), Expect = 4e-35, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 73/195 (37%), Gaps = 13/195 (6%) Query: 22 KDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKT 81 + +L + F P++ R + L + V A V G + Sbjct: 709 ARDTSGDLLKSFMLFKTTPMAGMR-QFVTRLQDLETMP-AVKFFAAYVAGTTLAGMFANQ 766 Query: 82 LVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG-PWSSQA 133 + L+SG +P LD + P +++AL+ G + ++ + + SS +LG P A Sbjct: 767 MNALLSGNDP-LDMTKPQTWLQALLKGGSFGIYGDFLFQDHTQYGSSIAGILGGPVLGFA 825 Query: 134 GKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVL 193 +L+ ++ + PF NLWY + NH + + ++ Sbjct: 826 EQLSKTVLTNSQKAMAGEETTFTADAL--KTARMITPFANLWYTKAITNHLILQQLQEMA 883 Query: 194 NPGGRARAEVYRQRQ 208 NPG AR R+ Sbjct: 884 NPGYNARVRDRAMRE 898 >gi|48697207|ref|YP_024937.1| hypothetical protein BcepC6B_gp17 [Burkholderia phage BcepC6B] gi|47779013|gb|AAT38376.1| gp17 [Burkholderia phage BcepC6B] Length = 864 Score = 152 bits (383), Expect = 4e-35, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 78/232 (33%), Gaps = 27/232 (11%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIE---------- 50 +E + ++ K + G+V L + QF P++ H Sbjct: 608 EEGEFAVLNPDLRTKVIASATPGTVTGELKKSFMQFKSFPMAMISRHWGRIGDMRRSGDF 667 Query: 51 IPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPT----EYIKALI 106 +++ + A ALV+ I L++GK+P+ F D + +A Sbjct: 668 RVDGAPALANPMAYAAALVVSTTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGFWTRAFS 727 Query: 107 NGIT--HYERF-------SPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKA 156 G + + S G + GP S + A V D K Sbjct: 728 VGGGAGFAGDMLVAAFQSADYGSLLGSAIGGPLLSTLFQPLRAVSSNVQDAAQGKDTHIG 787 Query: 157 QAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 ++ + P NLW+ + +N + +++ + L+PG R + Q Sbjct: 788 ADLL--KIAQSNTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQRNMNRSRTQ 837 >gi|221213942|ref|ZP_03586915.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166119|gb|EED98592.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 864 Score = 151 bits (381), Expect = 7e-35, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 79/232 (34%), Gaps = 27/232 (11%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIE---------- 50 +E + ++ K + G+ L + QF PI+ H Sbjct: 608 EEGEFAVLNPDLRTKVIASATPGTAMGELKKTFMQFKSFPIAMISRHWGRIGDMRRSGDF 667 Query: 51 IPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPT----EYIKALI 106 +++ + A ALV+ I + L++GK+P+ F D + +A Sbjct: 668 RVDGAPALANPMAYAAALVVSTTLIGAISTQVKNLLAGKDPEPMFDDVKHAAGFWTRAFS 727 Query: 107 NGIT--HYERF-------SPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKA 156 G + + S G V GP S ++ A D K + Sbjct: 728 VGGGAGFAGDMLTASFESTDYGSLLGSVVGGPLPSTIYQVVRAFSSNAQDAAQGKDTHVS 787 Query: 157 QAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 ++ + P NLW+ + +N + +++ + L+PG R + Q Sbjct: 788 ADLL--KVAQSNTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQRNINRSRNQ 837 >gi|221201510|ref|ZP_03574549.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207934|ref|ZP_03580940.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221172119|gb|EEE04560.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221178778|gb|EEE11186.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 869 Score = 149 bits (375), Expect = 3e-34, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 79/238 (33%), Gaps = 34/238 (14%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIE---------- 50 +E + ++ K + G+V L + QF P++ H Sbjct: 608 EEGEFAVLNPDLRTKVIASATPGTVTGELKKSFMQFKSFPMAMISRHWGRIGNMRRSGDY 667 Query: 51 ------IPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPT----E 100 + +++ + A ALV+ I L++GK+P+ F D Sbjct: 668 LVEGAPRAFGIP-LANPMAYAAALVVSTTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGF 726 Query: 101 YIKALINGIT--HYERF-------SPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTR 150 + +A G + + S G V GP S + A V D Sbjct: 727 WTRAFSVGGGAGFAGDMLVAAFESADYGSLLGSAVGGPLLSTLFQPLRAISSNVQDAAQG 786 Query: 151 KQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 K ++ + P NLW+ + +N + +++ + L+PG R + Q Sbjct: 787 KDTHVGADLL--KIAQSNTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQRNMNRSRTQ 842 >gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233] Length = 530 Score = 148 bits (373), Expect = 5e-34, Method: Composition-based stats. Identities = 46/233 (19%), Positives = 87/233 (37%), Gaps = 36/233 (15%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSR------MHLIEIPSSL 55 +G + + + R +GQF P+S M I L Sbjct: 299 DARVKGIMTQGLLAGT-------GMGEAIRFVGQFKAFPMSIMNKVLGREMAYIRKGKKL 351 Query: 56 VGVSSQVYRA---------KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALI 106 G+S++ RA ALVI + T+ L+ GKEP+ D + + + Sbjct: 352 GGLSTEAGRAEIGRGIRGMAALVITSGFMGYMAMTMKDLLKGKEPR-DPTKFKTIMAGFL 410 Query: 107 NGITH-------YERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQ 159 G ++ S ++GP + L +A + A+ E G + Sbjct: 411 QGGGLGIYGDVLFKEQRDAGSVIAGLVGPAPTTVVDLGLALQYALLGE------GGKSGK 464 Query: 160 FGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212 +++ +PF NL+Y + AF++ + I + +NPG + E ++ ++ Sbjct: 465 AAYRAISSNIPFLNLFYIKIAFDYLIGFQIMETVNPGVLKKVERRMKKDYNQE 517 >gi|254251753|ref|ZP_04945071.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] gi|124894362|gb|EAY68242.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] Length = 865 Score = 138 bits (346), Expect = 7e-31, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 73/232 (31%), Gaps = 27/232 (11%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIE---------- 50 E + ++ K G++ L + QF PI+ H Sbjct: 609 DEGEFAVLNPDLRTKVIAAATPGTLQGELQKTFLQFKSFPIAMISRHWGRIGEMRRSGDF 668 Query: 51 IPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPT----EYIKALI 106 ++S + ALV+ + L L+ GK+P+ D + +A Sbjct: 669 RVEGAPTLASPMAYGAALVVSTTLLGALAVQLQNLLLGKDPEPMGDDVKHGGAFWFRAFT 728 Query: 107 NGIT--HYERFS--------PFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKA 156 G P + G GP S A + K + Sbjct: 729 KGGGAGFAGDMLSAMLTGKNPAEAVGSVFGGPLVSTAIQAVTPFSNNAMAAAEGKDTHLS 788 Query: 157 QAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + + +P NLWY + +N + ++I + L+PG +R ++Q Sbjct: 789 ADLL--KFAQSNMPIVNLWYWKTVWNRLIWDNIAENLSPGVTSRNVAKSRQQ 838 >gi|48696687|ref|YP_024981.1| hypothetical protein VP5_gp18 [Vibrio phage VP5] gi|40806150|gb|AAR92068.1| hypothetical protein [Vibrio phage VP5] Length = 782 Score = 137 bits (345), Expect = 1e-30, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 71/201 (35%), Gaps = 19/201 (9%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS---QVYRAKALVIGILGEELIRKTLV 83 L R + F PI+ M+ + G S ++ A +V + Sbjct: 572 GELHRSLFMFHSFPITTI-MNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVLGVGIIQAK 630 Query: 84 PLISGKEPQLDFSDPTEYIKALINGITH--YERFSPFNSSGWD------VLGPWSSQAGK 135 +++GK+P+ SDP +I+ + G + +SG+ V GP + Sbjct: 631 DILNGKKPR-SMSDPKLWIEGMAQGGSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYGDW 689 Query: 136 LAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNP 195 +A+ + + +PF NLWY + A + + + I + +P Sbjct: 690 VAMTAADMAKGDAES------AMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLSDP 743 Query: 196 GGRARAEVYRQRQKYKKQRKR 216 + ++ + Q++ Sbjct: 744 EYDKKQLNKMRKMQRTSQQEY 764 >gi|48696644|ref|YP_024423.1| hypothetical protein VP2p19 [Vibrio phage VP2] gi|40950042|gb|AAR97633.1| hypothetical protein [Vibrio phage VP2] Length = 782 Score = 137 bits (345), Expect = 1e-30, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 71/201 (35%), Gaps = 19/201 (9%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS---QVYRAKALVIGILGEELIRKTLV 83 L R + F PI+ M+ + G S ++ A +V + Sbjct: 572 GELHRSLFMFHSFPITTI-MNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVLGVGIIQAK 630 Query: 84 PLISGKEPQLDFSDPTEYIKALINGITH--YERFSPFNSSGWD------VLGPWSSQAGK 135 +++GK+P+ SDP +I+ + G + +SG+ V GP + Sbjct: 631 DILNGKKPR-SMSDPKLWIEGMAQGGSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYGDW 689 Query: 136 LAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNP 195 +A+ + + +PF NLWY + A + + + I + +P Sbjct: 690 VAMTAADMAKGDAES------AMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLSDP 743 Query: 196 GGRARAEVYRQRQKYKKQRKR 216 + ++ + Q++ Sbjct: 744 EYDKKQLNKMRKMQRTSQQEY 764 >gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] Length = 995 Score = 134 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 75/217 (34%), Gaps = 20/217 (9%) Query: 7 GSVGSTIQDKRWIT--GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVY 63 + +++ + + G++ R +GQF P++ + Sbjct: 765 AVINPGARERAMLRRGTQAGTLEGEALRFVGQFKAFPVAVISKVWGRDLYGGERGWGRAA 824 Query: 64 RAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT--HYERF--SPFN 119 ++ + L L G+ P+ D +DP + A + G Y F ++ Sbjct: 825 GIVHTLVATTVMGYVAGMLKDLSKGRAPR-DPTDPRAWGAAFLQGGGAGIYGDFLLGQYS 883 Query: 120 SSGWDVL----GPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 G L GP S AG+L +W + G + + PF NL+ Sbjct: 884 RFGNRFLESAAGPTLSSAGELL-----NIWA---GAREGNDEKAATLRWTLSNTPFVNLF 935 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212 Y R A ++ + + +NPG R E + ++ Sbjct: 936 YTRMALDYLFLYQVQEAMNPGFLRRFEQRVAKDNNQR 972 >gi|291336683|gb|ADD96225.1| hypothetical protein Rsph17025_0444 [uncultured organism MedDCM-OCT-S08-C1350] Length = 850 Score = 133 bits (335), Expect = 1e-29, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 79/227 (34%), Gaps = 15/227 (6%) Query: 2 QEHARGSVGSTIQDKRWITG--KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGV 58 E S+ + + ++G + G+V + M + PI+ M + VG+ Sbjct: 616 NETNFAVPTSSAKGRITLSGSAQPGTVKGEIVNSMLMYKNFPITL-GMTHLSRGFQQVGL 674 Query: 59 SSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDP-TEYIKALINGITH------ 111 + +++G I + + +GK P ++ A+I G Sbjct: 675 KGKAKYLVPMIVGGAVMGSIAYEIKQIAAGKTPTKPEDMGVRYWLNAIIYGGGLGIFGDF 734 Query: 112 -YERFSPFN-SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFV 169 + + + S + GP +S G + ++ + + + Sbjct: 735 LFSDQNRYGGSFSKTLAGPVASFIGDSINLTFGNAAQLISGEKTNAGKELAA--FIQRYT 792 Query: 170 PFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216 P +LWYAR A + +SI+ ++NP + + K + + Sbjct: 793 PGSSLWYARVALERILFDSIERLINPDFDSDNRRNINKLKSRTGQDY 839 >gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 854 Score = 132 bits (331), Expect = 4e-29, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 72/216 (33%), Gaps = 15/216 (6%) Query: 3 EHARGSVGSTIQDKRW--ITGKDGSVN-NLARLMGQFLVMPISWSRMHLIEIPSSLVGVS 59 E + ++ + + + G+V L+R QF P++ M + Sbjct: 619 ETNAAVLEVGARESTFMGLGRERGTVGNELSRFFWQFKQFPLAMI-MRQWTRGMAQGTPQ 677 Query: 60 SQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT--HYERFS- 116 + L + + L GK+ D + Y+K+++ G + Sbjct: 678 EKFVYFAKLFAYTTVMGALVSQIQNLTQGKD-LDDPTTLDFYMKSIVKGGSASFLADAIS 736 Query: 117 ----PFNSSGWDVLGPW-SSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171 P S D + P + A T + +V +PF Sbjct: 737 ATSDPTERSVKDFIIPAAFKDITSIGTMVSGAGSAFITERDSSYG--AEAVNVVKNNIPF 794 Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQR 207 QNLWY+R F+ V + ++ + G R R + ++ Sbjct: 795 QNLWYSRLVFDRLVIAEMQELFDEGYRERKQRRQEN 830 >gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 831 Score = 124 bits (312), Expect = 7e-27, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 70/184 (38%), Gaps = 17/184 (9%) Query: 36 FLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF 95 F ++ + S + ++ + ++ G+L + L+ +++G++P+ D Sbjct: 627 FKSFGLAMFE-RHWKRVSQIESTGGKLAYSASVFTGLLMAGAMTNQLMDIMNGRDPR-DM 684 Query: 96 SDPTEYIKALINGIT--HYERFSPFNSSG---------WDVLGPWSSQAGKLAIAGKEAV 144 D +++A++ G + G +LGP A + + Sbjct: 685 KDGKFWLQAMLRGGGVGIFGDILNTGLGGDNRGGQSNLTGLLGPVYGTAADVGLTLGSVF 744 Query: 145 WDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVY 204 ++ G + G PF WY + AF H V + + ++L+PG +R + Sbjct: 745 KEKTEPADVGANLLRIGY----QNTPFIRSWYTKAAFEHAVMHDMQEMLSPGYLSRMKKR 800 Query: 205 RQRQ 208 ++ Sbjct: 801 AKKD 804 >gi|146276496|ref|YP_001166655.1| hypothetical protein Rsph17025_0444 [Rhodobacter sphaeroides ATCC 17025] gi|145554737|gb|ABP69350.1| hypothetical protein Rsph17025_0444 [Rhodobacter sphaeroides ATCC 17025] Length = 830 Score = 122 bits (305), Expect = 5e-26, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 74/222 (33%), Gaps = 16/222 (7%) Query: 1 VQEHARGSVGSTIQDKRWITG--KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVG 57 ++E ++I+ + + G GSV L R + +S ++ +SL Sbjct: 583 LEELEFAIPTASIEGRALLQGTAAPGSVAGELMRSSMSYKSFSLSLM-LNQYRRFASLPT 641 Query: 58 VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH--YERF 115 + A + +L + L L G +P+ + ++ AL G + F Sbjct: 642 PWDKAKYAAKVSTLLLVTGAMAIQLKELAKGNDPR-PMDENKFWLAALFQGGGLGIFGDF 700 Query: 116 SPFNSS------GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFV 169 +S + GP AG L + + + LV Sbjct: 701 FSAETSRVGGGLAETIAGPVVGAAGDLLKPVASNITRAVQGEDTLVGRDVAA--LVRRNT 758 Query: 170 PF-QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210 PF + WYAR A++ V + + L+P ++ Sbjct: 759 PFLSSAWYARTAYSRLVADELQAFLDPEAEVLFRRRMKKMAK 800 >gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1] gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 855 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 77/221 (34%), Gaps = 39/221 (17%) Query: 22 KDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLV------GVSSQVYR---------- 64 + G+V +L R + QF P ++ + L G S + R Sbjct: 627 QPGTVPGDLLRFVTQFKSFPAAYMQKTLGRELYGRGYTPTALGNSFRGGRDLVQALRNGN 686 Query: 65 -----AKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH-------Y 112 L++ + + G+EP+ DP ++ A++ G + Sbjct: 687 GERLALAQLMLWTTAFGYLSMASKDVTKGREPR-PADDPKTWLAAMVQGGGLGIFGDYLF 745 Query: 113 ERFSPFNSSG-WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171 + F +S GP A + +W R + G A L PF Sbjct: 746 GEANRFGNSALESAAGPTIGTAADVI-----NLWA---RAKEGDDTASSALRLAQNNTPF 797 Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212 NL+Y R A +H S+ + +NPG R E ++Q ++ Sbjct: 798 MNLFYTRIALDHLFLYSVQEAMNPGSLRRTEERIRQQNGQE 838 >gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine microorganism HF4000_48F7] Length = 828 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 79/213 (37%), Gaps = 23/213 (10%) Query: 16 KRWITGKDGSVNN-----LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVI 70 R+ TG++G + +L QF ++ + S Y +V Sbjct: 609 GRFFTGEEGIKSGTPMAMANKLFWQFRSFGLT-MLFRQWPRAYEMGLPS--FYHLVPMV- 664 Query: 71 GILGEELIRKTLVPLISGKEPQLDFSDP-TEYIKALIN-------GITHYERFSPFNSSG 122 + + ++ G+E + DP + +++ G + + +++S Sbjct: 665 ---LMGYVAMAMKDILKGRELKDVVEDPGKIAVASVLQSGFGGIAGDFLFNDYRQYSTSY 721 Query: 123 WDVL-GPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAF 181 D+L GP S LA G D T A G V +P+ N W +R F Sbjct: 722 VDLLAGPSGSSLNDLAEFGATTF-DVATGGDP-VDAAAAGWRAVKGNIPYANWWASRTLF 779 Query: 182 NHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQR 214 ++ + + ++LNPG R E +++ + R Sbjct: 780 DYLINYQVQEILNPGSLRRMERRFKQKNNQDYR 812 >gi|303328566|ref|ZP_07359001.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861332|gb|EFL84271.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 855 Score = 101 bits (250), Expect = 9e-20, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 71/233 (30%), Gaps = 39/233 (16%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSR---------------- 45 + R + + + R + QF PI++ + Sbjct: 616 DDATRAIMRQGTRPGT-------GAGEVWRAIMQFKSFPIAYMQRVLGGRRWVRGDLQRG 668 Query: 46 MHLIEI---PSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYI 102 M + ++ + V+ + TL L G+EP+ T Sbjct: 669 MRYGPRNLPGAVEDALTRDMGGLMGFVLSSVAFGYASMTLKDLAKGREPRSLAHRETWLA 728 Query: 103 KALINGI-THYERFS-----PFN-SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGK 155 A+ +G + F S +GP G A G + V + Sbjct: 729 AAMQSGGAGIFGDILFGKVNRFGNSFAETAVGPLGGLIGDAATLGGQLVRGDM------A 782 Query: 156 AQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + L PF NLWY R A + + + ++++PG R E +++ Sbjct: 783 DAGEDTLRLAMGNAPFINLWYTRAALDWMLLYHVREMMSPGTLRRTERKMKKE 835 >gi|190893672|ref|YP_001980214.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652] gi|190698951|gb|ACE93036.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652] Length = 460 Score = 94.9 bits (234), Expect = 7e-18, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 46/109 (42%), Gaps = 10/109 (9%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQ 61 RG++ +Q I R QF P+++ H++ + G++++ Sbjct: 352 DARIRGAMTGGLQRGTII-------GEAVRSATQFKSFPMTYMMTHMMRALTQ--GMANR 402 Query: 62 VYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGIT 110 YR L + + + LI+G++PQ + +DP + ++ I G Sbjct: 403 TYRTTQLALTMTIAGAEMSQMQSLIAGRDPQ-NMADPRFWEQSFIRGGG 450 >gi|262043648|ref|ZP_06016757.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038986|gb|EEW40148.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 974 Score = 91.8 bits (226), Expect = 5e-17, Method: Composition-based stats. Identities = 24/238 (10%), Positives = 66/238 (27%), Gaps = 46/238 (19%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPS-------- 53 + +Q + + R QF S+ + + Sbjct: 738 DARTMSIMKQGMQRGT-------AYGEMLRFAWQFKSFTASFMQNAIGRELYGRGYDFGS 790 Query: 54 --------------SLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPT 99 ++ + ++ L + + ++ G+ P+ + + Sbjct: 791 LSQNNTFRNNALIRAMRNGNGELMGIAQLFLWATAFGYLSMQTKLMLRGQTPR-PADNVS 849 Query: 100 EYIKALINGITH-------YERFSPFN-SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRK 151 + A+ G + ++ F + + GP++S A +L + Sbjct: 850 TWTAAMAQGGGLGILGDFLFGEYNRFGNTPATSLAGPFASDAAQLVNLFGLTKQGDAKAA 909 Query: 152 QRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209 P+ NL R + + N + + ++PG R + + ++ Sbjct: 910 DY--------FNFAINHTPYMNLHVVRPVMDFLILNQMREWMSPGSLQRYQQRVKEEQ 959 >gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 810 Score = 88.7 bits (218), Expect = 5e-16, Method: Composition-based stats. Identities = 64/220 (29%), Positives = 108/220 (49%), Gaps = 13/220 (5%) Query: 2 QEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPS----SLVG 57 Q++ARGSVGS+++D ++ + + G L+ QFL PIS + HL +P G Sbjct: 592 QDNARGSVGSSLRDTKYTSSRGGIPG--LSLVTQFLTTPISMAEKHLWAVPKTLVGGANG 649 Query: 58 VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERF-S 116 +S+ YRAK L GI+ E ++ T ++G+E DF+DP +THY+RF + Sbjct: 650 MSAWSYRAKFLAFGIVLEGIVANTARKALTGQE-LDDFTDPKVLALMTARTLTHYDRFFN 708 Query: 117 PFNSSGWDVLG--PWSSQAGKLAIAGKEAVWD-EGTRKQRGKAQAQFGKELVNTFVPFQN 173 ++ D+L P +S L AG E + G +++ + V +P +N Sbjct: 709 EYHHDFKDLLHSVPVASTVIGLGDAGLEVSRNIFGEDEEKKAKANAKLAKEVANNMPLKN 768 Query: 174 LWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 L+Y + AF V +++ + N G + R R+ K + Sbjct: 769 LFYVKAAFQKMVVDNLCEYFNEGYKDRLA--MNRELRKSR 806 >gi|157372110|ref|YP_001480099.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] gi|157323874|gb|ABV42971.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] Length = 850 Score = 84.9 bits (208), Expect = 8e-15, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 69/216 (31%), Gaps = 38/216 (17%) Query: 22 KDGS-VNNLARLMGQFLVM-------------------PISWSRMHLIEIPSSLVGVSSQ 61 + G+ + R GQF P + + +++ + + Sbjct: 630 QPGTPLGEAIRFGGQFKSFTGSFMQNTIGREIYGRGYTPAELGQSRFTSLANAMRNGNGE 689 Query: 62 VYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKAL-------INGITHYER 114 L I + + L+ G+ P+ +D ++ A I G + Sbjct: 690 KMGLAQLFIWMTALGYVSMQTKLLLKGQTPRP--ADAKTFLAAAAQGGGLGIMGDFLFGE 747 Query: 115 FSPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQN 173 ++ F + GP ++ R + G A+A + PF N Sbjct: 748 YNRFGGGLASSLAGPTVGDLDQIRNLFL--------RARDGDAKAADLLKFGIDHTPFMN 799 Query: 174 LWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209 L R A N+ + N + L+PG R ++++ Sbjct: 800 LHVVRPAMNYLILNRAQEWLSPGSLERYRQRVEKEQ 835 >gi|262043550|ref|ZP_06016663.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039084|gb|EEW40242.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 143 Score = 72.9 bits (177), Expect = 3e-11, Method: Composition-based stats. Identities = 30/132 (22%), Positives = 48/132 (36%), Gaps = 6/132 (4%) Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH--YERFSPFNSSGWDVLGPWSSQAGKLA 137 L+ G+ P+ +D ++ A G F F GP +S G A Sbjct: 1 MQSKLLLKGQTPRP--ADAKTFLAAASQGGGLGILGDFM-FGEVNRMGAGPVTSLMGPAA 57 Query: 138 IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGG 197 + + RG A PF N+++ R A N + N I D L+PG Sbjct: 58 SNADSIIT-LLQQTTRGDADLGDWYRTALDNTPFLNVFWLRTAMNGLILNRIQDALDPGS 116 Query: 198 RARAEVYRQRQK 209 R + +R++ Sbjct: 117 LERYQRRVEREQ 128 >gi|291336673|gb|ADD96216.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 101 Score = 66.0 bits (159), Expect = 4e-09, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 40/106 (37%), Gaps = 13/106 (12%) Query: 105 LINGITH-------YERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQ 157 ++ G + S+ +GP ++A ++ A A+ E G Sbjct: 1 MLQGGGLGIYTDFLFGNIQNSTSALATAVGPIPTEAARVLSALNYAIKGE------GGKA 54 Query: 158 AQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEV 203 + + +PF NL+Y + AF++ + + + L+PG Sbjct: 55 GKQAYYSIKENIPFLNLFYIKTAFDYMIGYQMMETLSPGSLKEWRK 100 >gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 56 Score = 61.4 bits (147), Expect = 9e-08, Method: Composition-based stats. Identities = 30/55 (54%), Positives = 43/55 (78%) Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216 KE++NT VPFQNLWY + F++FVR +DD +NPG RARAE YR++ +++RK+ Sbjct: 2 KEVLNTTVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRKNIQREKRKK 56 >gi|218514216|ref|ZP_03511056.1| hypothetical protein Retl8_11184 [Rhizobium etli 8C-3] Length = 73 Score = 59.1 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 12/52 (23%), Positives = 27/52 (51%) Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216 + + P +LWY + A + + ++I +++P RA + Y +R K + + Sbjct: 6 LKAWTPGSSLWYTKIATDRLIFDNIQAMIDPNYRASFDRYERRMKREFGQAF 57 >gi|315122771|ref|YP_004063260.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496173|gb|ADR52772.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 137 Score = 58.3 bits (139), Expect = 8e-07, Method: Composition-based stats. Identities = 33/137 (24%), Positives = 59/137 (43%), Gaps = 10/137 (7%) Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITHYERF-SPFNSSGWDVLG--PWSSQAGKL 136 L+ K +DF+DP +THY+RF + ++ D+L P +S L Sbjct: 4 TQQERLLQVKN-SIDFTDPKTLALLTARTLTHYDRFFNEYHHDFKDLLHAVPVASTIIGL 62 Query: 137 AIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 A DE R++ A+ +P +NL+YA+ AF + +++ + N G Sbjct: 63 GDARNIFGEDEEKREKANANFAKELAN----NIPLKNLFYAKAAFQKMIVDNLCEYFNEG 118 Query: 197 GRARAEVYRQRQKYKKQ 213 + R + R+ K + Sbjct: 119 YKERLD--MNRELRKSR 133 >gi|315121758|ref|YP_004062247.1| hypothetical protein CKC_00040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495160|gb|ADR51759.1| hypothetical protein CKC_00040 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 107 Score = 49.8 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 45/99 (45%), Positives = 57/99 (57%), Gaps = 6/99 (6%) Query: 42 SWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEY 101 S L +L+G SS L++ EELI+ LVPLISG EP+ D + P +Y Sbjct: 12 SLFPHFLFVRSKALLGRSS----ILILLVEYANEELIKNVLVPLISGNEPRFDITSPRDY 67 Query: 102 IKALINGITHYERFSPF--NSSGWDVLGPWSSQAGKLAI 138 KA++N ITHYERFSP S WD+LGP QAG+L Sbjct: 68 AKAIVNAITHYERFSPLGGGQSKWDILGPALGQAGRLGG 106 >gi|212710806|ref|ZP_03318934.1| hypothetical protein PROVALCAL_01874 [Providencia alcalifaciens DSM 30120] gi|212686503|gb|EEB46031.1| hypothetical protein PROVALCAL_01874 [Providencia alcalifaciens DSM 30120] Length = 1122 Score = 49.0 bits (115), Expect = 5e-04, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 57/180 (31%), Gaps = 40/180 (22%) Query: 28 NLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLIS 87 L +++ QF + H + S + + Y L I + L +++ Sbjct: 951 ELGKIVMQFKTF---FFGTHNRALVSGIQSGDASFYYGALLQIS---LGSLVYVLKSMMA 1004 Query: 88 GKE----PQLDFSDPTEYIKAL------------INGITH-----YER--FSPFNS--SG 122 G+E P + ++ + ++G ++ + S + S Sbjct: 1005 GREINAEPANLVKEGLDWSGMMGWLGEPNNLLENLSGGSYGMSAMFGGPPASRYQSRNGI 1064 Query: 123 WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 +LGP G + + E ++ V +PFQNL+Y N Sbjct: 1065 GALLGPTFDLGGDIQNITAGVMNGEFDDREV---------RSVRKLLPFQNLFYLSPLLN 1115 >gi|291336674|gb|ADD96217.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 333 Score = 48.7 bits (114), Expect = 6e-04, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 28/95 (29%), Gaps = 5/95 (5%) Query: 1 VQEHARGSVGSTIQDKRWIT--GKDGS-VNNLARLMGQFLVMPISWSRMHLIE--IPSSL 55 + + + + W+ + G+ R M QF P ++ + + Sbjct: 234 LDRSTYAVLEPDARTRGWMKMGQQAGTHPGEALRFMTQFKAFPFAFYQKMIGRETAAWKD 293 Query: 56 VGVSSQVYRAKALVIGILGEELIRKTLVPLISGKE 90 + LV G + T ++ GK Sbjct: 294 GNKMNAALSMAQLVGGSALFGYMAMTAKDILKGKN 328 >gi|227355848|ref|ZP_03840241.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] gi|227164167|gb|EEI49064.1| conserved hypothetical protein [Proteus mirabilis ATCC 29906] Length = 1127 Score = 47.9 bits (112), Expect = 0.001, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 58/188 (30%), Gaps = 41/188 (21%) Query: 28 NLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLIS 87 L +++ QF + H + S + + Y L + + L ++ Sbjct: 956 ELGKIVMQFKTF---FFATHNRALVSGIQSGDASFYYGALLQVA---LGSLVYVLKAKMA 1009 Query: 88 GKE----PQLDFSDPTEYIKAL------------INGITH-----YER--FSPFNS--SG 122 G++ P + ++ + ++G T+ + S + S Sbjct: 1010 GRDINTEPANLVKEGLDWSGMMGWLGEPNNVLENLSGGTYGMSAMFGGPPASRYQSRNGI 1069 Query: 123 WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 +LGP G + + E ++ V +PFQNL+Y N Sbjct: 1070 GALLGPTFDLGGDIKNITSGVLNGEFDDREV---------RSVRKLLPFQNLFYLSPLLN 1120 Query: 183 HFVRNSID 190 V + Sbjct: 1121 Q-VEEQMK 1127 >gi|301021601|ref|ZP_07185598.1| hypothetical protein HMPREF9551_01224 [Escherichia coli MS 196-1] gi|299881535|gb|EFI89746.1| hypothetical protein HMPREF9551_01224 [Escherichia coli MS 196-1] Length = 614 Score = 44.0 bits (102), Expect = 0.013, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 48/178 (26%), Gaps = 40/178 (22%) Query: 30 ARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGK 89 R + QF + L+ L ++Q Y A I + L GK Sbjct: 445 GRTIFQFKSFTTASYNRALLG---GLQEGTAQFYYGTAFQIA---LGSLVYALKEASKGK 498 Query: 90 E----PQL-------------------DFSDPTEYIKALINGITHYERFSPFNSSG--WD 124 + P+ + ++ + + S + S G Sbjct: 499 DVDWSPEKLVLEGIDRSGILGPLMEYNNMAEKATGGAVGLGALFGTGTQSRYASRGFVSS 558 Query: 125 VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 + GP S A + + + + V T +P NL++ N Sbjct: 559 LFGPSFSLADSIIDVTSGVLNGDVGDRIVHN---------VRTTIPGNNLFWIAPLIN 607 >gi|119386478|ref|YP_917533.1| hypothetical protein Pden_3771 [Paracoccus denitrificans PD1222] gi|119377073|gb|ABL71837.1| hypothetical protein Pden_3771 [Paracoccus denitrificans PD1222] Length = 1099 Score = 44.0 bits (102), Expect = 0.014, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 51/196 (26%), Gaps = 39/196 (19%) Query: 29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISG 88 + R+ QF ++ S ++ V+G+ L L SG Sbjct: 912 VGRMALQFRSFALA-SNQRVLLRGLQEDQTRFWGG-----VVGMSAIGAFIYMLKQLESG 965 Query: 89 KEPQLDFSDPTEYIKALINGITHY-------------ERFSPFNSSGWDVLG-------- 127 +E +P ++ ++ + F +N++ G Sbjct: 966 RE---ISDNPGTWVAEGLDRSGIFSLAFEVNNALEKAGGFGIYNAAAAAFPGKSQKAPAS 1022 Query: 128 ---------PWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYAR 178 +L + + + PF +L Y R Sbjct: 1023 RFASRTGYASMFGPTYELGEGAYGLMSMGLRAARGDLDMTAGDVGTLRRMTPFASLPYWR 1082 Query: 179 GAFNHFVRNSIDDVLN 194 + + N + + L+ Sbjct: 1083 WLIDGQIVNPLKESLS 1098 >gi|259418630|ref|ZP_05742547.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259344852|gb|EEW56706.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 1302 Score = 42.1 bits (97), Expect = 0.048, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 42/173 (24%), Gaps = 27/173 (15%) Query: 28 NLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLIS 87 + + GQF +S L+ G+ A L + + + Sbjct: 1126 EMGKFFGQFKSFALSAHHRILLS------GIQRADADVLAQATTALVFGALTANVKAYLG 1179 Query: 88 GKEPQLDFS----------------DPTEYIKALINGITHYERFSPFNSSGWDVLGPWSS 131 G EP+ + +P AL G T L Sbjct: 1180 GYEPKEGAAMWEDALDRSGLAGWLMEPYNLAAALSGGKTSITGEPVSRYQARSALEGALG 1239 Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHF 184 + + G EA+ + +P NLWY F Sbjct: 1240 PSVDMMKGGVEAINAF-----SNGKANYRDVRKLMRPIPGNNLWYLLPLFQKV 1287 >gi|294490696|gb|ADE89452.1| conserved hypothetical protein [Escherichia coli IHE3034] Length = 1129 Score = 41.3 bits (95), Expect = 0.091, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 53/188 (28%), Gaps = 41/188 (21%) Query: 28 NLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLIS 87 +++ QF + S + + AL G + + + +S Sbjct: 958 EGWKMITQFKTF----IFAQHNRVLVSGIQQGDAAFYLGAL--GTIALGSMVYMMKQKLS 1011 Query: 88 GKEPQLD-----------------FSDPTEYIKALING----ITHYER--FSPFNS--SG 122 G++ S+P ++ + G + S F S + Sbjct: 1012 GRDIDYSWNNLVKEGIDRGGMLGWLSEPLNTVENISGGRFGLGAMFGAPPVSRFQSRNAI 1071 Query: 123 WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 +LGP G A + E +Q V +PFQNLW N Sbjct: 1072 GALLGPTFDLGGDAATVANGVLNGEFDSQQTH---------AVRKMLPFQNLWAISPLLN 1122 Query: 183 HFVRNSID 190 V + Sbjct: 1123 K-VEEQMK 1129 >gi|301046396|ref|ZP_07193556.1| conserved domain protein [Escherichia coli MS 185-1] gi|300301622|gb|EFJ58007.1| conserved domain protein [Escherichia coli MS 185-1] Length = 1129 Score = 41.3 bits (95), Expect = 0.092, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 53/188 (28%), Gaps = 41/188 (21%) Query: 28 NLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLIS 87 +++ QF + S + + AL G + + + +S Sbjct: 958 EGWKMITQFKTF----IFAQHNRVLVSGIQQGDAAFYLGAL--GTIALGSMVYMMKQKLS 1011 Query: 88 GKEPQLD-----------------FSDPTEYIKALING----ITHYER--FSPFNS--SG 122 G++ S+P ++ + G + S F S + Sbjct: 1012 GRDIDYSWNNLVKEGIDRGGMLGWLSEPLNTVENISGGRFGLGAMFGAPPVSRFQSRNAI 1071 Query: 123 WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 +LGP G A + E +Q V +PFQNLW N Sbjct: 1072 GALLGPTFDLGGDAATVANGVLNGEFDSQQTH---------AVRKMLPFQNLWAISPLLN 1122 Query: 183 HFVRNSID 190 V + Sbjct: 1123 K-VEEQMK 1129 >gi|295096859|emb|CBK85949.1| hypothetical protein ENC_24210 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 963 Score = 40.6 bits (93), Expect = 0.14, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 53/195 (27%), Gaps = 49/195 (25%) Query: 29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISG 88 L + + QF + + L + Q Y A IG + L +G Sbjct: 793 LGKTIFQFKSFATASYNRATLG---GLQEGTGQFYYGTAFQIG---LGALTYALKQSANG 846 Query: 89 KE--------------------PQLDFSDPTEYIKALINGITHY---ERFSPFNSSG--W 123 KE P +++++ E + G+ S + S G Sbjct: 847 KEVDWSPNKLVLEGVDRSGILGPLMEYNNMAEKASGGMVGLGALLGTGTQSRYASRGFIG 906 Query: 124 DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNH 183 LGP + + V T +P NL++ N Sbjct: 907 SALGPTFGLLDTITDVTAGVL---------NGDAGDRVLHNVRTLLPGNNLFWIAPLINQ 957 Query: 184 FVRNSIDDVLNPGGR 198 ++PG R Sbjct: 958 ---------VDPGMR 963 >gi|262043399|ref|ZP_06016524.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039225|gb|EEW40371.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 964 Score = 39.8 bits (91), Expect = 0.27, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 50/195 (25%), Gaps = 49/195 (25%) Query: 29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISG 88 L + + QF + + L ++Q Y A IG + L +G Sbjct: 794 LGKTVFQFKSFATASYNRATLG---GLQEGTAQFYYGTAFQIG---LGSLTYALKQAANG 847 Query: 89 KE----PQLD-----------------FSDPTEYIKALINGITHYER--FSPFNSSG--W 123 +E PQ + + +I S + S G Sbjct: 848 REVDLTPQKMVLEGIDRSGILGPLMEYNNMAEKASGGMIGLGPLLGTGTQSRYASRGFIG 907 Query: 124 DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNH 183 LGP + + V T +P NL++ N Sbjct: 908 SALGPTFGLLDTVTDVTAGVL---------NGDAGDRVLHSVRTLLPGNNLFWVAPLINQ 958 Query: 184 FVRNSIDDVLNPGGR 198 ++PG R Sbjct: 959 ---------VDPGMR 964 >gi|13186153|emb|CAC33464.1| hypothetical protein [Legionella pneumophila] Length = 504 Score = 39.8 bits (91), Expect = 0.28, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 55/184 (29%), Gaps = 27/184 (14%) Query: 8 SVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA 67 S+ + DK + + G+ ++ LM QF + + + ++ + Sbjct: 324 SLSPNLGDKPLLLQQRGAFGHMTNLMFQFKSFLFAATNRIFYSGIQNRNDINLYLG---- 379 Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSDP------------TEYIKALINGITHY--E 113 + ++G ++ + + G + + + + + G + Sbjct: 380 -AVSMMGLGMLGYVVSSHLRGNKEIDLSTKNLLREGVDRSGILAIFGEGINIGQKLFQLG 438 Query: 114 RFSPFNSSGW--DVLGPWSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNTFVP 170 S + S VLGP +L K + K E V +P Sbjct: 439 EVSRYKSRDAFGSVLGPTGGSVSQLVSLFNKLNPLSTAKGEWTTKDA-----EAVMRLMP 493 Query: 171 FQNL 174 F L Sbjct: 494 FAKL 497 >gi|67925131|ref|ZP_00518505.1| Cell division transporter substrate-binding protein FtsY [Crocosphaera watsonii WH 8501] gi|67853025|gb|EAM48410.1| Cell division transporter substrate-binding protein FtsY [Crocosphaera watsonii WH 8501] Length = 457 Score = 37.9 bits (86), Expect = 0.91, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 46/153 (30%), Gaps = 6/153 (3%) Query: 49 IEIPSSLVGVSSQVYRAKALVIGI--LGEELIRKTLVPLIS-GKEPQLDFSDPTEYIKAL 105 ++ L + + +E + L + G+ P + ++K L Sbjct: 97 MKKSDRLEALKETAMETATVEDTTPLTFDEEFVWSAKVLAAQGRAPDDVSQEEINWLKRL 156 Query: 106 INGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELV 165 G+ + + ++G + + + + + +L+ Sbjct: 157 RQGLG--KSRLGLVNQLKSIVGQGPLNEDAVMEIEALLLQADTGLEATDYIISTLQNKLL 214 Query: 166 NTFVP-FQNLWYARGAFNHFVRNSIDDVLNPGG 197 +P Q L Y + + + V+NPG Sbjct: 215 EEALPPEQALEYLKSILREILDRPLQKVVNPGF 247 >gi|313219833|emb|CBY30750.1| unnamed protein product [Oikopleura dioica] Length = 266 Score = 37.9 bits (86), Expect = 0.93, Method: Composition-based stats. Identities = 21/179 (11%), Positives = 50/179 (27%), Gaps = 18/179 (10%) Query: 48 LIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALIN 107 +++ + + RA + + L+ + + ++ ++ Sbjct: 38 IVDQMLATPVSKEEADRAIGISGNT-------EQEDDLLRRSDTMISDRIHKSWVTKVVK 90 Query: 108 GITHYERFSPFNSSG-WDVLGPWSSQAGKL-----AIAGKEAVWDEGTRKQRGKAQAQFG 161 + ++ +NS+ P + K+ G A Sbjct: 91 QTSFLQK--KYNSADCRRFGNPNQDDVSDVWDEFGGDKIKDQYNLFGKLIWPASAADGLR 148 Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSID---DVLNPGGRARAEVYRQRQKYKKQRKRN 217 P NLW + ++ +I D NP ++ +YK RK + Sbjct: 149 GRQTVEEAPLTNLWLQFARWRLWINENISTCYDESNPKATRAGAQRVKKWRYKAHRKFD 207 >gi|172038841|ref|YP_001805342.1| cell division protein [Cyanothece sp. ATCC 51142] gi|171700295|gb|ACB53276.1| cell division protein [Cyanothece sp. ATCC 51142] Length = 464 Score = 37.5 bits (85), Expect = 1.5, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 47/153 (30%), Gaps = 6/153 (3%) Query: 49 IEIPSSLVGVSSQVYRAKALVIGI--LGEELIRKTLVPLIS-GKEPQLDFSDPTEYIKAL 105 ++ L + A+ +E + L + G+ P + ++K L Sbjct: 104 MQKSDRLEALKETAMETAAVEDTTPITFDEEFVWSAKVLAAQGRAPDDISEEEINWLKRL 163 Query: 106 INGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELV 165 G+ + + ++G + + + + + +L+ Sbjct: 164 RQGLG--KSRRGLVNQLKSIVGQGPLNEDAVMEIEALLLQADTGIEATDYIISTLQNKLL 221 Query: 166 NTFVP-FQNLWYARGAFNHFVRNSIDDVLNPGG 197 +P Q L Y + + + V+NPG Sbjct: 222 EEALPPEQALDYLKSILRDILDRPLQKVVNPGF 254 >gi|313229825|emb|CBY07530.1| unnamed protein product [Oikopleura dioica] Length = 266 Score = 36.7 bits (83), Expect = 2.0, Method: Composition-based stats. Identities = 21/179 (11%), Positives = 49/179 (27%), Gaps = 18/179 (10%) Query: 48 LIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALIN 107 +++ + + RA G + L+ + + ++ ++ Sbjct: 38 IVDQMLATPVSKEEADRAI-------GISGNAEQEDDLLRRSDTMISDRIHKSWVTKVVK 90 Query: 108 GITHYERFSPFNSSG-WDVLGPWSSQAGKL-----AIAGKEAVWDEGTRKQRGKAQAQFG 161 + ++ +NS+ P + K+ G A Sbjct: 91 QTSFLQK--KYNSADCRRFGNPNQDDVSDVWDEFGGDKIKDQYNLFGKLIWPASAADGLR 148 Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSID---DVLNPGGRARAEVYRQRQKYKKQRKRN 217 P NLW + ++ +I D NP ++ + K RK + Sbjct: 149 GRQTVEEAPLTNLWLQFARWRLWINENISTCYDESNPKATRAGAQRVKKWRNKAHRKFD 207 >gi|85117696|ref|XP_965305.1| hypothetical protein NCU03205 [Neurospora crassa OR74A] gi|28927112|gb|EAA36069.1| predicted protein [Neurospora crassa OR74A] Length = 188 Score = 36.3 bits (82), Expect = 2.6, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 47/120 (39%), Gaps = 9/120 (7%) Query: 61 QVYRAKALVIGILGEELIRKTL-------VPLISGKEPQLDFSDPTEYIKALINGITHYE 113 + Y A+ L L L+ TL + S +P +++ +I + I Sbjct: 52 ETYFARYLGFSQLTIGLLIVTLSGALPLTTNVTSPNDPNSPYANAVVFISMFYHTIGAAY 111 Query: 114 RFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQN 173 +S + SSG V G G ++ A+W ++GK A G + + PF+N Sbjct: 112 GYSRYTSSGGHV-GYLLGFTGSAIMSA-YALWLFMFAGEQGKVNATTGADKRTSNFPFKN 169 >gi|126657247|ref|ZP_01728413.1| cell division protein; FtsY [Cyanothece sp. CCY0110] gi|126621518|gb|EAZ92229.1| cell division protein; FtsY [Cyanothece sp. CCY0110] Length = 463 Score = 36.3 bits (82), Expect = 3.0, Method: Composition-based stats. Identities = 17/153 (11%), Positives = 46/153 (30%), Gaps = 6/153 (3%) Query: 49 IEIPSSLVGVSSQVYRAKALVIGI--LGEELIRKTLVPLIS-GKEPQLDFSDPTEYIKAL 105 ++ L + + ++ + L + G+ P + ++K L Sbjct: 103 MQKSDRLEALKETAMEKATVEDTTPITFDDEFVWSAKVLAAQGRAPDDVSQEEINWLKRL 162 Query: 106 INGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELV 165 G+ + + ++G + + + + + +L+ Sbjct: 163 RQGLG--KSRRGLVNQLKSIVGQGPLNEDAVMEIEALLLQADTGIEATDYIISTLQNKLL 220 Query: 166 NTFVP-FQNLWYARGAFNHFVRNSIDDVLNPGG 197 +P Q L Y + + + V+NPG Sbjct: 221 EEALPPEQALEYLKSILREILDRPLQKVVNPGF 253 >gi|83312738|ref|YP_423002.1| hypothetical protein amb3639 [Magnetospirillum magneticum AMB-1] gi|82947579|dbj|BAE52443.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 614 Score = 35.2 bits (79), Expect = 7.4, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 48/166 (28%), Gaps = 20/166 (12%) Query: 28 NLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLIS 87 L +++GQF I+ ++ + + + + S Sbjct: 437 ELGKMIGQFKTFSIASTQRVALAALQQRDAAALNGSLLSLGLGALSYVAY---------S 487 Query: 88 GKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDE 147 G + P + K ++ S N+ G V G + + + EA+ Sbjct: 488 GASGRDLSDHPAVWAKEAVDRSGLLFWLSDVNNIGAKVFGYGEGPSRYASRSATEALLGP 547 Query: 148 GT-----------RKQRGKAQAQFGKELVNTFVPFQNLWYARGAFN 182 G + VPFQNL+Y R F+ Sbjct: 548 GLGAGLDTSIQVLGDASRGEWRSSDTRALRRLVPFQNLFYLRRLFD 593 >gi|39942430|ref|XP_360752.1| hypothetical protein [Magnaporthe oryzae 70-15] gi|145015699|gb|EDK00189.1| predicted protein [Magnaporthe oryzae 70-15] Length = 582 Score = 34.8 bits (78), Expect = 8.3, Method: Composition-based stats. Identities = 7/65 (10%), Positives = 17/65 (26%), Gaps = 2/65 (3%) Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 60 + E + W++ ++ + R + P R+ G Sbjct: 61 LDEQGNSVIHPGF--SSWLSALPDPISPIERYQRELKSYPTGIVRLAKRRRDEFEGGNGG 118 Query: 61 QVYRA 65 + A Sbjct: 119 KSAHA 123 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.309 0.122 0.303 Lambda K H 0.267 0.0374 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,101,793,318 Number of Sequences: 13984884 Number of extensions: 23335402 Number of successful extensions: 106465 Number of sequences better than 10.0: 98 Number of HSP's better than 10.0 without gapping: 51 Number of HSP's successfully gapped in prelim test: 47 Number of HSP's that attempted gapping in prelim test: 106215 Number of HSP's gapped (non-prelim): 110 length of query: 218 length of database: 4,792,584,752 effective HSP length: 133 effective length of query: 85 effective length of database: 2,932,595,180 effective search space: 249270590300 effective search space used: 249270590300 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 78 (34.8 bits)