BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1142_1 (218 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2] gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus] Length = 809 Score = 455 bits (1171), Expect = e-126, Method: Compositional matrix adjust. Identities = 218/218 (100%), Positives = 218/218 (100%) Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 60 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS Sbjct: 592 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 651 Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS Sbjct: 652 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 711 Query: 121 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA Sbjct: 712 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 771 Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG Sbjct: 772 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 809 >gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 810 Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 67/217 (30%), Positives = 115/217 (52%), Gaps = 11/217 (5%) Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLV---- 56 Q++ARGSVGS+++D ++ + + G + L+ L+ QFL PIS + HL +P +LV Sbjct: 591 TQDNARGSVGSSLRDTKYTSSR-GGIPGLS-LVTQFLTTPISMAEKHLWAVPKTLVGGAN 648 Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYER-F 115 G+S+ YRAK L GI+ E ++ T ++G+E DF+DP +THY+R F Sbjct: 649 GMSAWSYRAKFLAFGIVLEGIVANTARKALTGQELD-DFTDPKVLALMTARTLTHYDRFF 707 Query: 116 SPFNSSGWDVLG--PWSSQAGKLAIAGKEAVWD-EGTRKQRGKAQAQFGKELVNTFVPFQ 172 + ++ D+L P +S L AG E + G +++ + V +P + Sbjct: 708 NEYHHDFKDLLHSVPVASTVIGLGDAGLEVSRNIFGEDEEKKAKANAKLAKEVANNMPLK 767 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209 NL+Y + AF V +++ + N G + R + R+ +K Sbjct: 768 NLFYVKAAFQKMVVDNLCEYFNEGYKDRLAMNRELRK 804 >gi|315121758|ref|YP_004062247.1| hypothetical protein CKC_00040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495160|gb|ADR51759.1| hypothetical protein CKC_00040 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 107 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 2/71 (2%) Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS--SGWDV 125 L++ EELI+ LVPLISG EP+ D + P +Y KA++N ITHYERFSP S WD+ Sbjct: 34 LLVEYANEELIKNVLVPLISGNEPRFDITSPRDYAKAIVNAITHYERFSPLGGGQSKWDI 93 Query: 126 LGPWSSQAGKL 136 LGP QAG+L Sbjct: 94 LGPALGQAGRL 104 >gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 56 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 30/55 (54%), Positives = 43/55 (78%) Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216 KE++NT VPFQNLWY + F++FVR +DD +NPG RARAE YR++ +++RK+ Sbjct: 2 KEVLNTTVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRKNIQREKRKK 56 >gi|315122771|ref|YP_004063260.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496173|gb|ADR52772.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 137 Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 3/118 (2%) Query: 93 LDFSDPTEYIKALINGITHYERF-SPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRK 151 +DF+DP +THY+RF + ++ D+L + + + ++ E K Sbjct: 16 IDFTDPKTLALLTARTLTHYDRFFNEYHHDFKDLLHAVPVASTIIGLGDARNIFGEDEEK 75 Query: 152 QRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209 R KA A F KEL N +P +NL+YA+ AF + +++ + N G + R ++ R+ +K Sbjct: 76 -REKANANFAKELANN-IPLKNLFYAKAAFQKMIVDNLCEYFNEGYKERLDMNRELRK 131 >gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 918 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 40/139 (28%), Positives = 67/139 (48%), Gaps = 25/139 (17%) Query: 85 LISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSS-GWDVLGP---WSSQA 133 L++G +P LD + PT +++AL+ G + ++ + + SS G + GP ++ Q Sbjct: 764 LLNGNDP-LDMTKPTTWVQALLKGGSFGIYGDFIFQDHTQYGSSIGATMGGPVLSFAEQL 822 Query: 134 GKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRNSI 189 KL I ++A+ E T FG + + T PF NLWYA+ NH + + Sbjct: 823 TKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHLILQQL 873 Query: 190 DDVLNPGGRARAEVYRQRQ 208 ++ NPG R QR+ Sbjct: 874 QEMANPGYNDRVRDRAQRE 892 >gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233] Length = 530 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 53/210 (25%), Positives = 94/210 (44%), Gaps = 40/210 (19%) Query: 31 RLMGQFLVMPISWS------RMHLIEIPSSLVGVSSQVYRAK---------ALVI--GIL 73 R +GQF P+S M I L G+S++ RA+ ALVI G + Sbjct: 321 RFVGQFKAFPMSIMNKVLGREMAYIRKGKKLGGLSTEAGRAEIGRGIRGMAALVITSGFM 380 Query: 74 GEELIRKTLVPLISGKEPQLDFSDPTEYIKAL----------INGITHYERFSPFNSSGW 123 G + T+ L+ GKEP+ DPT++ + I G ++ S Sbjct: 381 G--YMAMTMKDLLKGKEPR----DPTKFKTIMAGFLQGGGLGIYGDVLFKEQRDAGSVIA 434 Query: 124 DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNH 183 ++GP + L +A + A+ EG + + +A +++ +PF NL+Y + AF++ Sbjct: 435 GLVGPAPTTVVDLGLALQYALLGEGGKSGKAAYRA------ISSNIPFLNLFYIKIAFDY 488 Query: 184 FVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 + I + +NPG + E R ++ Y ++ Sbjct: 489 LIGFQIMETVNPGVLKKVE-RRMKKDYNQE 517 >gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15] gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15] Length = 918 Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 32/148 (21%) Query: 85 LISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG----PWSSQA 133 L++G +P LD + PT +++AL+ G + ++ + + SS +G ++ Q Sbjct: 764 LLTGNDP-LDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQYGSSIAATIGGPVLSFAEQL 822 Query: 134 GKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRNSI 189 KL I ++A+ E T FG + + T PF NLWYA+ NH + + Sbjct: 823 TKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHLILQQL 873 Query: 190 DDVLNPGGRARAEVYRQRQKYKKQRKRN 217 ++ NPG Y R + + QR+ N Sbjct: 874 QEMANPG-------YNDRVRDRAQREFN 894 >gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1] gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus] Length = 864 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 58/235 (24%), Positives = 101/235 (42%), Gaps = 42/235 (17%) Query: 1 VQEHARGSVGSTIQDKR---WITGKDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLV 56 VQ RG++ +++ D++ +T K G+ R+ QF P ++++++ +S Sbjct: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMF-LNILDLSNSAK 687 Query: 57 ---GVSSQV------YRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALIN 107 G S + Y A + GI G I+ L+ G++P L Y L N Sbjct: 688 MPKGASMALNHVWIQYSATMALAGI-GVASIK----ALLRGEDPSLP---EVIYDGTLAN 739 Query: 108 G--ITHYERFSPFNSSG-----WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQF 160 G + + +R + S G +LGP S L + E + + +A Sbjct: 740 GALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKA-- 797 Query: 161 GKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRK 215 + +PF N+WY + +F+H + N I + LNPG Y RQ+ KK++K Sbjct: 798 ----IRKTLPFMNMWYLKNSFDHLILNQILEELNPG-------YLDRQQSKKKKK 841 >gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] Length = 924 Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust. Identities = 35/148 (23%), Positives = 63/148 (42%), Gaps = 23/148 (15%) Query: 67 ALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFN 119 A V G + + L+SG +P LD + P +++AL+ G + ++ + + Sbjct: 752 AYVAGTTLAGMFANQMNALLSGNDP-LDMTKPQTWLQALLKGGSFGIYGDFLFQDHTQYG 810 Query: 120 SSGWDVLGP----WSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQ 172 SS +LG ++ Q K + ++K + F + + T PF Sbjct: 811 SSIAGILGGPVLGFAEQLSKTVLTN--------SQKAMAGEETTFTADALKTARMITPFA 862 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRAR 200 NLWY + NH + + ++ NPG AR Sbjct: 863 NLWYTKAITNHLILQQLQEMANPGYNAR 890 >gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] Length = 841 Score = 44.7 bits (104), Expect = 0.010, Method: Compositional matrix adjust. Identities = 41/149 (27%), Positives = 66/149 (44%), Gaps = 21/149 (14%) Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127 L L++G +PQ D +DP + ++++ + G + ++SG D V G Sbjct: 679 LKELLNGNDPQTIYDSNDPKKASNFFVRSAVQGGGLSFLGDILVAGTDTSGRDAHSFVAG 738 Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185 P S L G ++EG G QF V +P QNLWY + A N V Sbjct: 739 PLGSDFESLLSLTVGNLTQYNEGKDTNFGNEAFQF----VKRKIPAQNLWYTKAAINRMV 794 Query: 186 RNSIDDVLNPGGRARAEVYRQRQKYKKQR 214 + I D + PG R +A + + +K ++R Sbjct: 795 FDEIQDFIAPGYREKA-LRKAEEKQDRER 822 >gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 1175 Score = 43.9 bits (102), Expect = 0.015, Method: Compositional matrix adjust. Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 20/144 (13%) Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127 L +++G +PQ D +DP + ++++L+ G + + ++SG D V G Sbjct: 1013 LREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSG 1072 Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185 P S L G ++EG G +F V +P QNLWY + A N V Sbjct: 1073 PLGSDFTSLLGLTVGNLTQYNEGKDTNFGNEAFKF----VKGKIPAQNLWYTKAAINRMV 1128 Query: 186 RNSIDDVLNPGGRARAEVYRQRQK 209 + + D + PG R +A +RQ+ Sbjct: 1129 FDEMQDTIAPGYREKALRKAERQQ 1152 >gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans'] gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 824 Score = 42.7 bits (99), Expect = 0.039, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 5/66 (7%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQ 206 EG +Q G +F K ++ P QNLWY + F+H V N + ++ +PG R E R Sbjct: 752 EGKPEQTGGDLVKFAKGMI----PGQNLWYTKAVFDHMVFNQLQEIFSPGYLRRME-KRS 806 Query: 207 RQKYKK 212 R+++ + Sbjct: 807 RKEFNQ 812 >gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 841 Score = 42.4 bits (98), Expect = 0.040, Method: Compositional matrix adjust. Identities = 39/148 (26%), Positives = 66/148 (44%), Gaps = 20/148 (13%) Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127 L L++G +PQ D +DP + +I++ + G + ++SG D V G Sbjct: 679 LKELLNGNDPQTIYDSNDPKKAGSFFIRSAVQGGGLSFLGDILVAGTDTSGRDANSFVAG 738 Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185 P + L G ++EG G +F V +P QNLWY + A N V Sbjct: 739 PLGNDFTALLGLTVGNLTQYNEGKDTNFGNEAFKF----VKGKIPAQNLWYTKAAINRMV 794 Query: 186 RNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 + + D + PG R +A +RQ+ +++ Sbjct: 795 FDEMQDTIAPGYREKALRKAERQQDRER 822 >gi|294843482|ref|ZP_06788165.1| putative phage related protein [Acinetobacter sp. 6014059] Length = 841 Score = 42.4 bits (98), Expect = 0.048, Method: Compositional matrix adjust. Identities = 38/148 (25%), Positives = 67/148 (45%), Gaps = 20/148 (13%) Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127 L +++G +PQ D +DP + ++++L+ G + + ++SG D V G Sbjct: 679 LREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSG 738 Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185 P S L G ++EG G +F V +P QNLWY + A N Sbjct: 739 PLGSDFTALLGLTVGNLTQYNEGKDTNFGNEAFKF----VKGKIPAQNLWYTKAAINRMF 794 Query: 186 RNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 + + D + PG R +A +RQ+ +++ Sbjct: 795 FDEVQDTIAPGYREKALRKAERQQDRER 822 >gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] Length = 838 Score = 41.6 bits (96), Expect = 0.072, Method: Compositional matrix adjust. Identities = 56/204 (27%), Positives = 88/204 (43%), Gaps = 38/204 (18%) Query: 29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLV----- 83 L R + F MPI+ H G+S R+KA IG L ++ T++ Sbjct: 623 LTRSVFLFKTMPIAMLMRHWER------GMSGPDARSKAGYIGAL---MVSTTVMGMLAL 673 Query: 84 ---PLISGKEPQLDFSDPTE-------YIKALING----ITHYERFSPFNSSGWDVLGPW 129 L+ G++P +P E +++A + G I FS N G GP Sbjct: 674 QIDELLKGRDPV--NMNPFEGKAGARNWVRAFLKGGSLGIYGDFLFSEQNQHGG---GPI 728 Query: 130 SSQAGKLAIAGKEAV-WDEGTRKQRGKAQ-AQFGKELVN---TFVPFQNLWYARGAFNHF 184 +S G + A +EA +G Q G+ + G EL+ P NLWY + A NH Sbjct: 729 ASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGAELLKFAKGMTPGANLWYLKAATNHL 788 Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208 + N + ++++PG AR + QR+ Sbjct: 789 IFNQLQEMVSPGYLARVKSRAQRE 812 >gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] Length = 921 Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust. Identities = 32/134 (23%), Positives = 55/134 (41%), Gaps = 9/134 (6%) Query: 82 LVPLISGKEPQLDFSDPTEYIKALING----ITHYERFSPFNSSGWDVLGPWSSQAGKLA 137 L L+SG +P +D + P ++ A + G I F G + + LA Sbjct: 764 LNALLSGNDP-IDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGSSIAATLGGPSLGLA 822 Query: 138 IAGKEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRNSIDDVLN 194 + + + + +G+ + FG + + T PF NLWY + NH + + ++ N Sbjct: 823 ESLMKLLITNPQKAMQGE-ETSFGADAIKTARMITPFANLWYTKAVTNHLILQQLQEMAN 881 Query: 195 PGGRARAEVYRQRQ 208 PG R Q Q Sbjct: 882 PGYNDRVRDRAQNQ 895 >gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1] gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 855 Score = 40.0 bits (92), Expect = 0.24, Method: Compositional matrix adjust. Identities = 35/122 (28%), Positives = 49/122 (40%), Gaps = 9/122 (7%) Query: 85 LISGKEPQLDFSDPTEYIKALING----ITHYERFSPFNSSGWDVLGPWSSQAGKLAIAG 140 + G+EP+ DP ++ A++ G I F N G L S AG I Sbjct: 712 VTKGREPR-PADDPKTWLAAMVQGGGLGIFGDYLFGEANRFGNSAL---ESAAGP-TIGT 766 Query: 141 KEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRAR 200 V + R + G A L PF NL+Y R A +H S+ + +NPG R Sbjct: 767 AADVINLWARAKEGDDTASSALRLAQNNTPFMNLFYTRIALDHLFLYSVQEAMNPGSLRR 826 Query: 201 AE 202 E Sbjct: 827 TE 828 >gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 854 Score = 39.7 bits (91), Expect = 0.27, Method: Compositional matrix adjust. Identities = 19/63 (30%), Positives = 34/63 (53%), Gaps = 10/63 (15%) Query: 158 AQFGKELVNTF---VPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQR 214 + +G E VN +PFQNLWY+R F+ V + ++ + G YR+R++ +++ Sbjct: 778 SSYGAEAVNVVKNNIPFQNLWYSRLVFDRLVIAEMQELFDEG-------YRERKQRRQEN 830 Query: 215 KRN 217 N Sbjct: 831 NHN 833 >gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] Length = 995 Score = 39.3 bits (90), Expect = 0.35, Method: Compositional matrix adjust. Identities = 48/184 (26%), Positives = 72/184 (39%), Gaps = 37/184 (20%) Query: 31 RLMGQFLVMPIS-----WSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEEL---IRKTL 82 R +GQF P++ W R L G RA +V ++ + + L Sbjct: 792 RFVGQFKAFPVAVISKVWGR--------DLYGGERGWGRAAGIVHTLVATTVMGYVAGML 843 Query: 83 VPLISGKEPQLDFSDPTEYIKALING----------ITHYERFSPFNSSGWDVLGPWSSQ 132 L G+ P+ D +DP + A + G + Y RF N GP S Sbjct: 844 KDLSKGRAPR-DPTDPRAWGAAFLQGGGAGIYGDFLLGQYSRFG--NRFLESAAGPTLSS 900 Query: 133 AGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDV 192 AG+L +W G R+ + A L NT PF NL+Y R A ++ + + Sbjct: 901 AGELL-----NIW-AGAREGNDEKAATLRWTLSNT--PFVNLFYTRMALDYLFLYQVQEA 952 Query: 193 LNPG 196 +NPG Sbjct: 953 MNPG 956 >gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] Length = 823 Score = 38.1 bits (87), Expect = 0.89, Method: Compositional matrix adjust. Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 4/56 (7%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAE 202 EG +Q G +F K L+ P QNLWY + +H V N + + +PG R E Sbjct: 750 EGKPEQTGGDTVKFVKGLI----PGQNLWYTKAVLDHMVFNQLQEYFSPGYLRRME 801 >gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 831 Score = 37.7 bits (86), Expect = 1.1, Method: Compositional matrix adjust. Identities = 45/193 (23%), Positives = 80/193 (41%), Gaps = 43/193 (22%) Query: 43 WSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYI 102 W R+ IE + S+ V+ G+L + L+ +++G++P+ D D ++ Sbjct: 639 WKRVSQIESTGGKLAYSASVF------TGLLMAGAMTNQLMDIMNGRDPR-DMKDGKFWL 691 Query: 103 KALI-------------NGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGT 149 +A++ G+ R N +G +LGP A + + +V+ E T Sbjct: 692 QAMLRGGGVGIFGDILNTGLGGDNRGGQSNLTG--LLGPVYGTAADVGLT-LGSVFKEKT 748 Query: 150 RKQRGKAQAQFGKELV-----NTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVY 204 A G L+ NT PF WY + AF H V + + ++L+PG Y Sbjct: 749 EP------ADVGANLLRIGYQNT--PFIRSWYTKAAFEHAVMHDMQEMLSPG-------Y 793 Query: 205 RQRQKYKKQRKRN 217 R K + ++ N Sbjct: 794 LSRMKKRAKKDFN 806 >gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 582 Score = 36.6 bits (83), Expect = 2.3, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 510 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 555 >gi|322703038|gb|EFY94654.1| hypothetical protein MAA_09875 [Metarhizium anisopliae ARSEF 23] Length = 303 Score = 36.6 bits (83), Expect = 2.6, Method: Compositional matrix adjust. Identities = 18/76 (23%), Positives = 34/76 (44%) Query: 116 SPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175 SPF+ + P K ++ G+ VW+ QR K + G E ++ V + + Sbjct: 16 SPFDDMDTESQKPEPQSPRKPSVGGESVVWEPFGIPQRNKLRLAVGPERISIVVDYWAIE 75 Query: 176 YARGAFNHFVRNSIDD 191 + +H +R ++DD Sbjct: 76 HISPVLHHMIRRALDD 91 >gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 824 Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] Length = 824 Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605] gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605] Length = 824 Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 825 Score = 36.2 bits (82), Expect = 3.0, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 753 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 798 >gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 824 Score = 36.2 bits (82), Expect = 3.0, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v] Length = 824 Score = 36.2 bits (82), Expect = 3.0, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14] Length = 824 Score = 36.2 bits (82), Expect = 3.2, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKSEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1] gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252] Length = 824 Score = 36.2 bits (82), Expect = 3.3, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKSEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2] Length = 824 Score = 36.2 bits (82), Expect = 3.3, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L+ P NLWY + A +H + N + + +PG Sbjct: 752 EGKSEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797 >gi|118590567|ref|ZP_01547969.1| hypothetical protein SIAM614_03291 [Stappia aggregata IAM 12614] gi|118437030|gb|EAV43669.1| hypothetical protein SIAM614_03291 [Stappia aggregata IAM 12614] Length = 317 Score = 35.4 bits (80), Expect = 5.9, Method: Compositional matrix adjust. Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 2/66 (3%) Query: 110 THYERFSPFNSSGW-DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTF 168 +H + P W D+ G S+ G AI G WD+ + G ++G+ L+N Sbjct: 95 SHKWQHEPIPPQAWADLFGELSAPLGTHAILGNHDWWDDADAQLTGGGPTKYGQALLNAG 154 Query: 169 VP-FQN 173 +P +QN Sbjct: 155 IPLYQN 160 >gi|307942811|ref|ZP_07658156.1| metallophosphoesterase [Roseibium sp. TrichSKD4] gi|307773607|gb|EFO32823.1| metallophosphoesterase [Roseibium sp. TrichSKD4] Length = 318 Score = 35.0 bits (79), Expect = 6.7, Method: Compositional matrix adjust. Identities = 26/101 (25%), Positives = 44/101 (43%), Gaps = 5/101 (4%) Query: 110 THYERFSPFNSSGW-DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTF 168 +H ++ P W D+ G + G A+ G WD+ + G ++G+ L+N Sbjct: 95 SHKWQYEPIEPQAWADIFGDLRAPLGVHAVLGNHDWWDDKDAQLTGYGPTKYGQALINAG 154 Query: 169 VP-FQNLWYARGAFNH-FVRNSIDD--VLNPGGRARAEVYR 205 +P +QN H F +DD L P RA+ + +R Sbjct: 155 IPLYQNRATRLSKDGHSFWLAGLDDQLALYPSRRAKRKSWR 195 >gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine microorganism HF4000_48F7] Length = 828 Score = 35.0 bits (79), Expect = 7.1, Method: Compositional matrix adjust. Identities = 32/114 (28%), Positives = 51/114 (44%), Gaps = 5/114 (4%) Query: 106 INGITHYERFSPFNSSGWDVL-GPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164 I G + + +++S D+L GP S LA G +D T A A G Sbjct: 705 IAGDFLFNDYRQYSTSYVDLLAGPSGSSLNDLAEFGA-TTFDVATGGDPVDAAAA-GWRA 762 Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218 V +P+ N W +R F++ + + ++LNPG R E R+ ++ Q R G Sbjct: 763 VKGNIPYANWWASRTLFDYLINYQVQEILNPGSLRRME--RRFKQKNNQDYRAG 814 >gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str. E2348/69] gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 824 Score = 34.7 bits (78), Expect = 8.3, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 4/50 (8%) Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196 EG +Q G + GK L P N+WY + A +H + N + + +PG Sbjct: 752 EGKSEQTGGDLVKLGKGLT----PGANIWYLKAALDHMIFNQMQEYFSPG 797 Searching..................................................done Results from round 2 >gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2] gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus] Length = 809 Score = 315 bits (807), Expect = 3e-84, Method: Composition-based stats. Identities = 218/218 (100%), Positives = 218/218 (100%) Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 60 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS Sbjct: 592 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 651 Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS Sbjct: 652 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 711 Query: 121 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA Sbjct: 712 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 771 Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG Sbjct: 772 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 809 >gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 810 Score = 227 bits (577), Expect = 1e-57, Method: Composition-based stats. Identities = 67/220 (30%), Positives = 116/220 (52%), Gaps = 11/220 (5%) Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVG--- 57 Q++ARGSVGS+++D ++ + + G + L+ L+ QFL PIS + HL +P +LVG Sbjct: 591 TQDNARGSVGSSLRDTKYTSSR-GGIPGLS-LVTQFLTTPISMAEKHLWAVPKTLVGGAN 648 Query: 58 -VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERF- 115 +S+ YRAK L GI+ E ++ T ++G+E DF+DP +THY+RF Sbjct: 649 GMSAWSYRAKFLAFGIVLEGIVANTARKALTGQELD-DFTDPKVLALMTARTLTHYDRFF 707 Query: 116 SPFNSSGWDVLG--PWSSQAGKLAIAGKEAVWD-EGTRKQRGKAQAQFGKELVNTFVPFQ 172 + ++ D+L P +S L AG E + G +++ + V +P + Sbjct: 708 NEYHHDFKDLLHSVPVASTVIGLGDAGLEVSRNIFGEDEEKKAKANAKLAKEVANNMPLK 767 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212 NL+Y + AF V +++ + N G + R + R+ +K + Sbjct: 768 NLFYVKAAFQKMVVDNLCEYFNEGYKDRLAMNRELRKSRS 807 >gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter asiaticus str. psy62] gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1] gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus] Length = 864 Score = 173 bits (438), Expect = 2e-41, Method: Composition-based stats. Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 35/228 (15%) Query: 1 VQEHARGSVGSTIQDKR---WITGKDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLV 56 VQ RG++ +++ D++ +T K G+ R+ QF P ++++++ +S Sbjct: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMF-LNILDLSNSAK 687 Query: 57 ---GVSSQV------YRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALIN 107 G S + Y A + GI G I+ L+ G++P L Y L N Sbjct: 688 MPKGASMALNHVWIQYSATMALAGI-GVASIK----ALLRGEDPSLP---EVIYDGTLAN 739 Query: 108 G--ITHYERFSPFNSSG-----WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQF 160 G + + +R + S G +LGP S L + E + + +A Sbjct: 740 GALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKA-- 797 Query: 161 GKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 + +PF N+WY + +F+H + N I + LNPG R + ++++ Sbjct: 798 ----IRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841 >gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233] Length = 530 Score = 165 bits (417), Expect = 4e-39, Method: Composition-based stats. Identities = 51/214 (23%), Positives = 92/214 (42%), Gaps = 36/214 (16%) Query: 25 SVNNLARLMGQFLVMPISWS------RMHLIEIPSSLVGVSSQVYRAK---------ALV 69 + R +GQF P+S M I L G+S++ RA+ ALV Sbjct: 315 GMGEAIRFVGQFKAFPMSIMNKVLGREMAYIRKGKKLGGLSTEAGRAEIGRGIRGMAALV 374 Query: 70 IGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKAL----------INGITHYERFSPFN 119 I + T+ L+ GKEP+ DPT++ + I G ++ Sbjct: 375 ITSGFMGYMAMTMKDLLKGKEPR----DPTKFKTIMAGFLQGGGLGIYGDVLFKEQRDAG 430 Query: 120 SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARG 179 S ++GP + L +A + A+ EG + + +A +++ +PF NL+Y + Sbjct: 431 SVIAGLVGPAPTTVVDLGLALQYALLGEGGKSGKAAYRA------ISSNIPFLNLFYIKI 484 Query: 180 AFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 AF++ + I + +NPG + E R ++ Y ++ Sbjct: 485 AFDYLIGFQIMETVNPGVLKKVE-RRMKKDYNQE 517 >gi|315122771|ref|YP_004063260.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496173|gb|ADR52772.1| hypothetical protein CKC_05130 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 137 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 32/122 (26%), Positives = 61/122 (50%), Gaps = 3/122 (2%) Query: 92 QLDFSDPTEYIKALINGITHYERF-SPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTR 150 +DF+DP +THY+RF + ++ D+L + + + ++ E Sbjct: 15 SIDFTDPKTLALLTARTLTHYDRFFNEYHHDFKDLLHAVPVASTIIGLGDARNIFGEDEE 74 Query: 151 KQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210 K R KA A F KE + +P +NL+YA+ AF + +++ + N G + R ++ R+ +K Sbjct: 75 K-REKANANFAKE-LANNIPLKNLFYAKAAFQKMIVDNLCEYFNEGYKERLDMNRELRKS 132 Query: 211 KK 212 + Sbjct: 133 RS 134 >gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15] gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15] Length = 918 Score = 131 bits (329), Expect = 6e-29, Method: Composition-based stats. Identities = 48/204 (23%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79 T L + F P + R L+ + L V + + A + G + Sbjct: 701 TYARDDAGQLIKSFMLFKTTPFAGFR-QLVNRANDLDTVPAIKFLASY-IAGTTLAGMFA 758 Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG----P 128 + L++G +P LD + PT +++AL+ G + ++ + + SS +G Sbjct: 759 NQMNSLLTGNDP-LDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQYGSSIAATIGGPVLS 817 Query: 129 WSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHF 184 ++ Q KL I ++A+ E T FG + + T PF NLWYA+ NH Sbjct: 818 FAEQLTKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHL 868 Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208 + + ++ NPG R QR+ Sbjct: 869 ILQQLQEMANPGYNDRVRDRAQRE 892 >gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 918 Score = 128 bits (321), Expect = 5e-28, Method: Composition-based stats. Identities = 48/204 (23%), Positives = 83/204 (40%), Gaps = 27/204 (13%) Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79 T L + F P + R L+ L V + + A + G + Sbjct: 701 TYARDDAGELMKSFMLFKTTPFAGFR-QLVNRTRDLDTVPAIKFLASY-IGGTTLAGMFA 758 Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG----P 128 + L++G +P LD + PT +++AL+ G + ++ + + SS +G Sbjct: 759 IQMNSLLNGNDP-LDMTKPTTWVQALLKGGSFGIYGDFIFQDHTQYGSSIGATMGGPVLS 817 Query: 129 WSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHF 184 ++ Q KL I ++A+ E T FG + + T PF NLWYA+ NH Sbjct: 818 FAEQLTKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHL 868 Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208 + + ++ NPG R QR+ Sbjct: 869 ILQQLQEMANPGYNDRVRDRAQRE 892 >gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3] Length = 924 Score = 120 bits (301), Expect = 1e-25, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 83/201 (41%), Gaps = 27/201 (13%) Query: 23 DGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTL 82 + +L + F P++ R + + L + + + A A V G + + Sbjct: 710 RDTSGDLLKSFMLFKTTPMAGMRQFVTRL-QDLETMPAVKFFA-AYVAGTTLAGMFANQM 767 Query: 83 VPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLGP----WSS 131 L+SG +P LD + P +++AL+ G + ++ + + SS +LG ++ Sbjct: 768 NALLSGNDP-LDMTKPQTWLQALLKGGSFGIYGDFLFQDHTQYGSSIAGILGGPVLGFAE 826 Query: 132 QAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRN 187 Q K + ++A+ E T F + + T PF NLWY + NH + Sbjct: 827 QLSKTVLTNSQKAMAGEET---------TFTADALKTARMITPFANLWYTKAITNHLILQ 877 Query: 188 SIDDVLNPGGRARAEVYRQRQ 208 + ++ NPG AR R+ Sbjct: 878 QLQEMANPGYNARVRDRAMRE 898 >gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB] Length = 921 Score = 114 bits (285), Expect = 9e-24, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 77/204 (37%), Gaps = 27/204 (13%) Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79 T L + F P + R ++ +L V + + A A + G + Sbjct: 704 TYARDQGGELYKSFMLFKTTPFAGFR-QMVTRAQNLDRVPALKFLA-AYIGGTTLTGMFA 761 Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLGP---- 128 L L+SG +P +D + P ++ A + G ++ + + SS LG Sbjct: 762 NQLNALLSGNDP-IDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGSSIAATLGGPSLG 820 Query: 129 WSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHF 184 + KL I ++A+ E T FG + + T PF NLWY + NH Sbjct: 821 LAESLMKLLITNPQKAMQGEET---------SFGADAIKTARMITPFANLWYTKAVTNHL 871 Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208 + + ++ NPG R Q Q Sbjct: 872 ILQQLQEMANPGYNDRVRDRAQNQ 895 >gi|315121758|ref|YP_004062247.1| hypothetical protein CKC_00040 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495160|gb|ADR51759.1| hypothetical protein CKC_00040 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 107 Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 45/98 (45%), Positives = 58/98 (59%), Gaps = 6/98 (6%) Query: 42 SWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEY 101 S L +L+G SS + L++ EELI+ LVPLISG EP+ D + P +Y Sbjct: 12 SLFPHFLFVRSKALLGRSSIL----ILLVEYANEELIKNVLVPLISGNEPRFDITSPRDY 67 Query: 102 IKALINGITHYERFSPFNS--SGWDVLGPWSSQAGKLA 137 KA++N ITHYERFSP S WD+LGP QAG+L Sbjct: 68 AKAIVNAITHYERFSPLGGGQSKWDILGPALGQAGRLG 105 >gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS] Length = 838 Score = 86.1 bits (211), Expect = 3e-15, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 80/218 (36%), Gaps = 27/218 (12%) Query: 6 RGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRA 65 R ++ S +Q W L R + F MPI+ H E S S+ Sbjct: 607 RAALYSNLQRGTW-------KGELTRSVFLFKTMPIAMLMRH-WERGMSGPDARSKAGYI 658 Query: 66 KALVIGILGEELIRKTLVPLISGKEPQ-----LDFSDPTEYIKALINGITH-------YE 113 AL++ ++ + L+ G++P + +++A + G + + Sbjct: 659 GALMVSTTVMGMLALQIDELLKGRDPVNMNPFEGKAGARNWVRAFLKGGSLGIYGDFLFS 718 Query: 114 RFSPFNSS-GWDVLGPWSSQAGK-LAIAGKEAVW-DEGTRKQRGKAQAQFGKELVNTFVP 170 + LGP + + V +G G +F K + P Sbjct: 719 EQNQHGGGPIASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGAELLKFAKGM----TP 774 Query: 171 FQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208 NLWY + A NH + N + ++++PG AR + QR+ Sbjct: 775 GANLWYLKAATNHLIFNQLQEMVSPGYLARVKSRAQRE 812 >gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407] Length = 825 Score = 84.5 bits (207), Expect = 8e-15, Method: Composition-based stats. Identities = 48/224 (21%), Positives = 84/224 (37%), Gaps = 37/224 (16%) Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68 VGS +Q W L R + F PIS H S +G+ S RA + Sbjct: 611 VGSGLQRGTW-------KGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYI 659 Query: 69 ---VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFS 116 + + + LI+G+ P+ D +I A + G + + Sbjct: 660 ATFLASTTMLGALSMQITDLINGRNPKEMTGDHMVKFWINAFLKGGGAGLYGDFLFSDHT 719 Query: 117 PFNS-SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQN 173 + S + +LGP + + + + EG +Q G + GK L +P N Sbjct: 720 RYGSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGAN 775 Query: 174 LWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 LWY + A +H + N + + +PG + E + +++ N Sbjct: 776 LWYLKAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 812 >gi|332160979|ref|YP_004297556.1| hypothetical protein YE105_C1357 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665209|gb|ADZ41853.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862135|emb|CBX72299.1| hypothetical protein YEW_AK02360 [Yersinia enterocolitica W22703] Length = 841 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 72/201 (35%), Gaps = 13/201 (6%) Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79 T + + R QF PI+ H + G Y A + L + Sbjct: 624 TTRGTWSGEIWRSATQFKSFPIAMVMRHAHR-ALAQDGAGKGTYAAAIIAASTLLGG-MA 681 Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH--YERF-----SPFNSSG-WDVLGPWSS 131 L + SG++P+ D + P + A + G Y F + +S + GP + Sbjct: 682 IQLNEIASGRDPR-DMTKPEFWGGAFLKGGALGLYGDFLLTNQTQGGNSFIASIGGPLAG 740 Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191 + + A + + A + P NLWYA+ A +H + + I + Sbjct: 741 DIESVVKMTQGAAFK--AIDGKDPHTAANVVRFIKGHTPGANLWYAKAALDHMIFHDIQE 798 Query: 192 VLNPGGRARAEVYRQRQKYKK 212 +PG +R Q++ ++ Sbjct: 799 QFSPGYLSRMRQRAQKEYDQQ 819 >gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 824 Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 79/221 (35%), Gaps = 31/221 (14%) Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68 VGS +Q W L R + F PIS H Y A L Sbjct: 610 VGSGLQRGTW-------KGELTRSVFLFKSFPISVVMRHWHRAMGMPSAGGRAAYIATFL 662 Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119 + + + LI+G+ P+ D +I A + G + + + Sbjct: 663 ASTTML-GALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 120 S-SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176 S + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWY 777 Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 778 LKAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302] gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302] Length = 824 Score = 83.8 bits (205), Expect = 2e-14, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 79/221 (35%), Gaps = 31/221 (14%) Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68 VGS +Q W L R + F PIS H Y A L Sbjct: 610 VGSGLQRGTW-------KGELTRSVFLFKSFPISVVMRHWHRAMGMPSAGGRAAYIATFL 662 Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119 + + + LI+G+ P+ D +I A + G + + + Sbjct: 663 ASTTML-GALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721 Query: 120 S-SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176 S + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWY 777 Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 778 LKAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605] gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605] Length = 824 Score = 82.6 bits (202), Expect = 4e-14, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 79/217 (36%), Gaps = 28/217 (12%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H Y A L Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWHRAMGMPSAGGRAAYIATFLASTT 666 Query: 73 LGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFNS-SG 122 + + + LI+G+ P+ D +I A + G + + + S + Sbjct: 667 ML-GALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYGSGAL 725 Query: 123 WDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180 +LGP + + + EG +Q G + GK L +P NLWY + A Sbjct: 726 ASMLGPVVGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYLKAA 781 Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 +H + N + + +PG + E + +++ N Sbjct: 782 LDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans'] gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 824 Score = 81.8 bits (200), Expect = 7e-14, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 67/195 (34%), Gaps = 19/195 (9%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLI 86 L R + F PI+ H + Y A L + + + + +I Sbjct: 621 GELVRSVFLFKSFPIAVMMRHWSRALNMPSAGGRAAYLAAFLASTTVL-GAMSQQISEVI 679 Query: 87 SGKEPQLDFSDPTEYI----------KALINGITHYERFSPFNS-SGWDVLGPWSSQAGK 135 +G+ P+ D + A + G + + S + +LGP + Sbjct: 680 AGRNPR-DITGDKALQFWVNAFLKGGGAGLYGDFLLSDHTRYGSGALASMLGPVAGVVDD 738 Query: 136 LAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVL 193 + + + + + + +P QNLWY + F+H V N + ++ Sbjct: 739 ----AIKLLQGIPLNAVEGKPEQTGGDLVKFAKGMIPGQNLWYTKAVFDHMVFNQLQEIF 794 Query: 194 NPGGRARAEVYRQRQ 208 +PG R E +++ Sbjct: 795 SPGYLRRMEKRSRKE 809 >gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131] Length = 823 Score = 81.1 bits (198), Expect = 1e-13, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 70/198 (35%), Gaps = 25/198 (12%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGE---ELIRKTLV 83 + R F PIS H +G+ S R L I G I + + Sbjct: 619 GEIVRSFFLFKSFPISVVVRHW----KRALGIQSAGGRVAYLAAFIAGTTVLGAISQQIN 674 Query: 84 PLISGKEPQLDFSDP----------TEYIKALINGITHYERFSPFNSS-GWDVLGPWSSQ 132 + SG+ P+ D +D + + G + + S +LGP + Sbjct: 675 DISSGRNPR-DMADENWHKFWLNALLKGGGLGLYGDFLLSDHTKYGSDAFASLLGPVAGV 733 Query: 133 AGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSID 190 + + EG +Q G +F V +P QNLWY + +H V N + Sbjct: 734 VDDAIKLAQGIPLNAVEGKPEQTGGDTVKF----VKGLIPGQNLWYTKAVLDHMVFNQLQ 789 Query: 191 DVLNPGGRARAEVYRQRQ 208 + +PG R E +++ Sbjct: 790 EYFSPGYLRRMEKRSKKE 807 >gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510] Length = 995 Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 46/199 (23%), Positives = 71/199 (35%), Gaps = 25/199 (12%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALV---IGILGEELIRKTLV 83 R +GQF P++ + L G RA +V + + L Sbjct: 788 GEALRFVGQFKAFPVAVISK-VW--GRDLYGGERGWGRAAGIVHTLVATTVMGYVAGMLK 844 Query: 84 PLISGKEPQLDFSDPTEYIKAL-------INGITHYERFSPFNSSG-WDVLGPWSSQAGK 135 L G+ P+ D +DP + A I G ++S F + GP S AG+ Sbjct: 845 DLSKGRAPR-DPTDPRAWGAAFLQGGGAGIYGDFLLGQYSRFGNRFLESAAGPTLSSAGE 903 Query: 136 LAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNP 195 L A EG ++ + + PF NL+Y R A ++ + + +NP Sbjct: 904 LL--NIWAGAREGNDEKAATLRWTL------SNTPFVNLFYTRMALDYLFLYQVQEAMNP 955 Query: 196 GGRARAEVYRQRQKYKKQR 214 G R E K QR Sbjct: 956 GFLRRFEQR--VAKDNNQR 972 >gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 56 Score = 78.4 bits (191), Expect = 7e-13, Method: Composition-based stats. Identities = 30/55 (54%), Positives = 43/55 (78%) Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216 KE++NT VPFQNLWY + F++FVR +DD +NPG RARAE YR++ +++RK+ Sbjct: 2 KEVLNTTVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRKNIQREKRKK 56 >gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1] gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1] Length = 855 Score = 78.4 bits (191), Expect = 7e-13, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 77/221 (34%), Gaps = 39/221 (17%) Query: 22 KDGSVNN-LARLMGQFLVMPISWSRMHL-----------IEIPSSLVG-----VSSQVYR 64 + G+V L R + QF P ++ + L + +S G + + Sbjct: 627 QPGTVPGDLLRFVTQFKSFPAAYMQKTLGRELYGRGYTPTALGNSFRGGRDLVQALRNGN 686 Query: 65 AKALVIGILGE-----ELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH-------Y 112 + L + L + + G+EP+ DP ++ A++ G + Sbjct: 687 GERLALAQLMLWTTAFGYLSMASKDVTKGREPR-PADDPKTWLAAMVQGGGLGIFGDYLF 745 Query: 113 ERFSPFN-SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171 + F S+ GP A + A + T A L PF Sbjct: 746 GEANRFGNSALESAAGPTIGTAADVINLWARAKEGDDT--------ASSALRLAQNNTPF 797 Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212 NL+Y R A +H S+ + +NPG R E ++Q ++ Sbjct: 798 MNLFYTRIALDHLFLYSVQEAMNPGSLRRTEERIRQQNGQE 838 >gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 831 Score = 77.6 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 72/189 (38%), Gaps = 18/189 (9%) Query: 36 FLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF 95 F ++ H + Y A G+L + L+ +++G++P+ D Sbjct: 627 FKSFGLAMFERHWKRVSQIESTGGKLAYSASVFT-GLLMAGAMTNQLMDIMNGRDPR-DM 684 Query: 96 SDPTEYIKALINGIT--HYERFSPFN---------SSGWDVLGPWSSQAGKLAIAGKEAV 144 D +++A++ G + S+ +LGP A + + Sbjct: 685 KDGKFWLQAMLRGGGVGIFGDILNTGLGGDNRGGQSNLTGLLGPVYGTAADVGLTLGSVF 744 Query: 145 WDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVY 204 ++ G + G + PF WY + AF H V + + ++L+PG +R + Sbjct: 745 KEKTEPADVGANLLRIGYQ----NTPFIRSWYTKAAFEHAVMHDMQEMLSPGYLSRMK-K 799 Query: 205 RQRQKYKKQ 213 R ++ + ++ Sbjct: 800 RAKKDFNQR 808 >gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 841 Score = 77.6 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 45/221 (20%), Positives = 83/221 (37%), Gaps = 19/221 (8%) Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66 V + +++K I G G++ + R + QF ++ H + G+ + A Sbjct: 605 VEAGLREKTLINVGARGTITGEIVRGLAQFKSFSAAFLMRHGSRAFAQ-EGIKGKAGYAV 663 Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118 L + + + L L++G +PQ D +DP + I G++ Sbjct: 664 PLFVTLTLLGGLVVQLKELLNGNDPQTIYDSNDPKKAGSFFIRSAVQGGGLSFLGDILVA 723 Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172 + V GP + L + K F + V +P Q Sbjct: 724 GTDTSGRDANSFVAGPLGNDFTALLGLTVGNLTQYNEGKDTNFGNEAF--KFVKGKIPAQ 781 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 NLWY + A N V + + D + PG R +A +RQ+ +++ Sbjct: 782 NLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRER 822 >gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624] Length = 841 Score = 74.9 bits (182), Expect = 7e-12, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 81/221 (36%), Gaps = 19/221 (8%) Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66 + + +++K I G G++ + R + QF ++ H + Y Sbjct: 605 IEAGLREKTLINVGARGTITGEIFRGIVQFKSFSAAFLMRHGSRTMAQEGLKGKAAYAIP 664 Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118 V+ L L+ + L L++G +PQ D +DP + + G++ Sbjct: 665 LFVMTTLLGGLVVQ-LKELLNGNDPQTIYDSNDPKKASNFFVRSAVQGGGLSFLGDILVA 723 Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172 + V GP S L + K F + V +P Q Sbjct: 724 GTDTSGRDAHSFVAGPLGSDFESLLSLTVGNLTQYNEGKDTNFGNEAF--QFVKRKIPAQ 781 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 NLWY + A N V + I D + PG R +A + ++ +++ Sbjct: 782 NLWYTKAAINRMVFDEIQDFIAPGYREKALRKAEEKQDRER 822 >gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 1175 Score = 74.5 bits (181), Expect = 9e-12, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 84/221 (38%), Gaps = 19/221 (8%) Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66 + + ++++ W+T G G++ + + + QF S M + G+ + A Sbjct: 939 IEAGLRERTWMTVGAKGTITGEVFKGLMQFKSFSAS-FLMRQGSRAMAQEGLKGKAAYAI 997 Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118 L++ + + L +++G +PQ D +DP + + G+ Sbjct: 998 PLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVA 1057 Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172 + V GP S L + K F + V +P Q Sbjct: 1058 GTDTSGRDANSFVSGPLGSDFTSLLGLTVGNLTQYNEGKDTNFGNEAF--KFVKGKIPAQ 1115 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 NLWY + A N V + + D + PG R +A +RQ+ +++ Sbjct: 1116 NLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRER 1156 >gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112] Length = 582 Score = 74.5 bits (181), Expect = 9e-12, Method: Composition-based stats. Identities = 46/220 (20%), Positives = 83/220 (37%), Gaps = 34/220 (15%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H S +G+ S RA + I Sbjct: 365 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 420 Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120 + + L L SG+ P+ + D ++ + G + + S Sbjct: 421 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 480 Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 + + GP + + + + EG +Q G + GK L +P NLWY Sbjct: 481 GALASMFGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYL 536 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 537 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 569 >gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2] Length = 824 Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats. Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H S +G+ S RA + I Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662 Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120 + + L L SG+ P+ + D ++ + G + + S Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722 Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGL----MPGANLWYL 778 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14] Length = 824 Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats. Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H S +G+ S RA + I Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662 Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120 + + L L SG+ P+ + D ++ + G + + S Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722 Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGL----MPGANLWYL 778 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10] Length = 824 Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats. Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H S +G+ S RA + I Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGIPSAGGRAAYIATFI 662 Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120 + + L L SG+ P+ + D ++ + G + + S Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGGDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722 Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYL 778 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1] gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252] Length = 824 Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats. Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H S +G+ S RA + I Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662 Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120 + + L L SG+ P+ + D ++ + G + + S Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722 Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGL----MPGANLWYL 778 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|303328566|ref|ZP_07359001.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861332|gb|EFL84271.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 855 Score = 74.1 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 76/215 (35%), Gaps = 33/215 (15%) Query: 25 SVNNLARLMGQFLVMPISWSRMHL----IEIPSSLVGV-----------SSQVYRAKALV 69 + R + QF PI++ + L G+ + R + Sbjct: 632 GAGEVWRAIMQFKSFPIAYMQRVLGGRRWVRGDLQRGMRYGPRNLPGAVEDALTRDMGGL 691 Query: 70 IGILGE----ELIRKTLVPLISGKEPQLDFSDPTEYIKAL------INGITHYERFSPFN 119 +G + TL L G+EP+ T A+ I G + + + F Sbjct: 692 MGFVLSSVAFGYASMTLKDLAKGREPRSLAHRETWLAAAMQSGGAGIFGDILFGKVNRFG 751 Query: 120 SSGWDV-LGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYAR 178 +S + +GP G A G + V + + G PF NLWY R Sbjct: 752 NSFAETAVGPLGGLIGDAATLGGQLVRGDMADAGEDTLRLAMG------NAPFINLWYTR 805 Query: 179 GAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 A + + + ++++PG R E + ++++ ++ Sbjct: 806 AALDWMLLYHVREMMSPGTLRRTE-RKMKKEFGQE 839 >gi|294843482|ref|ZP_06788165.1| putative phage related protein [Acinetobacter sp. 6014059] Length = 841 Score = 73.7 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 83/221 (37%), Gaps = 19/221 (8%) Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66 + + ++++ W+T G G++ + + + QF S M + G+ + A Sbjct: 605 IEAGLRERTWMTVGAKGTITGEVFKGLMQFKSFSAS-FLMRQGSRAMAQEGLKGKAAYAI 663 Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118 L++ + + L +++G +PQ D +DP + + G+ Sbjct: 664 PLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVA 723 Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172 + V GP S L + K F + V +P Q Sbjct: 724 GTDTSGRDANSFVSGPLGSDFTALLGLTVGNLTQYNEGKDTNFGNEAF--KFVKGKIPAQ 781 Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 NLWY + A N + + D + PG R +A +RQ+ +++ Sbjct: 782 NLWYTKAAINRMFFDEVQDTIAPGYREKALRKAERQQDRER 822 >gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str. E2348/69] gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] Length = 824 Score = 72.6 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 74/206 (35%), Gaps = 30/206 (14%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGE---ELIRKTLV 83 L R + F PIS H S +G+ S RA + I + + L Sbjct: 621 GELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFIASTTILGALSQQLN 676 Query: 84 PLISGKEPQ---------LDFSDPTEYIKALINGITHYERFSPFNS-SGWDVLGPWSSQA 133 + SG+ P+ + + G + + S + +LGP + Sbjct: 677 DMASGRNPRDMVGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGSGALASMLGPVAGLV 736 Query: 134 GKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191 + G+ + EG +Q G + GK L P N+WY + A +H + N + + Sbjct: 737 DDVIKIGQGIPLNAVEGKSEQTGGDLVKLGKGL----TPGANIWYLKAALDHMIFNQMQE 792 Query: 192 VLNPGGRARAEVYRQRQKYKKQRKRN 217 +PG + E + +++ N Sbjct: 793 YFSPGYLRKME-------QRSKKEFN 811 >gi|221213942|ref|ZP_03586915.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166119|gb|EED98592.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 864 Score = 72.6 bits (176), Expect = 4e-11, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 78/223 (34%), Gaps = 28/223 (12%) Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPS----------SLVGVSSQ 61 ++ K + G+ L + QF PI+ H I +++ Sbjct: 619 LRTKVIASATPGTAMGELKKTFMQFKSFPIAMISRHWGRIGDMRRSGDFRVDGAPALANP 678 Query: 62 VYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING------------- 108 + A ALV+ I + L++GK+P+ F D Sbjct: 679 MAYAAALVVSTTLIGAISTQVKNLLAGKDPEPMFDDVKHAAGFWTRAFSVGGGAGFAGDM 738 Query: 109 ITHYERFSPFNSSGWDVLG-PWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNT 167 +T + + S V+G P S ++ A + + + ++ + Sbjct: 739 LTASFESTDYGSLLGSVVGGPLPSTIYQVVRAFSSNAQ--DAAQGKDTHVSADLLKVAQS 796 Query: 168 FVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210 P NLW+ + +N + +++ + L+PG R + R R +Y Sbjct: 797 NTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQR-NINRSRNQY 838 >gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine microorganism HF4000_48F7] Length = 828 Score = 72.6 bits (176), Expect = 4e-11, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 79/206 (38%), Gaps = 28/206 (13%) Query: 32 LMGQFLV--------MPISWSRMHLIEIPS-SLVGVSSQVYRAKAL-------VIGILGE 75 MG+F P++ + + S L + Q RA + ++ ++ Sbjct: 607 FMGRFFTGEEGIKSGTPMAMANKLFWQFRSFGLTMLFRQWPRAYEMGLPSFYHLVPMVLM 666 Query: 76 ELIRKTLVPLISGKEPQLDFSDPTEYIKAL--------INGITHYERFSPFNSSGWDVL- 126 + + ++ G+E + DP + A I G + + +++S D+L Sbjct: 667 GYVAMAMKDILKGRELKDVVEDPGKIAVASVLQSGFGGIAGDFLFNDYRQYSTSYVDLLA 726 Query: 127 GPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVR 186 GP S LA G A + A G V +P+ N W +R F++ + Sbjct: 727 GPSGSSLNDLAEFG--ATTFDVATGGDPVDAAAAGWRAVKGNIPYANWWASRTLFDYLIN 784 Query: 187 NSIDDVLNPGGRARAEVYRQRQKYKK 212 + ++LNPG R E R +QK + Sbjct: 785 YQVQEILNPGSLRRME-RRFKQKNNQ 809 >gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 854 Score = 72.2 bits (175), Expect = 5e-11, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 75/201 (37%), Gaps = 23/201 (11%) Query: 22 KDGSVNN-LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRK 80 + G+V N L+R QF P++ + VY AK + L+ + Sbjct: 640 ERGTVGNELSRFFWQFKQFPLAMIMRQWTRGMAQGTPQEKFVYFAKLFAYTTVMGALVSQ 699 Query: 81 TLVPLISGKEPQLDFSDPTE---YIKALINGIT--HYERFSPFNSS-----GWDVLGPWS 130 + L GK+ DPT Y+K+++ G + S D + P + Sbjct: 700 -IQNLTQGKDLD----DPTTLDFYMKSIVKGGSASFLADAISATSDPTERSVKDFIIPAA 754 Query: 131 ---SQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRN 187 + ++G + + G +V +PFQNLWY+R F+ V Sbjct: 755 FKDITSIGTMVSGAGSAFITERDSSYGAEAVN----VVKNNIPFQNLWYSRLVFDRLVIA 810 Query: 188 SIDDVLNPGGRARAEVYRQRQ 208 + ++ + G R R + ++ Sbjct: 811 EMQELFDEGYRERKQRRQENN 831 >gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v] Length = 824 Score = 71.8 bits (174), Expect = 7e-11, Method: Composition-based stats. Identities = 43/220 (19%), Positives = 81/220 (36%), Gaps = 34/220 (15%) Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72 + ITG + G+ L R + F PIS H S +G+ S RA + I Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662 Query: 73 LGEELIRKTLVPL-----------ISGKEP-QLDFSDPTEYIKALINGITHYERFSPFNS 120 ++ L ++G++ + + + G + + S Sbjct: 663 ASTTILGALSQQLNDLASGRNHREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722 Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177 + +LGP + + + + EG +Q G + GK L +P NLWY Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYL 778 Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217 + A +H + N + + +PG + E + +++ N Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811 >gi|291336673|gb|ADD96216.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 101 Score = 69.1 bits (167), Expect = 4e-10, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 40/94 (42%), Gaps = 6/94 (6%) Query: 106 INGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELV 165 I + S+ +GP ++A ++ A A+ EG + + + + Sbjct: 9 IYTDFLFGNIQNSTSALATAVGPIPTEAARVLSALNYAIKGEGGKAGKQAYYS------I 62 Query: 166 NTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRA 199 +PF NL+Y + AF++ + + + L+PG Sbjct: 63 KENIPFLNLFYIKTAFDYMIGYQMMETLSPGSLK 96 >gi|48696644|ref|YP_024423.1| hypothetical protein VP2p19 [Vibrio phage VP2] gi|40950042|gb|AAR97633.1| hypothetical protein [Vibrio phage VP2] Length = 782 Score = 68.3 bits (165), Expect = 7e-10, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 75/196 (38%), Gaps = 12/196 (6%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVI-----GILGEELIRKT 81 L R + F PI+ M+ + G S R A I +LG +I+ Sbjct: 572 GELHRSLFMFHSFPITTI-MNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVLGVGIIQ-- 628 Query: 82 LVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGK--LAIA 139 +++GK+P+ SDP +I+ + G + ++ +S G LA Sbjct: 629 AKDILNGKKPR-SMSDPKLWIEGMAQGGSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYG 687 Query: 140 GKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRA 199 A+ K ++ +PF NLWY + A + + + I + +P Sbjct: 688 DWVAMTAADMAKGDAESAMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLSDPEY-D 746 Query: 200 RAEVYRQRQKYKKQRK 215 + ++ + R+ + ++ Sbjct: 747 KKQLNKMRKMQRTSQQ 762 >gi|48696687|ref|YP_024981.1| hypothetical protein VP5_gp18 [Vibrio phage VP5] gi|40806150|gb|AAR92068.1| hypothetical protein [Vibrio phage VP5] Length = 782 Score = 68.3 bits (165), Expect = 7e-10, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 75/196 (38%), Gaps = 12/196 (6%) Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVI-----GILGEELIRKT 81 L R + F PI+ M+ + G S R A I +LG +I+ Sbjct: 572 GELHRSLFMFHSFPITTI-MNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVLGVGIIQ-- 628 Query: 82 LVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGK--LAIA 139 +++GK+P+ SDP +I+ + G + ++ +S G LA Sbjct: 629 AKDILNGKKPR-SMSDPKLWIEGMAQGGSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYG 687 Query: 140 GKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRA 199 A+ K ++ +PF NLWY + A + + + I + +P Sbjct: 688 DWVAMTAADMAKGDAESAMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLSDPEY-D 746 Query: 200 RAEVYRQRQKYKKQRK 215 + ++ + R+ + ++ Sbjct: 747 KKQLNKMRKMQRTSQQ 762 >gi|48697207|ref|YP_024937.1| hypothetical protein BcepC6B_gp17 [Burkholderia phage BcepC6B] gi|47779013|gb|AAT38376.1| gp17 [Burkholderia phage BcepC6B] Length = 864 Score = 66.0 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 81/223 (36%), Gaps = 28/223 (12%) Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPS----------SLVGVSSQ 61 ++ K + G+V L + QF P++ H I +++ Sbjct: 619 LRTKVIASATPGTVTGELKKSFMQFKSFPMAMISRHWGRIGDMRRSGDFRVDGAPALANP 678 Query: 62 VYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTE----YIKALING--------- 108 + A ALV+ I L++GK+P+ F D + +A G Sbjct: 679 MAYAAALVVSTTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAGFAGDM 738 Query: 109 ITHYERFSPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNT 167 + + + + S G + GP S + A V + + ++ + Sbjct: 739 LVAAFQSADYGSLLGSAIGGPLLSTLFQPLRAVSSNVQ--DAAQGKDTHIGADLLKIAQS 796 Query: 168 FVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210 P NLW+ + +N + +++ + L+PG R + R R +Y Sbjct: 797 NTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQR-NMNRSRTQY 838 >gi|221201510|ref|ZP_03574549.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207934|ref|ZP_03580940.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221172119|gb|EEE04560.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2] gi|221178778|gb|EEE11186.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 869 Score = 66.0 bits (159), Expect = 3e-09, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 82/231 (35%), Gaps = 33/231 (14%) Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVG-------------- 57 ++ K + G+V L + QF P++ H I + Sbjct: 619 LRTKVIASATPGTVTGELKKSFMQFKSFPMAMISRHWGRIGNMRRSGDYLVEGAPRAFGI 678 Query: 58 -VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTE----YIKALING---- 108 +++ + A ALV+ I L++GK+P+ F D + +A G Sbjct: 679 PLANPMAYAAALVVSTTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAG 738 Query: 109 -----ITHYERFSPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGK 162 + + + S G V GP S + A V + + Sbjct: 739 FAGDMLVAAFESADYGSLLGSAVGGPLLSTLFQPLRAISSNVQ--DAAQGKDTHVGADLL 796 Query: 163 ELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213 ++ + P NLW+ + +N + +++ + L+PG R + R R +Y + Sbjct: 797 KIAQSNTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQR-NMNRSRTQYHNE 846 >gi|262371858|ref|ZP_06065137.1| predicted protein [Acinetobacter junii SH205] gi|262311883|gb|EEY92968.1| predicted protein [Acinetobacter junii SH205] Length = 841 Score = 61.0 bits (146), Expect = 1e-07, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 87/211 (41%), Gaps = 20/211 (9%) Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPIS-WSRMHLIEIPSSLVGVSSQVYRA 65 + + ++++ I G+ G++ L R + QF P++ RM + S + A Sbjct: 626 IEAGVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYMFRMGHRAFAQGDIK-SRVTFLA 684 Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING--ITHYERF-----SPF 118 L L LI +T L +GK P+ F+ + K+L+ G ++ P Sbjct: 685 SLLAYQTLAGALIVQT-QNLANGKNPEPVFTID-FFGKSLLKGGGLSFLGDIMSALSDPT 742 Query: 119 NSSGWDVL-GPWSSQAGKLAIAGKEAVWDEGTR--KQRGKAQAQFGKELVNTFVPFQNLW 175 S D + GP Q+ KL + + G + + + + + +P QNLW Sbjct: 743 GRSASDFISGPLLGQSMKLGM----LLTGMGNNIIEGKESTRMMEVANTLKSNIPLQNLW 798 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQ 206 Y++ + + + + ++++P R + + Sbjct: 799 YSKLVVDRMLYSKMQNMIDPDYLPRTQQRLE 829 >gi|254251753|ref|ZP_04945071.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] gi|124894362|gb|EAY68242.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158] Length = 865 Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 78/229 (34%), Gaps = 37/229 (16%) Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVG-----------VSS 60 ++ K G++ L + QF PI+ H I S Sbjct: 620 LRTKVIAAATPGTLQGELQKTFLQFKSFPIAMISRHWGRIGEMRRSGDFRVEGAPTLASP 679 Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120 Y A ALV+ + L L+ GK+P+ D ++ A + F+ Sbjct: 680 MAYGA-ALVVSTTLLGALAVQLQNLLLGKDPE-PMGDDVKHGGAF-----WFRAFTKGGG 732 Query: 121 SG-------WDVLGPWSSQAGK------LAIAGKEAVWDEGTR-----KQRGKAQAQFGK 162 +G + G ++A L +AV + + + Sbjct: 733 AGFAGDMLSAMLTGKNPAEAVGSVFGGPLVSTAIQAVTPFSNNAMAAAEGKDTHLSADLL 792 Query: 163 ELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYK 211 + + +P NLWY + +N + ++I + L+PG +R ++Q + Sbjct: 793 KFAQSNMPIVNLWYWKTVWNRLIWDNIAENLSPGVTSRNVAKSRQQYHN 841 >gi|226953662|ref|ZP_03824126.1| phage related protein [Acinetobacter sp. ATCC 27244] gi|226835534|gb|EEH67917.1| phage related protein [Acinetobacter sp. ATCC 27244] Length = 842 Score = 59.5 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 87/211 (41%), Gaps = 20/211 (9%) Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPIS-WSRMHLIEIPSSLVGVSSQVYRA 65 + + ++++ I G+ G++ L R + QF P++ R+ + S + A Sbjct: 626 IEAGVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYMFRIGHRAFAQGDIK-SRVTFLA 684 Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING--ITHYERF-----SPF 118 L L LI +T L +GK P+ F+ + K+L+ G ++ P Sbjct: 685 SLLAYQTLAGALIVQT-QNLANGKNPEPVFTID-FFGKSLLKGGGLSFLGDIMSALSDPT 742 Query: 119 NSSGWDVL-GPWSSQAGKLAIAGKEAVWDEGTR--KQRGKAQAQFGKELVNTFVPFQNLW 175 S D + GP Q+ KL + + G + + + + + +P QNLW Sbjct: 743 GRSASDFISGPLLGQSMKLGM----LLTGMGNNIIEGKESTRMMEVANTLKSNIPLQNLW 798 Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQ 206 Y++ + + + + ++++P R + + Sbjct: 799 YSKLVVDRMLYSKMQNMIDPDYLPRTQQRLE 829 >gi|291336683|gb|ADD96225.1| hypothetical protein Rsph17025_0444 [uncultured organism MedDCM-OCT-S08-C1350] Length = 850 Score = 56.4 bits (134), Expect = 3e-06, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 76/210 (36%), Gaps = 29/210 (13%) Query: 20 TGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELI 78 + + G+V + M + PI+ HL VG+ + +++G I Sbjct: 636 SAQPGTVKGEIVNSMLMYKNFPITLGMTHLSR-GFQQVGLKGKAKYLVPMIVGGAVMGSI 694 Query: 79 RKTLVPLISGKEPQLDFSDPTE-----YIKALINGITH-------YERFSPFNSSG-WDV 125 + + +GK P + P + ++ A+I G + + + S + Sbjct: 695 AYEIKQIAAGKTP----TKPEDMGVRYWLNAIIYGGGLGIFGDFLFSDQNRYGGSFSKTL 750 Query: 126 LGPWSSQAGK---LAIAGK-EAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAF 181 GP +S G L + + E T + + + P +LWYAR A Sbjct: 751 AGPVASFIGDSINLTFGNAAQLISGEKTNAGKE------LAAFIQRYTPGSSLWYARVAL 804 Query: 182 NHFVRNSIDDVLNPGGRARAEVYRQRQKYK 211 + +SI+ ++NP + + K + Sbjct: 805 ERILFDSIERLINPDFDSDNRRNINKLKSR 834 >gi|157372110|ref|YP_001480099.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] gi|157323874|gb|ABV42971.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568] Length = 850 Score = 52.9 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 70/212 (33%), Gaps = 35/212 (16%) Query: 26 VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA------------------ 67 + R GQF S+ + + +++ +++ Sbjct: 635 LGEAIRFGGQFKSFTGSFMQNTIGREIYGRGYTPAELGQSRFTSLANAMRNGNGEKMGLA 694 Query: 68 -LVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKAL-------INGITHYERFSPFN 119 L I + + L+ G+ P+ +D ++ A I G + ++ F Sbjct: 695 QLFIWMTALGYVSMQTKLLLKGQTPR--PADAKTFLAAAAQGGGLGIMGDFLFGEYNRFG 752 Query: 120 SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARG 179 G +S + + + + R + G A+A + PF NL R Sbjct: 753 -------GGLASSLAGPTVGDLDQIRNLFLRARDGDAKAADLLKFGIDHTPFMNLHVVRP 805 Query: 180 AFNHFVRNSIDDVLNPGGRARAEVYRQRQKYK 211 A N+ + N + L+PG R ++++ Sbjct: 806 AMNYLILNRAQEWLSPGSLERYRQRVEKEQGN 837 >gi|262043648|ref|ZP_06016757.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259038986|gb|EEW40148.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 974 Score = 52.6 bits (124), Expect = 4e-05, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 77/225 (34%), Gaps = 41/225 (18%) Query: 22 KDGSV-NNLARLMGQFLVMPISWSR----MHLIEIPSSLVGVSSQ-VYRAKAL------- 68 + G+ + R QF S+ + L +S +R AL Sbjct: 750 QRGTAYGEMLRFAWQFKSFTASFMQNAIGRELYGRGYDFGSLSQNNTFRNNALIRAMRNG 809 Query: 69 ---VIGIL-------GEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH------- 111 ++GI + ++ G+ P+ + + + A+ G Sbjct: 810 NGELMGIAQLFLWATAFGYLSMQTKLMLRGQTPR-PADNVSTWTAAMAQGGGLGILGDFL 868 Query: 112 YERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171 + ++ F ++ P +S AG A + V G KQ A + +N P+ Sbjct: 869 FGEYNRFGNT------PATSLAGPFASDAAQLVNLFGLTKQGDAKAADYFNFAINHT-PY 921 Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216 NL R + + N + + ++PG R Y+QR K ++ Sbjct: 922 MNLHVVRPVMDFLILNQMREWMSPGSLQR---YQQRVKEEQGNDF 963 >gi|262043550|ref|ZP_06016663.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039084|gb|EEW40242.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 143 Score = 49.1 bits (115), Expect = 4e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%) Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFN-SSGWDVLGPWSS 131 L+ G+ P+ +D ++ A G + + ++GP +S Sbjct: 1 MQSKLLLKGQTPR--PADAKTFLAAASQGGGLGILGDFMFGEVNRMGAGPVTSLMGPAAS 58 Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191 A + ++ + + PF N+++ R A N + N I D Sbjct: 59 NADSIITLLQQTTRGDADLGDWYRTALD--------NTPFLNVFWLRTAMNGLILNRIQD 110 Query: 192 VLNPGGRARAEVYRQRQKYK 211 L+PG R + +R++ Sbjct: 111 ALDPGSLERYQRRVEREQGN 130 >gi|190893672|ref|YP_001980214.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652] gi|190698951|gb|ACE93036.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652] Length = 460 Score = 49.1 bits (115), Expect = 5e-04, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 48/106 (45%), Gaps = 12/106 (11%) Query: 5 ARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVY 63 RG++ +Q G++ R QF P+++ H++ + G++++ Y Sbjct: 355 IRGAMTGGLQ--------RGTIIGEAVRSATQFKSFPMTYMMTHMMRALTQ--GMANRTY 404 Query: 64 RAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGI 109 R L + + + LI+G++PQ + +DP + ++ I G Sbjct: 405 RTTQLALTMTIAGAEMSQMQSLIAGRDPQ-NMADPRFWEQSFIRGG 449 >gi|218514216|ref|ZP_03511056.1| hypothetical protein Retl8_11184 [Rhizobium etli 8C-3] Length = 73 Score = 44.8 bits (104), Expect = 0.008, Method: Composition-based stats. Identities = 12/49 (24%), Positives = 26/49 (53%) Query: 161 GKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209 + + + P +LWY + A + + ++I +++P RA + Y +R K Sbjct: 2 LADHLKAWTPGSSLWYTKIATDRLIFDNIQAMIDPNYRASFDRYERRMK 50 >gi|242783432|ref|XP_002480186.1| GTP cyclohydrolase II, putative [Talaromyces stipitatus ATCC 10500] gi|218720333|gb|EED19752.1| GTP cyclohydrolase II, putative [Talaromyces stipitatus ATCC 10500] Length = 451 Score = 40.6 bits (93), Expect = 0.14, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 49/138 (35%), Gaps = 11/138 (7%) Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF------SDPTEYIKALINGIT 110 G S +Y A A+ G + P + EP DF SDP + + G Sbjct: 84 GGSYSIYNALAIAAG-----DLPTDFKPDFNNTEPTFDFPQQPAWSDPKKIVSLDPFGHD 138 Query: 111 HYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVP 170 ++F + GWD+ + + +A E EG + G ++ T V Sbjct: 139 IVKQFKSYLDVGWDLRPSMAITRANMRLAEIEKAVSEGQIEVDGSIVVDKNGDVRVTKVA 198 Query: 171 FQNLWYARGAFNHFVRNS 188 + +WY G F + Sbjct: 199 VEPVWYLPGVAERFGVDE 216 >gi|212527336|ref|XP_002143825.1| GTP cyclohydrolase II, putative [Penicillium marneffei ATCC 18224] gi|210073223|gb|EEA27310.1| GTP cyclohydrolase II, putative [Penicillium marneffei ATCC 18224] Length = 494 Score = 39.5 bits (90), Expect = 0.38, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 48/138 (34%), Gaps = 11/138 (7%) Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF------SDPTEYIKALINGIT 110 G S +Y A A+ G + P + EP DF SDP + + G Sbjct: 127 GGSYSIYNALAIAAG-----DLPTDFKPDFNNTEPTFDFPVQPAWSDPKKIVSLDPFGHD 181 Query: 111 HYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVP 170 + F + GWD+ + + ++ E EG + G ++ T V Sbjct: 182 IVKHFKSYLDVGWDLRPSMAITRANMRLSEIEKAVSEGQIEVDGSIVIGKNGDVRVTKVA 241 Query: 171 FQNLWYARGAFNHFVRNS 188 + +WY G F + Sbjct: 242 VEPVWYLPGVAERFGVDE 259 >gi|294661369|ref|YP_003573245.1| hypothetical protein Aasi_1895 [Candidatus Amoebophilus asiaticus 5a2] gi|227336520|gb|ACP21117.1| hypothetical protein Aasi_1895 [Candidatus Amoebophilus asiaticus 5a2] Length = 585 Score = 38.7 bits (88), Expect = 0.60, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 2/120 (1%) Query: 39 MPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDP 98 PIS S + + VY+ + + EEL K+ L G + + +P Sbjct: 224 YPISISSRNYATEGNKSEQGVWDVYKKELSIKNYTQEELRTKSFPYLFHGGKLDTTYLNP 283 Query: 99 TEYIKALINGITHYERFSPFNSSGWD--VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKA 156 T + ++ E F D ++ P KL +E + ++ K Sbjct: 284 TTFYNLMVRAGFQEEDFKEGKHGFQDKVLVKPIILTKTKLNECHEELRELINSTLKKAKY 343 >gi|310798539|gb|EFQ33432.1| hypothetical protein GLRG_08711 [Glomerella graminicola M1.001] Length = 1103 Score = 37.9 bits (86), Expect = 0.85, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 40/125 (32%), Gaps = 9/125 (7%) Query: 36 FLVMPISWSRMHLIEIPS--SLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQL 93 F + RM P+ S + +Y G +E + + L+ G P L Sbjct: 964 FKTQSMVLMRMFYFVEPADGSAAKIQGPIYSPDQAAAGTSNKEFLANFVANLLRGAFPNL 1023 Query: 94 DFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQR 153 + +++ L T Y++F L ++ E E +++R Sbjct: 1024 QPAQIQTFVEGLFTLNTQYDKFRLNLRDFLISLKEFAGD-------NAELFQVEKEQQER 1076 Query: 154 GKAQA 158 A Sbjct: 1077 DAKAA 1081 >gi|170048775|ref|XP_001870771.1| bromodomain-containing protein 8 [Culex quinquefasciatus] gi|167870763|gb|EDS34146.1| bromodomain-containing protein 8 [Culex quinquefasciatus] Length = 917 Score = 37.9 bits (86), Expect = 0.96, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 44/136 (32%), Gaps = 9/136 (6%) Query: 71 GILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWS 130 G+ ++ L++G P ++ + A + P S + P Sbjct: 243 GMQAVAGRSPSITNLLTGNSPGMNIQGKNLFPTAGSTSTQLQDDIKPIEGSSSYQIAP-- 300 Query: 131 SQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL---VNTFVPFQNLWYARGAFNHFVRN 187 KL ++ V D+ T G Q +++ + P ++L F + Sbjct: 301 -NLTKLLDTKQQVVDDKPTDSGEGAVQVDKAEDMEIDADNVDPAKDLM---AVFQELMPE 356 Query: 188 SIDDVLNPGGRARAEV 203 + ++LN E Sbjct: 357 ELVEILNENNGMILED 372 >gi|291336674|gb|ADD96217.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377] Length = 333 Score = 36.8 bits (83), Expect = 2.5, Method: Composition-based stats. Identities = 11/72 (15%), Positives = 23/72 (31%), Gaps = 10/72 (13%) Query: 27 NNLARLMGQFLVMPISWSRMHL------IEIPSSLVGVSSQVYRAKALVIGILGEELIRK 80 R M QF P ++ + + + + + + LV G + Sbjct: 263 GEALRFMTQFKAFPFAFYQKMIGRETAAWKDGNKM----NAALSMAQLVGGSALFGYMAM 318 Query: 81 TLVPLISGKEPQ 92 T ++ GK + Sbjct: 319 TAKDILKGKNLR 330 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.308 0.118 0.280 Lambda K H 0.267 0.0361 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,150,439,952 Number of Sequences: 13984884 Number of extensions: 110355439 Number of successful extensions: 265952 Number of sequences better than 10.0: 58 Number of HSP's better than 10.0 without gapping: 66 Number of HSP's successfully gapped in prelim test: 34 Number of HSP's that attempted gapping in prelim test: 265819 Number of HSP's gapped (non-prelim): 115 length of query: 218 length of database: 4,792,584,752 effective HSP length: 133 effective length of query: 85 effective length of database: 2,932,595,180 effective search space: 249270590300 effective search space used: 249270590300 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 78 (34.8 bits)