BLASTP 2.2.22 [Sep-27-2009]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= 537021.9.peg.1142_1
(218 letters)
Database: nr
13,984,884 sequences; 4,792,584,752 total letters
Searching..................................................done
Results from round 1
>gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2]
gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus]
Length = 809
Score = 455 bits (1171), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/218 (100%), Positives = 218/218 (100%)
Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 60
VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS
Sbjct: 592 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 651
Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120
QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS
Sbjct: 652 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 711
Query: 121 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180
SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA
Sbjct: 712 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 771
Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218
FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG
Sbjct: 772 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 809
>gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 810
Score = 90.9 bits (224), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 115/217 (52%), Gaps = 11/217 (5%)
Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLV---- 56
Q++ARGSVGS+++D ++ + + G + L+ L+ QFL PIS + HL +P +LV
Sbjct: 591 TQDNARGSVGSSLRDTKYTSSR-GGIPGLS-LVTQFLTTPISMAEKHLWAVPKTLVGGAN 648
Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYER-F 115
G+S+ YRAK L GI+ E ++ T ++G+E DF+DP +THY+R F
Sbjct: 649 GMSAWSYRAKFLAFGIVLEGIVANTARKALTGQELD-DFTDPKVLALMTARTLTHYDRFF 707
Query: 116 SPFNSSGWDVLG--PWSSQAGKLAIAGKEAVWD-EGTRKQRGKAQAQFGKELVNTFVPFQ 172
+ ++ D+L P +S L AG E + G +++ + V +P +
Sbjct: 708 NEYHHDFKDLLHSVPVASTVIGLGDAGLEVSRNIFGEDEEKKAKANAKLAKEVANNMPLK 767
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209
NL+Y + AF V +++ + N G + R + R+ +K
Sbjct: 768 NLFYVKAAFQKMVVDNLCEYFNEGYKDRLAMNRELRK 804
>gi|315121758|ref|YP_004062247.1| hypothetical protein CKC_00040 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495160|gb|ADR51759.1| hypothetical protein CKC_00040 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 107
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/71 (54%), Positives = 49/71 (69%), Gaps = 2/71 (2%)
Query: 68 LVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS--SGWDV 125
L++ EELI+ LVPLISG EP+ D + P +Y KA++N ITHYERFSP S WD+
Sbjct: 34 LLVEYANEELIKNVLVPLISGNEPRFDITSPRDYAKAIVNAITHYERFSPLGGGQSKWDI 93
Query: 126 LGPWSSQAGKL 136
LGP QAG+L
Sbjct: 94 LGPALGQAGRL 104
>gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 56
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 43/55 (78%)
Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216
KE++NT VPFQNLWY + F++FVR +DD +NPG RARAE YR++ +++RK+
Sbjct: 2 KEVLNTTVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRKNIQREKRKK 56
>gi|315122771|ref|YP_004063260.1| hypothetical protein CKC_05130 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313496173|gb|ADR52772.1| hypothetical protein CKC_05130 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 137
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 3/118 (2%)
Query: 93 LDFSDPTEYIKALINGITHYERF-SPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRK 151
+DF+DP +THY+RF + ++ D+L + + + ++ E K
Sbjct: 16 IDFTDPKTLALLTARTLTHYDRFFNEYHHDFKDLLHAVPVASTIIGLGDARNIFGEDEEK 75
Query: 152 QRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209
R KA A F KEL N +P +NL+YA+ AF + +++ + N G + R ++ R+ +K
Sbjct: 76 -REKANANFAKELANN-IPLKNLFYAKAAFQKMIVDNLCEYFNEGYKERLDMNRELRK 131
>gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1]
Length = 918
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 67/139 (48%), Gaps = 25/139 (17%)
Query: 85 LISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSS-GWDVLGP---WSSQA 133
L++G +P LD + PT +++AL+ G + ++ + + SS G + GP ++ Q
Sbjct: 764 LLNGNDP-LDMTKPTTWVQALLKGGSFGIYGDFIFQDHTQYGSSIGATMGGPVLSFAEQL 822
Query: 134 GKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRNSI 189
KL I ++A+ E T FG + + T PF NLWYA+ NH + +
Sbjct: 823 TKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHLILQQL 873
Query: 190 DDVLNPGGRARAEVYRQRQ 208
++ NPG R QR+
Sbjct: 874 QEMANPGYNDRVRDRAQRE 892
>gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233]
Length = 530
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 94/210 (44%), Gaps = 40/210 (19%)
Query: 31 RLMGQFLVMPISWS------RMHLIEIPSSLVGVSSQVYRAK---------ALVI--GIL 73
R +GQF P+S M I L G+S++ RA+ ALVI G +
Sbjct: 321 RFVGQFKAFPMSIMNKVLGREMAYIRKGKKLGGLSTEAGRAEIGRGIRGMAALVITSGFM 380
Query: 74 GEELIRKTLVPLISGKEPQLDFSDPTEYIKAL----------INGITHYERFSPFNSSGW 123
G + T+ L+ GKEP+ DPT++ + I G ++ S
Sbjct: 381 G--YMAMTMKDLLKGKEPR----DPTKFKTIMAGFLQGGGLGIYGDVLFKEQRDAGSVIA 434
Query: 124 DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNH 183
++GP + L +A + A+ EG + + +A +++ +PF NL+Y + AF++
Sbjct: 435 GLVGPAPTTVVDLGLALQYALLGEGGKSGKAAYRA------ISSNIPFLNLFYIKIAFDY 488
Query: 184 FVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
+ I + +NPG + E R ++ Y ++
Sbjct: 489 LIGFQIMETVNPGVLKKVE-RRMKKDYNQE 517
>gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15]
gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15]
Length = 918
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 32/148 (21%)
Query: 85 LISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG----PWSSQA 133
L++G +P LD + PT +++AL+ G + ++ + + SS +G ++ Q
Sbjct: 764 LLTGNDP-LDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQYGSSIAATIGGPVLSFAEQL 822
Query: 134 GKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRNSI 189
KL I ++A+ E T FG + + T PF NLWYA+ NH + +
Sbjct: 823 TKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHLILQQL 873
Query: 190 DDVLNPGGRARAEVYRQRQKYKKQRKRN 217
++ NPG Y R + + QR+ N
Sbjct: 874 QEMANPG-------YNDRVRDRAQREFN 894
>gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter
asiaticus str. psy62]
gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter
asiaticus str. psy62]
gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1]
gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus]
Length = 864
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 101/235 (42%), Gaps = 42/235 (17%)
Query: 1 VQEHARGSVGSTIQDKR---WITGKDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLV 56
VQ RG++ +++ D++ +T K G+ R+ QF P ++++++ +S
Sbjct: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMF-LNILDLSNSAK 687
Query: 57 ---GVSSQV------YRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALIN 107
G S + Y A + GI G I+ L+ G++P L Y L N
Sbjct: 688 MPKGASMALNHVWIQYSATMALAGI-GVASIK----ALLRGEDPSLP---EVIYDGTLAN 739
Query: 108 G--ITHYERFSPFNSSG-----WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQF 160
G + + +R + S G +LGP S L + E + + +A
Sbjct: 740 GALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKA-- 797
Query: 161 GKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRK 215
+ +PF N+WY + +F+H + N I + LNPG Y RQ+ KK++K
Sbjct: 798 ----IRKTLPFMNMWYLKNSFDHLILNQILEELNPG-------YLDRQQSKKKKK 841
>gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3]
gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3]
Length = 924
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 63/148 (42%), Gaps = 23/148 (15%)
Query: 67 ALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFN 119
A V G + + L+SG +P LD + P +++AL+ G + ++ + +
Sbjct: 752 AYVAGTTLAGMFANQMNALLSGNDP-LDMTKPQTWLQALLKGGSFGIYGDFLFQDHTQYG 810
Query: 120 SSGWDVLGP----WSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQ 172
SS +LG ++ Q K + ++K + F + + T PF
Sbjct: 811 SSIAGILGGPVLGFAEQLSKTVLTN--------SQKAMAGEETTFTADALKTARMITPFA 862
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRAR 200
NLWY + NH + + ++ NPG AR
Sbjct: 863 NLWYTKAITNHLILQQLQEMANPGYNAR 890
>gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624]
gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624]
Length = 841
Score = 44.7 bits (104), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 66/149 (44%), Gaps = 21/149 (14%)
Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127
L L++G +PQ D +DP + ++++ + G + ++SG D V G
Sbjct: 679 LKELLNGNDPQTIYDSNDPKKASNFFVRSAVQGGGLSFLGDILVAGTDTSGRDAHSFVAG 738
Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185
P S L G ++EG G QF V +P QNLWY + A N V
Sbjct: 739 PLGSDFESLLSLTVGNLTQYNEGKDTNFGNEAFQF----VKRKIPAQNLWYTKAAINRMV 794
Query: 186 RNSIDDVLNPGGRARAEVYRQRQKYKKQR 214
+ I D + PG R +A + + +K ++R
Sbjct: 795 FDEIQDFIAPGYREKA-LRKAEEKQDRER 822
>gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024]
gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024]
Length = 1175
Score = 43.9 bits (102), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 20/144 (13%)
Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127
L +++G +PQ D +DP + ++++L+ G + + ++SG D V G
Sbjct: 1013 LREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSG 1072
Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185
P S L G ++EG G +F V +P QNLWY + A N V
Sbjct: 1073 PLGSDFTSLLGLTVGNLTQYNEGKDTNFGNEAFKF----VKGKIPAQNLWYTKAAINRMV 1128
Query: 186 RNSIDDVLNPGGRARAEVYRQRQK 209
+ + D + PG R +A +RQ+
Sbjct: 1129 FDEMQDTIAPGYREKALRKAERQQ 1152
>gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans']
gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans']
Length = 824
Score = 42.7 bits (99), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 5/66 (7%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQ 206
EG +Q G +F K ++ P QNLWY + F+H V N + ++ +PG R E R
Sbjct: 752 EGKPEQTGGDLVKFAKGMI----PGQNLWYTKAVFDHMVFNQLQEIFSPGYLRRME-KRS 806
Query: 207 RQKYKK 212
R+++ +
Sbjct: 807 RKEFNQ 812
>gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE]
gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein
[Acinetobacter baumannii AYE]
Length = 841
Score = 42.4 bits (98), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 66/148 (44%), Gaps = 20/148 (13%)
Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127
L L++G +PQ D +DP + +I++ + G + ++SG D V G
Sbjct: 679 LKELLNGNDPQTIYDSNDPKKAGSFFIRSAVQGGGLSFLGDILVAGTDTSGRDANSFVAG 738
Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185
P + L G ++EG G +F V +P QNLWY + A N V
Sbjct: 739 PLGNDFTALLGLTVGNLTQYNEGKDTNFGNEAFKF----VKGKIPAQNLWYTKAAINRMV 794
Query: 186 RNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
+ + D + PG R +A +RQ+ +++
Sbjct: 795 FDEMQDTIAPGYREKALRKAERQQDRER 822
>gi|294843482|ref|ZP_06788165.1| putative phage related protein [Acinetobacter sp. 6014059]
Length = 841
Score = 42.4 bits (98), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 67/148 (45%), Gaps = 20/148 (13%)
Query: 82 LVPLISGKEPQL--DFSDPTE----YIKALING----ITHYERFSPFNSSGWD----VLG 127
L +++G +PQ D +DP + ++++L+ G + + ++SG D V G
Sbjct: 679 LREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVAGTDTSGRDANSFVSG 738
Query: 128 PWSSQAGKLA--IAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFV 185
P S L G ++EG G +F V +P QNLWY + A N
Sbjct: 739 PLGSDFTALLGLTVGNLTQYNEGKDTNFGNEAFKF----VKGKIPAQNLWYTKAAINRMF 794
Query: 186 RNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
+ + D + PG R +A +RQ+ +++
Sbjct: 795 FDEVQDTIAPGYREKALRKAERQQDRER 822
>gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS]
gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS]
Length = 838
Score = 41.6 bits (96), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 88/204 (43%), Gaps = 38/204 (18%)
Query: 29 LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLV----- 83
L R + F MPI+ H G+S R+KA IG L ++ T++
Sbjct: 623 LTRSVFLFKTMPIAMLMRHWER------GMSGPDARSKAGYIGAL---MVSTTVMGMLAL 673
Query: 84 ---PLISGKEPQLDFSDPTE-------YIKALING----ITHYERFSPFNSSGWDVLGPW 129
L+ G++P +P E +++A + G I FS N G GP
Sbjct: 674 QIDELLKGRDPV--NMNPFEGKAGARNWVRAFLKGGSLGIYGDFLFSEQNQHGG---GPI 728
Query: 130 SSQAGKLAIAGKEAV-WDEGTRKQRGKAQ-AQFGKELVN---TFVPFQNLWYARGAFNHF 184
+S G + A +EA +G Q G+ + G EL+ P NLWY + A NH
Sbjct: 729 ASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGAELLKFAKGMTPGANLWYLKAATNHL 788
Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208
+ N + ++++PG AR + QR+
Sbjct: 789 IFNQLQEMVSPGYLARVKSRAQRE 812
>gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB]
gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB]
Length = 921
Score = 40.0 bits (92), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/134 (23%), Positives = 55/134 (41%), Gaps = 9/134 (6%)
Query: 82 LVPLISGKEPQLDFSDPTEYIKALING----ITHYERFSPFNSSGWDVLGPWSSQAGKLA 137
L L+SG +P +D + P ++ A + G I F G + + LA
Sbjct: 764 LNALLSGNDP-IDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGSSIAATLGGPSLGLA 822
Query: 138 IAGKEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRNSIDDVLN 194
+ + + + +G+ + FG + + T PF NLWY + NH + + ++ N
Sbjct: 823 ESLMKLLITNPQKAMQGE-ETSFGADAIKTARMITPFANLWYTKAVTNHLILQQLQEMAN 881
Query: 195 PGGRARAEVYRQRQ 208
PG R Q Q
Sbjct: 882 PGYNDRVRDRAQNQ 895
>gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1]
gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1]
Length = 855
Score = 40.0 bits (92), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 49/122 (40%), Gaps = 9/122 (7%)
Query: 85 LISGKEPQLDFSDPTEYIKALING----ITHYERFSPFNSSGWDVLGPWSSQAGKLAIAG 140
+ G+EP+ DP ++ A++ G I F N G L S AG I
Sbjct: 712 VTKGREPR-PADDPKTWLAAMVQGGGLGIFGDYLFGEANRFGNSAL---ESAAGP-TIGT 766
Query: 141 KEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRAR 200
V + R + G A L PF NL+Y R A +H S+ + +NPG R
Sbjct: 767 AADVINLWARAKEGDDTASSALRLAQNNTPFMNLFYTRIALDHLFLYSVQEAMNPGSLRR 826
Query: 201 AE 202
E
Sbjct: 827 TE 828
>gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
Length = 854
Score = 39.7 bits (91), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 34/63 (53%), Gaps = 10/63 (15%)
Query: 158 AQFGKELVNTF---VPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQR 214
+ +G E VN +PFQNLWY+R F+ V + ++ + G YR+R++ +++
Sbjct: 778 SSYGAEAVNVVKNNIPFQNLWYSRLVFDRLVIAEMQELFDEG-------YRERKQRRQEN 830
Query: 215 KRN 217
N
Sbjct: 831 NHN 833
>gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510]
gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510]
Length = 995
Score = 39.3 bits (90), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 72/184 (39%), Gaps = 37/184 (20%)
Query: 31 RLMGQFLVMPIS-----WSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEEL---IRKTL 82
R +GQF P++ W R L G RA +V ++ + + L
Sbjct: 792 RFVGQFKAFPVAVISKVWGR--------DLYGGERGWGRAAGIVHTLVATTVMGYVAGML 843
Query: 83 VPLISGKEPQLDFSDPTEYIKALING----------ITHYERFSPFNSSGWDVLGPWSSQ 132
L G+ P+ D +DP + A + G + Y RF N GP S
Sbjct: 844 KDLSKGRAPR-DPTDPRAWGAAFLQGGGAGIYGDFLLGQYSRFG--NRFLESAAGPTLSS 900
Query: 133 AGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDV 192
AG+L +W G R+ + A L NT PF NL+Y R A ++ + +
Sbjct: 901 AGELL-----NIW-AGAREGNDEKAATLRWTLSNT--PFVNLFYTRMALDYLFLYQVQEA 952
Query: 193 LNPG 196
+NPG
Sbjct: 953 MNPG 956
>gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131]
gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131]
Length = 823
Score = 38.1 bits (87), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 4/56 (7%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAE 202
EG +Q G +F K L+ P QNLWY + +H V N + + +PG R E
Sbjct: 750 EGKPEQTGGDTVKFVKGLI----PGQNLWYTKAVLDHMVFNQLQEYFSPGYLRRME 801
>gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
NCPPB 3335]
gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
NCPPB 3335]
Length = 831
Score = 37.7 bits (86), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 80/193 (41%), Gaps = 43/193 (22%)
Query: 43 WSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYI 102
W R+ IE + S+ V+ G+L + L+ +++G++P+ D D ++
Sbjct: 639 WKRVSQIESTGGKLAYSASVF------TGLLMAGAMTNQLMDIMNGRDPR-DMKDGKFWL 691
Query: 103 KALI-------------NGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGT 149
+A++ G+ R N +G +LGP A + + +V+ E T
Sbjct: 692 QAMLRGGGVGIFGDILNTGLGGDNRGGQSNLTG--LLGPVYGTAADVGLT-LGSVFKEKT 748
Query: 150 RKQRGKAQAQFGKELV-----NTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVY 204
A G L+ NT PF WY + AF H V + + ++L+PG Y
Sbjct: 749 EP------ADVGANLLRIGYQNT--PFIRSWYTKAAFEHAVMHDMQEMLSPG-------Y 793
Query: 205 RQRQKYKKQRKRN 217
R K + ++ N
Sbjct: 794 LSRMKKRAKKDFN 806
>gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112]
Length = 582
Score = 36.6 bits (83), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 510 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 555
>gi|322703038|gb|EFY94654.1| hypothetical protein MAA_09875 [Metarhizium anisopliae ARSEF 23]
Length = 303
Score = 36.6 bits (83), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 34/76 (44%)
Query: 116 SPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLW 175
SPF+ + P K ++ G+ VW+ QR K + G E ++ V + +
Sbjct: 16 SPFDDMDTESQKPEPQSPRKPSVGGESVVWEPFGIPQRNKLRLAVGPERISIVVDYWAIE 75
Query: 176 YARGAFNHFVRNSIDD 191
+ +H +R ++DD
Sbjct: 76 HISPVLHHMIRRALDD 91
>gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length = 824
Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10]
gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10]
Length = 824
Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605]
gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605]
Length = 824
Score = 36.2 bits (82), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407]
Length = 825
Score = 36.2 bits (82), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 753 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 798
>gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302]
gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302]
Length = 824
Score = 36.2 bits (82), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v]
Length = 824
Score = 36.2 bits (82), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKNEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14]
Length = 824
Score = 36.2 bits (82), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKSEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1]
gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1]
gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252]
Length = 824
Score = 36.2 bits (82), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKSEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2]
Length = 824
Score = 36.2 bits (82), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L+ P NLWY + A +H + N + + +PG
Sbjct: 752 EGKSEQTGGDLVKLGKGLM----PGANLWYLKAALDHMIFNQMQEYFSPG 797
>gi|118590567|ref|ZP_01547969.1| hypothetical protein SIAM614_03291 [Stappia aggregata IAM 12614]
gi|118437030|gb|EAV43669.1| hypothetical protein SIAM614_03291 [Stappia aggregata IAM 12614]
Length = 317
Score = 35.4 bits (80), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/66 (28%), Positives = 31/66 (46%), Gaps = 2/66 (3%)
Query: 110 THYERFSPFNSSGW-DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTF 168
+H + P W D+ G S+ G AI G WD+ + G ++G+ L+N
Sbjct: 95 SHKWQHEPIPPQAWADLFGELSAPLGTHAILGNHDWWDDADAQLTGGGPTKYGQALLNAG 154
Query: 169 VP-FQN 173
+P +QN
Sbjct: 155 IPLYQN 160
>gi|307942811|ref|ZP_07658156.1| metallophosphoesterase [Roseibium sp. TrichSKD4]
gi|307773607|gb|EFO32823.1| metallophosphoesterase [Roseibium sp. TrichSKD4]
Length = 318
Score = 35.0 bits (79), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 44/101 (43%), Gaps = 5/101 (4%)
Query: 110 THYERFSPFNSSGW-DVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTF 168
+H ++ P W D+ G + G A+ G WD+ + G ++G+ L+N
Sbjct: 95 SHKWQYEPIEPQAWADIFGDLRAPLGVHAVLGNHDWWDDKDAQLTGYGPTKYGQALINAG 154
Query: 169 VP-FQNLWYARGAFNH-FVRNSIDD--VLNPGGRARAEVYR 205
+P +QN H F +DD L P RA+ + +R
Sbjct: 155 IPLYQNRATRLSKDGHSFWLAGLDDQLALYPSRRAKRKSWR 195
>gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine
microorganism HF4000_48F7]
Length = 828
Score = 35.0 bits (79), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 51/114 (44%), Gaps = 5/114 (4%)
Query: 106 INGITHYERFSPFNSSGWDVL-GPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL 164
I G + + +++S D+L GP S LA G +D T A A G
Sbjct: 705 IAGDFLFNDYRQYSTSYVDLLAGPSGSSLNDLAEFGA-TTFDVATGGDPVDAAAA-GWRA 762
Query: 165 VNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218
V +P+ N W +R F++ + + ++LNPG R E R+ ++ Q R G
Sbjct: 763 VKGNIPYANWWASRTLFDYLINYQVQEILNPGSLRRME--RRFKQKNNQDYRAG 814
>gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str.
E2348/69]
gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
Length = 824
Score = 34.7 bits (78), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 4/50 (8%)
Query: 147 EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPG 196
EG +Q G + GK L P N+WY + A +H + N + + +PG
Sbjct: 752 EGKSEQTGGDLVKLGKGLT----PGANIWYLKAALDHMIFNQMQEYFSPG 797
Searching..................................................done
Results from round 2
>gi|317120709|gb|ADV02531.1| hypothetical protein SC2_gp030 [Liberibacter phage SC2]
gi|317120770|gb|ADV02591.1| hypothetical protein SC2_gp030 [Candidatus Liberibacter asiaticus]
Length = 809
Score = 315 bits (807), Expect = 3e-84, Method: Composition-based stats.
Identities = 218/218 (100%), Positives = 218/218 (100%)
Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 60
VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS
Sbjct: 592 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSS 651
Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120
QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS
Sbjct: 652 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 711
Query: 121 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180
SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA
Sbjct: 712 SGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 771
Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 218
FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG
Sbjct: 772 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRNG 809
>gi|315121926|ref|YP_004062415.1| hypothetical protein CKC_00880 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|315122888|ref|YP_004063377.1| hypothetical protein CKC_05720 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495328|gb|ADR51927.1| hypothetical protein CKC_00880 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313496290|gb|ADR52889.1| hypothetical protein CKC_05720 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 810
Score = 227 bits (577), Expect = 1e-57, Method: Composition-based stats.
Identities = 67/220 (30%), Positives = 116/220 (52%), Gaps = 11/220 (5%)
Query: 1 VQEHARGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVG--- 57
Q++ARGSVGS+++D ++ + + G + L+ L+ QFL PIS + HL +P +LVG
Sbjct: 591 TQDNARGSVGSSLRDTKYTSSR-GGIPGLS-LVTQFLTTPISMAEKHLWAVPKTLVGGAN 648
Query: 58 -VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERF- 115
+S+ YRAK L GI+ E ++ T ++G+E DF+DP +THY+RF
Sbjct: 649 GMSAWSYRAKFLAFGIVLEGIVANTARKALTGQELD-DFTDPKVLALMTARTLTHYDRFF 707
Query: 116 SPFNSSGWDVLG--PWSSQAGKLAIAGKEAVWD-EGTRKQRGKAQAQFGKELVNTFVPFQ 172
+ ++ D+L P +S L AG E + G +++ + V +P +
Sbjct: 708 NEYHHDFKDLLHSVPVASTVIGLGDAGLEVSRNIFGEDEEKKAKANAKLAKEVANNMPLK 767
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212
NL+Y + AF V +++ + N G + R + R+ +K +
Sbjct: 768 NLFYVKAAFQKMVVDNLCEYFNEGYKDRLAMNRELRKSRS 807
>gi|254781202|ref|YP_003065615.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter
asiaticus str. psy62]
gi|254040879|gb|ACT57675.1| hypothetical protein CLIBASIA_05545 [Candidatus Liberibacter
asiaticus str. psy62]
gi|317120668|gb|ADV02491.1| hypothetical protein SC1_gp030 [Liberibacter phage SC1]
gi|317120812|gb|ADV02633.1| hypothetical protein SC1_gp030 [Candidatus Liberibacter asiaticus]
Length = 864
Score = 173 bits (438), Expect = 2e-41, Method: Composition-based stats.
Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 35/228 (15%)
Query: 1 VQEHARGSVGSTIQDKR---WITGKDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLV 56
VQ RG++ +++ D++ +T K G+ R+ QF P ++++++ +S
Sbjct: 629 VQTSVRGAMHTSLFDRQRLGLLTYKRGTRAGEALRMFQQFTTTPTGMF-LNILDLSNSAK 687
Query: 57 ---GVSSQV------YRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALIN 107
G S + Y A + GI G I+ L+ G++P L Y L N
Sbjct: 688 MPKGASMALNHVWIQYSATMALAGI-GVASIK----ALLRGEDPSLP---EVIYDGTLAN 739
Query: 108 G--ITHYERFSPFNSSG-----WDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQF 160
G + + +R + S G +LGP S L + E + + +A
Sbjct: 740 GALLPYMDRLTKLVSKGDRAAIGGLLGPVPSMVTNLTSSAVELATKDNENSKVNATKA-- 797
Query: 161 GKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208
+ +PF N+WY + +F+H + N I + LNPG R + ++++
Sbjct: 798 ----IRKTLPFMNMWYLKNSFDHLILNQILEELNPGYLDRQQSKKKKK 841
>gi|291334971|gb|ADD94604.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C233]
Length = 530
Score = 165 bits (417), Expect = 4e-39, Method: Composition-based stats.
Identities = 51/214 (23%), Positives = 92/214 (42%), Gaps = 36/214 (16%)
Query: 25 SVNNLARLMGQFLVMPISWS------RMHLIEIPSSLVGVSSQVYRAK---------ALV 69
+ R +GQF P+S M I L G+S++ RA+ ALV
Sbjct: 315 GMGEAIRFVGQFKAFPMSIMNKVLGREMAYIRKGKKLGGLSTEAGRAEIGRGIRGMAALV 374
Query: 70 IGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKAL----------INGITHYERFSPFN 119
I + T+ L+ GKEP+ DPT++ + I G ++
Sbjct: 375 ITSGFMGYMAMTMKDLLKGKEPR----DPTKFKTIMAGFLQGGGLGIYGDVLFKEQRDAG 430
Query: 120 SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARG 179
S ++GP + L +A + A+ EG + + +A +++ +PF NL+Y +
Sbjct: 431 SVIAGLVGPAPTTVVDLGLALQYALLGEGGKSGKAAYRA------ISSNIPFLNLFYIKI 484
Query: 180 AFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
AF++ + I + +NPG + E R ++ Y ++
Sbjct: 485 AFDYLIGFQIMETVNPGVLKKVE-RRMKKDYNQE 517
>gi|315122771|ref|YP_004063260.1| hypothetical protein CKC_05130 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313496173|gb|ADR52772.1| hypothetical protein CKC_05130 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 137
Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats.
Identities = 32/122 (26%), Positives = 61/122 (50%), Gaps = 3/122 (2%)
Query: 92 QLDFSDPTEYIKALINGITHYERF-SPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTR 150
+DF+DP +THY+RF + ++ D+L + + + ++ E
Sbjct: 15 SIDFTDPKTLALLTARTLTHYDRFFNEYHHDFKDLLHAVPVASTIIGLGDARNIFGEDEE 74
Query: 151 KQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210
K R KA A F KE + +P +NL+YA+ AF + +++ + N G + R ++ R+ +K
Sbjct: 75 K-REKANANFAKE-LANNIPLKNLFYAKAAFQKMIVDNLCEYFNEGYKERLDMNRELRKS 132
Query: 211 KK 212
+
Sbjct: 133 RS 134
>gi|30387396|ref|NP_848225.1| hypothetical protein epsilon15p17 [Enterobacteria phage epsilon15]
gi|30266051|gb|AAO06080.1| 17 [Salmonella phage epsilon15]
Length = 918
Score = 131 bits (329), Expect = 6e-29, Method: Composition-based stats.
Identities = 48/204 (23%), Positives = 84/204 (41%), Gaps = 27/204 (13%)
Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79
T L + F P + R L+ + L V + + A + G +
Sbjct: 701 TYARDDAGQLIKSFMLFKTTPFAGFR-QLVNRANDLDTVPAIKFLASY-IAGTTLAGMFA 758
Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG----P 128
+ L++G +P LD + PT +++AL+ G + ++ + + SS +G
Sbjct: 759 NQMNSLLTGNDP-LDMTKPTTWVQALLKGGSFGIYGDFLFQDHTQYGSSIAATIGGPVLS 817
Query: 129 WSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHF 184
++ Q KL I ++A+ E T FG + + T PF NLWYA+ NH
Sbjct: 818 FAEQLTKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHL 868
Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208
+ + ++ NPG R QR+
Sbjct: 869 ILQQLQEMANPGYNDRVRDRAQRE 892
>gi|301028422|ref|ZP_07191668.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gi|299878533|gb|EFI86744.1| conserved hypothetical protein [Escherichia coli MS 196-1]
Length = 918
Score = 128 bits (321), Expect = 5e-28, Method: Composition-based stats.
Identities = 48/204 (23%), Positives = 83/204 (40%), Gaps = 27/204 (13%)
Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79
T L + F P + R L+ L V + + A + G +
Sbjct: 701 TYARDDAGELMKSFMLFKTTPFAGFR-QLVNRTRDLDTVPAIKFLASY-IGGTTLAGMFA 758
Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLG----P 128
+ L++G +P LD + PT +++AL+ G + ++ + + SS +G
Sbjct: 759 IQMNSLLNGNDP-LDMTKPTTWVQALLKGGSFGIYGDFIFQDHTQYGSSIGATMGGPVLS 817
Query: 129 WSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHF 184
++ Q KL I ++A+ E T FG + + T PF NLWYA+ NH
Sbjct: 818 FAEQLTKLLITNPQKALQGEET---------SFGADALKTARMITPFANLWYAKAITNHL 868
Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208
+ + ++ NPG R QR+
Sbjct: 869 ILQQLQEMANPGYNDRVRDRAQRE 892
>gi|330007168|ref|ZP_08305910.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3]
gi|328535515|gb|EGF61975.1| hypothetical protein HMPREF9538_03599 [Klebsiella sp. MS 92-3]
Length = 924
Score = 120 bits (301), Expect = 1e-25, Method: Composition-based stats.
Identities = 44/201 (21%), Positives = 83/201 (41%), Gaps = 27/201 (13%)
Query: 23 DGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTL 82
+ +L + F P++ R + + L + + + A A V G + +
Sbjct: 710 RDTSGDLLKSFMLFKTTPMAGMRQFVTRL-QDLETMPAVKFFA-AYVAGTTLAGMFANQM 767
Query: 83 VPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLGP----WSS 131
L+SG +P LD + P +++AL+ G + ++ + + SS +LG ++
Sbjct: 768 NALLSGNDP-LDMTKPQTWLQALLKGGSFGIYGDFLFQDHTQYGSSIAGILGGPVLGFAE 826
Query: 132 QAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHFVRN 187
Q K + ++A+ E T F + + T PF NLWY + NH +
Sbjct: 827 QLSKTVLTNSQKAMAGEET---------TFTADALKTARMITPFANLWYTKAITNHLILQ 877
Query: 188 SIDDVLNPGGRARAEVYRQRQ 208
+ ++ NPG AR R+
Sbjct: 878 QLQEMANPGYNARVRDRAMRE 898
>gi|304398390|ref|ZP_07380264.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB]
gi|304354256|gb|EFM18629.1| hypothetical protein PanABDRAFT_3525 [Pantoea sp. aB]
Length = 921
Score = 114 bits (285), Expect = 9e-24, Method: Composition-based stats.
Identities = 46/204 (22%), Positives = 77/204 (37%), Gaps = 27/204 (13%)
Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79
T L + F P + R ++ +L V + + A A + G +
Sbjct: 704 TYARDQGGELYKSFMLFKTTPFAGFR-QMVTRAQNLDRVPALKFLA-AYIGGTTLTGMFA 761
Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFNSSGWDVLGP---- 128
L L+SG +P +D + P ++ A + G ++ + + SS LG
Sbjct: 762 NQLNALLSGNDP-IDMTKPGAWVGATLKGGGFGIYGDFLFQDHTQYGSSIAATLGGPSLG 820
Query: 129 WSSQAGKLAIAG-KEAVWDEGTRKQRGKAQAQFGKELVNT---FVPFQNLWYARGAFNHF 184
+ KL I ++A+ E T FG + + T PF NLWY + NH
Sbjct: 821 LAESLMKLLITNPQKAMQGEET---------SFGADAIKTARMITPFANLWYTKAVTNHL 871
Query: 185 VRNSIDDVLNPGGRARAEVYRQRQ 208
+ + ++ NPG R Q Q
Sbjct: 872 ILQQLQEMANPGYNDRVRDRAQNQ 895
>gi|315121758|ref|YP_004062247.1| hypothetical protein CKC_00040 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495160|gb|ADR51759.1| hypothetical protein CKC_00040 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 107
Score = 93.0 bits (229), Expect = 2e-17, Method: Composition-based stats.
Identities = 45/98 (45%), Positives = 58/98 (59%), Gaps = 6/98 (6%)
Query: 42 SWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEY 101
S L +L+G SS + L++ EELI+ LVPLISG EP+ D + P +Y
Sbjct: 12 SLFPHFLFVRSKALLGRSSIL----ILLVEYANEELIKNVLVPLISGNEPRFDITSPRDY 67
Query: 102 IKALINGITHYERFSPFNS--SGWDVLGPWSSQAGKLA 137
KA++N ITHYERFSP S WD+LGP QAG+L
Sbjct: 68 AKAIVNAITHYERFSPLGGGQSKWDILGPALGQAGRLG 105
>gi|319793417|ref|YP_004155057.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS]
gi|315595880|gb|ADU36946.1| hypothetical protein Varpa_2748 [Variovorax paradoxus EPS]
Length = 838
Score = 86.1 bits (211), Expect = 3e-15, Method: Composition-based stats.
Identities = 44/218 (20%), Positives = 80/218 (36%), Gaps = 27/218 (12%)
Query: 6 RGSVGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRA 65
R ++ S +Q W L R + F MPI+ H E S S+
Sbjct: 607 RAALYSNLQRGTW-------KGELTRSVFLFKTMPIAMLMRH-WERGMSGPDARSKAGYI 658
Query: 66 KALVIGILGEELIRKTLVPLISGKEPQ-----LDFSDPTEYIKALINGITH-------YE 113
AL++ ++ + L+ G++P + +++A + G + +
Sbjct: 659 GALMVSTTVMGMLALQIDELLKGRDPVNMNPFEGKAGARNWVRAFLKGGSLGIYGDFLFS 718
Query: 114 RFSPFNSS-GWDVLGPWSSQAGK-LAIAGKEAVW-DEGTRKQRGKAQAQFGKELVNTFVP 170
+ LGP + + V +G G +F K + P
Sbjct: 719 EQNQHGGGPIASALGPVVGAVEEAFGLTQGNLVQLGQGKDTHAGAELLKFAKGM----TP 774
Query: 171 FQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQ 208
NLWY + A NH + N + ++++PG AR + QR+
Sbjct: 775 GANLWYLKAATNHLIFNQLQEMVSPGYLARVKSRAQRE 812
>gi|309702799|emb|CBJ02130.1| hypothetical phage protein [Escherichia coli ETEC H10407]
Length = 825
Score = 84.5 bits (207), Expect = 8e-15, Method: Composition-based stats.
Identities = 48/224 (21%), Positives = 84/224 (37%), Gaps = 37/224 (16%)
Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68
VGS +Q W L R + F PIS H S +G+ S RA +
Sbjct: 611 VGSGLQRGTW-------KGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYI 659
Query: 69 ---VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFS 116
+ + + LI+G+ P+ D +I A + G + +
Sbjct: 660 ATFLASTTMLGALSMQITDLINGRNPKEMTGDHMVKFWINAFLKGGGAGLYGDFLFSDHT 719
Query: 117 PFNS-SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQN 173
+ S + +LGP + + + + EG +Q G + GK L +P N
Sbjct: 720 RYGSGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGAN 775
Query: 174 LWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
LWY + A +H + N + + +PG + E + +++ N
Sbjct: 776 LWYLKAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 812
>gi|332160979|ref|YP_004297556.1| hypothetical protein YE105_C1357 [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|325665209|gb|ADZ41853.1| Hypothetical phage protein [Yersinia enterocolitica subsp.
palearctica 105.5R(r)]
gi|330862135|emb|CBX72299.1| hypothetical protein YEW_AK02360 [Yersinia enterocolitica W22703]
Length = 841
Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 39/201 (19%), Positives = 72/201 (35%), Gaps = 13/201 (6%)
Query: 20 TGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIR 79
T + + R QF PI+ H + G Y A + L +
Sbjct: 624 TTRGTWSGEIWRSATQFKSFPIAMVMRHAHR-ALAQDGAGKGTYAAAIIAASTLLGG-MA 681
Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH--YERF-----SPFNSSG-WDVLGPWSS 131
L + SG++P+ D + P + A + G Y F + +S + GP +
Sbjct: 682 IQLNEIASGRDPR-DMTKPEFWGGAFLKGGALGLYGDFLLTNQTQGGNSFIASIGGPLAG 740
Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191
+ + A + + A + P NLWYA+ A +H + + I +
Sbjct: 741 DIESVVKMTQGAAFK--AIDGKDPHTAANVVRFIKGHTPGANLWYAKAALDHMIFHDIQE 798
Query: 192 VLNPGGRARAEVYRQRQKYKK 212
+PG +R Q++ ++
Sbjct: 799 QFSPGYLSRMRQRAQKEYDQQ 819
>gi|300898440|ref|ZP_07116781.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|300357907|gb|EFJ73777.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length = 824
Score = 83.8 bits (205), Expect = 1e-14, Method: Composition-based stats.
Identities = 46/221 (20%), Positives = 79/221 (35%), Gaps = 31/221 (14%)
Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68
VGS +Q W L R + F PIS H Y A L
Sbjct: 610 VGSGLQRGTW-------KGELTRSVFLFKSFPISVVMRHWHRAMGMPSAGGRAAYIATFL 662
Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119
+ + + LI+G+ P+ D +I A + G + + +
Sbjct: 663 ASTTML-GALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721
Query: 120 S-SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176
S + +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWY 777
Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 778 LKAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|298381705|ref|ZP_06991304.1| conserved hypothetical protein [Escherichia coli FVEC1302]
gi|298279147|gb|EFI20661.1| conserved hypothetical protein [Escherichia coli FVEC1302]
Length = 824
Score = 83.8 bits (205), Expect = 2e-14, Method: Composition-based stats.
Identities = 46/221 (20%), Positives = 79/221 (35%), Gaps = 31/221 (14%)
Query: 9 VGSTIQDKRWITGKDGSVNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKAL 68
VGS +Q W L R + F PIS H Y A L
Sbjct: 610 VGSGLQRGTW-------KGELTRSVFLFKSFPISVVMRHWHRAMGMPSAGGRAAYIATFL 662
Query: 69 VIGILGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFN 119
+ + + LI+G+ P+ D +I A + G + + +
Sbjct: 663 ASTTML-GALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYG 721
Query: 120 S-SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWY 176
S + +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 722 SGALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWY 777
Query: 177 ARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 778 LKAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|331648163|ref|ZP_08349253.1| hypothetical protein ECIG_04089 [Escherichia coli M605]
gi|331043023|gb|EGI15163.1| hypothetical protein ECIG_04089 [Escherichia coli M605]
Length = 824
Score = 82.6 bits (202), Expect = 4e-14, Method: Composition-based stats.
Identities = 45/217 (20%), Positives = 79/217 (36%), Gaps = 28/217 (12%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H Y A L
Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHWHRAMGMPSAGGRAAYIATFLASTT 666
Query: 73 LGEELIRKTLVPLISGKEPQLDFSDP--TEYIKALINGIT-------HYERFSPFNS-SG 122
+ + + LI+G+ P+ D +I A + G + + + S +
Sbjct: 667 ML-GALSMQITDLINGRNPKEMTGDNMVKFWINAFLKGGGAGLYGDFLFSDHTRYGSGAL 725
Query: 123 WDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGA 180
+LGP + + + EG +Q G + GK L +P NLWY + A
Sbjct: 726 ASMLGPVVGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYLKAA 781
Query: 181 FNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+H + N + + +PG + E + +++ N
Sbjct: 782 LDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|85059173|ref|YP_454875.1| hypothetical protein SG1195 [Sodalis glossinidius str. 'morsitans']
gi|84779693|dbj|BAE74470.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans']
Length = 824
Score = 81.8 bits (200), Expect = 7e-14, Method: Composition-based stats.
Identities = 33/195 (16%), Positives = 67/195 (34%), Gaps = 19/195 (9%)
Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLI 86
L R + F PI+ H + Y A L + + + + +I
Sbjct: 621 GELVRSVFLFKSFPIAVMMRHWSRALNMPSAGGRAAYLAAFLASTTVL-GAMSQQISEVI 679
Query: 87 SGKEPQLDFSDPTEYI----------KALINGITHYERFSPFNS-SGWDVLGPWSSQAGK 135
+G+ P+ D + A + G + + S + +LGP +
Sbjct: 680 AGRNPR-DITGDKALQFWVNAFLKGGGAGLYGDFLLSDHTRYGSGALASMLGPVAGVVDD 738
Query: 136 LAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVL 193
+ + + + + + +P QNLWY + F+H V N + ++
Sbjct: 739 ----AIKLLQGIPLNAVEGKPEQTGGDLVKFAKGMIPGQNLWYTKAVFDHMVFNQLQEIF 794
Query: 194 NPGGRARAEVYRQRQ 208
+PG R E +++
Sbjct: 795 SPGYLRRMEKRSRKE 809
>gi|268589387|ref|ZP_06123608.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131]
gi|291315414|gb|EFE55867.1| hypothetical protein PROVRETT_05519 [Providencia rettgeri DSM 1131]
Length = 823
Score = 81.1 bits (198), Expect = 1e-13, Method: Composition-based stats.
Identities = 42/198 (21%), Positives = 70/198 (35%), Gaps = 25/198 (12%)
Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGE---ELIRKTLV 83
+ R F PIS H +G+ S R L I G I + +
Sbjct: 619 GEIVRSFFLFKSFPISVVVRHW----KRALGIQSAGGRVAYLAAFIAGTTVLGAISQQIN 674
Query: 84 PLISGKEPQLDFSDP----------TEYIKALINGITHYERFSPFNSS-GWDVLGPWSSQ 132
+ SG+ P+ D +D + + G + + S +LGP +
Sbjct: 675 DISSGRNPR-DMADENWHKFWLNALLKGGGLGLYGDFLLSDHTKYGSDAFASLLGPVAGV 733
Query: 133 AGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSID 190
+ + EG +Q G +F V +P QNLWY + +H V N +
Sbjct: 734 VDDAIKLAQGIPLNAVEGKPEQTGGDTVKF----VKGLIPGQNLWYTKAVLDHMVFNQLQ 789
Query: 191 DVLNPGGRARAEVYRQRQ 208
+ +PG R E +++
Sbjct: 790 EYFSPGYLRRMEKRSKKE 807
>gi|288959378|ref|YP_003449719.1| hypothetical protein AZL_025370 [Azospirillum sp. B510]
gi|288911686|dbj|BAI73175.1| hypothetical protein AZL_025370 [Azospirillum sp. B510]
Length = 995
Score = 79.9 bits (195), Expect = 2e-13, Method: Composition-based stats.
Identities = 46/199 (23%), Positives = 71/199 (35%), Gaps = 25/199 (12%)
Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALV---IGILGEELIRKTLV 83
R +GQF P++ + L G RA +V + + L
Sbjct: 788 GEALRFVGQFKAFPVAVISK-VW--GRDLYGGERGWGRAAGIVHTLVATTVMGYVAGMLK 844
Query: 84 PLISGKEPQLDFSDPTEYIKAL-------INGITHYERFSPFNSSG-WDVLGPWSSQAGK 135
L G+ P+ D +DP + A I G ++S F + GP S AG+
Sbjct: 845 DLSKGRAPR-DPTDPRAWGAAFLQGGGAGIYGDFLLGQYSRFGNRFLESAAGPTLSSAGE 903
Query: 136 LAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNP 195
L A EG ++ + + PF NL+Y R A ++ + + +NP
Sbjct: 904 LL--NIWAGAREGNDEKAATLRWTL------SNTPFVNLFYTRMALDYLFLYQVQEAMNP 955
Query: 196 GGRARAEVYRQRQKYKKQR 214
G R E K QR
Sbjct: 956 GFLRRFEQR--VAKDNNQR 972
>gi|315122308|ref|YP_004062797.1| hypothetical protein CKC_02800 [Candidatus Liberibacter
solanacearum CLso-ZC1]
gi|313495710|gb|ADR52309.1| hypothetical protein CKC_02800 [Candidatus Liberibacter
solanacearum CLso-ZC1]
Length = 56
Score = 78.4 bits (191), Expect = 7e-13, Method: Composition-based stats.
Identities = 30/55 (54%), Positives = 43/55 (78%)
Query: 162 KELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216
KE++NT VPFQNLWY + F++FVR +DD +NPG RARAE YR++ +++RK+
Sbjct: 2 KEVLNTTVPFQNLWYTKSVFDYFVRGKLDDAINPGNRARAEAYRRKNIQREKRKK 56
>gi|167032768|ref|YP_001667999.1| hypothetical protein PputGB1_1760 [Pseudomonas putida GB-1]
gi|166859256|gb|ABY97663.1| conserved hypothetical protein [Pseudomonas putida GB-1]
Length = 855
Score = 78.4 bits (191), Expect = 7e-13, Method: Composition-based stats.
Identities = 43/221 (19%), Positives = 77/221 (34%), Gaps = 39/221 (17%)
Query: 22 KDGSVNN-LARLMGQFLVMPISWSRMHL-----------IEIPSSLVG-----VSSQVYR 64
+ G+V L R + QF P ++ + L + +S G + +
Sbjct: 627 QPGTVPGDLLRFVTQFKSFPAAYMQKTLGRELYGRGYTPTALGNSFRGGRDLVQALRNGN 686
Query: 65 AKALVIGILGE-----ELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH-------Y 112
+ L + L + + G+EP+ DP ++ A++ G +
Sbjct: 687 GERLALAQLMLWTTAFGYLSMASKDVTKGREPR-PADDPKTWLAAMVQGGGLGIFGDYLF 745
Query: 113 ERFSPFN-SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171
+ F S+ GP A + A + T A L PF
Sbjct: 746 GEANRFGNSALESAAGPTIGTAADVINLWARAKEGDDT--------ASSALRLAQNNTPF 797
Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKK 212
NL+Y R A +H S+ + +NPG R E ++Q ++
Sbjct: 798 MNLFYTRIALDHLFLYSVQEAMNPGSLRRTEERIRQQNGQE 838
>gi|298485996|ref|ZP_07004070.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
NCPPB 3335]
gi|298159473|gb|EFI00520.1| predicted phage protein [Pseudomonas savastanoi pv. savastanoi
NCPPB 3335]
Length = 831
Score = 77.6 bits (189), Expect = 1e-12, Method: Composition-based stats.
Identities = 33/189 (17%), Positives = 72/189 (38%), Gaps = 18/189 (9%)
Query: 36 FLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF 95
F ++ H + Y A G+L + L+ +++G++P+ D
Sbjct: 627 FKSFGLAMFERHWKRVSQIESTGGKLAYSASVFT-GLLMAGAMTNQLMDIMNGRDPR-DM 684
Query: 96 SDPTEYIKALINGIT--HYERFSPFN---------SSGWDVLGPWSSQAGKLAIAGKEAV 144
D +++A++ G + S+ +LGP A + +
Sbjct: 685 KDGKFWLQAMLRGGGVGIFGDILNTGLGGDNRGGQSNLTGLLGPVYGTAADVGLTLGSVF 744
Query: 145 WDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVY 204
++ G + G + PF WY + AF H V + + ++L+PG +R +
Sbjct: 745 KEKTEPADVGANLLRIGYQ----NTPFIRSWYTKAAFEHAVMHDMQEMLSPGYLSRMK-K 799
Query: 205 RQRQKYKKQ 213
R ++ + ++
Sbjct: 800 RAKKDFNQR 808
>gi|169795397|ref|YP_001713190.1| putative phage related protein [Acinetobacter baumannii AYE]
gi|169148324|emb|CAM86189.1| conserved hypothetical protein; putative phage related protein
[Acinetobacter baumannii AYE]
Length = 841
Score = 77.6 bits (189), Expect = 1e-12, Method: Composition-based stats.
Identities = 45/221 (20%), Positives = 83/221 (37%), Gaps = 19/221 (8%)
Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66
V + +++K I G G++ + R + QF ++ H + G+ + A
Sbjct: 605 VEAGLREKTLINVGARGTITGEIVRGLAQFKSFSAAFLMRHGSRAFAQ-EGIKGKAGYAV 663
Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118
L + + + L L++G +PQ D +DP + I G++
Sbjct: 664 PLFVTLTLLGGLVVQLKELLNGNDPQTIYDSNDPKKAGSFFIRSAVQGGGLSFLGDILVA 723
Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172
+ V GP + L + K F + V +P Q
Sbjct: 724 GTDTSGRDANSFVAGPLGNDFTALLGLTVGNLTQYNEGKDTNFGNEAF--KFVKGKIPAQ 781
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
NLWY + A N V + + D + PG R +A +RQ+ +++
Sbjct: 782 NLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRER 822
>gi|260548934|ref|ZP_05823156.1| conserved hypothetical protein [Acinetobacter sp. RUH2624]
gi|260408102|gb|EEX01573.1| conserved hypothetical protein [Acinetobacter sp. RUH2624]
Length = 841
Score = 74.9 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 44/221 (19%), Positives = 81/221 (36%), Gaps = 19/221 (8%)
Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66
+ + +++K I G G++ + R + QF ++ H + Y
Sbjct: 605 IEAGLREKTLINVGARGTITGEIFRGIVQFKSFSAAFLMRHGSRTMAQEGLKGKAAYAIP 664
Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118
V+ L L+ + L L++G +PQ D +DP + + G++
Sbjct: 665 LFVMTTLLGGLVVQ-LKELLNGNDPQTIYDSNDPKKASNFFVRSAVQGGGLSFLGDILVA 723
Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172
+ V GP S L + K F + V +P Q
Sbjct: 724 GTDTSGRDAHSFVAGPLGSDFESLLSLTVGNLTQYNEGKDTNFGNEAF--QFVKRKIPAQ 781
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
NLWY + A N V + I D + PG R +A + ++ +++
Sbjct: 782 NLWYTKAAINRMVFDEIQDFIAPGYREKALRKAEEKQDRER 822
>gi|293609607|ref|ZP_06691909.1| conserved hypothetical protein [Acinetobacter sp. SH024]
gi|292828059|gb|EFF86422.1| conserved hypothetical protein [Acinetobacter sp. SH024]
Length = 1175
Score = 74.5 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 43/221 (19%), Positives = 84/221 (38%), Gaps = 19/221 (8%)
Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66
+ + ++++ W+T G G++ + + + QF S M + G+ + A
Sbjct: 939 IEAGLRERTWMTVGAKGTITGEVFKGLMQFKSFSAS-FLMRQGSRAMAQEGLKGKAAYAI 997
Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118
L++ + + L +++G +PQ D +DP + + G+
Sbjct: 998 PLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVA 1057
Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172
+ V GP S L + K F + V +P Q
Sbjct: 1058 GTDTSGRDANSFVSGPLGSDFTSLLGLTVGNLTQYNEGKDTNFGNEAF--KFVKGKIPAQ 1115
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
NLWY + A N V + + D + PG R +A +RQ+ +++
Sbjct: 1116 NLWYTKAAINRMVFDEMQDTIAPGYREKALRKAERQQDRER 1156
>gi|320175029|gb|EFW50142.1| 17 [Shigella dysenteriae CDC 74-1112]
Length = 582
Score = 74.5 bits (181), Expect = 9e-12, Method: Composition-based stats.
Identities = 46/220 (20%), Positives = 83/220 (37%), Gaps = 34/220 (15%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H S +G+ S RA + I
Sbjct: 365 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 420
Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120
+ + L L SG+ P+ + D ++ + G + + S
Sbjct: 421 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 480
Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177
+ + GP + + + + EG +Q G + GK L +P NLWY
Sbjct: 481 GALASMFGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYL 536
Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 537 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 569
>gi|324008547|gb|EGB77766.1| hypothetical protein HMPREF9532_01734 [Escherichia coli MS 57-2]
Length = 824
Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H S +G+ S RA + I
Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662
Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120
+ + L L SG+ P+ + D ++ + G + + S
Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722
Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177
+ +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGL----MPGANLWYL 778
Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|323156120|gb|EFZ42279.1| hypothetical protein ECEPECA14_1895 [Escherichia coli EPECa14]
Length = 824
Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H S +G+ S RA + I
Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662
Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120
+ + L L SG+ P+ + D ++ + G + + S
Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722
Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177
+ +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGL----MPGANLWYL 778
Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|89152441|ref|YP_512274.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10]
gi|74055464|gb|AAZ95913.1| hypothetical protein PhiV10p20 [Escherichia phage phiV10]
Length = 824
Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H S +G+ S RA + I
Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGIPSAGGRAAYIATFI 662
Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120
+ + L L SG+ P+ + D ++ + G + + S
Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGGDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722
Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177
+ +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYL 778
Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|117624699|ref|YP_853612.1| hypothetical protein APECO1_4054 [Escherichia coli APEC O1]
gi|115513823|gb|ABJ01898.1| conserved hypothetical protein [Escherichia coli APEC O1]
gi|323948672|gb|EGB44577.1| hypothetical protein ERKG_04895 [Escherichia coli H252]
Length = 824
Score = 74.5 bits (181), Expect = 1e-11, Method: Composition-based stats.
Identities = 47/220 (21%), Positives = 84/220 (38%), Gaps = 34/220 (15%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H S +G+ S RA + I
Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662
Query: 73 LGE---ELIRKTLVPLISGKEPQ-LDFSDPTEY--------IKALINGITHYERFSPFNS 120
+ + L L SG+ P+ + D ++ + G + + S
Sbjct: 663 ASTTILGALSQQLNDLASGRNPREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722
Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177
+ +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKSEQTGGDLVKLGKGL----MPGANLWYL 778
Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|303328566|ref|ZP_07359001.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
gi|302861332|gb|EFL84271.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
Length = 855
Score = 74.1 bits (180), Expect = 1e-11, Method: Composition-based stats.
Identities = 40/215 (18%), Positives = 76/215 (35%), Gaps = 33/215 (15%)
Query: 25 SVNNLARLMGQFLVMPISWSRMHL----IEIPSSLVGV-----------SSQVYRAKALV 69
+ R + QF PI++ + L G+ + R +
Sbjct: 632 GAGEVWRAIMQFKSFPIAYMQRVLGGRRWVRGDLQRGMRYGPRNLPGAVEDALTRDMGGL 691
Query: 70 IGILGE----ELIRKTLVPLISGKEPQLDFSDPTEYIKAL------INGITHYERFSPFN 119
+G + TL L G+EP+ T A+ I G + + + F
Sbjct: 692 MGFVLSSVAFGYASMTLKDLAKGREPRSLAHRETWLAAAMQSGGAGIFGDILFGKVNRFG 751
Query: 120 SSGWDV-LGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYAR 178
+S + +GP G A G + V + + G PF NLWY R
Sbjct: 752 NSFAETAVGPLGGLIGDAATLGGQLVRGDMADAGEDTLRLAMG------NAPFINLWYTR 805
Query: 179 GAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
A + + + ++++PG R E + ++++ ++
Sbjct: 806 AALDWMLLYHVREMMSPGTLRRTE-RKMKKEFGQE 839
>gi|294843482|ref|ZP_06788165.1| putative phage related protein [Acinetobacter sp. 6014059]
Length = 841
Score = 73.7 bits (179), Expect = 1e-11, Method: Composition-based stats.
Identities = 42/221 (19%), Positives = 83/221 (37%), Gaps = 19/221 (8%)
Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAK 66
+ + ++++ W+T G G++ + + + QF S M + G+ + A
Sbjct: 605 IEAGLRERTWMTVGAKGTITGEVFKGLMQFKSFSAS-FLMRQGSRAMAQEGLKGKAAYAI 663
Query: 67 ALVIGILGEELIRKTLVPLISGKEPQ--LDFSDPTEYIKALIN------GITHYERFSPF 118
L++ + + L +++G +PQ D +DP + + G+
Sbjct: 664 PLMVSMTLLGGLVVQLREILNGNDPQTIYDSNDPKKATSFFMRSLVAGGGLPVLGDILVA 723
Query: 119 NSSGWD------VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQ 172
+ V GP S L + K F + V +P Q
Sbjct: 724 GTDTSGRDANSFVSGPLGSDFTALLGLTVGNLTQYNEGKDTNFGNEAF--KFVKGKIPAQ 781
Query: 173 NLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
NLWY + A N + + D + PG R +A +RQ+ +++
Sbjct: 782 NLWYTKAAINRMFFDEVQDTIAPGYREKALRKAERQQDRER 822
>gi|215487808|ref|YP_002330239.1| hypothetical protein E2348C_2741 [Escherichia coli O127:H6 str.
E2348/69]
gi|215265880|emb|CAS10289.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
Length = 824
Score = 72.6 bits (176), Expect = 3e-11, Method: Composition-based stats.
Identities = 41/206 (19%), Positives = 74/206 (35%), Gaps = 30/206 (14%)
Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGE---ELIRKTLV 83
L R + F PIS H S +G+ S RA + I + + L
Sbjct: 621 GELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFIASTTILGALSQQLN 676
Query: 84 PLISGKEPQ---------LDFSDPTEYIKALINGITHYERFSPFNS-SGWDVLGPWSSQA 133
+ SG+ P+ + + G + + S + +LGP +
Sbjct: 677 DMASGRNPRDMVGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGSGALASMLGPVAGLV 736
Query: 134 GKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191
+ G+ + EG +Q G + GK L P N+WY + A +H + N + +
Sbjct: 737 DDVIKIGQGIPLNAVEGKSEQTGGDLVKLGKGL----TPGANIWYLKAALDHMIFNQMQE 792
Query: 192 VLNPGGRARAEVYRQRQKYKKQRKRN 217
+PG + E + +++ N
Sbjct: 793 YFSPGYLRKME-------QRSKKEFN 811
>gi|221213942|ref|ZP_03586915.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
gi|221166119|gb|EED98592.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
Length = 864
Score = 72.6 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 39/223 (17%), Positives = 78/223 (34%), Gaps = 28/223 (12%)
Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPS----------SLVGVSSQ 61
++ K + G+ L + QF PI+ H I +++
Sbjct: 619 LRTKVIASATPGTAMGELKKTFMQFKSFPIAMISRHWGRIGDMRRSGDFRVDGAPALANP 678
Query: 62 VYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING------------- 108
+ A ALV+ I + L++GK+P+ F D
Sbjct: 679 MAYAAALVVSTTLIGAISTQVKNLLAGKDPEPMFDDVKHAAGFWTRAFSVGGGAGFAGDM 738
Query: 109 ITHYERFSPFNSSGWDVLG-PWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNT 167
+T + + S V+G P S ++ A + + + ++ +
Sbjct: 739 LTASFESTDYGSLLGSVVGGPLPSTIYQVVRAFSSNAQ--DAAQGKDTHVSADLLKVAQS 796
Query: 168 FVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210
P NLW+ + +N + +++ + L+PG R + R R +Y
Sbjct: 797 NTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQR-NINRSRNQY 838
>gi|167041093|gb|ABZ05854.1| hypothetical protein ALOHA_HF400048F7ctg1g21 [uncultured marine
microorganism HF4000_48F7]
Length = 828
Score = 72.6 bits (176), Expect = 4e-11, Method: Composition-based stats.
Identities = 43/206 (20%), Positives = 79/206 (38%), Gaps = 28/206 (13%)
Query: 32 LMGQFLV--------MPISWSRMHLIEIPS-SLVGVSSQVYRAKAL-------VIGILGE 75
MG+F P++ + + S L + Q RA + ++ ++
Sbjct: 607 FMGRFFTGEEGIKSGTPMAMANKLFWQFRSFGLTMLFRQWPRAYEMGLPSFYHLVPMVLM 666
Query: 76 ELIRKTLVPLISGKEPQLDFSDPTEYIKAL--------INGITHYERFSPFNSSGWDVL- 126
+ + ++ G+E + DP + A I G + + +++S D+L
Sbjct: 667 GYVAMAMKDILKGRELKDVVEDPGKIAVASVLQSGFGGIAGDFLFNDYRQYSTSYVDLLA 726
Query: 127 GPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVR 186
GP S LA G A + A G V +P+ N W +R F++ +
Sbjct: 727 GPSGSSLNDLAEFG--ATTFDVATGGDPVDAAAAGWRAVKGNIPYANWWASRTLFDYLIN 784
Query: 187 NSIDDVLNPGGRARAEVYRQRQKYKK 212
+ ++LNPG R E R +QK +
Sbjct: 785 YQVQEILNPGSLRRME-RRFKQKNNQ 809
>gi|294648411|ref|ZP_06725910.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
gi|292825716|gb|EFF84420.1| phage protein [Acinetobacter haemolyticus ATCC 19194]
Length = 854
Score = 72.2 bits (175), Expect = 5e-11, Method: Composition-based stats.
Identities = 41/201 (20%), Positives = 75/201 (37%), Gaps = 23/201 (11%)
Query: 22 KDGSVNN-LARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRK 80
+ G+V N L+R QF P++ + VY AK + L+ +
Sbjct: 640 ERGTVGNELSRFFWQFKQFPLAMIMRQWTRGMAQGTPQEKFVYFAKLFAYTTVMGALVSQ 699
Query: 81 TLVPLISGKEPQLDFSDPTE---YIKALINGIT--HYERFSPFNSS-----GWDVLGPWS 130
+ L GK+ DPT Y+K+++ G + S D + P +
Sbjct: 700 -IQNLTQGKDLD----DPTTLDFYMKSIVKGGSASFLADAISATSDPTERSVKDFIIPAA 754
Query: 131 ---SQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRN 187
+ ++G + + G +V +PFQNLWY+R F+ V
Sbjct: 755 FKDITSIGTMVSGAGSAFITERDSSYGAEAVN----VVKNNIPFQNLWYSRLVFDRLVIA 810
Query: 188 SIDDVLNPGGRARAEVYRQRQ 208
+ ++ + G R R + ++
Sbjct: 811 EMQELFDEGYRERKQRRQENN 831
>gi|327252171|gb|EGE63843.1| hypothetical protein ECSTEC7V_3018 [Escherichia coli STEC_7v]
Length = 824
Score = 71.8 bits (174), Expect = 7e-11, Method: Composition-based stats.
Identities = 43/220 (19%), Positives = 81/220 (36%), Gaps = 34/220 (15%)
Query: 17 RWITG---KDGS-VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGI 72
+ ITG + G+ L R + F PIS H S +G+ S RA + I
Sbjct: 607 QLITGSGIQRGTWKGELTRSVFLFKSFPISVVMRHW----SRAMGMPSAGGRAAYIATFI 662
Query: 73 LGEELIRKTLVPL-----------ISGKEP-QLDFSDPTEYIKALINGITHYERFSPFNS 120
++ L ++G++ + + + G + + S
Sbjct: 663 ASTTILGALSQQLNDLASGRNHREMTGEDAAKFWLGALLKGGGLGLYGDFLLSDHTRYGS 722
Query: 121 -SGWDVLGPWSSQAGKLAIAGKEAVWD--EGTRKQRGKAQAQFGKELVNTFVPFQNLWYA 177
+ +LGP + + + + EG +Q G + GK L +P NLWY
Sbjct: 723 GALASMLGPVAGLVDDVVKIAQGIPLNAVEGKNEQTGGDLVKLGKGL----MPGANLWYL 778
Query: 178 RGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKRN 217
+ A +H + N + + +PG + E + +++ N
Sbjct: 779 KAALDHMIFNQMQEYFSPGYLRKME-------QRSKKEFN 811
>gi|291336673|gb|ADD96216.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377]
Length = 101
Score = 69.1 bits (167), Expect = 4e-10, Method: Composition-based stats.
Identities = 19/94 (20%), Positives = 40/94 (42%), Gaps = 6/94 (6%)
Query: 106 INGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELV 165
I + S+ +GP ++A ++ A A+ EG + + + +
Sbjct: 9 IYTDFLFGNIQNSTSALATAVGPIPTEAARVLSALNYAIKGEGGKAGKQAYYS------I 62
Query: 166 NTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRA 199
+PF NL+Y + AF++ + + + L+PG
Sbjct: 63 KENIPFLNLFYIKTAFDYMIGYQMMETLSPGSLK 96
>gi|48696644|ref|YP_024423.1| hypothetical protein VP2p19 [Vibrio phage VP2]
gi|40950042|gb|AAR97633.1| hypothetical protein [Vibrio phage VP2]
Length = 782
Score = 68.3 bits (165), Expect = 7e-10, Method: Composition-based stats.
Identities = 38/196 (19%), Positives = 75/196 (38%), Gaps = 12/196 (6%)
Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVI-----GILGEELIRKT 81
L R + F PI+ M+ + G S R A I +LG +I+
Sbjct: 572 GELHRSLFMFHSFPITTI-MNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVLGVGIIQ-- 628
Query: 82 LVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGK--LAIA 139
+++GK+P+ SDP +I+ + G + ++ +S G LA
Sbjct: 629 AKDILNGKKPR-SMSDPKLWIEGMAQGGSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYG 687
Query: 140 GKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRA 199
A+ K ++ +PF NLWY + A + + + I + +P
Sbjct: 688 DWVAMTAADMAKGDAESAMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLSDPEY-D 746
Query: 200 RAEVYRQRQKYKKQRK 215
+ ++ + R+ + ++
Sbjct: 747 KKQLNKMRKMQRTSQQ 762
>gi|48696687|ref|YP_024981.1| hypothetical protein VP5_gp18 [Vibrio phage VP5]
gi|40806150|gb|AAR92068.1| hypothetical protein [Vibrio phage VP5]
Length = 782
Score = 68.3 bits (165), Expect = 7e-10, Method: Composition-based stats.
Identities = 38/196 (19%), Positives = 75/196 (38%), Gaps = 12/196 (6%)
Query: 27 NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVI-----GILGEELIRKT 81
L R + F PI+ M+ + G S R A I +LG +I+
Sbjct: 572 GELHRSLFMFHSFPITTI-MNQWRRVFTGKGYSGAFDRMSAAAIMVGATSVLGVGIIQ-- 628
Query: 82 LVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGK--LAIA 139
+++GK+P+ SDP +I+ + G + ++ +S G LA
Sbjct: 629 AKDILNGKKPR-SMSDPKLWIEGMAQGGSFNYIGDLMRNAASGYSHDMTSYVGGPVLAYG 687
Query: 140 GKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRA 199
A+ K ++ +PF NLWY + A + + + I + +P
Sbjct: 688 DWVAMTAADMAKGDAESAMARTANFATQQIPFNNLWYTKIATDRLLMDRIRRLSDPEY-D 746
Query: 200 RAEVYRQRQKYKKQRK 215
+ ++ + R+ + ++
Sbjct: 747 KKQLNKMRKMQRTSQQ 762
>gi|48697207|ref|YP_024937.1| hypothetical protein BcepC6B_gp17 [Burkholderia phage BcepC6B]
gi|47779013|gb|AAT38376.1| gp17 [Burkholderia phage BcepC6B]
Length = 864
Score = 66.0 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 41/223 (18%), Positives = 81/223 (36%), Gaps = 28/223 (12%)
Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPS----------SLVGVSSQ 61
++ K + G+V L + QF P++ H I +++
Sbjct: 619 LRTKVIASATPGTVTGELKKSFMQFKSFPMAMISRHWGRIGDMRRSGDFRVDGAPALANP 678
Query: 62 VYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTE----YIKALING--------- 108
+ A ALV+ I L++GK+P+ F D + +A G
Sbjct: 679 MAYAAALVVSTTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAGFAGDM 738
Query: 109 ITHYERFSPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNT 167
+ + + + S G + GP S + A V + + ++ +
Sbjct: 739 LVAAFQSADYGSLLGSAIGGPLLSTLFQPLRAVSSNVQ--DAAQGKDTHIGADLLKIAQS 796
Query: 168 FVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKY 210
P NLW+ + +N + +++ + L+PG R + R R +Y
Sbjct: 797 NTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQR-NMNRSRTQY 838
>gi|221201510|ref|ZP_03574549.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|221207934|ref|ZP_03580940.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2]
gi|221172119|gb|EEE04560.1| hypothetical protein BURMUCGD2_2469 [Burkholderia multivorans CGD2]
gi|221178778|gb|EEE11186.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
Length = 869
Score = 66.0 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 42/231 (18%), Positives = 82/231 (35%), Gaps = 33/231 (14%)
Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVG-------------- 57
++ K + G+V L + QF P++ H I +
Sbjct: 619 LRTKVIASATPGTVTGELKKSFMQFKSFPMAMISRHWGRIGNMRRSGDYLVEGAPRAFGI 678
Query: 58 -VSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTE----YIKALING---- 108
+++ + A ALV+ I L++GK+P+ F D + +A G
Sbjct: 679 PLANPMAYAAALVVSTTLIGAISTQAKNLLAGKDPEPMFDDVKHAGGFWTRAFSVGGGAG 738
Query: 109 -----ITHYERFSPFNSS-GWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGK 162
+ + + S G V GP S + A V + +
Sbjct: 739 FAGDMLVAAFESADYGSLLGSAVGGPLLSTLFQPLRAISSNVQ--DAAQGKDTHVGADLL 796
Query: 163 ELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQ 213
++ + P NLW+ + +N + +++ + L+PG R + R R +Y +
Sbjct: 797 KIAQSNTPLVNLWFWKTVWNRLIWDNLAENLSPGVTQR-NMNRSRTQYHNE 846
>gi|262371858|ref|ZP_06065137.1| predicted protein [Acinetobacter junii SH205]
gi|262311883|gb|EEY92968.1| predicted protein [Acinetobacter junii SH205]
Length = 841
Score = 61.0 bits (146), Expect = 1e-07, Method: Composition-based stats.
Identities = 42/211 (19%), Positives = 87/211 (41%), Gaps = 20/211 (9%)
Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPIS-WSRMHLIEIPSSLVGVSSQVYRA 65
+ + ++++ I G+ G++ L R + QF P++ RM + S + A
Sbjct: 626 IEAGVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYMFRMGHRAFAQGDIK-SRVTFLA 684
Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING--ITHYERF-----SPF 118
L L LI +T L +GK P+ F+ + K+L+ G ++ P
Sbjct: 685 SLLAYQTLAGALIVQT-QNLANGKNPEPVFTID-FFGKSLLKGGGLSFLGDIMSALSDPT 742
Query: 119 NSSGWDVL-GPWSSQAGKLAIAGKEAVWDEGTR--KQRGKAQAQFGKELVNTFVPFQNLW 175
S D + GP Q+ KL + + G + + + + + +P QNLW
Sbjct: 743 GRSASDFISGPLLGQSMKLGM----LLTGMGNNIIEGKESTRMMEVANTLKSNIPLQNLW 798
Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQ 206
Y++ + + + + ++++P R + +
Sbjct: 799 YSKLVVDRMLYSKMQNMIDPDYLPRTQQRLE 829
>gi|254251753|ref|ZP_04945071.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158]
gi|124894362|gb|EAY68242.1| hypothetical protein BDAG_00950 [Burkholderia dolosa AUO158]
Length = 865
Score = 59.9 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 41/229 (17%), Positives = 78/229 (34%), Gaps = 37/229 (16%)
Query: 13 IQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVG-----------VSS 60
++ K G++ L + QF PI+ H I S
Sbjct: 620 LRTKVIAAATPGTLQGELQKTFLQFKSFPIAMISRHWGRIGEMRRSGDFRVEGAPTLASP 679
Query: 61 QVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNS 120
Y A ALV+ + L L+ GK+P+ D ++ A + F+
Sbjct: 680 MAYGA-ALVVSTTLLGALAVQLQNLLLGKDPE-PMGDDVKHGGAF-----WFRAFTKGGG 732
Query: 121 SG-------WDVLGPWSSQAGK------LAIAGKEAVWDEGTR-----KQRGKAQAQFGK 162
+G + G ++A L +AV + + +
Sbjct: 733 AGFAGDMLSAMLTGKNPAEAVGSVFGGPLVSTAIQAVTPFSNNAMAAAEGKDTHLSADLL 792
Query: 163 ELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYK 211
+ + +P NLWY + +N + ++I + L+PG +R ++Q +
Sbjct: 793 KFAQSNMPIVNLWYWKTVWNRLIWDNIAENLSPGVTSRNVAKSRQQYHN 841
>gi|226953662|ref|ZP_03824126.1| phage related protein [Acinetobacter sp. ATCC 27244]
gi|226835534|gb|EEH67917.1| phage related protein [Acinetobacter sp. ATCC 27244]
Length = 842
Score = 59.5 bits (142), Expect = 3e-07, Method: Composition-based stats.
Identities = 41/211 (19%), Positives = 87/211 (41%), Gaps = 20/211 (9%)
Query: 9 VGSTIQDKRWIT-GKDGSV-NNLARLMGQFLVMPIS-WSRMHLIEIPSSLVGVSSQVYRA 65
+ + ++++ I G+ G++ L R + QF P++ R+ + S + A
Sbjct: 626 IEAGVRERSIINLGEAGTIQGELGRTLFQFKGFPLAYMFRIGHRAFAQGDIK-SRVTFLA 684
Query: 66 KALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALING--ITHYERF-----SPF 118
L L LI +T L +GK P+ F+ + K+L+ G ++ P
Sbjct: 685 SLLAYQTLAGALIVQT-QNLANGKNPEPVFTID-FFGKSLLKGGGLSFLGDIMSALSDPT 742
Query: 119 NSSGWDVL-GPWSSQAGKLAIAGKEAVWDEGTR--KQRGKAQAQFGKELVNTFVPFQNLW 175
S D + GP Q+ KL + + G + + + + + +P QNLW
Sbjct: 743 GRSASDFISGPLLGQSMKLGM----LLTGMGNNIIEGKESTRMMEVANTLKSNIPLQNLW 798
Query: 176 YARGAFNHFVRNSIDDVLNPGGRARAEVYRQ 206
Y++ + + + + ++++P R + +
Sbjct: 799 YSKLVVDRMLYSKMQNMIDPDYLPRTQQRLE 829
>gi|291336683|gb|ADD96225.1| hypothetical protein Rsph17025_0444 [uncultured organism
MedDCM-OCT-S08-C1350]
Length = 850
Score = 56.4 bits (134), Expect = 3e-06, Method: Composition-based stats.
Identities = 38/210 (18%), Positives = 76/210 (36%), Gaps = 29/210 (13%)
Query: 20 TGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELI 78
+ + G+V + M + PI+ HL VG+ + +++G I
Sbjct: 636 SAQPGTVKGEIVNSMLMYKNFPITLGMTHLSR-GFQQVGLKGKAKYLVPMIVGGAVMGSI 694
Query: 79 RKTLVPLISGKEPQLDFSDPTE-----YIKALINGITH-------YERFSPFNSSG-WDV 125
+ + +GK P + P + ++ A+I G + + + S +
Sbjct: 695 AYEIKQIAAGKTP----TKPEDMGVRYWLNAIIYGGGLGIFGDFLFSDQNRYGGSFSKTL 750
Query: 126 LGPWSSQAGK---LAIAGK-EAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAF 181
GP +S G L + + E T + + + P +LWYAR A
Sbjct: 751 AGPVASFIGDSINLTFGNAAQLISGEKTNAGKE------LAAFIQRYTPGSSLWYARVAL 804
Query: 182 NHFVRNSIDDVLNPGGRARAEVYRQRQKYK 211
+ +SI+ ++NP + + K +
Sbjct: 805 ERILFDSIERLINPDFDSDNRRNINKLKSR 834
>gi|157372110|ref|YP_001480099.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568]
gi|157323874|gb|ABV42971.1| hypothetical protein Spro_3875 [Serratia proteamaculans 568]
Length = 850
Score = 52.9 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 33/212 (15%), Positives = 70/212 (33%), Gaps = 35/212 (16%)
Query: 26 VNNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVYRAKA------------------ 67
+ R GQF S+ + + +++ +++
Sbjct: 635 LGEAIRFGGQFKSFTGSFMQNTIGREIYGRGYTPAELGQSRFTSLANAMRNGNGEKMGLA 694
Query: 68 -LVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKAL-------INGITHYERFSPFN 119
L I + + L+ G+ P+ +D ++ A I G + ++ F
Sbjct: 695 QLFIWMTALGYVSMQTKLLLKGQTPR--PADAKTFLAAAAQGGGLGIMGDFLFGEYNRFG 752
Query: 120 SSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARG 179
G +S + + + + R + G A+A + PF NL R
Sbjct: 753 -------GGLASSLAGPTVGDLDQIRNLFLRARDGDAKAADLLKFGIDHTPFMNLHVVRP 805
Query: 180 AFNHFVRNSIDDVLNPGGRARAEVYRQRQKYK 211
A N+ + N + L+PG R ++++
Sbjct: 806 AMNYLILNRAQEWLSPGSLERYRQRVEKEQGN 837
>gi|262043648|ref|ZP_06016757.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
gi|259038986|gb|EEW40148.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
Length = 974
Score = 52.6 bits (124), Expect = 4e-05, Method: Composition-based stats.
Identities = 40/225 (17%), Positives = 77/225 (34%), Gaps = 41/225 (18%)
Query: 22 KDGSV-NNLARLMGQFLVMPISWSR----MHLIEIPSSLVGVSSQ-VYRAKAL------- 68
+ G+ + R QF S+ + L +S +R AL
Sbjct: 750 QRGTAYGEMLRFAWQFKSFTASFMQNAIGRELYGRGYDFGSLSQNNTFRNNALIRAMRNG 809
Query: 69 ---VIGIL-------GEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITH------- 111
++GI + ++ G+ P+ + + + A+ G
Sbjct: 810 NGELMGIAQLFLWATAFGYLSMQTKLMLRGQTPR-PADNVSTWTAAMAQGGGLGILGDFL 868
Query: 112 YERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPF 171
+ ++ F ++ P +S AG A + V G KQ A + +N P+
Sbjct: 869 FGEYNRFGNT------PATSLAGPFASDAAQLVNLFGLTKQGDAKAADYFNFAINHT-PY 921
Query: 172 QNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQKYKKQRKR 216
NL R + + N + + ++PG R Y+QR K ++
Sbjct: 922 MNLHVVRPVMDFLILNQMREWMSPGSLQR---YQQRVKEEQGNDF 963
>gi|262043550|ref|ZP_06016663.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
gi|259039084|gb|EEW40242.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
rhinoscleromatis ATCC 13884]
Length = 143
Score = 49.1 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 24/140 (17%), Positives = 49/140 (35%), Gaps = 18/140 (12%)
Query: 80 KTLVPLISGKEPQLDFSDPTEYIKALINGITH-------YERFSPFN-SSGWDVLGPWSS 131
L+ G+ P+ +D ++ A G + + ++GP +S
Sbjct: 1 MQSKLLLKGQTPR--PADAKTFLAAASQGGGLGILGDFMFGEVNRMGAGPVTSLMGPAAS 58
Query: 132 QAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVPFQNLWYARGAFNHFVRNSIDD 191
A + ++ + + PF N+++ R A N + N I D
Sbjct: 59 NADSIITLLQQTTRGDADLGDWYRTALD--------NTPFLNVFWLRTAMNGLILNRIQD 110
Query: 192 VLNPGGRARAEVYRQRQKYK 211
L+PG R + +R++
Sbjct: 111 ALDPGSLERYQRRVEREQGN 130
>gi|190893672|ref|YP_001980214.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652]
gi|190698951|gb|ACE93036.1| hypothetical protein RHECIAT_CH0004107 [Rhizobium etli CIAT 652]
Length = 460
Score = 49.1 bits (115), Expect = 5e-04, Method: Composition-based stats.
Identities = 22/106 (20%), Positives = 48/106 (45%), Gaps = 12/106 (11%)
Query: 5 ARGSVGSTIQDKRWITGKDGSV-NNLARLMGQFLVMPISWSRMHLIEIPSSLVGVSSQVY 63
RG++ +Q G++ R QF P+++ H++ + G++++ Y
Sbjct: 355 IRGAMTGGLQ--------RGTIIGEAVRSATQFKSFPMTYMMTHMMRALTQ--GMANRTY 404
Query: 64 RAKALVIGILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGI 109
R L + + + LI+G++PQ + +DP + ++ I G
Sbjct: 405 RTTQLALTMTIAGAEMSQMQSLIAGRDPQ-NMADPRFWEQSFIRGG 449
>gi|218514216|ref|ZP_03511056.1| hypothetical protein Retl8_11184 [Rhizobium etli 8C-3]
Length = 73
Score = 44.8 bits (104), Expect = 0.008, Method: Composition-based stats.
Identities = 12/49 (24%), Positives = 26/49 (53%)
Query: 161 GKELVNTFVPFQNLWYARGAFNHFVRNSIDDVLNPGGRARAEVYRQRQK 209
+ + + P +LWY + A + + ++I +++P RA + Y +R K
Sbjct: 2 LADHLKAWTPGSSLWYTKIATDRLIFDNIQAMIDPNYRASFDRYERRMK 50
>gi|242783432|ref|XP_002480186.1| GTP cyclohydrolase II, putative [Talaromyces stipitatus ATCC 10500]
gi|218720333|gb|EED19752.1| GTP cyclohydrolase II, putative [Talaromyces stipitatus ATCC 10500]
Length = 451
Score = 40.6 bits (93), Expect = 0.14, Method: Composition-based stats.
Identities = 30/138 (21%), Positives = 49/138 (35%), Gaps = 11/138 (7%)
Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF------SDPTEYIKALINGIT 110
G S +Y A A+ G + P + EP DF SDP + + G
Sbjct: 84 GGSYSIYNALAIAAG-----DLPTDFKPDFNNTEPTFDFPQQPAWSDPKKIVSLDPFGHD 138
Query: 111 HYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVP 170
++F + GWD+ + + +A E EG + G ++ T V
Sbjct: 139 IVKQFKSYLDVGWDLRPSMAITRANMRLAEIEKAVSEGQIEVDGSIVVDKNGDVRVTKVA 198
Query: 171 FQNLWYARGAFNHFVRNS 188
+ +WY G F +
Sbjct: 199 VEPVWYLPGVAERFGVDE 216
>gi|212527336|ref|XP_002143825.1| GTP cyclohydrolase II, putative [Penicillium marneffei ATCC 18224]
gi|210073223|gb|EEA27310.1| GTP cyclohydrolase II, putative [Penicillium marneffei ATCC 18224]
Length = 494
Score = 39.5 bits (90), Expect = 0.38, Method: Composition-based stats.
Identities = 29/138 (21%), Positives = 48/138 (34%), Gaps = 11/138 (7%)
Query: 57 GVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDF------SDPTEYIKALINGIT 110
G S +Y A A+ G + P + EP DF SDP + + G
Sbjct: 127 GGSYSIYNALAIAAG-----DLPTDFKPDFNNTEPTFDFPVQPAWSDPKKIVSLDPFGHD 181
Query: 111 HYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKELVNTFVP 170
+ F + GWD+ + + ++ E EG + G ++ T V
Sbjct: 182 IVKHFKSYLDVGWDLRPSMAITRANMRLSEIEKAVSEGQIEVDGSIVIGKNGDVRVTKVA 241
Query: 171 FQNLWYARGAFNHFVRNS 188
+ +WY G F +
Sbjct: 242 VEPVWYLPGVAERFGVDE 259
>gi|294661369|ref|YP_003573245.1| hypothetical protein Aasi_1895 [Candidatus Amoebophilus asiaticus
5a2]
gi|227336520|gb|ACP21117.1| hypothetical protein Aasi_1895 [Candidatus Amoebophilus asiaticus
5a2]
Length = 585
Score = 38.7 bits (88), Expect = 0.60, Method: Composition-based stats.
Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 2/120 (1%)
Query: 39 MPISWSRMHLIEIPSSLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQLDFSDP 98
PIS S + + VY+ + + EEL K+ L G + + +P
Sbjct: 224 YPISISSRNYATEGNKSEQGVWDVYKKELSIKNYTQEELRTKSFPYLFHGGKLDTTYLNP 283
Query: 99 TEYIKALINGITHYERFSPFNSSGWD--VLGPWSSQAGKLAIAGKEAVWDEGTRKQRGKA 156
T + ++ E F D ++ P KL +E + ++ K
Sbjct: 284 TTFYNLMVRAGFQEEDFKEGKHGFQDKVLVKPIILTKTKLNECHEELRELINSTLKKAKY 343
>gi|310798539|gb|EFQ33432.1| hypothetical protein GLRG_08711 [Glomerella graminicola M1.001]
Length = 1103
Score = 37.9 bits (86), Expect = 0.85, Method: Composition-based stats.
Identities = 21/125 (16%), Positives = 40/125 (32%), Gaps = 9/125 (7%)
Query: 36 FLVMPISWSRMHLIEIPS--SLVGVSSQVYRAKALVIGILGEELIRKTLVPLISGKEPQL 93
F + RM P+ S + +Y G +E + + L+ G P L
Sbjct: 964 FKTQSMVLMRMFYFVEPADGSAAKIQGPIYSPDQAAAGTSNKEFLANFVANLLRGAFPNL 1023
Query: 94 DFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWSSQAGKLAIAGKEAVWDEGTRKQR 153
+ +++ L T Y++F L ++ E E +++R
Sbjct: 1024 QPAQIQTFVEGLFTLNTQYDKFRLNLRDFLISLKEFAGD-------NAELFQVEKEQQER 1076
Query: 154 GKAQA 158
A
Sbjct: 1077 DAKAA 1081
>gi|170048775|ref|XP_001870771.1| bromodomain-containing protein 8 [Culex quinquefasciatus]
gi|167870763|gb|EDS34146.1| bromodomain-containing protein 8 [Culex quinquefasciatus]
Length = 917
Score = 37.9 bits (86), Expect = 0.96, Method: Composition-based stats.
Identities = 21/136 (15%), Positives = 44/136 (32%), Gaps = 9/136 (6%)
Query: 71 GILGEELIRKTLVPLISGKEPQLDFSDPTEYIKALINGITHYERFSPFNSSGWDVLGPWS 130
G+ ++ L++G P ++ + A + P S + P
Sbjct: 243 GMQAVAGRSPSITNLLTGNSPGMNIQGKNLFPTAGSTSTQLQDDIKPIEGSSSYQIAP-- 300
Query: 131 SQAGKLAIAGKEAVWDEGTRKQRGKAQAQFGKEL---VNTFVPFQNLWYARGAFNHFVRN 187
KL ++ V D+ T G Q +++ + P ++L F +
Sbjct: 301 -NLTKLLDTKQQVVDDKPTDSGEGAVQVDKAEDMEIDADNVDPAKDLM---AVFQELMPE 356
Query: 188 SIDDVLNPGGRARAEV 203
+ ++LN E
Sbjct: 357 ELVEILNENNGMILED 372
>gi|291336674|gb|ADD96217.1| hypothetical protein [uncultured organism MedDCM-OCT-S06-C2377]
Length = 333
Score = 36.8 bits (83), Expect = 2.5, Method: Composition-based stats.
Identities = 11/72 (15%), Positives = 23/72 (31%), Gaps = 10/72 (13%)
Query: 27 NNLARLMGQFLVMPISWSRMHL------IEIPSSLVGVSSQVYRAKALVIGILGEELIRK 80
R M QF P ++ + + + + + + LV G +
Sbjct: 263 GEALRFMTQFKAFPFAFYQKMIGRETAAWKDGNKM----NAALSMAQLVGGSALFGYMAM 318
Query: 81 TLVPLISGKEPQ 92
T ++ GK +
Sbjct: 319 TAKDILKGKNLR 330
Database: nr
Posted date: May 13, 2011 4:10 AM
Number of letters in database: 999,999,932
Number of sequences in database: 2,987,209
Database: /data/usr2/db/fasta/nr.01
Posted date: May 13, 2011 4:17 AM
Number of letters in database: 999,998,956
Number of sequences in database: 2,896,973
Database: /data/usr2/db/fasta/nr.02
Posted date: May 13, 2011 4:23 AM
Number of letters in database: 999,999,979
Number of sequences in database: 2,907,862
Database: /data/usr2/db/fasta/nr.03
Posted date: May 13, 2011 4:29 AM
Number of letters in database: 999,999,513
Number of sequences in database: 2,932,190
Database: /data/usr2/db/fasta/nr.04
Posted date: May 13, 2011 4:33 AM
Number of letters in database: 792,586,372
Number of sequences in database: 2,260,650
Lambda K H
0.308 0.118 0.280
Lambda K H
0.267 0.0361 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,150,439,952
Number of Sequences: 13984884
Number of extensions: 110355439
Number of successful extensions: 265952
Number of sequences better than 10.0: 58
Number of HSP's better than 10.0 without gapping: 66
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 265819
Number of HSP's gapped (non-prelim): 115
length of query: 218
length of database: 4,792,584,752
effective HSP length: 133
effective length of query: 85
effective length of database: 2,932,595,180
effective search space: 249270590300
effective search space used: 249270590300
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.3 bits)
S2: 78 (34.8 bits)