BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041680
(290 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356510499|ref|XP_003523975.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Glycine
max]
Length = 305
Score = 339 bits (870), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 171/281 (60%), Positives = 208/281 (74%), Gaps = 5/281 (1%)
Query: 5 LLCLILVSEAWPVKCQ-YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPN 63
LL L+ + ++W VK Q + + ILAGQSNMAGRGGV N+T T TWDG+VPPQ +PN
Sbjct: 4 LLLLVFLIQSWAVKAQQVYDRNIFILAGQSNMAGRGGVLNNTGTGIATWDGVVPPQSRPN 63
Query: 64 PSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGG 123
PS+L+L A L WV A EPL ADID KTNGVGPG+ FAN+VL K P+FG+IGLVPCAIGG
Sbjct: 64 PSVLKLDAHLTWVEAREPLDADIDSRKTNGVGPGMAFANSVLEKHPDFGLIGLVPCAIGG 123
Query: 124 TNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
+NIS+W +G LY QMI+RA+ +LR GGTIRA+LWYQGE+DTVNL DA+ Y+ R FF
Sbjct: 124 SNISEWERGKELYFQMIKRAKASLRDGGTIRALLWYQGETDTVNLHDAQSYQRRVHKFFL 183
Query: 184 DLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
D+R DLQSPLLPII+VALASG GP IEIVR+AQL DL N+R VDA GLPL+PDGLHL+T
Sbjct: 184 DVRDDLQSPLLPIIQVALASGSGPHIEIVRQAQLGIDLLNLRTVDAHGLPLQPDGLHLST 243
Query: 244 PAQGSTLNSWSNEALRV----NLSLLVFRILEGSCRISKQA 280
PAQ +N L+ N++ V IL + R+ A
Sbjct: 244 PAQAHLGQMMANAFLQFVPSSNVNYKVSPILNEAIRLYNYA 284
>gi|224137652|ref|XP_002327179.1| predicted protein [Populus trichocarpa]
gi|222835494|gb|EEE73929.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 172/255 (67%), Positives = 197/255 (77%), Gaps = 8/255 (3%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
Q + ILAGQSNMAGRGGV N+T+ +WDGIVP QCQPNPSILRL+A L WV AHEPLH
Sbjct: 23 QNIFILAGQSNMAGRGGVVNNTKNGIPSWDGIVPVQCQPNPSILRLSASLTWVQAHEPLH 82
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ADID NKTNGVGPG+ FANA+LTKVPNFG IGLVPCAIGGT+IS+W KG LY+Q+++R
Sbjct: 83 ADIDYNKTNGVGPGMSFANAILTKVPNFGSIGLVPCAIGGTSISEWAKGGFLYDQLVRRT 142
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
Q AL+ GG I A+LWYQGESDT EDA YK R D FF DLR+DL P LPII+VALAS
Sbjct: 143 QFALQRGGVIGAMLWYQGESDTQIREDADAYKGRLDRFFIDLRADLGYPTLPIIQVALAS 202
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ---GSTLNSWSNEALRV 260
GEGP++EIVR AQL +LPNV+CVDA GLPLEPD +HLTTPAQ G TL +A
Sbjct: 203 GEGPYVEIVRNAQLGINLPNVQCVDAKGLPLEPDRVHLTTPAQVQLGQTL----TDAFLQ 258
Query: 261 NLSLLVFRILEGSCR 275
+LS + I SCR
Sbjct: 259 SLSSPI-HIANNSCR 272
>gi|255538182|ref|XP_002510156.1| conserved hypothetical protein [Ricinus communis]
gi|223550857|gb|EEF52343.1| conserved hypothetical protein [Ricinus communis]
Length = 300
Score = 337 bits (865), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 161/246 (65%), Positives = 186/246 (75%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
M + L +L V Q + + ILAGQSNMAGRGGV NDT+T L WDGIVPPQC
Sbjct: 1 MLSLLFMALLAQANISVTSQQLPKNIFILAGQSNMAGRGGVVNDTKTGILRWDGIVPPQC 60
Query: 61 QPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCA 120
QP PS+ RL+ WVLAHEPLH+DID NKTNG+GPG+ FANAVLTK P GV+GLVPCA
Sbjct: 61 QPEPSVFRLSGDFTWVLAHEPLHSDIDYNKTNGIGPGMAFANAVLTKDPAIGVVGLVPCA 120
Query: 121 IGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDM 180
IGGT ISQW KG LY+Q++QR +VAL GG +RA+LWYQGESDT+ EDA YK R +
Sbjct: 121 IGGTAISQWEKGGFLYDQLVQRTRVALYSGGVLRAMLWYQGESDTLIEEDADSYKGRLEK 180
Query: 181 FFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLH 240
FFTD+R+DLQ P LPI +VALASGEGP I+ +R+AQ LPNV CVDA GLPLEPD LH
Sbjct: 181 FFTDVRADLQHPFLPIFQVALASGEGPVIDTIREAQKGIKLPNVHCVDAKGLPLEPDRLH 240
Query: 241 LTTPAQ 246
LTTPAQ
Sbjct: 241 LTTPAQ 246
>gi|356518106|ref|XP_003527723.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Glycine
max]
Length = 298
Score = 326 bits (836), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 159/235 (67%), Positives = 188/235 (80%), Gaps = 5/235 (2%)
Query: 13 EAWPVKCQY-QQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTA 71
+AWPVK Q + + ILAGQSNMAGRGGV N+T T WDG+V PQ +PNPS+L+L A
Sbjct: 13 QAWPVKPQQAYDRNIFILAGQSNMAGRGGVVNNTAT----WDGVVSPQSRPNPSVLKLDA 68
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK 131
L WV A EPL ADID KTNGVGPG+ FAN VL K P FG+IGLVPCAIGG+NIS+W +
Sbjct: 69 HLTWVAAREPLDADIDSAKTNGVGPGMAFANWVLEKHPEFGLIGLVPCAIGGSNISEWER 128
Query: 132 GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQS 191
G LY QMI+RA+ +LR GGTIRA+LWYQGE+DTVNL DA+LY+ R FF D+R DL+S
Sbjct: 129 GKELYNQMIKRAKASLRDGGTIRALLWYQGETDTVNLHDAQLYQTRVHKFFLDVRDDLRS 188
Query: 192 PLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
PLLPII+VALASG GP+IE+VR+AQL DL N+R VDA GLPL+PDGLHL+TPAQ
Sbjct: 189 PLLPIIQVALASGSGPYIEMVRQAQLGIDLLNLRTVDAHGLPLQPDGLHLSTPAQ 243
>gi|145339433|ref|NP_190869.3| uncharacterized protein [Arabidopsis thaliana]
gi|110738676|dbj|BAF01263.1| hypothetical protein [Arabidopsis thaliana]
gi|332645504|gb|AEE79025.1| uncharacterized protein [Arabidopsis thaliana]
Length = 297
Score = 319 bits (818), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 149/223 (66%), Positives = 183/223 (82%), Gaps = 5/223 (2%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRGGV NDT TN WDG++PP+C+ NPSILRLT+KL+W A EPLH D
Sbjct: 31 IFILAGQSNMAGRGGVYNDTATNTTVWDGVIPPECRSNPSILRLTSKLEWKEAKEPLHVD 90
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
ID+NKTNGVGPG+PFAN V+ + FG +GLVPC+IGGT +SQW+KG LYE+ ++RA+
Sbjct: 91 IDINKTNGVGPGMPFANRVVNR---FGQVGLVPCSIGGTKLSQWQKGEFLYEETVKRAKA 147
Query: 146 ALR--GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
A+ GGG+ RAVLWYQGESDTV++ DA +YK+R FF+DLR+DLQ P LPII+VALA+
Sbjct: 148 AMASGGGGSYRAVLWYQGESDTVDMVDASVYKKRLVKFFSDLRNDLQHPNLPIIQVALAT 207
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G GP+++ VRKAQL +DL NV CVDA GLPLEPDGLHLTT +Q
Sbjct: 208 GAGPYLDAVRKAQLKTDLENVYCVDARGLPLEPDGLHLTTSSQ 250
>gi|297820028|ref|XP_002877897.1| hypothetical protein ARALYDRAFT_485676 [Arabidopsis lyrata subsp.
lyrata]
gi|297323735|gb|EFH54156.1| hypothetical protein ARALYDRAFT_485676 [Arabidopsis lyrata subsp.
lyrata]
Length = 296
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 147/222 (66%), Positives = 181/222 (81%), Gaps = 4/222 (1%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRGGV NDT TN WDG++PP+C+ NPSILRLTAKL+W A EPLH D
Sbjct: 31 IFILAGQSNMAGRGGVYNDTATNNTVWDGVIPPECRSNPSILRLTAKLEWKEAKEPLHVD 90
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
IDVNKTNG+GPG+ FAN V+T+ FG +GLVPC+IGGT +SQW+KG LYE+ ++R++
Sbjct: 91 IDVNKTNGIGPGMSFANRVITR---FGQVGLVPCSIGGTKLSQWQKGQFLYEETVRRSKA 147
Query: 146 AL-RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASG 204
A+ GGG+ +AVLWYQGESDTV++ DA +YK+R FF DLR+DL P LPII+VALA+G
Sbjct: 148 AVASGGGSYQAVLWYQGESDTVDMVDASVYKKRLVKFFNDLRNDLHQPNLPIIQVALATG 207
Query: 205 EGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
GP+++ VRKAQL +DL NV CVDA GLPLEPDGLHLTT +Q
Sbjct: 208 AGPYLDAVRKAQLKTDLENVYCVDARGLPLEPDGLHLTTSSQ 249
>gi|225458723|ref|XP_002283036.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Vitis
vinifera]
Length = 270
Score = 309 bits (792), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 151/227 (66%), Positives = 182/227 (80%), Gaps = 8/227 (3%)
Query: 20 QYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+ + ILAGQSNMAGRGGV N T WDGIVP +CQPNPSILRLTA L WV A
Sbjct: 23 RLHNDNIFILAGQSNMAGRGGVINGT------WDGIVPSECQPNPSILRLTAGLTWVEAR 76
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQM 139
EPLHADID NKT G+GPG+ FANAVL + P FG++GLVPCA+G TNIS+W +G+ LY Q+
Sbjct: 77 EPLHADIDTNKTCGIGPGMAFANAVL-RDPAFGIVGLVPCAVGATNISEWSRGTYLYTQL 135
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
++RA+ +L+ GG IRA+LWYQGESD+ + E AK YK + + F DLR+DL+SP+LP+I+V
Sbjct: 136 VRRAKASLQHGGKIRALLWYQGESDSKSPEYAKSYKGKLEKFILDLRTDLRSPMLPVIQV 195
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
ALASG GPFI+IVR+AQL DLPNV CVDAMGLPLEPDG+HLTTPAQ
Sbjct: 196 ALASG-GPFIKIVREAQLGVDLPNVTCVDAMGLPLEPDGIHLTTPAQ 241
>gi|357465631|ref|XP_003603100.1| hypothetical protein MTR_3g102390 [Medicago truncatula]
gi|355492148|gb|AES73351.1| hypothetical protein MTR_3g102390 [Medicago truncatula]
Length = 267
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 141/212 (66%), Positives = 167/212 (78%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRGGV NDT T TWDG+VP QCQPNPSI++L A LKWV AHEPLH DID KTNGV
Sbjct: 1 MAGRGGVVNDTTTGVTTWDGVVPLQCQPNPSIMKLNANLKWVEAHEPLHEDIDTLKTNGV 60
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIR 154
GPG+ FA VL K G++GLVPCAIGGTNIS+W +G LY M++R + +LR G IR
Sbjct: 61 GPGMAFAKHVLEKNSGLGLVGLVPCAIGGTNISEWERGKVLYNHMMKRVKASLRDDGNIR 120
Query: 155 AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK 214
A+LW+QGE+DTV+L DA+ Y+ R FF D+R DLQSPLLPII+VALASG GP+IEIVR+
Sbjct: 121 ALLWFQGETDTVSLTDAQSYQARVHKFFLDVRDDLQSPLLPIIQVALASGSGPYIEIVRQ 180
Query: 215 AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
AQL DL N++ VDA GLPL+PD LHL+TPAQ
Sbjct: 181 AQLGIDLLNLKTVDAKGLPLQPDRLHLSTPAQ 212
>gi|449508201|ref|XP_004163248.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 276
Score = 296 bits (757), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 146/246 (59%), Positives = 178/246 (72%), Gaps = 5/246 (2%)
Query: 6 LCLILVSEAWPVKCQYQQ----QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQ 61
LCL+ ++ + QQ + +LAGQSNMAGRGGVTN T T+ TWDG+VPPQC
Sbjct: 4 LCLLFLTTVAQIPSTSQQPSPPTDIFLLAGQSNMAGRGGVTNSTLTHHPTWDGVVPPQCS 63
Query: 62 PNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPN-FGVIGLVPCA 120
P P ILRL A L WV A EPLHADID KTNG+GPG+PFAN +L P VIGLVPCA
Sbjct: 64 PTPYILRLAADLTWVEAREPLHADIDFLKTNGIGPGMPFANTILMDKPGGRTVIGLVPCA 123
Query: 121 IGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDM 180
+GGT+I +W+KGS+LY ++ RA ++ GG I+A+LWYQGESDT N ED++LY R
Sbjct: 124 MGGTSIKEWQKGSNLYNHLLSRADASVLSGGKIKALLWYQGESDTENAEDSELYGGRLKK 183
Query: 181 FFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLH 240
FFT +RSDL+ PLLPII+V +ASGEG + E VR+ Q DL NV VDA+GLPLEPDGLH
Sbjct: 184 FFTGIRSDLKIPLLPIIQVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLH 243
Query: 241 LTTPAQ 246
LTT +Q
Sbjct: 244 LTTTSQ 249
>gi|449447271|ref|XP_004141392.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 300
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/246 (59%), Positives = 178/246 (72%), Gaps = 5/246 (2%)
Query: 6 LCLILVSEAWPVKCQYQQ----QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQ 61
LCL+ ++ + QQ + +LAGQSNMAGRGGVTN T T+ TWDG+VPPQC
Sbjct: 4 LCLLFLTTVAQIPSTSQQPSPPTDIFLLAGQSNMAGRGGVTNSTLTHHPTWDGVVPPQCS 63
Query: 62 PNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPN-FGVIGLVPCA 120
P P ILRL A L WV A EPLHADID KTNG+GPG+PFAN +L P VIGLVPCA
Sbjct: 64 PTPYILRLAADLTWVEAREPLHADIDFLKTNGIGPGMPFANTILMDKPGGRTVIGLVPCA 123
Query: 121 IGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDM 180
+GGT+I +W+KGS+LY ++ RA ++ GG I+A+LWYQGESDT N ED++LY R
Sbjct: 124 MGGTSIKEWQKGSNLYNHLLSRADASVLSGGKIKALLWYQGESDTENAEDSELYGGRLKK 183
Query: 181 FFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLH 240
FFT +RSDL+ PLLPII+V +ASGEG + E VR+ Q DL NV VDA+GLPLEPDGLH
Sbjct: 184 FFTGIRSDLKIPLLPIIQVGIASGEGEYKEGVRRGQFGIDLVNVMIVDALGLPLEPDGLH 243
Query: 241 LTTPAQ 246
LTT +Q
Sbjct: 244 LTTTSQ 249
>gi|357470245|ref|XP_003605407.1| hypothetical protein MTR_4g031010 [Medicago truncatula]
gi|355506462|gb|AES87604.1| hypothetical protein MTR_4g031010 [Medicago truncatula]
Length = 292
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 136/213 (63%), Positives = 166/213 (77%), Gaps = 1/213 (0%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
M GRGGV NDT T TWD +VPPQ QPNPSIL+L A L+WV A EPLH DID KTNG+
Sbjct: 1 MGGRGGVVNDTTTGVATWDSVVPPQSQPNPSILKLNAHLEWVEAQEPLHEDIDTLKTNGI 60
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA-LRGGGTI 153
GPG+ FAN VL K FG++GLVPCA GGTNIS+W +G LY+ M++R + + L GG I
Sbjct: 61 GPGMVFANHVLEKNLGFGLVGLVPCATGGTNISEWERGKVLYKNMMKRVKASLLDDGGNI 120
Query: 154 RAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVR 213
+A+LW+QGE+DTV+L DA+ Y+ R FF D+R DLQSPLLPII+VALASG GP+IEIVR
Sbjct: 121 QALLWFQGETDTVSLSDAQSYQTRVHKFFLDVRDDLQSPLLPIIQVALASGSGPYIEIVR 180
Query: 214 KAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+AQL DL N++ VDA GLPL+PDGLHL++ AQ
Sbjct: 181 QAQLGIDLLNLKTVDAKGLPLQPDGLHLSSTAQ 213
>gi|255538184|ref|XP_002510157.1| conserved hypothetical protein [Ricinus communis]
gi|223550858|gb|EEF52344.1| conserved hypothetical protein [Ricinus communis]
Length = 263
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 142/247 (57%), Positives = 180/247 (72%), Gaps = 8/247 (3%)
Query: 2 FAWLLCLILVS-EAWPV-KCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQ 59
F L CLI V ++P+ + ILAGQSNMAGRGGV K W+G VPP+
Sbjct: 4 FCKLFCLIFVLLSSYPILATALFPNDIFILAGQSNMAGRGGV------EKGKWNGNVPPE 57
Query: 60 CQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
C+ NPSILRL+A+LKW +A EPLHADIDV KT GVGPG+ FAN+V GV+GLVPC
Sbjct: 58 CRSNPSILRLSAELKWGVAREPLHADIDVGKTCGVGPGMAFANSVKANDLRIGVVGLVPC 117
Query: 120 AIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
A+GGT ISQW +G+ LY++++ RA +++ GG IRA+LWYQGESDTV +DA+ YK +
Sbjct: 118 AVGGTKISQWARGTRLYQELVSRANESVKYGGNIRAILWYQGESDTVWKKDAEAYKGNFE 177
Query: 180 MFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL 239
F +LRSDL +P LP+I+VA+ASGEG FIE+VR+AQL +PNVRC+DA GLPL+ D L
Sbjct: 178 RFIANLRSDLNTPYLPVIQVAVASGEGQFIEMVRRAQLGIKMPNVRCIDAKGLPLKSDHL 237
Query: 240 HLTTPAQ 246
HLTT +Q
Sbjct: 238 HLTTMSQ 244
>gi|356553982|ref|XP_003545329.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Glycine
max]
Length = 276
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 138/246 (56%), Positives = 177/246 (71%), Gaps = 9/246 (3%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
+ +W LC+++V+ + + + ILAGQSNMAGRGGV WDG VP +C
Sbjct: 6 VLSWFLCVLVVAARGGLGAV--SRDIFILAGQSNMAGRGGVFGGK------WDGDVPEEC 57
Query: 61 QPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCA 120
+P+P + RL+A L+W A EPLHADIDV KT GVGPG+ FAN V+ G++GLVPCA
Sbjct: 58 RPSPWVFRLSAGLEWEEAREPLHADIDVGKTCGVGPGMAFANEVVKARGAGGLVGLVPCA 117
Query: 121 IGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDM 180
+GGT I QW +G+ LY++++QRA A+ GGGTIRAVLWYQGESDTV +DA+ YK++ +
Sbjct: 118 VGGTKIGQWSRGTRLYDELVQRAMQAI-GGGTIRAVLWYQGESDTVRKKDAEGYKDKMER 176
Query: 181 FFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLH 240
F DLRSDL P L +I+VALASGEG FIE VR+AQ+ LPNV+CVDA GL L+PD LH
Sbjct: 177 FIMDLRSDLNLPSLLVIQVALASGEGKFIEKVRRAQMGITLPNVKCVDAKGLRLKPDKLH 236
Query: 241 LTTPAQ 246
LTT +Q
Sbjct: 237 LTTMSQ 242
>gi|357437699|ref|XP_003589125.1| hypothetical protein MTR_1g018750 [Medicago truncatula]
gi|355478173|gb|AES59376.1| hypothetical protein MTR_1g018750 [Medicago truncatula]
Length = 268
Score = 273 bits (699), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 139/247 (56%), Positives = 174/247 (70%), Gaps = 11/247 (4%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
+++ LC+++V+ C + + ILAGQSNMAGRGGV N WDG +PP+C
Sbjct: 7 IWSMFLCVLVVTP----HCGKATKDIFILAGQSNMAGRGGVLNGK------WDGNIPPEC 56
Query: 61 QPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCA 120
+PNPSIL+L KLKW AHEPLHADIDV KT G+GPGL FAN V+ V+GLVPCA
Sbjct: 57 KPNPSILKLNTKLKWEEAHEPLHADIDVGKTCGIGPGLAFANEVVRMSGGECVVGLVPCA 116
Query: 121 IGGTNISQWRKGSSLYEQMIQRAQVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
+GGT I +WR GS LY ++++R+ +++ G G IRAVLWYQGESDTV EDA+ YK R +
Sbjct: 117 VGGTRIEEWRNGSHLYNELVRRSIESVKDGDGVIRAVLWYQGESDTVREEDAERYKYRME 176
Query: 180 MFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL 239
+LR DLQ P L +I+VALASGEG FIE VR AQL LPNV+CVDA GL L+ D L
Sbjct: 177 NLIENLRLDLQLPSLLVIQVALASGEGKFIEKVRHAQLGIKLPNVKCVDAKGLHLKTDKL 236
Query: 240 HLTTPAQ 246
HLTT ++
Sbjct: 237 HLTTMSE 243
>gi|225458721|ref|XP_002283028.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Vitis
vinifera]
Length = 270
Score = 270 bits (689), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 132/223 (59%), Positives = 159/223 (71%), Gaps = 6/223 (2%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + ILAGQSNMAGRGGV + WDG VPP+C+PNPSILRL +L+W AHEPLH
Sbjct: 37 KDIFILAGQSNMAGRGGVRHGK------WDGNVPPECRPNPSILRLNPQLQWEEAHEPLH 90
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
I KT GVGPGL FAN + K GV+GLVPCA+GGT IS W +G++LY ++++R
Sbjct: 91 TGIGPPKTQGVGPGLAFANEIRAKGSMVGVVGLVPCAVGGTKISAWARGTTLYNELVRRT 150
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+ ++ GGG +RA+LWYQGESDTV EDA+ YK + DLRSDL P L I+VAL S
Sbjct: 151 KASVSGGGQLRAILWYQGESDTVRSEDAEAYKGNLEKLIIDLRSDLSHPTLLFIQVALGS 210
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
GEG FIE VR+ QL LPNV+CVDA GL LEPD LHLTT AQ
Sbjct: 211 GEGKFIETVRRGQLGIRLPNVKCVDAKGLRLEPDKLHLTTIAQ 253
>gi|297798488|ref|XP_002867128.1| hydrolase [Arabidopsis lyrata subsp. lyrata]
gi|297312964|gb|EFH43387.1| hydrolase [Arabidopsis lyrata subsp. lyrata]
Length = 262
Score = 266 bits (679), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 130/223 (58%), Positives = 161/223 (72%), Gaps = 2/223 (0%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
Q+ IL+GQSNMAGRGGV D N+ WD IVPP+C PN SILRL+A L+W AHEPLH
Sbjct: 25 QIFILSGQSNMAGRGGVVKDHHHNRWVWDKIVPPECAPNSSILRLSADLRWEEAHEPLHV 84
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVP-NFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
DID K G+GPG+PFANAV ++ + VIGLVPCA GGT I QW +G+ LYE+M++R
Sbjct: 85 DIDTGKVCGIGPGMPFANAVKNRLKTDSAVIGLVPCAAGGTAIKQWERGTHLYERMVKRT 144
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+ + + GG I+AVLWYQGESD +++ DA+ Y D +LR DL P LPII+VA+AS
Sbjct: 145 EESRKCGGEIKAVLWYQGESDVLDIHDAESYGSNMDRLIKNLRHDLNLPSLPIIQVAIAS 204
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G G +I+ VR+AQL L NV CVDA GLPL+ D LHLTT AQ
Sbjct: 205 G-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQ 246
>gi|225428900|ref|XP_002282529.1| PREDICTED: receptor protein kinase-like protein At4g34220-like
[Vitis vinifera]
Length = 1004
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 128/223 (57%), Positives = 161/223 (72%), Gaps = 8/223 (3%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+Q+ IL+GQSNMAGRGGV + WDG+VPP+C P+ SILRL A+L W A EPLH
Sbjct: 768 KQIFILSGQSNMAGRGGVNGHHK-----WDGVVPPECSPDSSILRLNAQLHWESAREPLH 822
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ADID K GVGPG+ FANAV +V GV+GLVPCA+GGT I +W +G LYE M+ RA
Sbjct: 823 ADIDTKKACGVGPGMSFANAVRKRV---GVLGLVPCAVGGTAIKEWARGQPLYENMVNRA 879
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+ +++ GG I+A+LWYQGESDT + DAK YK+ + ++R DL SP LPII+VA+AS
Sbjct: 880 KESVKSGGEIKALLWYQGESDTSSYNDAKSYKDNMESLIQNVRQDLGSPSLPIIQVAIAS 939
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G+ ++E VR+AQ D PNV CVDA GLPL+ D LHLTT AQ
Sbjct: 940 GDSKYMERVREAQKEIDFPNVVCVDAKGLPLKEDHLHLTTEAQ 982
>gi|21594009|gb|AAM65927.1| unknown [Arabidopsis thaliana]
Length = 260
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 130/223 (58%), Positives = 160/223 (71%), Gaps = 2/223 (0%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
Q+ IL+GQSNMAGRGGV D N+ WD I+PP+C PN SILRL+A L+W AHEPLH
Sbjct: 23 QIFILSGQSNMAGRGGVVKDHHHNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPLHV 82
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVP-NFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
DID K GVGPG+ FANAV +V + VIGLVPCA GGT I +W +GS LYE+M++R
Sbjct: 83 DIDTGKVCGVGPGMAFANAVKNRVETDSAVIGLVPCASGGTAIKEWERGSHLYERMVKRT 142
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+ + + GG I+AVLWYQGESD +++ DA+ Y D +LR DL P LPII+VA+AS
Sbjct: 143 EESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVAIAS 202
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G G +I+ VR+AQL L NV CVDA GLPL+ D LHLTT AQ
Sbjct: 203 G-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQ 244
>gi|18418402|ref|NP_567960.1| uncharacterized protein [Arabidopsis thaliana]
gi|30689964|ref|NP_849493.1| uncharacterized protein [Arabidopsis thaliana]
gi|109940187|sp|Q8L9J9.2|CAES_ARATH RecName: Full=Probable carbohydrate esterase At4g34215
gi|332660941|gb|AEE86341.1| uncharacterized protein [Arabidopsis thaliana]
gi|332660942|gb|AEE86342.1| uncharacterized protein [Arabidopsis thaliana]
Length = 260
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 129/224 (57%), Positives = 160/224 (71%), Gaps = 2/224 (0%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
Q+ IL+GQSNMAGRGGV D N+ WD I+PP+C PN SILRL+A L+W AHEPLH
Sbjct: 22 NQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPLH 81
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVP-NFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID K GVGPG+ FANAV ++ + VIGLVPCA GGT I +W +GS LYE+M++R
Sbjct: 82 VDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYERMVKR 141
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
+ + + GG I+AVLWYQGESD +++ DA+ Y D +LR DL P LPII+VA+A
Sbjct: 142 TEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVAIA 201
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG G +I+ VR+AQL L NV CVDA GLPL+ D LHLTT AQ
Sbjct: 202 SG-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQ 244
>gi|75766300|pdb|2APJ|A Chain A, X-Ray Structure Of Protein From Arabidopsis Thaliana
At4g34215 At 1.6 Angstrom Resolution
gi|75766301|pdb|2APJ|B Chain B, X-Ray Structure Of Protein From Arabidopsis Thaliana
At4g34215 At 1.6 Angstrom Resolution
gi|75766302|pdb|2APJ|C Chain C, X-Ray Structure Of Protein From Arabidopsis Thaliana
At4g34215 At 1.6 Angstrom Resolution
gi|75766303|pdb|2APJ|D Chain D, X-Ray Structure Of Protein From Arabidopsis Thaliana
At4g34215 At 1.6 Angstrom Resolution
Length = 260
Score = 260 bits (664), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 128/224 (57%), Positives = 159/224 (70%), Gaps = 2/224 (0%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
Q+ IL+GQ NMAGRGGV D N+ WD I+PP+C PN SILRL+A L+W AHEPLH
Sbjct: 22 NQIFILSGQXNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPLH 81
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVP-NFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID K GVGPG+ FANAV ++ + VIGLVPCA GGT I +W +GS LYE+M++R
Sbjct: 82 VDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYERMVKR 141
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
+ + + GG I+AVLWYQGESD +++ DA+ Y D +LR DL P LPII+VA+A
Sbjct: 142 TEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVAIA 201
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG G +I+ VR+AQL L NV CVDA GLPL+ D LHLTT AQ
Sbjct: 202 SG-GGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQ 244
>gi|302142266|emb|CBI19469.3| unnamed protein product [Vitis vinifera]
Length = 223
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 125/212 (58%), Positives = 150/212 (70%), Gaps = 6/212 (2%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRGGV + WDG VPP+C+PNPSILRL +L+W AHEPLH I KT GV
Sbjct: 1 MAGRGGVRHGK------WDGNVPPECRPNPSILRLNPQLQWEEAHEPLHTGIGPPKTQGV 54
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIR 154
GPGL FAN + K GV+GLVPCA+GGT IS W +G++LY ++++R + ++ GGG +R
Sbjct: 55 GPGLAFANEIRAKGSMVGVVGLVPCAVGGTKISAWARGTTLYNELVRRTKASVSGGGQLR 114
Query: 155 AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK 214
A+LWYQGESDTV EDA+ YK + DLRSDL P L I+VAL SGEG FIE VR+
Sbjct: 115 AILWYQGESDTVRSEDAEAYKGNLEKLIIDLRSDLSHPTLLFIQVALGSGEGKFIETVRR 174
Query: 215 AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
QL LPNV+CVDA GL LEPD LHLTT AQ
Sbjct: 175 GQLGIRLPNVKCVDAKGLRLEPDKLHLTTIAQ 206
>gi|224060568|ref|XP_002300236.1| predicted protein [Populus trichocarpa]
gi|222847494|gb|EEE85041.1| predicted protein [Populus trichocarpa]
Length = 235
Score = 253 bits (647), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 120/224 (53%), Positives = 158/224 (70%), Gaps = 3/224 (1%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRT-NKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPL 82
+Q+ IL+GQSNMAGRGGV D N WD +VPP+CQP+ I R +AKL W AHEPL
Sbjct: 1 KQIFILSGQSNMAGRGGVCKDHHHHNHQYWDKLVPPECQPHQDIFRFSAKLHWEQAHEPL 60
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
HADID K GVGPG+ FAN V K+ V+GLVPCA+GGT I++W +G LYE M++R
Sbjct: 61 HADIDSKKVCGVGPGMSFANMVREKMRV--VVGLVPCAVGGTAITRWGRGEVLYENMVKR 118
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
A+ ++ GG I+ +LWYQGESDT ++ DA++Y+ + ++R DL P LPI+ +
Sbjct: 119 AKESVEDGGEIKGLLWYQGESDTSDIHDAEVYQGNMEKLIENVREDLGLPSLPIVMATIT 178
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG+G +++ VR+AQL +LPNV CVDAMGL L+ D LHLTT AQ
Sbjct: 179 SGDGKYVDKVREAQLRINLPNVVCVDAMGLDLKDDHLHLTTEAQ 222
>gi|30102980|gb|AAP21393.1| unknown protein [Oryza sativa Japonica Group]
gi|108712200|gb|ABF99995.1| expressed protein [Oryza sativa Japonica Group]
gi|125588704|gb|EAZ29368.1| hypothetical protein OsJ_13438 [Oryza sativa Japonica Group]
Length = 259
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 127/221 (57%), Positives = 154/221 (69%), Gaps = 7/221 (3%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ IL GQSNMAGRGGV WDG+VPP+C PNPSILRL+ +L+W AHEPLH
Sbjct: 27 VFILGGQSNMAGRGGVVGSH------WDGMVPPECAPNPSILRLSPQLRWEEAHEPLHNG 80
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
ID N+T GVGPG+ FANA+L + F VIGLVPCA+GGT ++ W KG+ LY +++R++V
Sbjct: 81 IDSNRTCGVGPGMSFANALL-RSGQFPVIGLVPCAVGGTRMADWAKGTDLYSDLVRRSRV 139
Query: 146 ALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE 205
AL GG I AVLWYQGESDTV DA Y R M +LR+DL P L +I+V LASG
Sbjct: 140 ALETGGRIGAVLWYQGESDTVRWADANEYARRMAMLVRNLRADLAMPHLLLIQVGLASGL 199
Query: 206 GPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G + E+VR+AQ L NVR VDA GLPLE LHL+T AQ
Sbjct: 200 GQYTEVVREAQKGIKLRNVRFVDAKGLPLEDGHLHLSTQAQ 240
>gi|115456711|ref|NP_001051956.1| Os03g0857500 [Oryza sativa Japonica Group]
gi|113550427|dbj|BAF13870.1| Os03g0857500, partial [Oryza sativa Japonica Group]
Length = 252
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 127/221 (57%), Positives = 154/221 (69%), Gaps = 7/221 (3%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ IL GQSNMAGRGGV WDG+VPP+C PNPSILRL+ +L+W AHEPLH
Sbjct: 20 VFILGGQSNMAGRGGVVGSH------WDGMVPPECAPNPSILRLSPQLRWEEAHEPLHNG 73
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
ID N+T GVGPG+ FANA+L + F VIGLVPCA+GGT ++ W KG+ LY +++R++V
Sbjct: 74 IDSNRTCGVGPGMSFANALL-RSGQFPVIGLVPCAVGGTRMADWAKGTDLYSDLVRRSRV 132
Query: 146 ALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE 205
AL GG I AVLWYQGESDTV DA Y R M +LR+DL P L +I+V LASG
Sbjct: 133 ALETGGRIGAVLWYQGESDTVRWADANEYARRMAMLVRNLRADLAMPHLLLIQVGLASGL 192
Query: 206 GPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G + E+VR+AQ L NVR VDA GLPLE LHL+T AQ
Sbjct: 193 GQYTEVVREAQKGIKLRNVRFVDAKGLPLEDGHLHLSTQAQ 233
>gi|356574280|ref|XP_003555277.1| PREDICTED: receptor protein kinase-like protein At4g34220-like
[Glycine max]
Length = 1118
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 123/227 (54%), Positives = 160/227 (70%), Gaps = 5/227 (2%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPL 82
++Q+ IL+GQSNMAGRGGV D N+ WDG+VPP+ + +PSILRL+A L+W A+EPL
Sbjct: 877 KRQIFILSGQSNMAGRGGVIRDA-NNRKRWDGVVPPESRSDPSILRLSATLQWEPANEPL 935
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
H DID K GVGPG+ FANA+L + G +GLVPCA+GGT + +W +G LYE M++R
Sbjct: 936 HVDIDSRKACGVGPGMVFANALLRRRVVVGELGLVPCAVGGTAMKEWARGEELYENMVKR 995
Query: 143 AQVALR---GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
A+ +++ I+AVLW+QGESD +N EDA YK + ++R DL P LPII+V
Sbjct: 996 AKESVKERENSSEIKAVLWFQGESDAINEEDAAAYKVNMETLIHNVRQDLNLPSLPIIQV 1055
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
ALASG +IE VR+AQ + DLPNV CVDA GL L D LHLTT +Q
Sbjct: 1056 ALASGSD-YIEKVREAQKAIDLPNVICVDAKGLQLMEDNLHLTTESQ 1101
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 50/85 (58%), Gaps = 12/85 (14%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
M L L ++ A PV CQ SNMAG+GG N+ WDG+VPP+
Sbjct: 758 MKEALQILDKIAGAAPVNCQ------------SNMAGQGGGGIRDANNRKRWDGVVPPES 805
Query: 61 QPNPSILRLTAKLKWVLAHEPLHAD 85
+P+PSILRL+A L+W LA+EPLH D
Sbjct: 806 RPDPSILRLSATLQWELANEPLHVD 830
>gi|449438359|ref|XP_004136956.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 260
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 120/223 (53%), Positives = 155/223 (69%), Gaps = 8/223 (3%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+Q+ IL+GQSNMAGRGGV R WDG+VPP+ P+PSI RL+AK W A EPLH
Sbjct: 18 KQIFILSGQSNMAGRGGVLKKLRR----WDGVVPPEAHPHPSIFRLSAKKHWEAACEPLH 73
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ADID KT GVGPG+ FAN V +V G + LVPCA+GGT I +W +G LYE+M++RA
Sbjct: 74 ADIDTKKTCGVGPGMVFANGVRERV---GTVALVPCAVGGTAIREWARGEKLYEEMVKRA 130
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+ +++GGG I+A+LW+QGESDT DA Y+ + ++R DL P LPII+VALAS
Sbjct: 131 RDSVKGGGEIKAILWFQGESDTSTEHDADAYQGNMEALVANVRRDLALPSLPIIQVALAS 190
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G + + VR+AQL + N+ CVDAMGL L+ D LHLTT +Q
Sbjct: 191 GL-KYTDKVREAQLGMKMENLVCVDAMGLELQEDNLHLTTHSQ 232
>gi|326507094|dbj|BAJ95624.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 238
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 123/224 (54%), Positives = 154/224 (68%), Gaps = 9/224 (4%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAG GGV ++ WDG+VPP+C P+PSILRL+A L W AHEPLHA
Sbjct: 2 RIFLLSGQSNMAGHGGV------HQRRWDGVVPPECAPDPSILRLSASLAWEEAHEPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKV--PNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID KT GVGPG+ FA A+L ++ P +GLVPCA+GGT I +W +G LYEQM++R
Sbjct: 56 DIDTTKTCGVGPGMAFARAILPELQPPGTAGVGLVPCAVGGTAIREWARGEHLYEQMVRR 115
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
A+ A G I AVLWYQGESD + + Y+ + ++R+DL P LP I+VALA
Sbjct: 116 ARAATECG-EIEAVLWYQGESDAESDAETAAYQGNVERLIANIRADLGMPHLPFIQVALA 174
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG IE VR+AQLS +L NV VDAMGLPL D LHLTT AQ
Sbjct: 175 SGNKRNIEKVREAQLSINLLNVVTVDAMGLPLNEDNLHLTTEAQ 218
>gi|449519880|ref|XP_004166962.1| PREDICTED: LOW QUALITY PROTEIN: probable carbohydrate esterase
At4g34215-like [Cucumis sativus]
Length = 260
Score = 237 bits (605), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 119/223 (53%), Positives = 151/223 (67%), Gaps = 8/223 (3%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+Q+ IL+GQSNMAGRGGV R WDG+VPP+ P+PSI RL+AK W A EPLH
Sbjct: 18 KQIFILSGQSNMAGRGGVLKKLRR----WDGVVPPEAHPHPSIFRLSAKKHWEAACEPLH 73
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ADID KT GVGPG+ FAN V +V G + LVPCA+GGT I +W +G LYE+M++R
Sbjct: 74 ADIDTKKTCGVGPGMVFANGVRERV---GTVALVPCAVGGTAIREWARGEKLYEEMVKRX 130
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+ GGG I+A+LW+QGESDT DA Y+ + ++R DL P LPII+VALAS
Sbjct: 131 ERQREGGGEIKAILWFQGESDTSTEHDADAYQGNMEALVANVRRDLALPSLPIIQVALAS 190
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G + + VR+AQL + N+ CVDAMGL L+ D LHLTT +Q
Sbjct: 191 GL-KYTDKVREAQLGMKMENLVCVDAMGLELQEDNLHLTTHSQ 232
>gi|218194149|gb|EEC76576.1| hypothetical protein OsI_14408 [Oryza sativa Indica Group]
Length = 224
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 121/212 (57%), Positives = 147/212 (69%), Gaps = 7/212 (3%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRGGV WDG+VPP+C PNPSILRL+ +L+W AHEPLH ID N+T GV
Sbjct: 1 MAGRGGVVGSH------WDGMVPPECAPNPSILRLSPQLRWEEAHEPLHNGIDSNRTCGV 54
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIR 154
GPG+ FANA+L + F VIGLVPCA+GGT ++ W KG+ LY +++R++VAL GG I
Sbjct: 55 GPGMSFANALL-RSGQFPVIGLVPCAVGGTRMADWAKGTDLYSDLVRRSRVALETGGRIG 113
Query: 155 AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK 214
AVLWYQGESDTV DA Y R M +LR+DL P L +I+V LASG G + E+VR+
Sbjct: 114 AVLWYQGESDTVRWADANEYARRMAMLVRNLRADLAMPHLLLIQVGLASGLGQYTEVVRE 173
Query: 215 AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
AQ L NVR VDA GLPLE LHL+T AQ
Sbjct: 174 AQKGIKLRNVRFVDAKGLPLEDGHLHLSTQAQ 205
>gi|326526507|dbj|BAJ97270.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 265
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 121/222 (54%), Positives = 146/222 (65%), Gaps = 7/222 (3%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRGGV+ WDG+VPP C P+ S+LR + L+W A EPLH
Sbjct: 31 IFILAGQSNMAGRGGVSGTH------WDGVVPPDCAPSASVLRFSPSLRWEQAREPLHQG 84
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGV-IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQ 144
ID N+T GVGPG+ FANA+L G + LVPCA+GGT +++W KGS LY M++RA+
Sbjct: 85 IDGNRTCGVGPGMSFANALLRSGGARGAAVALVPCAVGGTRMAEWAKGSELYADMVRRAR 144
Query: 145 VALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASG 204
VA+ GG I AVLWYQGESDTV DA Y R DLR DL P L +I+V LASG
Sbjct: 145 VAVETGGRIGAVLWYQGESDTVRWADASEYARRMGALVRDLRQDLAMPHLLLIQVGLASG 204
Query: 205 EGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G + E+VR+AQ L NVR VDAMGLP + LHL T AQ
Sbjct: 205 LGQYTEVVREAQKGLKLRNVRFVDAMGLPFQDGHLHLNTQAQ 246
>gi|242075338|ref|XP_002447605.1| hypothetical protein SORBIDRAFT_06g006110 [Sorghum bicolor]
gi|241938788|gb|EES11933.1| hypothetical protein SORBIDRAFT_06g006110 [Sorghum bicolor]
Length = 243
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 124/229 (54%), Positives = 152/229 (66%), Gaps = 14/229 (6%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV ++ WDG+VPP C P+PSILRL+A L+W A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGV------HRRHWDGVVPPDCAPDPSILRLSAALQWEEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKV----PNFGV---IGLVPCAIGGTNISQWRKGSSLYE 137
DID KT G+GPG+ FA AVL ++ P G IGLVPCA+GGT I +W +G LYE
Sbjct: 56 DIDTTKTCGIGPGMAFARAVLPRLQEDTPGAGTRTGIGLVPCAVGGTAIREWSRGEHLYE 115
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
QM+ RA+VA G G I AVLWYQGESD + D Y E + ++R+DL P LP I
Sbjct: 116 QMVCRARVAA-GYGEIEAVLWYQGESDAESDADTGAYLENVERLIGNVRADLGMPQLPFI 174
Query: 198 RVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+VALASG IE VR AQ S +LPNV VD MG+ L D LHL T +Q
Sbjct: 175 QVALASGNKRNIEKVRNAQFSVNLPNVVTVDPMGMALNEDNLHLATESQ 223
>gi|326511549|dbj|BAJ91919.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/226 (51%), Positives = 153/226 (67%), Gaps = 10/226 (4%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNM GRGG T + R WDG+VP +C P+P LRL+ L+W A EPLH
Sbjct: 55 VFILAGQSNMGGRGGATLNNR-----WDGVVPRECAPSPRTLRLSPSLRWEEAREPLHEG 109
Query: 86 IDVNKTNGVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
IDV GVGPG+PFA+A+L P V+GLVPCA GGT I+ W +GS LY++M+ RA
Sbjct: 110 IDVGNVLGVGPGMPFAHALLRAPACPKGAVVGLVPCAQGGTPIANWSRGSDLYDRMVTRA 169
Query: 144 QVAL---RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
+ A+ +G G I A+LW+QGE+DT+ EDA Y R + D+R DL P L +I+V
Sbjct: 170 RAAVAGTKGKGRIAAMLWFQGETDTIRREDALAYTARMEALIRDVRRDLGIPNLLVIQVG 229
Query: 201 LASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+G+G F+++VRKAQ + PN+R VDAMGLP+ D HLTTPAQ
Sbjct: 230 IATGQGKFVDLVRKAQRAVRAPNLRYVDAMGLPVANDFTHLTTPAQ 275
>gi|326503556|dbj|BAJ86284.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 279
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 124/251 (49%), Positives = 161/251 (64%), Gaps = 10/251 (3%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
M A L L+L S A + ILAGQSNM GRGG T + R WDG+VP +C
Sbjct: 18 MRALPLVLLLASTAVTASAARTPTLVFILAGQSNMGGRGGATLNNR-----WDGVVPREC 72
Query: 61 QPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTK--VPNFGVIGLVP 118
P+P LRL+ L+W A EPLH IDV GVGPG+PFA+A+L P V+GLVP
Sbjct: 73 APSPRTLRLSPSLRWEEAREPLHEGIDVGNVLGVGPGMPFAHALLRSPACPKGAVVGLVP 132
Query: 119 CAIGGTNISQWRKGSSLYEQMIQRAQVAL---RGGGTIRAVLWYQGESDTVNLEDAKLYK 175
CA GGT I+ W +GS LY++M+ RA+ A+ +G G I A+LW+QGE+DT+ EDA Y
Sbjct: 133 CAQGGTPIANWSRGSDLYDRMVTRARAAVAGTKGKGRIAAMLWFQGETDTIRREDALAYT 192
Query: 176 ERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLE 235
R + D+R DL P L +I+V +A+G+G F+++VRKAQ + PN+R VDAMGLP+
Sbjct: 193 ARMEALIRDVRRDLGIPNLLVIQVGIATGQGKFVDLVRKAQRAVRAPNLRYVDAMGLPVA 252
Query: 236 PDGLHLTTPAQ 246
D HLTTPAQ
Sbjct: 253 NDFTHLTTPAQ 263
>gi|242037335|ref|XP_002466062.1| hypothetical protein SORBIDRAFT_01g000530 [Sorghum bicolor]
gi|241919916|gb|EER93060.1| hypothetical protein SORBIDRAFT_01g000530 [Sorghum bicolor]
Length = 278
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 118/237 (49%), Positives = 156/237 (65%), Gaps = 22/237 (9%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +LAGQSNM GRGG TN T WDG+VPP C P+P ILRL+ L+W A EPLHA
Sbjct: 32 VFLLAGQSNMGGRGGATNGT------WDGVVPPDCAPSPRILRLSPSLRWEEAREPLHAG 85
Query: 86 IDVNKTNGVGPGLPFANAVLTK---VPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
ID++ GVGPG+PFA+A+L + VP V+GLVPCA G T I+ W +G+ LY++M++R
Sbjct: 86 IDLHNVLGVGPGMPFAHALLRRHGRVPPHAVVGLVPCAQGATPIASWSRGTPLYDRMLKR 145
Query: 143 AQVALR-------------GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL 189
A+ AL G + A+LWYQGE+DT+ +DA +Y R + F D+R DL
Sbjct: 146 ARAALANNNNNNNNNNNNAGSSRLAALLWYQGEADTIRRQDADVYTSRMEAFVRDVRRDL 205
Query: 190 QSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
P L +I+V LA+G+G F++IVR+AQ L NV+ VDA GLP+ D HLTTPAQ
Sbjct: 206 GMPDLLVIQVGLATGQGKFVDIVREAQRRVSLHNVKYVDAKGLPVASDYTHLTTPAQ 262
>gi|7529725|emb|CAB86905.1| putative protein [Arabidopsis thaliana]
Length = 169
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 108/167 (64%), Positives = 135/167 (80%), Gaps = 5/167 (2%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRGGV NDT TN WDG++PP+C+ NPSILRLT+KL+W A EPLH DID+NKTNGV
Sbjct: 1 MAGRGGVYNDTATNTTVWDGVIPPECRSNPSILRLTSKLEWKEAKEPLHVDIDINKTNGV 60
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALR--GGGT 152
GPG+PFAN V+ + FG +GLVPC+IGGT +SQW+KG LYE+ ++RA+ A+ GGG+
Sbjct: 61 GPGMPFANRVVNR---FGQVGLVPCSIGGTKLSQWQKGEFLYEETVKRAKAAMASGGGGS 117
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RAVLWYQGESDTV++ DA +YK+R FF+DLR+DLQ P LPII+V
Sbjct: 118 YRAVLWYQGESDTVDMVDASVYKKRLVKFFSDLRNDLQHPNLPIIQV 164
>gi|194702336|gb|ACF85252.1| unknown [Zea mays]
gi|195648735|gb|ACG43835.1| receptor protein kinase-like protein [Zea mays]
gi|224033897|gb|ACN36024.1| unknown [Zea mays]
gi|413932369|gb|AFW66920.1| Receptor protein kinase-like protein [Zea mays]
Length = 265
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 121/223 (54%), Positives = 152/223 (68%), Gaps = 8/223 (3%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRGGV + WDG+VP C P+P++LRL+ L+W A EPLHA
Sbjct: 30 VFILAGQSNMAGRGGVVANR------WDGVVPGDCAPSPAVLRLSPDLRWEEAREPLHAG 83
Query: 86 IDV-NKTNGVGPGLPFANAVLTKVPNFG-VIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ID N GVGPG+ FANA+L G V+GLVPCA+GGT +++W +G+ LY +M++RA
Sbjct: 84 IDAANHAVGVGPGMAFANALLRSGRAGGAVVGLVPCAVGGTRMAEWGRGTELYAEMLRRA 143
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+VA+ GG I A+LWYQGESDTV DA Y R M DLR+DL P L +I+V LAS
Sbjct: 144 RVAVETGGRIGALLWYQGESDTVRWSDATEYGRRMGMLVRDLRADLGIPHLLVIQVGLAS 203
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G G + ++VR AQ L NVR VDAMGLPL+ LHL+T AQ
Sbjct: 204 GLGQYTQVVRDAQKGIKLRNVRFVDAMGLPLQDGHLHLSTQAQ 246
>gi|255555299|ref|XP_002518686.1| conserved hypothetical protein [Ricinus communis]
gi|223542067|gb|EEF43611.1| conserved hypothetical protein [Ricinus communis]
Length = 265
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 112/228 (49%), Positives = 151/228 (66%), Gaps = 5/228 (2%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+++ +L+GQSNMAGRGGV + WDGIVP +C+P+ ILRLTA L+WV A EPLH
Sbjct: 12 KRIFLLSGQSNMAGRGGVNKHPHQHHKHWDGIVPQECKPHQDILRLTANLRWVTAQEPLH 71
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLV-----PCAIGGTNISQWRKGSSLYEQ 138
ADID K GVGPG+ FAN+V + G G PCA+GGT I +W +G LY+
Sbjct: 72 ADIDSKKVCGVGPGMSFANSVRDQGHAGGDGGGEVVGLVPCAVGGTAIKEWGRGEKLYDM 131
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
M++RA+ +++ GG I +LWYQGESDT DA Y+ + ++R DL P LPI++
Sbjct: 132 MVKRAKESVKDGGEIECLLWYQGESDTYTEHDADAYQGNMEKLVANVREDLGLPSLPIVQ 191
Query: 199 VALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
VA+ SG+ ++E VR+AQL ++ NV CVDA GL L+ D LHLTT +Q
Sbjct: 192 VAITSGDEKYLEKVREAQLKMNISNVVCVDAKGLQLKDDNLHLTTHSQ 239
>gi|242032175|ref|XP_002463482.1| hypothetical protein SORBIDRAFT_01g000550 [Sorghum bicolor]
gi|241917336|gb|EER90480.1| hypothetical protein SORBIDRAFT_01g000550 [Sorghum bicolor]
Length = 269
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 121/223 (54%), Positives = 152/223 (68%), Gaps = 8/223 (3%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRGGV + WDG+VP C P+P++LRL+ L+W A EPLHA
Sbjct: 34 IFILAGQSNMAGRGGVVANR------WDGVVPGDCAPSPAVLRLSPDLRWEEAREPLHAG 87
Query: 86 IDVNKTN-GVGPGLPFANAVL-TKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ID + GVGPG+ FANA+L + V+GLVPCA+GGT ++QW KG+ LY +M++RA
Sbjct: 88 IDADHHAVGVGPGMAFANALLRSGHAGSPVVGLVPCAVGGTRMAQWGKGTDLYAEMLRRA 147
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
+VA+ GG I A+LWYQGESDTV DA Y R M DLR+DL P L +I+V LAS
Sbjct: 148 RVAVETGGRIGALLWYQGESDTVRWSDATEYGRRMAMLVRDLRADLGIPHLLVIQVGLAS 207
Query: 204 GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G G + ++VR AQ L NVR VDAMGLPL+ LHL+T AQ
Sbjct: 208 GLGQYTQVVRDAQKGIKLRNVRFVDAMGLPLQDGHLHLSTQAQ 250
>gi|116311023|emb|CAH67955.1| H0117D06-OSIGBa0088B06.7 [Oryza sativa Indica Group]
gi|125547553|gb|EAY93375.1| hypothetical protein OsI_15173 [Oryza sativa Indica Group]
Length = 237
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 124/224 (55%), Positives = 149/224 (66%), Gaps = 10/224 (4%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P PS+LRLTA L WV A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGV------HHRRWDGVVPPECAPCPSVLRLTAALDWVEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKV--PNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID KT GVGPG+ FA AVL ++ P GV GLVPCA+GGT I +W +G LY+QM++R
Sbjct: 56 DIDTAKTCGVGPGMAFARAVLPRLDPPGSGV-GLVPCAVGGTAIREWARGERLYDQMVRR 114
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
A+ A G I AVLWYQGESD + Y + ++R DL P LP I+VALA
Sbjct: 115 ARAAAECG-EIEAVLWYQGESDAESDAATAAYAGNLETLIANVREDLGMPQLPFIQVALA 173
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG IE VRKAQL +LPNV VDA GL L D LHLTT +Q
Sbjct: 174 SGNKKNIEKVRKAQLGINLPNVVTVDAFGLSLNEDHLHLTTESQ 217
>gi|195657565|gb|ACG48250.1| receptor protein kinase-like protein [Zea mays]
gi|224032835|gb|ACN35493.1| unknown [Zea mays]
gi|414587837|tpg|DAA38408.1| TPA: Receptor protein kinase-like protein [Zea mays]
Length = 241
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 124/244 (50%), Positives = 154/244 (63%), Gaps = 13/244 (5%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P+PSILRL++ +W A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGVHHKH------WDGVVPPECAPDPSILRLSSAQQWEEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPN-----FGVIGLVPCAIGGTNISQWRKGSSLYEQM 139
DID KT G+GPG+ FA AVL+ + IGLVPCA+GGT I +W G LYEQM
Sbjct: 56 DIDTTKTCGIGPGMAFARAVLSSLQEDTPGAAAQIGLVPCAVGGTAIREWSLGKHLYEQM 115
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
+ RA+VA G I A+LWYQGESD + D Y E + ++R+DL P LP I+V
Sbjct: 116 VSRARVATLYG-EIEAILWYQGESDAESDADTSAYLENVERLICNVRADLGMPQLPFIQV 174
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALR 259
ALASG IE VR AQ S +LPNV VD MG+ L D LHLTT +Q L EA
Sbjct: 175 ALASGNKRNIEKVRNAQFSVNLPNVVTVDPMGMALNEDKLHLTTESQ-VKLGKMLAEAYI 233
Query: 260 VNLS 263
+N S
Sbjct: 234 LNFS 237
>gi|115457508|ref|NP_001052354.1| Os04g0276600 [Oryza sativa Japonica Group]
gi|58532036|emb|CAE05089.3| OSJNBa0009K15.9 [Oryza sativa Japonica Group]
gi|113563925|dbj|BAF14268.1| Os04g0276600 [Oryza sativa Japonica Group]
gi|215695517|dbj|BAG90708.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 237
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 123/224 (54%), Positives = 148/224 (66%), Gaps = 10/224 (4%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P PS+LRLTA L WV A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGV------HHRRWDGVVPPECAPCPSVLRLTAALDWVEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKV--PNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID KT GVGPG+ FA AVL ++ P GV GLVPCA+GGT I +W +G LY+QM++R
Sbjct: 56 DIDTAKTCGVGPGMAFARAVLPRLDPPGSGV-GLVPCAVGGTAIREWARGERLYDQMVRR 114
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
A+ A G I AV WYQGESD + Y + ++R DL P LP I+VALA
Sbjct: 115 ARAAAE-CGEIEAVQWYQGESDAESDAATAAYAGNLETLIANVREDLGMPQLPFIQVALA 173
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG IE VRKAQL +LPNV VDA GL L D LHLTT +Q
Sbjct: 174 SGNKKNIEKVRKAQLGINLPNVVTVDAFGLSLNEDHLHLTTESQ 217
>gi|242048404|ref|XP_002461948.1| hypothetical protein SORBIDRAFT_02g011020 [Sorghum bicolor]
gi|241925325|gb|EER98469.1| hypothetical protein SORBIDRAFT_02g011020 [Sorghum bicolor]
Length = 269
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 116/229 (50%), Positives = 154/229 (67%), Gaps = 14/229 (6%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNM+GRGG TN T WDGIVPP+C P+ ILRL+ L+W A EPLH
Sbjct: 31 VFILAGQSNMSGRGGATNGT------WDGIVPPECAPSGRILRLSPALRWEEAREPLHDG 84
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFG---VIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
IDV G+GPG+PFA+AVL + V+GLVPCA GGT I+ W +G+ LYE+M+ R
Sbjct: 85 IDVGNVVGIGPGMPFAHAVLAATSSGSDSVVVGLVPCAQGGTPIANWTRGTELYERMVTR 144
Query: 143 AQVAL---RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
A+ A+ G G + VLW+QGE+DT+ EDA+LY+ R + D+R DL P L +I+V
Sbjct: 145 ARAAVAECSGRGELAGVLWFQGEADTMRREDAELYRRRMETLVHDVRRDLGRPDLLVIQV 204
Query: 200 ALASGE--GPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+ + G F+++VR+AQ + LPNV+ VDAMGLP+ D HLT AQ
Sbjct: 205 GIATAQYNGKFLDVVREAQKAVTLPNVKYVDAMGLPIASDHTHLTMEAQ 253
>gi|226499498|ref|NP_001146996.1| receptor protein kinase-like protein [Zea mays]
gi|195606294|gb|ACG24977.1| receptor protein kinase-like protein [Zea mays]
Length = 243
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 125/246 (50%), Positives = 155/246 (63%), Gaps = 15/246 (6%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P+PSILRL++ +W A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGVHHKH------WDGVVPPECAPDPSILRLSSAQQWEEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKV----PNFGV---IGLVPCAIGGTNISQWRKGSSLYE 137
DID KT G+GPG+ FA AVL+++ P IGLVPCA+GGT I +W G LYE
Sbjct: 56 DIDTTKTCGIGPGMAFARAVLSRLQEDTPGAATQIGIGLVPCAVGGTAIREWSLGKHLYE 115
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
QM+ RA+VA G I A+LWYQGESD + D Y E ++R+DL P LP I
Sbjct: 116 QMVSRARVATLYG-EIEAILWYQGESDAESDADTSAYLENVKRLICNVRADLGMPQLPFI 174
Query: 198 RVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEA 257
+VALASG IE VR AQ S +LPNV VD MG+ L D LHLTT +Q L EA
Sbjct: 175 QVALASGNKRNIEKVRNAQFSVNLPNVVTVDPMGMALNEDKLHLTTESQ-VKLGKMLAEA 233
Query: 258 LRVNLS 263
+N S
Sbjct: 234 YILNFS 239
>gi|414884494|tpg|DAA60508.1| TPA: hypothetical protein ZEAMMB73_597600 [Zea mays]
Length = 270
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 117/230 (50%), Positives = 155/230 (67%), Gaps = 15/230 (6%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNM+GRGG TN T WDGIVPP+C P+ I+RL+ L+W A EPLHA
Sbjct: 32 VFILAGQSNMSGRGGATNGT------WDGIVPPECAPSDRIVRLSPALRWEEAREPLHAG 85
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFG----VIGLVPCAIGGTNISQWRKGSSLYEQMIQ 141
+DV GVGPG+PFA+AVL V+GLVPCA GGT I+ W +G+ LYE+M+
Sbjct: 86 VDVGNVLGVGPGMPFAHAVLASEGAAAEPPVVVGLVPCAQGGTPIANWSRGTELYERMVT 145
Query: 142 RAQVAL---RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
RA+ A+ G G + A+LWYQGE+DT+ +DA+LY+ R + D+R DL P L +I+
Sbjct: 146 RARAAVAECSGRGHLAALLWYQGEADTMRRQDAELYQRRMETLVRDVRCDLGRPDLLVIQ 205
Query: 199 VALASGE--GPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
V +A+ + G F+ +VR+AQ + LPNV+ VDAMGLP+ D HLTT AQ
Sbjct: 206 VGIATAQYNGKFLGVVREAQKAVKLPNVKYVDAMGLPIASDHTHLTTEAQ 255
>gi|357115381|ref|XP_003559467.1| PREDICTED: LOW QUALITY PROTEIN: probable carbohydrate esterase
At4g34215-like [Brachypodium distachyon]
Length = 272
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 124/225 (55%), Positives = 150/225 (66%), Gaps = 11/225 (4%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +LAGQSNMAGRGGVT WDG+VPP P+PS+LRLTA L+W A EPLH
Sbjct: 35 VFVLAGQSNMAGRGGVTG------ARWDGVVPPDSAPSPSVLRLTADLRWEEAREPLHQG 88
Query: 86 IDV---NKTNGVGPGLPFANAVL-TKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQ 141
IDV N+ GVGPG+ FANAVL + + +GLVPCA+ GT +++W KGS LY M++
Sbjct: 89 IDVGGGNRAVGVGPGMAFANAVLRSGRLDGAAVGLVPCAVXGTRMAEWGKGSELYGDMVR 148
Query: 142 RAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL 201
RA+VA+ GG I AVLWY GESDTV DA L R M DLR+DL P L +I+V L
Sbjct: 149 RARVAVETGGRIGAVLWYXGESDTVRWADAIL-TPRMAMLXRDLRADLAMPHLLLIQVGL 207
Query: 202 ASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
ASG G + E+VR+AQ L NVR VDAMGLP + LHL T AQ
Sbjct: 208 ASGLGQYTEVVREAQKGLRLHNVRFVDAMGLPFQDGHLHLNTQAQ 252
>gi|125589694|gb|EAZ30044.1| hypothetical protein OsJ_14101 [Oryza sativa Japonica Group]
Length = 237
Score = 223 bits (567), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 122/224 (54%), Positives = 147/224 (65%), Gaps = 10/224 (4%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P PS+LRLTA L WV A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGV------HHRRWDGVVPPECAPCPSVLRLTAALDWVEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKV--PNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID KT GVGPG+ FA AVL ++ P GV GLVP A+GGT I +W +G LY+QM++R
Sbjct: 56 DIDTAKTCGVGPGMAFARAVLPRLDPPGSGV-GLVPWAVGGTAIREWARGERLYDQMVRR 114
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
A+ A G I AV WYQGESD + Y + ++R DL P LP I+VALA
Sbjct: 115 ARAAAE-CGEIEAVQWYQGESDAESDAATAAYAGNLETLIANVREDLGMPQLPFIQVALA 173
Query: 203 SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG IE VRKAQL +LPNV VDA GL L D LHLTT +Q
Sbjct: 174 SGNKKNIEKVRKAQLGINLPNVVTVDAFGLSLNEDHLHLTTESQ 217
>gi|219363025|ref|NP_001136877.1| uncharacterized protein LOC100217031 [Zea mays]
gi|194697446|gb|ACF82807.1| unknown [Zea mays]
Length = 227
Score = 219 bits (559), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 114/214 (53%), Positives = 144/214 (67%), Gaps = 8/214 (3%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDV-NKTNG 93
MAGRGGV + WDG+VP C P+P++LRL+ L+W A EPLHA ID N G
Sbjct: 1 MAGRGGVVANR------WDGVVPGDCAPSPAVLRLSPDLRWEEAREPLHAGIDAANHAVG 54
Query: 94 VGPGLPFANAVLTKVPNFG-VIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGT 152
VGPG+ FANA+L G V+GLVPCA+GGT +++W +G+ LY +M++RA+VA+ GG
Sbjct: 55 VGPGMAFANALLRSGRAGGAVVGLVPCAVGGTRMAEWGRGTELYAEMLRRARVAVETGGR 114
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
I A+LWYQGESDTV DA Y R M DLR+DL P L +I+V LASG G + ++V
Sbjct: 115 IGALLWYQGESDTVRWSDATEYGRRMGMLVRDLRADLGIPHLLVIQVGLASGLGQYTQVV 174
Query: 213 RKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
R AQ L NVR VDAMGLPL+ LHL+T AQ
Sbjct: 175 RDAQKGIKLRNVRFVDAMGLPLQDGHLHLSTQAQ 208
>gi|226509714|ref|NP_001150914.1| receptor protein kinase-like protein precursor [Zea mays]
gi|195642928|gb|ACG40932.1| receptor protein kinase-like protein [Zea mays]
Length = 268
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 117/227 (51%), Positives = 151/227 (66%), Gaps = 12/227 (5%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +LAGQSNM GRGG TN T WDG+VPP C P+P ILRL+ L+W A EPLHA
Sbjct: 30 VFLLAGQSNMGGRGGATNGT------WDGVVPPACAPSPRILRLSPSLRWEEAREPLHAG 83
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFG----VIGLVPCAIGGTNISQWRKGSSLYEQMIQ 141
ID++ GVGPG+PFA+A+L G V+GLVPCA G T I+ W +G+ LY++M+
Sbjct: 84 IDLHNVLGVGPGMPFAHALLRSWRRSGRRPAVVGLVPCAQGATPIASWSRGTPLYDRMLA 143
Query: 142 RAQVALRGGGTIR--AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ G R A+LWYQGE+DT+ +DA Y R + D+R DL P L +I+V
Sbjct: 144 RARAAVARGPATRLAALLWYQGEADTIRRQDADAYTPRMEALVRDVRRDLGMPDLLVIQV 203
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
LA+G+G F++IVR+AQ L NVR VDA GLP+ D HLTTPAQ
Sbjct: 204 GLATGQGRFVDIVREAQRRVSLRNVRYVDAKGLPVANDYTHLTTPAQ 250
>gi|414874027|tpg|DAA52584.1| TPA: hypothetical protein ZEAMMB73_890704 [Zea mays]
Length = 274
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/227 (51%), Positives = 151/227 (66%), Gaps = 12/227 (5%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +LAGQSNM GRGG TN T WDG+VPP C P+P ILRL+ L+W A EPLHA
Sbjct: 36 VFLLAGQSNMGGRGGATNGT------WDGVVPPACAPSPRILRLSPSLRWEEAREPLHAG 89
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFG----VIGLVPCAIGGTNISQWRKGSSLYEQMIQ 141
ID++ GVGPG+PFA+A+L G V+GL+PCA G T I+ W +G+ LY++M+
Sbjct: 90 IDLHNVLGVGPGMPFAHALLRSWRRSGRRPAVVGLIPCAQGATPIASWSRGTPLYDRMLA 149
Query: 142 RAQVALRGGGTIR--AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ G R A+LWYQGE+DT+ +DA Y R + D+R DL P L +I+V
Sbjct: 150 RARAAVARGPATRLAALLWYQGEADTIRRQDADAYTPRMEALVRDVRRDLGMPDLLVIQV 209
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
LA+G+G F++IVR+AQ L NVR VDA GLP+ D HLTTPAQ
Sbjct: 210 GLATGQGRFVDIVREAQRRVSLRNVRYVDAKGLPVANDYTHLTTPAQ 256
>gi|125546521|gb|EAY92660.1| hypothetical protein OsI_14409 [Oryza sativa Indica Group]
Length = 264
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/226 (49%), Positives = 150/226 (66%), Gaps = 11/226 (4%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +L GQSNM GRGG TN WDG+VPP+C P+P ILRL+ +L+W A EPLHA
Sbjct: 30 IFLLGGQSNMGGRGGATNGP------WDGVVPPECAPSPRILRLSPELRWEEAREPLHAG 83
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR--- 142
IDV+ GVGPG+ FA+A+ +P VIGLVPCA GGT I+ W +G+ LYE+M+ R
Sbjct: 84 IDVHNVLGVGPGMSFAHALFRAIPPSTVIGLVPCAQGGTPIANWTRGTELYERMVARGRA 143
Query: 143 --AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
A G + A+LWYQGE+DT+ EDA++Y + + D+R DL P L +I+V
Sbjct: 144 AMATAGAGAGARMGALLWYQGEADTIRREDAEVYARKMEGMVRDVRRDLALPELLVIQVG 203
Query: 201 LASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+G+G F+E VR+AQ + LP ++ VDA GLP+ D HLTTPAQ
Sbjct: 204 IATGQGKFVEPVREAQKAVRLPFLKYVDAKGLPIANDYTHLTTPAQ 249
>gi|115456713|ref|NP_001051957.1| Os03g0857600 [Oryza sativa Japonica Group]
gi|30102977|gb|AAP21390.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108712202|gb|ABF99997.1| expressed protein [Oryza sativa Japonica Group]
gi|113550428|dbj|BAF13871.1| Os03g0857600 [Oryza sativa Japonica Group]
gi|215686426|dbj|BAG87711.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215704718|dbj|BAG94746.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 266
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 112/226 (49%), Positives = 150/226 (66%), Gaps = 11/226 (4%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +L GQSNM GRGG TN WDG+VPP+C P+P ILRL+ +L+W A EPLHA
Sbjct: 32 IFLLGGQSNMGGRGGATNGP------WDGVVPPECAPSPRILRLSPELRWEEAREPLHAG 85
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR--- 142
IDV+ GVGPG+ FA+A+ +P VIGLVPCA GGT I+ W +G+ LYE+M+ R
Sbjct: 86 IDVHNVLGVGPGMSFAHALFRAIPPSTVIGLVPCAQGGTPIANWTRGTELYERMVGRGRA 145
Query: 143 --AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
A G + A+LWYQGE+DT+ EDA++Y + + D+R DL P L +I+V
Sbjct: 146 AMATAGAGAGARMGALLWYQGEADTIRREDAEVYARKMEGMVRDVRRDLALPELLVIQVG 205
Query: 201 LASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+G+G F+E VR+AQ + LP ++ VDA GLP+ D HLTTPAQ
Sbjct: 206 IATGQGKFVEPVREAQKAVRLPFLKYVDAKGLPIANDYTHLTTPAQ 251
>gi|224137648|ref|XP_002327178.1| predicted protein [Populus trichocarpa]
gi|222835493|gb|EEE73928.1| predicted protein [Populus trichocarpa]
Length = 198
Score = 217 bits (552), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 101/177 (57%), Positives = 128/177 (72%), Gaps = 6/177 (3%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
Q + ILAGQSNMAGRGGV + WDG VPP+C+PNPS LRL+AKL W AHEPLH
Sbjct: 16 QDIFILAGQSNMAGRGGVEHGK------WDGNVPPECRPNPSTLRLSAKLTWEEAHEPLH 69
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ADIDV KT G+GPG+ F + + GV+GLVPCA+GGT IS+W +G+ LY Q++ RA
Sbjct: 70 ADIDVGKTCGIGPGMAFVDGLRANGSRIGVVGLVPCAVGGTKISKWARGTQLYSQLVSRA 129
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
+++ GGTIRA+LWYQGESDTV EDA YK + T+LR+DL P LP+I+++
Sbjct: 130 GASVKDGGTIRAILWYQGESDTVTKEDADAYKGNMETLITNLRTDLNIPSLPVIQMS 186
>gi|326522672|dbj|BAJ88382.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 274
Score = 216 bits (551), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 114/232 (49%), Positives = 152/232 (65%), Gaps = 16/232 (6%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNM GRGG T+ R WDG+VPP+C P+P LRL+ L+W A EPLHA
Sbjct: 33 VFILAGQSNMGGRGGATSGNR-----WDGVVPPECAPSPRTLRLSPSLRWEEAREPLHAG 87
Query: 86 IDVNKTNGVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
+D GVGPG+PFA+A+L P V+GLVPCA GGT I+ W +GS LY++M+ RA
Sbjct: 88 VDAGNVVGVGPGMPFAHALLRSPACPRGAVVGLVPCAQGGTPIANWSRGSELYDRMVTRA 147
Query: 144 QVALRGGGT---IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
+VA G GT I A+LW+QGE+DT+ EDA Y R + F D+R DL P L +I+V
Sbjct: 148 RVAGAGTGTGKKIAALLWFQGEADTLRREDALAYAGRMESFVHDVRRDLALPNLLVIQVG 207
Query: 201 LASG------EGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+ +G ++++VRK Q + + N++ VDAMGLP+ D HLTT AQ
Sbjct: 208 IATAQWQGNKQGKWLDLVRKEQRAVRVANLKYVDAMGLPIANDITHLTTQAQ 259
>gi|326522823|dbj|BAJ88457.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523543|dbj|BAJ92942.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 274
Score = 216 bits (551), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 114/232 (49%), Positives = 152/232 (65%), Gaps = 16/232 (6%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNM GRGG T+ R WDG+VPP+C P+P LRL+ L+W A EPLHA
Sbjct: 33 VFILAGQSNMGGRGGATSGNR-----WDGVVPPECAPSPRTLRLSPSLRWEEAREPLHAG 87
Query: 86 IDVNKTNGVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
+D GVGPG+PFA+A+L P V+GLVPCA GGT I+ W +GS LY++M+ RA
Sbjct: 88 VDAGNVVGVGPGMPFAHALLRSPACPRGAVVGLVPCAQGGTPIANWSRGSELYDRMVTRA 147
Query: 144 QVALRGGGT---IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
+VA G GT I A+LW+QGE+DT+ EDA Y R + F D+R DL P L +I+V
Sbjct: 148 RVAGAGTGTGKKIAALLWFQGEADTLRREDALAYAGRMESFVHDVRRDLALPNLLVIQVG 207
Query: 201 LASG------EGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+ +G ++++VRK Q + + N++ VDAMGLP+ D HLTT AQ
Sbjct: 208 IATAQWQGNKQGKWLDLVRKEQRAVRVANLKYVDAMGLPIANDITHLTTQAQ 259
>gi|2911040|emb|CAA17550.1| receptor protein kinase-like protein [Arabidopsis thaliana]
gi|7270372|emb|CAB80139.1| receptor protein kinase-like protein [Arabidopsis thaliana]
Length = 980
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 100/180 (55%), Positives = 128/180 (71%), Gaps = 1/180 (0%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
Q+ IL+GQSNMAGRGGV D N+ WD I+PP+C PN SILRL+A L+W AHEPLH
Sbjct: 797 NQIFILSGQSNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAHEPLH 856
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVP-NFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
DID K GVGPG+ FANAV ++ + VIGLVPCA GGT I +W +GS LYE+M++R
Sbjct: 857 VDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYERMVKR 916
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA 202
+ + + GG I+AVLWYQGESD +++ DA+ Y D +LR DL P LPII+V+L+
Sbjct: 917 TEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQVSLS 976
>gi|357167782|ref|XP_003581330.1| PREDICTED: probable carbohydrate esterase At4g34215-like
[Brachypodium distachyon]
Length = 247
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 120/233 (51%), Positives = 149/233 (63%), Gaps = 18/233 (7%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P PSILRL+A L W A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGV------HHRRWDGVVPPECAPLPSILRLSAALDWEEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGV-----------IGLVPCAIGGTNISQWRKGS 133
DID KT GVGPG+ FA A+L ++ +GLVPCA+GGT I +W +G
Sbjct: 56 DIDKAKTCGVGPGMAFARAILPQLQPPAPAPAPGAAAGAGVGLVPCAVGGTAIREWARGE 115
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
LYEQM++RA+ A G I A+LWYQGESD + A Y+ + ++R DL P
Sbjct: 116 PLYEQMVRRARAATEYG-EIEALLWYQGESDAESDAAAAAYQGNVERLIANVREDLGMPE 174
Query: 194 LPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
LP I+VALASG E VRKAQLS +LPNV VDA+GL L D LHLTT +Q
Sbjct: 175 LPFIQVALASGNKRNFEKVRKAQLSINLPNVVTVDAIGLALNDDNLHLTTESQ 227
>gi|168007564|ref|XP_001756478.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692517|gb|EDQ78874.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 120/237 (50%), Positives = 153/237 (64%), Gaps = 16/237 (6%)
Query: 25 QLIILAGQSNMAGRGG----VTNDTRTNKLTWDGIVPPQCQPNP-SILRLTAKLKWVLAH 79
++ IL+GQSNM+GRGG V D T++ WDGIVP +C P SILRL L+W AH
Sbjct: 9 EIFILSGQSNMSGRGGMQTIVAKDGSTSR-KWDGIVPAECAAEPGSILRLNKNLEWEEAH 67
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLT----KV-PNFGVIGLVPCAIGGTNISQWRKGSS 134
EP H DID +K GVGPGL FA ++L KV P IGLVPCAIGGT+I QW KG
Sbjct: 68 EPTHIDIDTSKACGVGPGLVFAASLLRARKYKVKPTGPQIGLVPCAIGGTSIVQWEKGRV 127
Query: 135 LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLL 194
LY MIQR + AL GGT++A+LWYQGESD V A Y++R FF +R+DL + L
Sbjct: 128 LYNHMIQRTKAALEKGGTLKALLWYQGESDAVEKSLADHYEQRLVTFFNHVRTDLNNHNL 187
Query: 195 PIIRVAL---ASGEGPFIEIVRKAQLSS--DLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
PII+VA+ A+ ++ VR AQ ++ + ++ VDA+GLPL D +HLTT AQ
Sbjct: 188 PIIQVAINWPAAPHPEYVNKVRSAQRAALDHVKHLHLVDALGLPLLSDHIHLTTEAQ 244
>gi|87240753|gb|ABD32611.1| hypothetical protein MtrDRAFT_AC150207g1v2 [Medicago truncatula]
gi|87241431|gb|ABD33289.1| hypothetical protein MtrDRAFT_AC158501g26v2 [Medicago truncatula]
Length = 205
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 106/199 (53%), Positives = 137/199 (68%), Gaps = 11/199 (5%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
+++ LC+++V+ C + + ILAGQSNMAGRGGV N WDG +PP+C
Sbjct: 7 IWSMFLCVLVVTP----HCGKATKDIFILAGQSNMAGRGGVLNGK------WDGNIPPEC 56
Query: 61 QPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCA 120
+PNPSIL+L KLKW AHEPLHADIDV KT G+GPGL FAN V+ V+GLVPCA
Sbjct: 57 KPNPSILKLNTKLKWEEAHEPLHADIDVGKTCGIGPGLAFANEVVRMSGGECVVGLVPCA 116
Query: 121 IGGTNISQWRKGSSLYEQMIQRAQVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
+GGT I +WR GS LY ++++R+ +++ G G IRAVLWYQGESDTV EDA+ YK R +
Sbjct: 117 VGGTRIEEWRNGSHLYNELVRRSIESVKDGDGVIRAVLWYQGESDTVREEDAERYKYRME 176
Query: 180 MFFTDLRSDLQSPLLPIIR 198
+LR DLQ P L +I+
Sbjct: 177 NLIENLRLDLQLPSLLVIQ 195
>gi|357166181|ref|XP_003580626.1| PREDICTED: probable carbohydrate esterase At4g34215-like
[Brachypodium distachyon]
Length = 300
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/244 (47%), Positives = 153/244 (62%), Gaps = 29/244 (11%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG + +PP +P ILRL+A +WV A
Sbjct: 46 HRPKLLFLLAGQSNMAGRGALPAS-----------LPPPYATHPRILRLSAARRWVAASP 94
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKV-----------PNFG------VIGLVPCAIGG 123
PLHADID +KT G+GP +PFA+ VL+ V P V+GLVPCA+GG
Sbjct: 95 PLHADIDTHKTCGLGPAMPFAHRVLSSVSADSAPSSVSDPGAASDDDPLVLGLVPCAVGG 154
Query: 124 TNISQWRKGSSLYEQMIQRAQVAL-RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
T I W +G LYE + R + A+ GGGT+ AVLW+QGESDT+ ++DA+ Y + +
Sbjct: 155 TRIWMWARGQPLYEAAVVRTRAAVADGGGTLGAVLWFQGESDTIEMDDARSYGGKMERLV 214
Query: 183 TDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLT 242
DLR+DL P L +I+V LASGEG + +IVR+AQ + +LPNV VDAMGLPL D LHL+
Sbjct: 215 ADLRADLGLPNLLVIQVGLASGEGNYTDIVREAQKNINLPNVILVDAMGLPLRDDQLHLS 274
Query: 243 TPAQ 246
T AQ
Sbjct: 275 TEAQ 278
>gi|302142265|emb|CBI19468.3| unnamed protein product [Vitis vinifera]
Length = 185
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 98/149 (65%), Positives = 125/149 (83%), Gaps = 2/149 (1%)
Query: 98 LPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVL 157
+ FANAVL + P FG++GLVPCA+G TNIS+W +G+ LY Q+++RA+ +L+ GG IRA+L
Sbjct: 1 MAFANAVL-RDPAFGIVGLVPCAVGATNISEWSRGTYLYTQLVRRAKASLQHGGKIRALL 59
Query: 158 WYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQL 217
WYQGESD+ + E AK YK + + F DLR+DL+SP+LP+I+VALASG GPFI+IVR+AQL
Sbjct: 60 WYQGESDSKSPEYAKSYKGKLEKFILDLRTDLRSPMLPVIQVALASG-GPFIKIVREAQL 118
Query: 218 SSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
DLPNV CVDAMGLPLEPDG+HLTTPAQ
Sbjct: 119 GVDLPNVTCVDAMGLPLEPDGIHLTTPAQ 147
>gi|90265156|emb|CAH67782.1| H0201G08.9 [Oryza sativa Indica Group]
Length = 282
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 155/244 (63%), Gaps = 12/244 (4%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG + L +P +LRL A +WV A
Sbjct: 45 HRPKLLFLLAGQSNMAGRGALARPLPPPYLP-----------HPRLLRLAASRRWVPAAP 93
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMI 140
PLHADID +KT G+GP +PFA+ +L + + V+GLVPCA+GGT I W +G LYE I
Sbjct: 94 PLHADIDTHKTCGLGPAMPFAHRLLLLLHSDEVLGLVPCAVGGTRIWMWARGQPLYEAAI 153
Query: 141 QRAQVAL-RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ GGG I AVLW+QGESDT+ L+DA+ Y + + DLR+DL P L +I+V
Sbjct: 154 DRARAAVADGGGAIGAVLWFQGESDTIELDDARSYGAKMERLVADLRADLHLPNLLVIQV 213
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALR 259
LASGEG + +IVR+AQ + +LPNV VDAMGLPL D LHL+T AQ N + L+
Sbjct: 214 GLASGEGNYTDIVREAQKNINLPNVLLVDAMGLPLRDDQLHLSTEAQLQLGNMLAEAYLK 273
Query: 260 VNLS 263
N S
Sbjct: 274 FNSS 277
>gi|218194221|gb|EEC76648.1| hypothetical protein OsI_14598 [Oryza sativa Indica Group]
Length = 285
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 155/244 (63%), Gaps = 12/244 (4%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG + L +P +LRL A +WV A
Sbjct: 48 HRPKLLFLLAGQSNMAGRGALARPLPPPYLP-----------HPRLLRLAASRRWVPAAP 96
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMI 140
PLHADID +KT G+GP +PFA+ +L + + V+GLVPCA+GGT I W +G LYE I
Sbjct: 97 PLHADIDTHKTCGLGPAMPFAHRLLLLLHSDEVLGLVPCAVGGTRIWMWARGQPLYEAAI 156
Query: 141 QRAQVAL-RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ GGG I AVLW+QGESDT+ L+DA+ Y + + DLR+DL P L +I+V
Sbjct: 157 DRARAAVADGGGAIGAVLWFQGESDTIELDDARSYGAKMERLVADLRADLHLPNLLVIQV 216
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALR 259
LASGEG + +IVR+AQ + +LPNV VDAMGLPL D LHL+T AQ N + L+
Sbjct: 217 GLASGEGNYTDIVREAQKNINLPNVLLVDAMGLPLRDDQLHLSTEAQLQLGNMLAEAYLK 276
Query: 260 VNLS 263
N S
Sbjct: 277 FNSS 280
>gi|449530291|ref|XP_004172129.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 288
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/225 (47%), Positives = 146/225 (64%), Gaps = 15/225 (6%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + I AGQSNMAGRGGV N+ + N L WDG+VPP+CQ PSILRL +W +A EPLH
Sbjct: 26 KNIFIFAGQSNMAGRGGVENNNKGN-LMWDGLVPPECQSEPSILRLNPDRQWEIAREPLH 84
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGGTNISQWRKGSS-----LYE 137
ID+N+T G+GPG+PFA+ +L KV PN G +GLVPCA GGT I QW K S Y+
Sbjct: 85 LGIDINRTPGIGPGMPFAHELLAKVGPNAGAVGLVPCARGGTLIGQWVKNPSNPSATFYQ 144
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
I+R + + + GG +RA+ W+QGESD + A YK+ FFTD+R+D++ LPII
Sbjct: 145 NFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRNDIKPRFLPII 204
Query: 198 RVALA------SGEGPFIEIVRKAQ--LSSDLPNVRCVDAMGLPL 234
V +A + + VR+AQ +S +LP+V +D++ LP+
Sbjct: 205 VVKIALYDFMMQHDTHNLPAVREAQDAVSKELPDVVAIDSLELPI 249
>gi|449446514|ref|XP_004141016.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 273
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 108/233 (46%), Positives = 144/233 (61%), Gaps = 10/233 (4%)
Query: 18 KCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVL 77
K + ILAGQSNMAGRGGV+ D T+K+ WDG +P +C+ N SI RL A + W
Sbjct: 9 KATTSPNNIFILAGQSNMAGRGGVSLDPTTDKMVWDGYIPLECESNDSIFRLNADMVWEQ 68
Query: 78 AHEPLHADIDVNKTNGVGPGLPFANAVLT-KVPNFGVIGLVPCAIGGTNISQWRKGSSLY 136
AHEPLH DIDV KTNG+GPG+ FAN +L G IGLVPCAIGG+++ +W KG++ Y
Sbjct: 69 AHEPLHWDIDVVKTNGIGPGMAFANELLAIGGKRIGAIGLVPCAIGGSHLKEWVKGTNRY 128
Query: 137 EQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPI 196
+ +++R + + + GGT++ +LWYQGESD E+A Y+ FF DLR+D P LPI
Sbjct: 129 DNLVERIRASEKNGGTVQGILWYQGESDAAVEEEAMCYERELTKFFIDLRADTNHPELPI 188
Query: 197 IRVALASGEG------PFIEIVRKA--QLSSDLPNVRCVDA-MGLPLEPDGLH 240
I V L + + F E V A ++ LPNV VD M + DGL+
Sbjct: 189 ILVKLVTHDFFLSPNISFKEEVCNALEAVTHRLPNVTMVDGPMAVGNFDDGLN 241
>gi|449482786|ref|XP_004156403.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 288
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 109/244 (44%), Positives = 153/244 (62%), Gaps = 17/244 (6%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNP 64
+LC++L + + + + ILAGQSNMAGRGGV N+ + N L WDG+VPP+CQP P
Sbjct: 9 ILCVMLYGPS--LSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQP 65
Query: 65 SILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGG 123
SILRL L+W +A EPLH ID+ +T G+GPG+ FA+ +L K PN G +GLVPCA GG
Sbjct: 66 SILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVPCARGG 125
Query: 124 TNISQWRKGSS-----LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
T I QW K S Y+ I+R + + + GG +RA+ W+QGESD + A YK+
Sbjct: 126 TLIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNL 185
Query: 179 DMFFTDLRSDLQSPLLPIIRVALA------SGEGPFIEIVRKAQ--LSSDLPNVRCVDAM 230
FFTD+R D++ LPII V +A + + VR+AQ +S +LP+V +D++
Sbjct: 186 KKFFTDIRDDIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAQEAVSKELPDVVAIDSL 245
Query: 231 GLPL 234
LP+
Sbjct: 246 KLPI 249
>gi|307135858|gb|ADN33727.1| hypothetical protein [Cucumis melo subsp. melo]
Length = 291
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 110/244 (45%), Positives = 150/244 (61%), Gaps = 17/244 (6%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNP 64
LLC +L + + Q + IL GQSNMAGRGGV ++ + K WDG++PP C+PNP
Sbjct: 12 LLCAMLFGPS--LSGAVSPQNIFILGGQSNMAGRGGVEKNS-SGKFEWDGVIPPDCKPNP 68
Query: 65 SILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGG 123
SILRL A +W +A EPLH DIDV K NG+ PG+ FA+ +L K P GV+GLVP AIGG
Sbjct: 69 SILRLNAARQWEVAREPLHWDIDVMKANGISPGMGFAHELLVKAGPRAGVVGLVPTAIGG 128
Query: 124 TNISQWRKGSS-----LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
T I QW K S Y+ +++R Q + + GG +RA+LW+QGESD E+A YK+
Sbjct: 129 TFIRQWLKNDSYPNATYYQNLVERIQASDKEGGVVRALLWFQGESDAAVKEEAINYKDNL 188
Query: 179 DMFFTDLRSDLQSPLLPIIRVALA------SGEGPFIEIVRKAQ--LSSDLPNVRCVDAM 230
F DLR D+Q LP+I V +A + + IVR AQ +S ++P+V +D+
Sbjct: 189 KTFIMDLRRDIQPRFLPVIIVKIALYDFLRANATDNLSIVRAAQEAVSKEVPDVSIIDSW 248
Query: 231 GLPL 234
LP+
Sbjct: 249 KLPM 252
>gi|413917772|gb|AFW57704.1| hypothetical protein ZEAMMB73_046701 [Zea mays]
Length = 285
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 115/240 (47%), Positives = 153/240 (63%), Gaps = 12/240 (5%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+++LAGQSNMAGRG + ++PPQ +P+P +LRL A +WV+A PLHAD
Sbjct: 53 VVLLAGQSNMAGRGLAPS-----------LLPPQFRPHPRVLRLAASRRWVVAAPPLHAD 101
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
ID +K G+GP +PFA+ +L V+GLVPCA+GGT I W KG LYE + R +
Sbjct: 102 IDTHKACGLGPAMPFAHRLLHAASPDLVLGLVPCAVGGTRIWMWAKGEPLYEAAVARGRA 161
Query: 146 ALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASG 204
A+ GG T+ AVLW+QGESDT+ L+DA Y R + D R+DL P L +I+V LASG
Sbjct: 162 AVAAGGGTLGAVLWFQGESDTIELDDATAYGGRMERLVNDFRADLGMPNLLVIQVGLASG 221
Query: 205 EGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALRVNLSL 264
EG + +IVR+AQ + LPNV VDA+GLPL D LHL+T AQ + L+ N S+
Sbjct: 222 EGNYTDIVREAQRNIKLPNVVLVDAIGLPLRDDQLHLSTEAQLRLGDMLGQAFLKFNSSM 281
>gi|449497121|ref|XP_004160318.1| PREDICTED: LOW QUALITY PROTEIN: probable carbohydrate esterase
At4g34215-like [Cucumis sativus]
Length = 199
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 123/187 (65%), Gaps = 1/187 (0%)
Query: 18 KCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVL 77
K + ILAGQSNMAGRGG D T+K+ WDG +P +C+ N SI RL A + W
Sbjct: 9 KATTSPNNIFILAGQSNMAGRGGFHXDPTTDKMVWDGYIPLECESNDSIFRLNADMVWEQ 68
Query: 78 AHEPLHADIDVNKTNGVGPGLPFANAVLT-KVPNFGVIGLVPCAIGGTNISQWRKGSSLY 136
AHEPLH DIDV KTNG+GPG+ FAN +L G IGLVPCAIGG+++ +W KG++ Y
Sbjct: 69 AHEPLHWDIDVVKTNGIGPGMAFANELLAIGGKRIGAIGLVPCAIGGSHLKEWVKGTNRY 128
Query: 137 EQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPI 196
+ +++R + + + GGT++ +LWYQGESD E+A Y+ FF DLR+D P LPI
Sbjct: 129 DNLVERIRASEKNGGTVQGILWYQGESDAAVEEEAMCYERELTKFFLDLRADTNHPELPI 188
Query: 197 IRVALAS 203
I V L +
Sbjct: 189 ILVKLVT 195
>gi|302807241|ref|XP_002985333.1| hypothetical protein SELMODRAFT_122285 [Selaginella moellendorffii]
gi|302810988|ref|XP_002987184.1| hypothetical protein SELMODRAFT_125405 [Selaginella moellendorffii]
gi|300145081|gb|EFJ11760.1| hypothetical protein SELMODRAFT_125405 [Selaginella moellendorffii]
gi|300146796|gb|EFJ13463.1| hypothetical protein SELMODRAFT_122285 [Selaginella moellendorffii]
Length = 247
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 109/222 (49%), Positives = 146/222 (65%), Gaps = 8/222 (3%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC-QPNPSILRLTAKLKWVLAHEPLHA 84
++IL+GQSNMAGRGGV + WDG VP + PN +I RL L+W A EPLH
Sbjct: 16 VVILSGQSNMAGRGGV--HAVGQRREWDGFVPQESWAPNGTIKRLNVDLEWEDAAEPLHR 73
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQ 144
DID K G+GPGL F A++ + + +GLVPCA G T+I++W KGS LYE+MI+RA+
Sbjct: 74 DIDTGKVCGIGPGLTFGAALINQQRSR-FLGLVPCAKGATSITEWTKGSFLYERMIKRAK 132
Query: 145 VALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASG 204
A+R GG +RA+LWYQGE+DT++ A+ YK + F ++RSDL LP I+V SG
Sbjct: 133 EAIRKGGVLRALLWYQGETDTLSEHLARNYKRALEAFIGNVRSDLGWDQLPFIQV---SG 189
Query: 205 EGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
F+ +VR+AQ + NV VDA GL L+ DG+HLTT +Q
Sbjct: 190 SLDFV-LVRQAQQQIHIANVFYVDAHGLALQEDGVHLTTASQ 230
>gi|222628255|gb|EEE60387.1| hypothetical protein OsJ_13540 [Oryza sativa Japonica Group]
Length = 285
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 116/244 (47%), Positives = 155/244 (63%), Gaps = 12/244 (4%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG + L +P +LRL A +WV A
Sbjct: 48 HRPKLLFLLAGQSNMAGRGALARPLPPPYLP-----------HPRLLRLAASRRWVPAAP 96
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMI 140
PLHADID +KT G+GP +PFA+ +L + + V+GLVPCA+GGT I W +G LYE +
Sbjct: 97 PLHADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLGLVPCAVGGTRIWMWARGQPLYEAAV 156
Query: 141 QRAQVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ GGG I AVLW+QGESDT+ L+DA+ Y + + DLR+DL P L +I+V
Sbjct: 157 ARARAAVADGGGAIGAVLWFQGESDTIELDDARSYGGKMERLVADLRADLHLPNLLVIQV 216
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALR 259
LASGEG + +IVR+AQ + ++PNV VDAMGLPL D LHL+T AQ N + L+
Sbjct: 217 GLASGEGNYTDIVREAQKNINIPNVLLVDAMGLPLRDDQLHLSTEAQLQLGNMLAEAYLK 276
Query: 260 VNLS 263
N S
Sbjct: 277 FNSS 280
>gi|224105611|ref|XP_002313872.1| predicted protein [Populus trichocarpa]
gi|222850280|gb|EEE87827.1| predicted protein [Populus trichocarpa]
Length = 189
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 91/176 (51%), Positives = 123/176 (69%), Gaps = 2/176 (1%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + +LAGQSNM+GRGGV D+ N+ WD VP +CQP+P+ILRL+AKLKW A E +H
Sbjct: 9 KTIFVLAGQSNMSGRGGVIKDSHNNQKLWDRAVPLECQPHPNILRLSAKLKWEPASEQIH 68
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ADID K GVGPG+ FANAV ++ GV+GLVPCA+GGT I +W +G LYE M++RA
Sbjct: 69 ADIDTKKACGVGPGMSFANAVRERIT--GVVGLVPCAVGGTAIKEWARGEELYENMVKRA 126
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
+ +++ GG I+ +LW+QGESDT +A Y+ ++R DL P LPII+V
Sbjct: 127 KESVKDGGEIKGLLWFQGESDTSTQIEADAYQGNMKKLIENVREDLGLPSLPIIQV 182
>gi|38345580|emb|CAE01778.2| OSJNBa0027H06.16 [Oryza sativa Japonica Group]
Length = 282
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 116/244 (47%), Positives = 155/244 (63%), Gaps = 12/244 (4%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG + L +P +LRL A +WV A
Sbjct: 45 HRPKLLFLLAGQSNMAGRGALARPLPPPYLP-----------HPRLLRLAASRRWVPAAP 93
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMI 140
PLHADID +KT G+GP +PFA+ +L + + V+GLVPCA+GGT I W +G LYE +
Sbjct: 94 PLHADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLGLVPCAVGGTRIWMWARGQPLYEAAV 153
Query: 141 QRAQVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ GGG I AVLW+QGESDT+ L+DA+ Y + + DLR+DL P L +I+V
Sbjct: 154 ARARAAVADGGGAIGAVLWFQGESDTIELDDARSYGGKMERLVADLRADLHLPNLLVIQV 213
Query: 200 ALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALR 259
LASGEG + +IVR+AQ + ++PNV VDAMGLPL D LHL+T AQ N + L+
Sbjct: 214 GLASGEGNYTDIVREAQKNINIPNVLLVDAMGLPLRDDQLHLSTEAQLQLGNMLAEAYLK 273
Query: 260 VNLS 263
N S
Sbjct: 274 FNSS 277
>gi|242072212|ref|XP_002446042.1| hypothetical protein SORBIDRAFT_06g000860 [Sorghum bicolor]
gi|241937225|gb|EES10370.1| hypothetical protein SORBIDRAFT_06g000860 [Sorghum bicolor]
Length = 293
Score = 186 bits (472), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 115/259 (44%), Positives = 151/259 (58%), Gaps = 19/259 (7%)
Query: 14 AWPVKCQYQQQQLI-ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAK 72
A+P +LI +LAGQSNMAGRG +P +LRL A
Sbjct: 42 AFPASPYATAPKLIFLLAGQSNMAGRGVAPLPLPPPFRP-----------HPRVLRLAAS 90
Query: 73 LKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFG------VIGLVPCAIGGTNI 126
L+WV+A PLHADID +K G+GP +PFA+ +L V+GLVPCA+GGT I
Sbjct: 91 LRWVVAAPPLHADIDTHKACGLGPAMPFAHRLLLHASAAADSESDLVLGLVPCAVGGTRI 150
Query: 127 SQWRKGSSLYEQMIQRAQVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDL 185
W KG LY+ + R + A+ GGG + AVLW+QGESDT+ L+DA Y R + DL
Sbjct: 151 WMWAKGEPLYDSAVARTRAAVAAGGGKLGAVLWFQGESDTIELDDATAYGGRMERLVNDL 210
Query: 186 RSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
R+DL P L +I+V LASGEG + +IVR+AQ + +PNV VDA+GLPL D LHL+T A
Sbjct: 211 RADLGIPNLLVIQVGLASGEGNYTDIVREAQRNIKVPNVILVDAIGLPLRDDQLHLSTEA 270
Query: 246 QGSTLNSWSNEALRVNLSL 264
Q + L+ N S+
Sbjct: 271 QLQLGDMLGQAFLKFNSSM 289
>gi|223949923|gb|ACN29045.1| unknown [Zea mays]
gi|413932370|gb|AFW66921.1| hypothetical protein ZEAMMB73_339368 [Zea mays]
Length = 206
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 94/178 (52%), Positives = 121/178 (67%), Gaps = 8/178 (4%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRGGV + WDG+VP C P+P++LRL+ L+W A EPLHA
Sbjct: 30 VFILAGQSNMAGRGGVVANR------WDGVVPGDCAPSPAVLRLSPDLRWEEAREPLHAG 83
Query: 86 IDV-NKTNGVGPGLPFANAVLTKVPNFG-VIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
ID N GVGPG+ FANA+L G V+GLVPCA+GGT +++W +G+ LY +M++RA
Sbjct: 84 IDAANHAVGVGPGMAFANALLRSGRAGGAVVGLVPCAVGGTRMAEWGRGTELYAEMLRRA 143
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL 201
+VA+ GG I A+LWYQGESDTV DA Y R M DLR+DL P L +I+V +
Sbjct: 144 RVAVETGGRIGALLWYQGESDTVRWSDATEYGRRMGMLVRDLRADLGIPHLLVIQVGV 201
>gi|449482789|ref|XP_004156404.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 252
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/214 (45%), Positives = 135/214 (63%), Gaps = 15/214 (7%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRGGV N+ + KL WDG+VPP+CQP PSILRL +W +A EPLH ID+ +T G+
Sbjct: 1 MAGRGGVENNAQ-GKLQWDGLVPPECQPQPSILRLNPDRQWEIAREPLHLGIDIKRTPGI 59
Query: 95 GPGLPFANAVLTKV-PNFGVIGLVPCAIGGTNISQWRKGSS-----LYEQMIQRAQVALR 148
GPG+ FA+ +L K PN G +GLVPCA GGT I +W K S Y+ I+R + + +
Sbjct: 60 GPGIAFAHELLAKAGPNAGAVGLVPCARGGTLIEEWVKNPSNPSATFYQNFIERIKASDK 119
Query: 149 GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--RVALASGEG 206
GG +RA+ W+QGESD + A YK+ FFTD+R D++ LPII ++AL
Sbjct: 120 DGGVVRALFWFQGESDAAMNDTAIRYKDNLKKFFTDIRDDIKPRYLPIIVVKIALYDFFR 179
Query: 207 PF----IEIVRKAQ--LSSDLPNVRCVDAMGLPL 234
P + VR+AQ +S +L +V +D++ LP+
Sbjct: 180 PHDTHNLPAVREAQEAVSKELADVVAIDSLKLPI 213
>gi|449525471|ref|XP_004169741.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 288
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 138/225 (61%), Gaps = 15/225 (6%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ IL+GQSNMAGRGGV + T L WDG++PP +P P ILRL A +W A EPL+
Sbjct: 26 NNIFILSGQSNMAGRGGVEKNA-TGNLHWDGVIPPDSEPTPCILRLNAARQWEEAREPLN 84
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGGTNISQWRKGS-----SLYE 137
DIDV K NG+ PG+ FA+ +L K P GV+GLVP AIGGT I QW K + + Y+
Sbjct: 85 FDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTDPNATYYQ 144
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
+++R + + + GG +RA+LW+QGESD + A YK+ DLR+DL+ LP+I
Sbjct: 145 HLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRNDLKPRFLPVI 204
Query: 198 RVALASGE------GPFIEIVRKAQ--LSSDLPNVRCVDAMGLPL 234
V +A + + VR AQ +S+++P+V +D+ LP+
Sbjct: 205 LVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPM 249
>gi|449450528|ref|XP_004143014.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 320
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 138/225 (61%), Gaps = 15/225 (6%)
Query: 24 QQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ IL+GQSNMAGRGGV + T L WDG++PP +P P ILRL A +W A EPL+
Sbjct: 58 NNIFILSGQSNMAGRGGVEKNA-TGNLHWDGVIPPDSEPTPCILRLNAARQWEEAREPLN 116
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGGTNISQWRKGS-----SLYE 137
DIDV K NG+ PG+ FA+ +L K P GV+GLVP AIGGT I QW K + + Y+
Sbjct: 117 FDIDVKKENGISPGMGFAHEILRKAGPRAGVVGLVPTAIGGTVIRQWMKNTTDPNATYYQ 176
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
+++R + + + GG +RA+LW+QGESD + A YK+ DLR+DL+ LP+I
Sbjct: 177 HLVERIKASDKDGGVVRALLWFQGESDAAVKDYAINYKDNLKTLINDLRNDLKPRFLPVI 236
Query: 198 RVALASGE------GPFIEIVRKAQ--LSSDLPNVRCVDAMGLPL 234
V +A + + VR AQ +S+++P+V +D+ LP+
Sbjct: 237 LVKIAIYDFFAVNGTDNLSTVRAAQEAVSNEVPDVSIIDSWKLPM 281
>gi|326497465|dbj|BAK05822.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 304
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 112/245 (45%), Positives = 146/245 (59%), Gaps = 30/245 (12%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG T+ L +P +LRL A +WV A
Sbjct: 49 HRPKLLFLLAGQSNMAGRGAPTSPLPPPYLP-----------HPRLLRLAADRRWVAASP 97
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFG------------------VIGLVPCAIG 122
PLHADID +KT G+ P +PFA+ +L P+ V+GLVPCA+G
Sbjct: 98 PLHADIDTHKTCGLSPAMPFAHRLLLSSPSSANPAPSSVSGPAGEEDGRLVLGLVPCAVG 157
Query: 123 GTNISQWRKGSSLYEQMIQRAQVALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERSDMF 181
GT I W +G LYE + R + A+ GGG + AVLW+QGESDT+ ++DA+ Y + +
Sbjct: 158 GTRIWMWARGEPLYEAAVARTRAAVAGGGGELGAVLWFQGESDTIEVDDARAYGGKMERL 217
Query: 182 FTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHL 241
DLR DL P L +I+V LASGEG + +IVR AQ S +LPNV VDAMGLPL D LHL
Sbjct: 218 VADLREDLGLPNLLVIQVGLASGEGNYTDIVRDAQKSINLPNVILVDAMGLPLSNDQLHL 277
Query: 242 TTPAQ 246
+T AQ
Sbjct: 278 STEAQ 282
>gi|125588705|gb|EAZ29369.1| hypothetical protein OsJ_13439 [Oryza sativa Japonica Group]
Length = 253
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/226 (43%), Positives = 133/226 (58%), Gaps = 24/226 (10%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +L GQSNM GRGG TN WDG+VPP P HA
Sbjct: 32 IFLLGGQSNMGGRGGATNGP------WDGVVPPDSGGRKR-------------GSPFHAG 72
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
IDV+ GVGPG+ FA+A+ +P VIGLVPCA GGT I+ W +G+ LYE+M+ R +
Sbjct: 73 IDVHNVLGVGPGMSFAHALFRAIPPSTVIGLVPCAQGGTPIANWTRGTELYERMVGRGRA 132
Query: 146 ALRGGGTIRA-----VLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
A+ G +LWYQGE+DT+ EDA++Y + + D+R DL P L +I+V
Sbjct: 133 AMATAGAGAGARMGALLWYQGEADTIRREDAEVYARKMEGMVRDVRRDLALPELLVIQVG 192
Query: 201 LASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+A+G+G F+E VR+AQ + LP ++ VDA GLP+ D HLTTPAQ
Sbjct: 193 IATGQGKFVEPVREAQKAVRLPFLKYVDAKGLPIANDYTHLTTPAQ 238
>gi|414587838|tpg|DAA38409.1| TPA: hypothetical protein ZEAMMB73_482423 [Zea mays]
Length = 218
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/179 (50%), Positives = 116/179 (64%), Gaps = 12/179 (6%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L+GQSNMAGRGGV + WDG+VPP+C P+PSILRL++ +W A EPLHA
Sbjct: 2 RIFVLSGQSNMAGRGGVHHKH------WDGVVPPECAPDPSILRLSSAQQWEEAREPLHA 55
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPN-----FGVIGLVPCAIGGTNISQWRKGSSLYEQM 139
DID KT G+GPG+ FA AVL+ + IGLVPCA+GGT I +W G LYEQM
Sbjct: 56 DIDTTKTCGIGPGMAFARAVLSSLQEDTPGAAAQIGLVPCAVGGTAIREWSLGKHLYEQM 115
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ RA+VA G I A+LWYQGESD + D Y E + ++R+DL P LP I+
Sbjct: 116 VSRARVATL-YGEIEAILWYQGESDAESDADTSAYLENVERLICNVRADLGMPQLPFIQ 173
>gi|296090449|emb|CBI40268.3| unnamed protein product [Vitis vinifera]
Length = 168
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 83/149 (55%), Positives = 108/149 (72%), Gaps = 3/149 (2%)
Query: 98 LPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVL 157
+ FANAV +V GV+GLVPCA+GGT I +W +G LYE M+ RA+ +++ GG I+A+L
Sbjct: 1 MSFANAVRKRV---GVLGLVPCAVGGTAIKEWARGQPLYENMVNRAKESVKSGGEIKALL 57
Query: 158 WYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQL 217
WYQGESDT + DAK YK+ + ++R DL SP LPII+VA+ASG+ ++E VR+AQ
Sbjct: 58 WYQGESDTSSYNDAKSYKDNMESLIQNVRQDLGSPSLPIIQVAIASGDSKYMERVREAQK 117
Query: 218 SSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
D PNV CVDA GLPL+ D LHLTT AQ
Sbjct: 118 EIDFPNVVCVDAKGLPLKEDHLHLTTEAQ 146
>gi|413932371|gb|AFW66922.1| hypothetical protein ZEAMMB73_339368 [Zea mays]
Length = 168
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 87/169 (51%), Positives = 113/169 (66%), Gaps = 8/169 (4%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDV-NKTNG 93
MAGRGGV + WDG+VP C P+P++LRL+ L+W A EPLHA ID N G
Sbjct: 1 MAGRGGVVANR------WDGVVPGDCAPSPAVLRLSPDLRWEEAREPLHAGIDAANHAVG 54
Query: 94 VGPGLPFANAVLTKVPNFG-VIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGT 152
VGPG+ FANA+L G V+GLVPCA+GGT +++W +G+ LY +M++RA+VA+ GG
Sbjct: 55 VGPGMAFANALLRSGRAGGAVVGLVPCAVGGTRMAEWGRGTELYAEMLRRARVAVETGGR 114
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL 201
I A+LWYQGESDTV DA Y R M DLR+DL P L +I+V +
Sbjct: 115 IGALLWYQGESDTVRWSDATEYGRRMGMLVRDLRADLGIPHLLVIQVGV 163
>gi|359490112|ref|XP_003634034.1| PREDICTED: LOW QUALITY PROTEIN: probable carbohydrate esterase
At4g34215-like [Vitis vinifera]
Length = 177
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 106/172 (61%), Gaps = 10/172 (5%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADID 87
I++GQ NMAGR V + + WD +V P+C P+ SI RL A+L W A EPLHADID
Sbjct: 12 IISGQINMAGRDDVNDHHK-----WDEVVLPECNPDSSIPRLNAQLHWEFAREPLHADID 66
Query: 88 VNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVAL 147
K G+GP + F N V +V V+GLV C +GGT I +W G LYE M+ RA+ ++
Sbjct: 67 TKKACGMGPRMSFTNTVRKRV----VVGLVSCTVGGTAIKEWAPGQPLYENMVNRAKESM 122
Query: 148 RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
+ G I+A+LWYQ E DT + + K YK+ + ++R DL P LPII+V
Sbjct: 123 KSGWEIKALLWYQEERDTSSHNNTKSYKDNMESLIQNVRQDL-XPSLPIIQV 173
>gi|297722739|ref|NP_001173733.1| Os04g0110400 [Oryza sativa Japonica Group]
gi|255675120|dbj|BAH92461.1| Os04g0110400 [Oryza sativa Japonica Group]
Length = 252
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 85/184 (46%), Positives = 115/184 (62%), Gaps = 12/184 (6%)
Query: 21 YQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
++ + L +LAGQSNMAGRG + L +P +LRL A +WV A
Sbjct: 48 HRPKLLFLLAGQSNMAGRGALARPLPPPYLP-----------HPRLLRLAASRRWVPAAP 96
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMI 140
PLHADID +KT G+GP +PFA+ +L + + V+GLVPCA+GGT I W +G LYE +
Sbjct: 97 PLHADIDTHKTCGLGPAMPFAHRLLLQTDSEEVLGLVPCAVGGTRIWMWARGQPLYEAAV 156
Query: 141 QRAQVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
RA+ A+ GGG I AVLW+QGESDT+ L+DA+ Y + + DLR+DL P L +I+V
Sbjct: 157 ARARAAVADGGGAIGAVLWFQGESDTIELDDARSYGGKMERLVADLRADLHLPNLLVIQV 216
Query: 200 ALAS 203
L S
Sbjct: 217 NLFS 220
>gi|449525474|ref|XP_004169742.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 174
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/131 (54%), Positives = 91/131 (69%), Gaps = 4/131 (3%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNP 64
+LC++L + + + + ILAGQSNMAGRGGV N+ + N L WDG+VPP+CQP P
Sbjct: 9 ILCVMLYGPS--LSGAASPKNIFILAGQSNMAGRGGVENNAQGN-LQWDGLVPPECQPQP 65
Query: 65 SILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGG 123
SILRL L+W +A EPLH ID+ +T G+GPG+ FA+ +L KV PN G +GLVPCA GG
Sbjct: 66 SILRLNPGLQWEIAREPLHLGIDIKRTPGIGPGIAFAHELLVKVGPNAGAVGLVPCARGG 125
Query: 124 TNISQWRKGSS 134
T I QW K S
Sbjct: 126 TLIEQWIKNPS 136
>gi|449450530|ref|XP_004143015.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 223
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 109/183 (59%), Gaps = 15/183 (8%)
Query: 66 ILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGGT 124
IL KL W+ A EPLH ID+ +T G+GPG+ FA+ +L K PN G +GLVPCA GGT
Sbjct: 3 ILTPQPKLLWI-AREPLHLGIDIKRTPGIGPGIAFAHELLVKAGPNAGAVGLVPCARGGT 61
Query: 125 NISQWRKGSS-----LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
I QW K S Y+ I+R + + + GG +RA+ W+QGESD + A YK+
Sbjct: 62 LIEQWIKNPSNPSATFYQNFIERIKASDKDGGVVRALFWFQGESDAAMNDTAIRYKDNLK 121
Query: 180 MFFTDLRSDLQSPLLPIIRVALA------SGEGPFIEIVRKAQ--LSSDLPNVRCVDAMG 231
FFTD+R D++ LPII V +A + + VR+AQ +S +LP+V +D++
Sbjct: 122 KFFTDIRDDIKPRFLPIIVVKIALYDFFRQHDTHNLPAVREAQEAVSKELPDVVAIDSLK 181
Query: 232 LPL 234
LP+
Sbjct: 182 LPI 184
>gi|223938605|ref|ZP_03630496.1| protein of unknown function DUF303 acetylesterase putative
[bacterium Ellin514]
gi|223892724|gb|EEF59194.1| protein of unknown function DUF303 acetylesterase putative
[bacterium Ellin514]
Length = 266
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 88/255 (34%), Positives = 130/255 (50%), Gaps = 32/255 (12%)
Query: 16 PVKCQYQQQQLIILAGQSNMAGRGGVT-NDTRTNKLTWDGIVPPQCQPNPSILRLTAKLK 74
P K ++Q + +L GQSNMAGRG V DT T+ P +L L
Sbjct: 29 PSKGKFQ---IYLLMGQSNMAGRGKVGLEDTTTH---------------PRVLLLNTNNT 70
Query: 75 WVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS 134
W LA EP+ D + GVGPGL F ++ K N IGLVPCA+GGT +S+W++G
Sbjct: 71 WELAMEPVTKDRKAGR--GVGPGLAFGKSMAEKNSNV-TIGLVPCAVGGTPLSRWQRGGD 127
Query: 135 LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLL 194
LY + RA+VA++ G + VLW+QGE+D+ + A+ Y +R D R+D+ L
Sbjct: 128 LYSNAVARAKVAVK-DGALAGVLWHQGENDSSDKGLAESYGKRLSEMIHDFRTDVGQTNL 186
Query: 195 PIIRVALAS-------GEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
P++ + + PF V +A QL +P+ CV++ GL D +H T +
Sbjct: 187 PVVVGQIGEFLYERGPDKTPFARTVNEALKQLPGMVPHTACVESHGLDHLGDKVHFNTES 246
Query: 246 QGSTLNSWSNEALRV 260
Q ++ E LR+
Sbjct: 247 QHEMGRKYAAEMLRL 261
>gi|147807958|emb|CAN66317.1| hypothetical protein VITISV_038126 [Vitis vinifera]
Length = 130
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 80/108 (74%)
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
M+ RA+ +++ GG I+A+LWYQGESDT + DAK YK+ + ++R DL SP LPII+
Sbjct: 1 MVNRAKESVKSGGEIKALLWYQGESDTSSYNDAKSYKDNMESLIQNVRQDLGSPSLPIIQ 60
Query: 199 VALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
VA+ASG+ ++E VR+AQ D+PNV CVDA GLPL+ D LHLTT AQ
Sbjct: 61 VAIASGDSKYMERVREAQKEIDIPNVVCVDAKGLPLKEDHLHLTTEAQ 108
>gi|147854812|emb|CAN82802.1| hypothetical protein VITISV_002090 [Vitis vinifera]
Length = 130
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 60/108 (55%), Positives = 79/108 (73%)
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
M+ RA+ +++ GG I+A+LWYQGESDT + DAK YK+ + ++R DL SP LPII+
Sbjct: 1 MVNRAKESVKSGGEIKALLWYQGESDTSSYNDAKSYKDNMESLIQNVRQDLGSPSLPIIQ 60
Query: 199 VALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
VA+ASG+ ++E VR+AQ D PNV CVDA GLPL+ D LHLTT AQ
Sbjct: 61 VAIASGDSKYMERVREAQKEIDFPNVVCVDAKGLPLKEDHLHLTTEAQ 108
>gi|325103456|ref|YP_004273110.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324972304|gb|ADY51288.1| protein of unknown function DUF303 acetylesterase [Pedobacter
saltans DSM 12145]
Length = 269
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 94/258 (36%), Positives = 136/258 (52%), Gaps = 38/258 (14%)
Query: 13 EAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAK 72
E +K Y L +L GQSNMAGRG + + T + + L A
Sbjct: 37 ETIDLKSGYD---LYLLVGQSNMAGRGVIEAEDTT--------------EHNRVFMLNAA 79
Query: 73 LKWVLAHEPLHADIDVNKTN-GVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK 131
++VLA EPLH D K+N GVGPGL F A+ P IGL+P A+GGT IS W
Sbjct: 80 DEFVLAKEPLHFD----KSNRGVGPGLAFGKAMAEANPKI-KIGLIPAAVGGTKISYWEP 134
Query: 132 GSS--LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL 189
G+S LYE+ I++A+VA++ GT++ ++W QGESD+ N +DA LYKER T R DL
Sbjct: 135 GNSRGLYEEAIRKAKVAMK-YGTLKGIVWQQGESDS-NTKDAPLYKERLLKLLTAFRKDL 192
Query: 190 QSPLLPIIRVALASGEGPFI-----EIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLT 242
+ LPI+ G G F+ ++V K+ + ++++ N +A L D LH
Sbjct: 193 GNNNLPIV----IGGLGDFLKSSQYKVVNKSLQETANEIGNAGFSEASTLGHIGDRLHFN 248
Query: 243 TPAQGSTLNSWSNEALRV 260
+ AQ N+ + L++
Sbjct: 249 SKAQRENGNNMAKAMLKL 266
>gi|116625011|ref|YP_827167.1| hypothetical protein Acid_5941 [Candidatus Solibacter usitatus
Ellin6076]
gi|116228173|gb|ABJ86882.1| protein of unknown function DUF303, acetylesterase putative
[Candidatus Solibacter usitatus Ellin6076]
Length = 252
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/234 (33%), Positives = 115/234 (49%), Gaps = 27/234 (11%)
Query: 22 QQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEP 81
Q ++ +L GQSNMAGRG V R QP P + L ++WV A +P
Sbjct: 17 QPHEIFLLIGQSNMAGRGVVEEQDR--------------QPIPRVFMLNKAMEWVPAIDP 62
Query: 82 LHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQ 141
+H D GVG F + PN IGLVP A GGT++ +W+ G LYE+ ++
Sbjct: 63 VH--FDKPDIAGVGLARTFGKVLAAADPN-ASIGLVPAAFGGTSLEEWKVGGKLYEEAVR 119
Query: 142 RAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL 201
RA+ A+ G +R +LW+QGE+D E A Y++R T LR+DL P +P++ L
Sbjct: 120 RAKFAM-SSGKLRGILWHQGEADAGKKELASSYRQRFSAMITQLRADLGEPDVPVVVGQL 178
Query: 202 -------ASGEGPFIEIVRK--AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
A+ PF +V + A + +P+ V + GL D LH +Q
Sbjct: 179 GEFLSESATPRSPFASVVDEQLATVPLTVPHSAFVSSNGLTSNADHLHFDARSQ 232
>gi|225164091|ref|ZP_03726373.1| hypothetical protein ObacDRAFT_6689 [Diplosphaera colitermitum
TAV2]
gi|224801297|gb|EEG19611.1| hypothetical protein ObacDRAFT_6689 [Diplosphaera colitermitum
TAV2]
Length = 282
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 82/236 (34%), Positives = 117/236 (49%), Gaps = 34/236 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L +L GQSNMAGRG +T P P+P +L L +WV EPLH D
Sbjct: 46 LYLLVGQSNMAGRGKLT--------------PADRAPDPRVLVLGKDDQWVRQGEPLHFD 91
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQW------RKGSSLYEQM 139
K GVG G FA + + P VIGL+PCA+GGT S+W + G LYE
Sbjct: 92 ---KKEAGVGLGFTFAKRMADRSPGV-VIGLIPCAVGGTPQSRWMPGTDGKAGGDLYEAA 147
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
++RA++A + G ++ +LW+QGES+ +L A+ Y E + R DL P P +
Sbjct: 148 VRRAKIAQQAG-RLKGILWHQGESECGSLTKAQAYAEGLALIVAGFRRDLNVPDAPFVAG 206
Query: 200 AL-------ASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
L + G+ P+ +IV + +L + +P V + GL + D LH AQ
Sbjct: 207 ELGEFLYTRSGGKSPYAKIVNEQIDRLPTLVPGTAVVSSAGLAHKGDELHFDADAQ 262
>gi|171910491|ref|ZP_02925961.1| hypothetical protein VspiD_04945 [Verrucomicrobium spinosum DSM
4136]
Length = 650
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 122/249 (48%), Gaps = 39/249 (15%)
Query: 11 VSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLT 70
V+E+ P K + L +L GQSNMAGRG + + R ++ +L+ +
Sbjct: 407 VAESMPEKETFD---LYLLIGQSNMAGRGLLPLEDRLSR--------------ERVLKFS 449
Query: 71 AKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWR 130
A+ W EPLH D G G G+ FA + P IGL+PCA+GGT + +W
Sbjct: 450 ARNAWAPGVEPLHTDKPA--VAGAGLGMSFARQMAEAKPKV-TIGLIPCAVGGTPLDRWV 506
Query: 131 KGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQ 190
KG LY + RA+ A++ G ++ +LW+QGE+D+ + E A Y +R DLR+DL
Sbjct: 507 KGGDLYAAALVRAREAMK-SGNLKGILWHQGEADSGSEEKAGSYAQRLAGMVKDLRADLG 565
Query: 191 SPLLPIIRVALASGEGPFIEIVRK--------------AQLSSDLPNVRCVDAMGLPLEP 236
+ +P + L G F+E K A L +PN VD+ GL +
Sbjct: 566 AGDVPFVAGEL----GEFLERTNKEGRPSFWPVVNEQLATLPGLVPNADVVDSAGLKHKG 621
Query: 237 DGLHLTTPA 245
DG+H TP+
Sbjct: 622 DGVHFDTPS 630
>gi|436836251|ref|YP_007321467.1| putative carbohydrate esterase [Fibrella aestuarina BUZ 2]
gi|384067664|emb|CCH00874.1| putative carbohydrate esterase [Fibrella aestuarina BUZ 2]
Length = 268
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 89/247 (36%), Positives = 122/247 (49%), Gaps = 32/247 (12%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
L +LAGQSNMAGRG T+K QPNP IL L +WV+A EPLH
Sbjct: 36 HLYLLAGQSNMAGRGA---PAETDK-----------QPNPHILMLNQANQWVVATEPLH- 80
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYE 137
D GVGPGL FA A+L IGL+P A+GG+ I W+ G S Y+
Sbjct: 81 -FDKPSVVGVGPGLAFARAMLA-ADTTAYIGLIPVAVGGSAIDSWQPGGYHDQTKSYPYD 138
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
++RA++AL GT+R +LW+QGESD+ E Y ++ R +L +P +P++
Sbjct: 139 DALRRAKIALP-SGTLRGILWHQGESDS-KPELVAGYDQKLITLINRFRQELAAPNVPVV 196
Query: 198 RVALAS---GEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNS 252
L + P + L + LP V C++A GL + D H TP+ L
Sbjct: 197 VGTLGDFYVRQNPAAAQINAQLRNLPTRLPVVACIEATGLTDKGDQTHFDTPS-ARELGR 255
Query: 253 WSNEALR 259
EA+R
Sbjct: 256 RYAEAMR 262
>gi|149177229|ref|ZP_01855835.1| probable acetyl xylan esterase AxeA [Planctomyces maris DSM 8797]
gi|148843943|gb|EDL58300.1| probable acetyl xylan esterase AxeA [Planctomyces maris DSM 8797]
Length = 278
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 78/246 (31%), Positives = 123/246 (50%), Gaps = 25/246 (10%)
Query: 20 QYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+ ++ + +L GQSNMAGRG V D +NK +P +L+L WV A
Sbjct: 46 EKEKFHIYLLIGQSNMAGRGKV--DPASNKA------------HPRVLKLDKAGNWVPAT 91
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQM 139
+PLH D K GVGPG F + P IGL+P A+GGT +S+W KG LYE+
Sbjct: 92 DPLH--FDKPKIAGVGPGSGFGPVIADAYPEV-TIGLIPAAVGGTPLSRWVKGGDLYERA 148
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
++ A+ + G I+ +W+QGE D+ N + Y++R DLR+DL P +P +
Sbjct: 149 VKLAKENQK-KGVIKGAIWHQGEGDSSNPKLYNSYQKRLSGMIADLRTDLGEPDMPFVMG 207
Query: 200 ALASGE---GPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWS 254
L GE P V +A ++ ++P + GLP + D +H ++ ++
Sbjct: 208 EL--GEFFTRPGAPTVNQALHGIAKEVPATAVASSKGLPAKSDQVHFNAESEREFGKRYA 265
Query: 255 NEALRV 260
+ L++
Sbjct: 266 AQMLKL 271
>gi|225164610|ref|ZP_03726855.1| hypothetical protein ObacDRAFT_6207 [Diplosphaera colitermitum
TAV2]
gi|224800776|gb|EEG19127.1| hypothetical protein ObacDRAFT_6207 [Diplosphaera colitermitum
TAV2]
Length = 301
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 77/232 (33%), Positives = 112/232 (48%), Gaps = 32/232 (13%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L +L GQSNM+GRG VT P QP+ +L L +W+L EP+H D
Sbjct: 53 LYLLVGQSNMSGRGRVT--------------PADSQPDTRVLVLGKDGEWLLQGEPVHFD 98
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
+ VG G FA + P IGL+PCA+G T +W G LYE+ ++RA +
Sbjct: 99 ---TRNAAVGLGFAFAKRMADHSPGV-TIGLIPCAVGATPQKRWMPGGDLYEEAVRRAGI 154
Query: 146 ALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE 205
A + G +R +LW+QGES+T +L +K Y E R DL +P +P + L GE
Sbjct: 155 AQQ-SGRLRGILWHQGESETGSLVRSKAYGENLAKIVEGFRRDLNAPGVPFVAGEL--GE 211
Query: 206 GPFIEIVRKA-----------QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+++ +A +L + +PN + + GL DG H AQ
Sbjct: 212 FLYMKSEERAANAKIVNEQINRLPALVPNTAVIPSAGLGHRGDGTHFNAEAQ 263
>gi|373853828|ref|ZP_09596627.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
gi|372473355|gb|EHP33366.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
Length = 296
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 122/249 (48%), Gaps = 31/249 (12%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L +L GQSNMAGRG +T+ R P+P +L + W L EP+H
Sbjct: 63 LYLLVGQSNMAGRGPLTDADRA--------------PDPRVLVFGPEDAWQLQGEPVH-- 106
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
D K GVG G FA + + P IGL+PCA+GGT S+W G LYE+ ++RA++
Sbjct: 107 FDKPKAAGVGLGFTFAKLMAAQKPGV-TIGLIPCAVGGTPQSRWMPGGDLYEEAVRRARL 165
Query: 146 ALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL---- 201
A + G +R +LW+QGES+ + A+ Y R DL +P +P + L
Sbjct: 166 A-QPSGKLRGILWHQGESECGSETKARAYAANLAKIVAGFRRDLDAPDVPFVAGELGEFL 224
Query: 202 ---ASGEGPFIEIVRKAQLSSDLPNV----RCVDAMGLPLEPDGLHLTTPAQGSTLNSWS 254
++ + P+ +V + Q+ S LP + V + GL + D LH + AQ ++
Sbjct: 225 YTRSANKSPWARVVNE-QIDS-LPTLVAAAATVPSHGLAHKGDELHFGSAAQREFGKRYA 282
Query: 255 NEALRVNLS 263
+R+ +
Sbjct: 283 EAMIRLQTA 291
>gi|391229092|ref|ZP_10265298.1| protein of unknown function (DUF303) [Opitutaceae bacterium TAV1]
gi|391218753|gb|EIP97173.1| protein of unknown function (DUF303) [Opitutaceae bacterium TAV1]
Length = 299
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 122/249 (48%), Gaps = 31/249 (12%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L +L GQSNMAGRG +T+ R P+P +L + W L EP+H
Sbjct: 66 LYLLVGQSNMAGRGPLTDADRA--------------PDPRVLVFGPEDAWQLQGEPVH-- 109
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
D K GVG G FA + + P IGL+PCA+GGT S+W G LYE+ ++RA++
Sbjct: 110 FDKPKAAGVGLGFTFAKLMAAQKPGV-TIGLIPCAVGGTPQSRWMPGGDLYEEAVRRARL 168
Query: 146 ALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL---- 201
A + G +R +LW+QGES+ + A+ Y R DL +P +P + L
Sbjct: 169 A-QPSGKLRGILWHQGESECGSETKARAYAANLAKIVAGFRRDLGAPDVPFVAGELGEFL 227
Query: 202 ---ASGEGPFIEIVRKAQLSSDLPNV----RCVDAMGLPLEPDGLHLTTPAQGSTLNSWS 254
++ + P+ +V + Q+ S LP + V + GL + D LH + AQ ++
Sbjct: 228 YTRSANKSPWARVVNE-QIDS-LPTLVAAAATVPSHGLAHKGDELHFGSAAQREFGKRYA 285
Query: 255 NEALRVNLS 263
+R+ +
Sbjct: 286 EAMIRLQTA 294
>gi|254445610|ref|ZP_05059086.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
gi|198259918|gb|EDY84226.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
Length = 265
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 116/234 (49%), Gaps = 34/234 (14%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
LI+LAGQSNMAGRG + P+ + NP +L L + +WV+A +PLH
Sbjct: 36 HLILLAGQSNMAGRGDMEG--------------PRVESNPQVLALDKEGRWVVAKDPLHW 81
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSL-------YE 137
D V GVG GL FA L P IGL+P A GG+ IS W G+ Y+
Sbjct: 82 DKSV---AGVGLGLSFAREYLKDHPGV-TIGLIPAACGGSPISSWEAGAYFDQTDSHPYD 137
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDT-VNLEDAKLYKERSDMFFTDLRSDLQSPLLPI 196
++R A + GT++ VLW+QGESD+ L D LY+ + + R + LP+
Sbjct: 138 DALKRVSRATQ-DGTLKGVLWHQGESDSHEGLSD--LYEAKLEGLIKRFRVEWDREDLPV 194
Query: 197 IRVALASGE---GPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
I L E G IE V +A +++ L +V V + L + D LH ++ A
Sbjct: 195 ILGQLGQFEVKWGKHIEEVNRATKRVAKRLEHVGFVSSKNLESKGDALHFSSAA 248
>gi|430747851|ref|YP_007206980.1| hypothetical protein Sinac_7238 [Singulisphaera acidiphila DSM
18658]
gi|430019571|gb|AGA31285.1| protein of unknown function (DUF303) [Singulisphaera acidiphila DSM
18658]
Length = 539
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/258 (35%), Positives = 124/258 (48%), Gaps = 67/258 (25%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
L +LAGQSNM G G +T+ + PP + + L KWV A EPLH
Sbjct: 151 DLWVLAGQSNMEGVGNLTD-----------VTPPSDR----VAALGMDGKWVKAEEPLHW 195
Query: 85 DIDV---------------------NKTNGVGPGLPF--ANAVLTKVPNFGVIGLVPCAI 121
+D ++T G G GLPF A A T VP +GLV CA
Sbjct: 196 LVDSPDPVHSGNPDDREARSKAAHRDRTKGAGLGLPFGVAMAAATNVP----VGLVVCAH 251
Query: 122 GGTNISQW---RKG---SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYK 175
GGT++ QW RKG +SLY MI++ ++A GG +R +LWYQGESD + AK +
Sbjct: 252 GGTSMEQWDPARKGEGGNSLYGSMIRQIKLA---GGKVRGILWYQGESDAMQPAAAK-FA 307
Query: 176 ERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFI--------EIVRKAQ--LSSDLPNVR 225
E F +R+DL P LP V + G F+ +VR AQ ++ +PN
Sbjct: 308 ENFTKFIGAVRADLDQPELPFYYVQI----GRFVAAVDPQGWHVVRDAQRLIADKVPNTA 363
Query: 226 CVDAMGLPLEPDGLHLTT 243
V A+ L L+ D +H+ T
Sbjct: 364 VVTAIDLELD-DLIHVGT 380
>gi|56962379|ref|YP_174104.1| hypothetical protein ABC0603 [Bacillus clausii KSM-K16]
gi|56908616|dbj|BAD63143.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
Length = 283
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 111/230 (48%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG V + VPP + +LR +W + EPL+ D
Sbjct: 4 ILLIGQSNMAGRGFVKD------------VPPIYNEHIHMLR---NGRWQMMAEPLNFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+GP FA A T P IGL+PCA GG++I +W S L I A A
Sbjct: 49 HVS---GIGPAASFAQAWTTDHPG-ESIGLIPCAEGGSSIDEWTMDSPLTRHAISEATFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
I A+LW+QGESD+ E K Y+ + FT LR +L P +PII L G
Sbjct: 105 TETSELI-AILWHQGESDSFG-ERFKTYENKLLSLFTHLREELNVPDIPIIIGELGHYLG 162
Query: 205 EGPF----IEIVRKAQ----LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
E F +E + Q ++ + N V + GL PDG+H+ +Q
Sbjct: 163 ERGFGENAVEFKQINQILYKIAHNEENCYFVTSKGLTANPDGIHIDAISQ 212
>gi|311748107|ref|ZP_07721892.1| probable acetyl xylan esterase AxeA [Algoriphagus sp. PR1]
gi|126574751|gb|EAZ79132.1| probable acetyl xylan esterase AxeA [Algoriphagus sp. PR1]
Length = 274
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 117/239 (48%), Gaps = 33/239 (13%)
Query: 18 KCQYQQQQLIILAGQSNMAGRGGVTN-DTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWV 76
K + + L +L GQSNMAGRG V DT ++ P + L + + WV
Sbjct: 30 KSEKENFHLYLLMGQSNMAGRGLVEAIDTLSH---------------PRVWMLDSTMNWV 74
Query: 77 LAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSL- 135
LA +P+H D V GVG GL F + + P+ IGL+P A+GG++I+ W K S
Sbjct: 75 LARDPMHFDKPVA---GVGLGLTFGKIMANENPSVK-IGLIPTAVGGSSINAWFKDSIHN 130
Query: 136 ------YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL 189
Y MI RA+ AL G GT++ +LW+QGESDT N E Y + L+ DL
Sbjct: 131 QTKTFPYNDMIDRAKKAL-GDGTLKGILWHQGESDTRNEESIANYPAKFYAMIDSLQKDL 189
Query: 190 QSPLLPIIRVALAS---GEGPFIEIVRK--AQLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
+PI+ + G P + + +Q++S+ P + V + GL + D H +
Sbjct: 190 GIEPVPIVMGEIGHFFYGRAPLAKNMNDTFSQIASENPCIDLVRSDGLNHKGDSTHFDS 248
>gi|449133716|ref|ZP_21769240.1| protein of unknown function acetylesterase [Rhodopirellula europaea
6C]
gi|448887592|gb|EMB17957.1| protein of unknown function acetylesterase [Rhodopirellula europaea
6C]
Length = 286
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 118/243 (48%), Gaps = 32/243 (13%)
Query: 16 PVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKW 75
P + Q L +LAGQSNMAGRG ++++ QP+P +L L +W
Sbjct: 42 PEQLQPTDLHLFLLAGQSNMAGRGKISDED--------------LQPHPRVLVLNKAGEW 87
Query: 76 VLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG--- 132
V A PLH D GVG G FA + P +GL+PCA+GG+++ W+ G
Sbjct: 88 VPAVAPLH--FDKPGIAGVGLGRTFAIDYAEQNPQI-TVGLIPCAVGGSSLDAWQPGGFH 144
Query: 133 ----SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSD 188
S Y+ ++R + AL G ++ +LW+QGESD+ + +K Y+ + D F R +
Sbjct: 145 KSTQSHPYDDCMKRMRQALN-AGELKGILWHQGESDSTPTK-SKTYQSKLDELFERFRKE 202
Query: 189 LQSPLLPIIRVALAS-GEGPFIE---IVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLT 242
SP +PI+ L E P+ E +V +A L + N V + GL + D H +
Sbjct: 203 FDSPDVPIVIGQLGQFPEKPWDESRQLVDQAHQTLPERMTNTAFVHSDGLQHKGDQTHFS 262
Query: 243 TPA 245
A
Sbjct: 263 AEA 265
>gi|354807829|ref|ZP_09041283.1| acetylxylan esterase related enzyme [Lactobacillus curvatus CRL
705]
gi|354513672|gb|EHE85665.1| acetylxylan esterase related enzyme [Lactobacillus curvatus CRL
705]
Length = 283
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 110/230 (47%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + + VP +LR +W + EP+H D
Sbjct: 5 ILLVGQSNMAGRGFIQD------------VPGLRHERVKMLR---NGRWQMMAEPIHFDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
+V GVGP FA A + P+ +GL+PCA GG+ I +W L I A+ A
Sbjct: 50 EVA---GVGPAASFAAAWVQAHPD-EELGLIPCAEGGSTIDEWASDELLMRHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
I VLW+QGESD++N + Y + F LR+ L P LPII
Sbjct: 106 QESSELI-GVLWHQGESDSLN-GGYQTYAAKLTAVFNHLRAALDQPDLPIIAGQLPAFLG 163
Query: 198 RVALASGEGPFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+V + F EI R+ AQ+ + P+ V+A L PDG+H+ + +Q
Sbjct: 164 KVGFGASATEFNEINREMAQVVAQDPHSYLVNAAELTANPDGIHIDSASQ 213
>gi|404417114|ref|ZP_10998922.1| hypothetical protein SARL_04556 [Staphylococcus arlettae CVD059]
gi|403490548|gb|EJY96085.1| hypothetical protein SARL_04556 [Staphylococcus arlettae CVD059]
Length = 283
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 104/230 (45%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG +T V P +L+ +W EP+H D
Sbjct: 4 ILLVGQSNMAGRGFMTE------------VEPIINERIKVLK---NGRWQFMEEPIHQDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA + PN +GL+PCA GGT+I W L I A A
Sbjct: 49 AVA---GIGPAAAFAQLWVEAHPN-ETLGLIPCADGGTSIDDWAPDQILTRHAISEAHFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I VLW+QGESD+ N + + Y+E+ F T LR L P LP+I L G
Sbjct: 105 METSELI-GVLWHQGESDSNN-DKFQNYQEKLQQFITHLRQALGQPELPVILGGLGDYLG 162
Query: 205 EGPFIEIVRKAQ--------LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F + + Q +S P+ V GL PDG+H+ +Q
Sbjct: 163 QSGFGQSATQYQEINKIIQSVSHSEPHCHFVTGQGLQPNPDGIHINARSQ 212
>gi|392965995|ref|ZP_10331414.1| protein of unknown function DUF303 acetylesterase putative
[Fibrisoma limi BUZ 3]
gi|387845059|emb|CCH53460.1| protein of unknown function DUF303 acetylesterase putative
[Fibrisoma limi BUZ 3]
Length = 260
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 110/239 (46%), Gaps = 43/239 (17%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+L +L GQSNMAGRG + QP + T + WV A EP+H
Sbjct: 26 RLFLLIGQSNMAGRGLPEAQDQ--------------QPVDRVWMFTKEDTWVPAREPMH- 70
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYE 137
D GVGPG F + PN IGL+PCA+GG+ I W+ G S Y+
Sbjct: 71 -FDKPAVVGVGPGFAFGRRLAEAFPNEN-IGLIPCAVGGSGIDVWQPGAYYEPTKSYPYD 128
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
++RA+ AL G G + +LW+QGESD+ E A Y + LR +L +P +P +
Sbjct: 129 DALRRAKKAL-GNGELAGILWHQGESDS-QPEKAPAYGAKLAELIQRLRRELNAPNVPFV 186
Query: 198 RVALASGEGPFIEIVRK-----------AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
L G F IVR+ Q+ +P+ CV + GL + D H TP+
Sbjct: 187 VGTL----GDF--IVRRNPDAGVINATLQQMPGRVPDTYCVVSEGLTHKGDSTHFDTPS 239
>gi|440715172|ref|ZP_20895727.1| protein of unknown function acetylesterase [Rhodopirellula baltica
SWK14]
gi|436439894|gb|ELP33287.1| protein of unknown function acetylesterase [Rhodopirellula baltica
SWK14]
Length = 286
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 113/234 (48%), Gaps = 32/234 (13%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
L +LAGQSNMAGRG + +D QP+P +L +W A PLH
Sbjct: 51 HLFLLAGQSNMAGRGKIADD--------------DLQPHPRVLVFNKAGEWAPAIAPLH- 95
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYE 137
D GVG G FA P +GL+PCA+GG+++ W+ G + Y+
Sbjct: 96 -FDKPGIAGVGLGRTFAIEYAENNPQV-TVGLIPCAVGGSSLDAWQPGGFHESTNTHPYD 153
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
++R Q A+ G ++ +LW+QGESD+ N +K Y+ + D F R++L SP +PI+
Sbjct: 154 DCMKRMQQAIV-AGELKGILWHQGESDS-NPALSKTYQSKLDELFERFRTELDSPNVPIV 211
Query: 198 RVALAS-GEGPFIE---IVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
L E P+ E +V +A L + N V + GL + D H + A
Sbjct: 212 IGQLGQFTEKPWDESRKLVDQAHRSLPDRMTNTVFVHSDGLEHKGDQTHFSAEA 265
>gi|284037442|ref|YP_003387372.1| hypothetical protein Slin_2555 [Spirosoma linguale DSM 74]
gi|283816735|gb|ADB38573.1| protein of unknown function DUF303 acetylesterase putative
[Spirosoma linguale DSM 74]
Length = 264
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 111/236 (47%), Gaps = 41/236 (17%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+L +L GQSNMAGRG + + QP+ I LT + WV A +PLH
Sbjct: 31 KLFLLIGQSNMAGRGIPEAEDK--------------QPHQRIWMLTKEQTWVPARDPLH- 75
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSL-------YE 137
D GVGPGL FA ++ IGL+PCA GG+ I W G+ Y+
Sbjct: 76 -FDKPAVIGVGPGLAFAQKLVNADKKVN-IGLIPCAQGGSGIDVWVPGAYYAATKSYPYD 133
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
I+RA+ AL G + +LW+QGESD+ E A +Y E+ + +R+DLQ+ +P
Sbjct: 134 DAIKRAKKALE-TGELAGILWHQGESDS-QTEKAAVYGEKLTALVSRIRTDLQAENVPFF 191
Query: 198 RVALASGEGPF----------IEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
L G F I + +A L +PN+ V A GL + D H T
Sbjct: 192 VGTL----GDFYVQKHPVAAQINTILEA-LPKTIPNMYAVSASGLTDKGDTTHFDT 242
>gi|158335342|ref|YP_001516514.1| hypothetical protein AM1_2187 [Acaryochloris marina MBIC11017]
gi|158305583|gb|ABW27200.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 302
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 129/256 (50%), Gaps = 38/256 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA- 84
L +LAGQSNM GRG + D ++K +P + +W LA +PL +
Sbjct: 59 LYVLAGQSNMTGRGPL--DAESSK------------THPQVFVFGNDYRWHLAKDPLDSI 104
Query: 85 DIDVN------KTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG---SSL 135
D V+ K GVGPG+ FA+A+L K VIGL+PCA GG+ I +W++ +SL
Sbjct: 105 DGQVDPVSQEGKAPGVGPGMTFASALL-KHDKDAVIGLIPCARGGSTIQEWQRNLSENSL 163
Query: 136 YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLE-------DAKLYKERSDMFFTDLRSD 188
Y ++R + A G + +L++QGE+D ++ + + + ++ + F R D
Sbjct: 164 YGSCLKRLRAA-SLMGQLEGMLFFQGEADALDQKQFSHLSLSPQQWSKKFEKFIESFRLD 222
Query: 189 LQSPLLPIIRVALASGEGPFI----EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTP 244
+ LPI+ + S + P + +V+K Q + LP+V + L LE D +H TT
Sbjct: 223 TKQENLPIVFAQIGSHDAPNLLTQWNVVKKQQENIQLPHVAMITTDDLALE-DYVHYTTK 281
Query: 245 AQGSTLNSWSNEALRV 260
+ + ++N +++
Sbjct: 282 SYRTIGQRFANAYIKL 297
>gi|87309203|ref|ZP_01091340.1| probable acetyl xylan esterase AxeA [Blastopirellula marina DSM
3645]
gi|87288194|gb|EAQ80091.1| probable acetyl xylan esterase AxeA [Blastopirellula marina DSM
3645]
Length = 270
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 115/251 (45%), Gaps = 35/251 (13%)
Query: 6 LCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPS 65
C S P K ++Q L +L GQSNMAGRG V + + NP
Sbjct: 21 FCAEPTSVTLPPKEKFQ---LFLLIGQSNMAGRGKVEAQDK--------------EINPR 63
Query: 66 ILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTN 125
+L L +WV A +P+H D GVG G F + P +GL+PCA+GGT
Sbjct: 64 VLTLNKAGQWVPAVDPIH--FDKPGIAGVGLGRTFGLEIANANPEI-TVGLIPCAVGGTP 120
Query: 126 ISQWRKG-------SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
I +W G S Y+ + RA+ AL G + +LW+QGE D+ N AK+Y+++
Sbjct: 121 IDRWTPGAYDKPTKSHPYDDALPRAKQALE-SGVLCGILWHQGEGDS-NPAKAKVYEQKL 178
Query: 179 DMFFTDLRSDLQSPLLPIIRVALAS-GEGPFIEIVRKA-----QLSSDLPNVRCVDAMGL 232
D T +R +L +P +P + L E P+ + ++ ++ PN V GL
Sbjct: 179 DELVTRVRKELDAPEVPFLVGQLGVFEERPWDDAKKQVDAAQRHYAASHPNAAFVSGEGL 238
Query: 233 PLEPDGLHLTT 243
+ D +H
Sbjct: 239 THKGDKVHFNA 249
>gi|359459101|ref|ZP_09247664.1| hypothetical protein ACCM5_10248 [Acaryochloris sp. CCMEE 5410]
Length = 302
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 128/256 (50%), Gaps = 38/256 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA- 84
L +LAGQSNM GRG + D ++K +P + +W LA +PL +
Sbjct: 59 LYVLAGQSNMTGRGPL--DAESSK------------THPQVFVFGNDYRWHLAKDPLDSI 104
Query: 85 DIDVN------KTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG---SSL 135
D V+ K GVGPG+ FA+A+L K VIGL+PCA GG+ I +W++ +SL
Sbjct: 105 DGQVDPVSQEGKAPGVGPGMTFASALL-KHDKDAVIGLIPCARGGSTIQEWQRNLSENSL 163
Query: 136 YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED-------AKLYKERSDMFFTDLRSD 188
Y ++R + A G + +L++QGE+D ++ + + + ++ + F R D
Sbjct: 164 YGSCLKRLRAA-SLMGQLEGMLFFQGEADALDQKQFSHLSLSPQQWSKKFEKFIESFRLD 222
Query: 189 LQSPLLPIIRVALASGEGPFI----EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTP 244
+ LPI+ + S + P + +V+K Q + LP V + L LE D +H TT
Sbjct: 223 TKQENLPIVFAQIGSHDAPDLLTQWNVVKKQQENIQLPQVAMITTDDLALE-DYVHYTTK 281
Query: 245 AQGSTLNSWSNEALRV 260
+ + ++N +++
Sbjct: 282 SYRTIGQRFANAYIKL 297
>gi|325109293|ref|YP_004270361.1| hypothetical protein Plabr_2739 [Planctomyces brasiliensis DSM
5305]
gi|324969561|gb|ADY60339.1| protein of unknown function DUF303 acetylesterase [Planctomyces
brasiliensis DSM 5305]
Length = 265
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 93/180 (51%), Gaps = 27/180 (15%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
L +L GQSNMAGRG V + + +P +L LT +W A +PLH
Sbjct: 34 HLFLLIGQSNMAGRGTVEASDK--------------EAHPRVLALTKANEWDYARDPLHF 79
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYE 137
D + GVG G F V P+ IGL+PCA+GG++I+ W G S Y+
Sbjct: 80 DKPIA---GVGLGRTFGLEVAKAQPDV-TIGLIPCAVGGSSITAWVPGGYHDQTKSHPYD 135
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
M++R +VAL+ GT++ +LW+QGESD+ N A YK+ + T LR L + +P
Sbjct: 136 DMLKRCEVALK-AGTLKGILWHQGESDS-NPNRAPEYKQDLEDLMTRLRKQLDAEDVPFF 193
>gi|332704970|ref|ZP_08425056.1| uncharacterized DUF303 domain protein [Moorea producens 3L]
gi|332356322|gb|EGJ35776.1| uncharacterized DUF303 domain protein [Moorea producens 3L]
Length = 303
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/240 (31%), Positives = 112/240 (46%), Gaps = 37/240 (15%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA- 84
L ILAGQSNM+G G +T P +P++ +W L EP+ +
Sbjct: 63 LFILAGQSNMSGTGKLT--------------PASSVTHPNVFVFGNDYRWHLGKEPIDSP 108
Query: 85 -----DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG---SSLY 136
+ +K+ GVGPG+ FA +L P +IGL+PCA GT I QW++ +LY
Sbjct: 109 SGQVDKVSEDKSAGVGPGMAFATELLKYNPEL-IIGLIPCAKSGTAIQQWQRSLSEDTLY 167
Query: 137 EQMIQRAQVALRGGGTIRAVLWYQGESD----TVNLEDAKLYKERSDMFFT---DLRSDL 189
++R A G I +L++QGE D + + E + +D F T D R DL
Sbjct: 168 GSCLKRVGAA-SVMGEITGILFFQGEKDAQKPSQDDEITFFPNQWADKFVTLVKDFRQDL 226
Query: 190 QSPLLPIIRVALASGEGPFI----EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
P LP++ + + P E V+ Q + LP R + L L+ D +HLTT +
Sbjct: 227 GKPELPVVFAQIGTTTDPEKLPNWETVKAQQETVQLPATRMITTDDLALQ-DYVHLTTES 285
>gi|384245750|gb|EIE19243.1| hypothetical protein COCSUDRAFT_83591 [Coccomyxa subellipsoidea
C-169]
Length = 159
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/118 (45%), Positives = 71/118 (60%), Gaps = 3/118 (2%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWD--GIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ IL GQSNM+GRGGV K+ + P +P +L A WV A EP+H
Sbjct: 17 VYILGGQSNMSGRGGVERFPDGTKVFDEEASKYPVAVGADPRVLCFNAAGHWVEAREPMH 76
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFG-VIGLVPCAIGGTNISQWRKGSSLYEQMI 140
ADID K GVGPGL FA +L + + G IGLVPCA+GGT + QW G++L++QM+
Sbjct: 77 ADIDTTKVTGVGPGLIFAKELLALLRSPGQQIGLVPCAVGGTCMDQWLPGTALFQQMV 134
>gi|296330504|ref|ZP_06872983.1| hypothetical protein BSU6633_05374 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|305674711|ref|YP_003866383.1| acetylesterase [Bacillus subtilis subsp. spizizenii str. W23]
gi|296152401|gb|EFG93271.1| hypothetical protein BSU6633_05374 [Bacillus subtilis subsp.
spizizenii ATCC 6633]
gi|305412955|gb|ADM38074.1| possible acetylesterase [Bacillus subtilis subsp. spizizenii str.
W23]
Length = 282
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/230 (33%), Positives = 111/230 (48%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + + VPP ++LR +W + EPL+ D
Sbjct: 4 ILLIGQSNMAGRGFIED------------VPPIYNERINMLR---NGRWQMMAEPLNFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVGP FA A P IG++PCA GG++I +W L I A+ A
Sbjct: 49 HVS---GVGPAASFAQAWTEDHPG-ESIGVIPCAEGGSSIDEWAIDGLLTRHAISEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ + +LW+QGESD+ E K Y+++ F LR +L +P +PII L G
Sbjct: 105 METSELV-GILWHQGESDSYG-ERYKTYEDKLLSLFKHLREELNAPDIPIIIGELGHYLG 162
Query: 205 EGPF----IEIVRKAQLSSDL----PNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F +E + Q+ S + N V + GL PDG+H+ +Q
Sbjct: 163 DVGFGKSAVEYKQINQILSKVAHAEKNCYFVTSKGLTANPDGIHIDAVSQ 212
>gi|32473459|ref|NP_866453.1| acetyl xylan esterase AxeA [Rhodopirellula baltica SH 1]
gi|32398139|emb|CAD78234.1| probable acetyl xylan esterase AxeA [Rhodopirellula baltica SH 1]
Length = 298
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 113/234 (48%), Gaps = 32/234 (13%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
L +LAGQSNMAGRG + ++ QP+P +L +W A PLH
Sbjct: 63 HLFLLAGQSNMAGRGKIADE--------------DLQPHPRVLVFNKAGEWAPAIAPLH- 107
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYE 137
D + GVG G FA P +GL+PCA+GG+++ W+ G + Y+
Sbjct: 108 -FDKPRIAGVGLGRTFAIEYAENNPQ-ATVGLIPCAVGGSSLDVWQPGGFHESTNTHPYD 165
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
++R Q A+ G ++ +LW+QGESD+ N +K Y+ + + F R++ SP +PI+
Sbjct: 166 DCMKRMQQAIV-AGELKGILWHQGESDS-NPALSKTYQSKLNELFERFRTEFGSPNVPIV 223
Query: 198 RVALAS-GEGPFIE---IVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
L E P+ E +V +A L + N V + GL + D H + A
Sbjct: 224 IGQLGQFTEKPWDESRKLVDQAHRTLPDRMTNTVFVHSDGLGHKGDQTHFSAEA 277
>gi|298246863|ref|ZP_06970668.1| protein of unknown function DUF303 acetylesterase putative
[Ktedonobacter racemifer DSM 44963]
gi|297549522|gb|EFH83388.1| protein of unknown function DUF303 acetylesterase putative
[Ktedonobacter racemifer DSM 44963]
Length = 403
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 84/256 (32%), Positives = 118/256 (46%), Gaps = 61/256 (23%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH-- 83
L +LAGQSNM G G +T D T P+P + L ++ +W +A EPLH
Sbjct: 51 LWVLAGQSNMEGVGNLT-DVET--------------PSPFVHSLQSREEWAMAEEPLHWP 95
Query: 84 -------------ADI--------DVNKTNGVGPGLPFANA--VLTKVPNFGVIGLVPCA 120
AD D +T G G GL FA + T VP IGL+P A
Sbjct: 96 NESPRIIHHKLMGADAVPHPLPSHDPMRTTGAGLGLAFAKERYIRTGVP----IGLIPAA 151
Query: 121 IGGTNISQW------RKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLY 174
GGT++ QW + +SLY +++R + GG + VLWYQGES+T +LE+ + Y
Sbjct: 152 HGGTSLEQWDPELREQGDASLYGALLKRIEGV---GGKVAGVLWYQGESETSSLENIERY 208
Query: 175 KERSDMFFTDLRSDLQSPLLPIIRVALA-----SGEGPFIEIVRKAQLSSD--LPNVRCV 227
R LR DLQ P LP V + + +R+AQ + L + V
Sbjct: 209 HRRMHALLKALRRDLQQPDLPFYYVQIGCTVSYDADAKNWNGIREAQRTWPLLLSHTAMV 268
Query: 228 DAMGLPLEPDGLHLTT 243
A+ L L+ D +H+ T
Sbjct: 269 SAIDLELD-DSIHIGT 283
>gi|422330133|ref|ZP_16411157.1| hypothetical protein HMPREF0981_04477 [Erysipelotrichaceae
bacterium 6_1_45]
gi|371655224|gb|EHO20580.1| hypothetical protein HMPREF0981_04477 [Erysipelotrichaceae
bacterium 6_1_45]
Length = 276
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + T P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLNEAT------------PIYNENIFMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVGP FA A N IGL+PCA GG++I +W K +L+ + A+ A
Sbjct: 49 SVS---GVGPAASFAQAWCNANKN-EQIGLIPCAEGGSSIDEWNKEGALFRHAVSEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y ++ ++ R +L++ +P I L G
Sbjct: 105 MENSELI-AILWHQGESDS-HSGKYKNYYQKLNVLVNSFRKELEALEVPFIAGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F +++ + L N C G L PDG+H+ +Q
Sbjct: 163 KSGFGRSCVEYDLINQELLKYAEYNRNCYFVTGEKLYPNPDGIHINAESQ 212
>gi|373121931|ref|ZP_09535798.1| hypothetical protein HMPREF0982_00727 [Erysipelotrichaceae
bacterium 21_3]
gi|371664910|gb|EHO30079.1| hypothetical protein HMPREF0982_00727 [Erysipelotrichaceae
bacterium 21_3]
Length = 276
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + T P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLNEAT------------PIYNENIFMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVGP FA A N IGL+PCA GG++I +W K +L+ + A+ A
Sbjct: 49 SVS---GVGPAASFAQAWCNANKN-EQIGLIPCAEGGSSIDEWNKEGALFRHAVSEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y ++ ++ R +L++ +P I L G
Sbjct: 105 MENSELI-AILWHQGESDS-HSGKYKNYYQKLNVLVNSFRKELEALEVPFIAGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F +++ + L N C G L PDG+H+ +Q
Sbjct: 163 KSGFGRSCVEYDLINQELLKYAEYNRNCYFVTGEKLYPNPDGIHINAESQ 212
>gi|384045787|ref|YP_005493804.1| acetylxylan esterase enzyme [Bacillus megaterium WSH-002]
gi|345443478|gb|AEN88495.1| Acetylxylan esterase enzyme [Bacillus megaterium WSH-002]
Length = 290
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 110/230 (47%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + + VPP + +LR +W EPL+ D
Sbjct: 12 ILLIGQSNMAGRGFIED------------VPPIYNEHIKMLR---NGRWQTMAEPLNFDR 56
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
++ GVGP FA A P IG++PCA GG++I +W L I A+ A
Sbjct: 57 HIS---GVGPAASFAQAWTEDHPG-ESIGVIPCAEGGSSIDEWTIDGLLTRHAISEAKFA 112
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ + +LW+QGESD+ E K Y+++ F LR +L +P +PII L G
Sbjct: 113 METSELV-GILWHQGESDSYG-ERYKTYEDKLLSLFKHLREELNAPDIPIIIGELGHYLG 170
Query: 205 EGPF----IEIVRKAQLSSDL----PNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F +E + Q+ S + N V + GL PDG+H+ +Q
Sbjct: 171 DVGFGKSAVEYKQINQILSKVAHTEKNCYFVTSKGLTANPDGIHIDAVSQ 220
>gi|332704971|ref|ZP_08425057.1| uncharacterized DUF303 domain protein [Moorea producens 3L]
gi|332356323|gb|EGJ35777.1| uncharacterized DUF303 domain protein [Moorea producens 3L]
Length = 303
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 114/252 (45%), Gaps = 44/252 (17%)
Query: 16 PVKCQYQQQ-QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLK 74
PV +Q L ILAGQSNM+G G +T P +P + +
Sbjct: 52 PVPANFQGNISLFILAGQSNMSGSGKLT--------------PASSITHPRVFVFGNDYR 97
Query: 75 WVLAHEPLHA------DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ 128
W L EP+ + + +K+ GV PG+ FA +L P ++GL+PCA T I Q
Sbjct: 98 WHLGKEPIDSPSGQVDHVSEDKSAGVSPGIAFATELLKYDPEL-IVGLIPCAKWDTTIQQ 156
Query: 129 WRKG---SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKER-------S 178
W+K +LY ++RA A G I+ +L++QGESD +N + Y R +
Sbjct: 157 WQKNLSEDTLYGSCLKRAYAA-SPMGEIQGLLFFQGESDALN---PQAYPSRRFFPNQWA 212
Query: 179 DMF---FTDLRSDLQSPLLPIIRVALASGEGPFI----EIVRKAQLSSDLPNVRCVDAMG 231
D F D R DL P LP++ + + P E V+ Q + LP +
Sbjct: 213 DKFVRLVKDFRQDLGKPELPVVFAQIGTTTDPEKLPNWETVKAQQETVQLPATGMITTDD 272
Query: 232 LPLEPDGLHLTT 243
L L+ D +HLTT
Sbjct: 273 LALQ-DHVHLTT 283
>gi|169349976|ref|ZP_02866914.1| hypothetical protein CLOSPI_00716 [Clostridium spiroforme DSM 1552]
gi|169293189|gb|EDS75322.1| hypothetical protein CLOSPI_00716 [Clostridium spiroforme DSM 1552]
Length = 276
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 72/230 (31%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + V P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLHE------------VTPIYNENIFMLR---NGRWQMMVEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA A IGL+PCA GG++I +W L+ I A+ A
Sbjct: 49 SVA---GIGPAASFAQA-WCNANKSEQIGLIPCAEGGSSIDEWNTDGILFRHAISEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + + K Y ++ ++ R +L++P +P I L G
Sbjct: 105 MENSELI-AILWHQGESDS-HSKRYKDYYQKLNVIVNSFRKELKAPEIPFIIGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMG--LPLEPDGLHLTTPAQ 246
+ F E+V + L N C G L + PDG+H+ +Q
Sbjct: 163 KTGFGKSCIEYELVNQELLKYAKNNKNCYFVTGEKLYVNPDGIHINAESQ 212
>gi|294500349|ref|YP_003564049.1| hypothetical protein BMQ_3602 [Bacillus megaterium QM B1551]
gi|294350286|gb|ADE70615.1| conserved hypothetical protein [Bacillus megaterium QM B1551]
Length = 282
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 111/230 (48%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + + VPP + +LR +W EPL+ D
Sbjct: 4 VLLIGQSNMAGRGFIED------------VPPIYNEHIHMLR---NGRWQTMAEPLNFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
++ GVGP FA A T+ IG++PCA GG++I +W L I A+ A
Sbjct: 49 HIS---GVGPAASFAQA-WTEDHQGESIGVIPCAEGGSSIDEWTIDGLLTRHAISEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ + +LW+QGESD+ E K Y+++ F LR +L +P +PII L G
Sbjct: 105 METSDLV-GILWHQGESDSYG-ERYKTYEDKLLSLFKHLREELNAPDIPIIIGELGHYLG 162
Query: 205 EGPF----IEIVRKAQLSSDL----PNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F +E + Q+ S + N V + GL PDG+H+ +Q
Sbjct: 163 DVGFGKSAVEYKQINQILSKVAHTEKNCYFVTSKGLTANPDGIHIDAVSQ 212
>gi|449446512|ref|XP_004141015.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 203
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 62/119 (52%), Gaps = 9/119 (7%)
Query: 48 NKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTK 107
N WD +PP P PS LR W EPLH DID KTNGVGPG+ FA+ +L K
Sbjct: 15 NICVWDKHIPPGSIPQPSTLRFALNYTWEQGREPLHWDIDPTKTNGVGPGMAFADHLLAK 74
Query: 108 VPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTV 166
+ + IS+W KG+ Y +I+R +L GG ++ +W+QGESD
Sbjct: 75 ASE---------NLDCSRISEWIKGTGRYTSLIRRINASLESGGRLQGFVWFQGESDAA 124
>gi|338210317|ref|YP_004654364.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336304130|gb|AEI47232.1| protein of unknown function DUF303 acetylesterase [Runella
slithyformis DSM 19594]
Length = 266
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 117/258 (45%), Gaps = 33/258 (12%)
Query: 1 MFAWLLCLI-LVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQ 59
+F ++ L L + A + ++ L +L GQSNMAGRG VT RT
Sbjct: 12 LFIYVFLLFSLKAMAQNPDFKGKKLHLYLLVGQSNMAGRGEVTEADRT------------ 59
Query: 60 CQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
P+P I L + +WV A P+H D GVGPG FA ++ + +IGL+P
Sbjct: 60 --PHPRIWMLNKESQWVPAVAPMHFD---KPFAGVGPGFEFAK-IMAEADTTVMIGLIPA 113
Query: 120 AIGGTNISQWRKG-------SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAK 172
A GG+ I W+ G S Y+ I+R + AL GT++ +LW+QGE D+ E
Sbjct: 114 AAGGSPIDVWQTGGYHDQTKSYPYDDAIRRTKAAL-PAGTLKGILWHQGEGDS-KPELVG 171
Query: 173 LYKERSDMFFTDLRSDLQSPLLPIIRVALA---SGEGPFIEIVRKA--QLSSDLPNVRCV 227
Y ++ + R +L + +P + L + P + + L + C
Sbjct: 172 SYTQKLESLIGRFRKELSARNVPFVVGTLGDFFAANNPEAKNINDQLRNLPQKVKRTACA 231
Query: 228 DAMGLPLEPDGLHLTTPA 245
+A GL + D H TP+
Sbjct: 232 EATGLTDKGDKTHFDTPS 249
>gi|407784602|ref|ZP_11131751.1| hypothetical protein B30_01125 [Celeribacter baekdonensis B30]
gi|407204304|gb|EKE74285.1| hypothetical protein B30_01125 [Celeribacter baekdonensis B30]
Length = 512
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 129/287 (44%), Gaps = 40/287 (13%)
Query: 19 CQYQQQQLIILAGQSNMAG-RGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVL 77
Q+ LI++AGQSNMAG + GV + T + +G R+ A WV
Sbjct: 2 AQHAPSDLILMAGQSNMAGHKVGVDDLAETERGLIEGA------------RIWANGAWV- 48
Query: 78 AHEPLHADIDVNKTNGVGPGLPFANA--VLTKVPNFGVIGLVPCAIGGTNISQ-WR---K 131
PL D K G GP L FA V T P + +V A GG+ +S+ W +
Sbjct: 49 ---PLAVDAGYQK-RGFGPELSFARQWQVQTGRP----LSIVKLAKGGSYLSRGWSAEGR 100
Query: 132 GSSLYEQMIQRAQVALRGGGT-IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQ 190
G LY++++ + A+ G +R ++W QGESD ++ EDA+ Y R + F LR DL
Sbjct: 101 GGPLYQRLVAEVRAAMATGPVRLRGLIWMQGESDALDHEDAQAYGTRFEGFVARLRQDLG 160
Query: 191 SPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTL 250
P LPI+ L + G +++VR A S+D+ + V+ L +HLT +
Sbjct: 161 VPDLPIV-AGLITAPGGHVDLVRDAMASADVTAFKTVETRDLAHRSGAVHLTASGLAALG 219
Query: 251 NSWSNEALRVNLSLLVFRIL----------EGSCRISKQAVSSLPHC 287
+++ S L+ + L EG V SLPH
Sbjct: 220 QRFADALSSFEDSALIRQWLWTSDQYHAWYEGETLTPTGVVVSLPHA 266
>gi|313897635|ref|ZP_07831177.1| conserved hypothetical protein [Clostridium sp. HGF2]
gi|312957587|gb|EFR39213.1| conserved hypothetical protein [Clostridium sp. HGF2]
Length = 276
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + T P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLNEAT------------PIYNENIFMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V GVGP FA A N IGL+PCA GG++I +W K +L+ + A+ A
Sbjct: 49 SVA---GVGPAASFAQAWCNANKN-EQIGLIPCAEGGSSIDEWDKEGALFRHAVSEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y ++ ++ R +L++ +P I L G
Sbjct: 105 MENSELI-AILWHQGESDS-HSGKYKNYYQKLNVLVNSFRKELEALEVPFIAGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F +++ + L N C G L PDG+H+ +Q
Sbjct: 163 KSGFGRSCVEYDLINQELLKYAEYNRNCYFVTGEKLYPNPDGIHINAESQ 212
>gi|239625735|ref|ZP_04668766.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239519965|gb|EEQ59831.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 280
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 109/230 (47%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 8 FLIIGQSNMAGRGYLHE------------VKPIVNERIVMLR---NGRWQMMAEPINCDR 52
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+ FA+A + G IGL+PCA GG+ I +W G +LY+ I A A
Sbjct: 53 SVA---GISLAASFADAWCHENKE-GRIGLIPCAEGGSEIDEWDVGKALYDHAISEAHFA 108
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
++ + +LW+QGESD++ + ++Y E+ R +L + +PII L
Sbjct: 109 MK-NSQLTGILWHQGESDSMGGKH-EIYYEKLHRIMQGFRKELDASNIPIIIGGLGDFLG 166
Query: 203 -SGEG----PFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG G + I +K Q + ++ N VDA GL PDG+H+ +Q
Sbjct: 167 QSGFGKNCTEYTLINQKLKQFAFEVDNCYFVDAAGLTCNPDGIHINAVSQ 216
>gi|346313852|ref|ZP_08855379.1| hypothetical protein HMPREF9022_01036 [Erysipelotrichaceae
bacterium 2_2_44A]
gi|345907707|gb|EGX77417.1| hypothetical protein HMPREF9022_01036 [Erysipelotrichaceae
bacterium 2_2_44A]
Length = 276
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + T P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLNEAT------------PIYNENIFMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVGP FA A N IGL+PCA GG++I +W K +L+ + ++ A
Sbjct: 49 SVS---GVGPAASFAQAWCNANKN-EQIGLIPCAEGGSSIDEWDKEGALFRHAVSESKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y ++ ++ R +L++ +P I L G
Sbjct: 105 MENSELI-AILWHQGESDS-HSGKYKNYYQKLNVLVNSFRKELEALEVPFIAGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F +++ + L N C G L PDG+H+ +Q
Sbjct: 163 KSGFGRSCVEYDLINQELLKYAEYNRNCYFVTGEKLYPNPDGIHINAESQ 212
>gi|326802358|ref|YP_004320177.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326553122|gb|ADZ81507.1| protein of unknown function DUF303 acetylesterase [Sphingobacterium
sp. 21]
Length = 278
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 111/234 (47%), Gaps = 31/234 (13%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPL 82
+ + +L GQSNMAGRG T++ VP +LR +W + EP+
Sbjct: 2 EMKSFLLIGQSNMAGRG-FTHE-----------VPSIYNERIMMLR---NGRWQMMTEPI 46
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR 142
H D + GVG FA A + IGL+PCA GG++I +W +L+ I
Sbjct: 47 HFDRPIA---GVGLSASFAEAWCSDHEG-EKIGLIPCAEGGSSIDEWSTDGTLFRHAINE 102
Query: 143 AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII----- 197
A+ A+ + VLW+QGESD+ + + K+Y+E+ F ++R L +P +P I
Sbjct: 103 AKFAME-DSELAGVLWHQGESDSHDGKH-KVYREKISRIFDEIRRALSAPNIPFIIGALG 160
Query: 198 ----RVALASGEGPFIEIVRKAQ-LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+VA +G + I + Q + D N V A GL PDG+H +Q
Sbjct: 161 DYLGKVAFGAGCIEYKLINEELQKYAMDNKNCYYVTAEGLTANPDGIHHDAMSQ 214
>gi|407475239|ref|YP_006789639.1| hypothetical protein Curi_c27990 [Clostridium acidurici 9a]
gi|407051747|gb|AFS79792.1| hypothetical protein DUF303 [Clostridium acidurici 9a]
Length = 281
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 110/230 (47%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + + VP C + +LR W + EP++ D
Sbjct: 5 FLMVGQSNMAGRGFLKD------------VPIICNEHIKVLRNGL---WQIMMEPINYD- 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
G+GP FA A + N IGL+PCA GG ++ W SL++ I +A++A
Sbjct: 49 --RPYAGIGPAASFAAAWCRENKN-EEIGLIPCAEGGASLDDWSVDGSLFKHAILQAKLA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ + +LW+QGESD+++ KLY E+ + R L P +PII + G
Sbjct: 106 QQ-NSKLEGILWHQGESDSMS-GLYKLYHEKFLKITEEFRKQLGEPDIPIIMGGIGDYLG 163
Query: 205 EG-------PFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
EG + EI ++ Q ++ N V A GL PDG+HL +Q
Sbjct: 164 EGFLGEYFPEYSEINQELLQFANTHKNCYFVTASGLTPNPDGIHLNAASQ 213
>gi|73663502|ref|YP_302283.1| hypothetical protein SSP2193 [Staphylococcus saprophyticus subsp.
saprophyticus ATCC 15305]
gi|72496017|dbj|BAE19338.1| hypothetical protein [Staphylococcus saprophyticus subsp.
saprophyticus ATCC 15305]
Length = 280
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 75/230 (32%), Positives = 101/230 (43%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + VPP +LR KW + EP+H+D
Sbjct: 4 ILLIGQSNMAGRGFIDE------------VPPIIDERMMMLR---NGKWQMMEEPIHSDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA L K PN IGL+PCA GGT I W L + A A
Sbjct: 49 SVA---GIGPAASFAKLWLDKHPN-ETIGLIPCADGGTTIDDWAPDQILTRHALAEATFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
I +LW+QGESD++N + + Y ++ R L P +P I L G
Sbjct: 105 QETSEII-GILWHQGESDSLN-QRYQDYDKKLKTLINYFREQLNIPEVPFIVGLLPDFLG 162
Query: 205 EGPFIE-IVRKAQLSSDLP-------NVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F + V AQ++ L N V A + PD +H+ +Q
Sbjct: 163 KAAFGQSAVEYAQINEALKRVTQLTTNSYYVTAQDITANPDAIHINANSQ 212
>gi|312129141|ref|YP_003996481.1| hypothetical protein Lbys_0350 [Leadbetterella byssophila DSM
17132]
gi|311905687|gb|ADQ16128.1| protein of unknown function DUF303 acetylesterase [Leadbetterella
byssophila DSM 17132]
Length = 247
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 85/254 (33%), Positives = 115/254 (45%), Gaps = 43/254 (16%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNP 64
LCL + +A L +L GQSNMAGRG + N P+
Sbjct: 7 FLCLSITVQA------QNNLDLYLLVGQSNMAGRGTLDN---------------YLLPSD 45
Query: 65 SILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT 124
S+ L L WV A EP H D G G FA +L+K + IGL+P A+GGT
Sbjct: 46 SLWMLAKDLSWVRAKEPFHYD---KSAAGAGLAASFARIILSKDKH--PIGLIPAAVGGT 100
Query: 125 NISQWRKGSSL-------YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKER 177
+I WR G+ Y+ I+RA+VAL+ G I+A+LW+QGESDT E Y +
Sbjct: 101 SIRYWRSGAQDPATGLYPYDDAIRRAKVALK-HGKIKAILWHQGESDT---ESTASYVQE 156
Query: 178 SDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK------AQLSSDLPNVRCVDAMG 231
+L DL PL I + +GE R+ ++ + LP V+ V + G
Sbjct: 157 FISLMDNLHRDLDLPLGSIPVIIGETGEFGDRSNSRQRINAVIREIPNRLPFVKVVTSEG 216
Query: 232 LPLEPDGLHLTTPA 245
L D H TPA
Sbjct: 217 LTHNGDLTHFDTPA 230
>gi|414159960|ref|ZP_11416232.1| hypothetical protein HMPREF9310_00606 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878897|gb|EKS26762.1| hypothetical protein HMPREF9310_00606 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 277
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/226 (31%), Positives = 105/226 (46%), Gaps = 31/226 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + VP + +LR +W + EP+HAD
Sbjct: 4 ILLLGQSNMAGRGFLNE------------VPAIINEHIHVLR---NGRWQMMGEPIHAD- 47
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
GVG FA A PN IGL+PCA GG+ IS+W+ GS L + A+ A
Sbjct: 48 --RHLAGVGLASAFAQAWSIDHPNES-IGLIPCAEGGSAISEWQPGSVLMRHALSEARFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII---RVALAS 203
I +LW+QGE+D N + ++Y+ + +R +L P +P I L
Sbjct: 105 QETSEII-GILWHQGEND-CNQDLYQVYQSQLKNVIAHVRKELDLPHVPFIIGGLDHLTH 162
Query: 204 GEGPFIEIVRKAQLSSDL-------PNVRCVDAMGLPLEPDGLHLT 242
EG + + A+++ L P+ V + GL + PDG+H
Sbjct: 163 AEGFSRTLTQHAEINHILQTMPQQVPDTYFVTSKGLTMNPDGIHFN 208
>gi|449497123|ref|XP_004160319.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Cucumis
sativus]
Length = 203
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 61/119 (51%), Gaps = 9/119 (7%)
Query: 48 NKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTK 107
N WD +PP P P+ LR W EPLH DID KTNGVGPG+ FA+ +L K
Sbjct: 15 NICVWDKHIPPGSIPQPTTLRFALNYTWEQGREPLHWDIDPTKTNGVGPGMAFADHLLAK 74
Query: 108 VPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTV 166
+ + IS+W KG Y +I+R +L GG ++ +W+QGESD
Sbjct: 75 ASE---------NLDCSRISEWIKGIGRYTSLIRRINASLESGGRLQGFVWFQGESDAA 124
>gi|389575037|ref|ZP_10165087.1| acetylxylan esterase [Bacillus sp. M 2-6]
gi|388425092|gb|EIL82927.1| acetylxylan esterase [Bacillus sp. M 2-6]
Length = 276
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/232 (31%), Positives = 109/232 (46%), Gaps = 35/232 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
+L GQSNMAGRG + VPP +LR +W + EP+H D
Sbjct: 4 FLLIGQSNMAGRG------------FKHEVPPIYNERIMMLR---NGRWQMMTEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V GVG FA K IGL+PCA GG+ I +W + +L+ I A+ A
Sbjct: 49 SVA---GVGLAASFAE-TWCKDHEGEKIGLIPCAEGGSTIDEWSRDGALFRHAINEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
R + +LW+QGESD+ + + K Y E+ F +LR++L P +P++
Sbjct: 105 -REDSELAGILWHQGESDSQDGK-YKEYDEKIRRLFHELRTELSVPNIPLVIGGLGDFLG 162
Query: 198 RVALASG--EGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ A +G E I EI++K ++ N V A L PDG+H+ +Q
Sbjct: 163 KTAFGAGCVEHQLINEILQK--YANHHENCYYVTAKSLIPNPDGIHINAMSQ 212
>gi|403047500|ref|ZP_10902968.1| hypothetical protein SOJ_25770 [Staphylococcus sp. OJ82]
gi|402763034|gb|EJX17128.1| hypothetical protein SOJ_25770 [Staphylococcus sp. OJ82]
Length = 279
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/233 (31%), Positives = 104/233 (44%), Gaps = 37/233 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + + V P +L+ +W + EP+H+D
Sbjct: 4 ILLIGQSNMAGRGFIDS------------VKPILDERIQVLK---NGRWQMMDEPIHSDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA L P+ IGL+PCA GGT I W + L I A+ A
Sbjct: 49 SVA---GIGPAASFAKLWLDDHPD-ETIGLIPCADGGTTIDDWAEDQVLTRHAISEAEFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKL-YKERSDMFFTDLRSDLQSPLLPIIRVALAS-- 203
+ I +LW+QGESD+ LE L Y+ + + R L +P LP + L
Sbjct: 105 MESSELI-GILWHQGESDS--LEGKHLDYEIKLNQVVDHFRQALNAPQLPFVMGLLGDFL 161
Query: 204 GEGPF----------IEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G+ F E+++ + D N V A GL PD +H+ +Q
Sbjct: 162 GQAAFGQSASEYTQINEVIKTVAEAKD--NCFYVTAQGLTANPDEIHIDAQSQ 212
>gi|81427760|ref|YP_394759.1| deacetylase (acetyl esterase) [Lactobacillus sakei subsp. sakei
23K]
gi|78609401|emb|CAI54447.1| Putative deacetylase (acetyl esterase) [Lactobacillus sakei subsp.
sakei 23K]
Length = 283
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/230 (31%), Positives = 106/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I+L GQSNMAGRG + + VP +LR +W + EP+H D
Sbjct: 5 ILLVGQSNMAGRGFIQD------------VPGLRHERVKMLR---NGRWQMMAEPIHFDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
+V GVGP FA A + P+ +GL+PCA GG++I +W L I A+ A
Sbjct: 50 EVA---GVGPAASFAAAWVQAHPD-EELGLIPCAEGGSSIDEWASDEMLMRHAIAEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
I VLW+QGESD++ + Y + F+ LR L LPII
Sbjct: 106 QESSELI-GVLWHQGESDSLK-GGYQTYAAKLTAVFSHLRQALGQADLPIIVGQLPDFLG 163
Query: 198 RVALASGEGPFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ + F +I R+ A + + P+ V+A L PDG+H+ +Q
Sbjct: 164 QEGFGASATEFNDINREMANVVAQDPHSYLVNAAELTANPDGIHIDAASQ 213
>gi|298247865|ref|ZP_06971670.1| protein of unknown function DUF303 acetylesterase putative
[Ktedonobacter racemifer DSM 44963]
gi|297550524|gb|EFH84390.1| protein of unknown function DUF303 acetylesterase putative
[Ktedonobacter racemifer DSM 44963]
Length = 406
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 120/280 (42%), Gaps = 85/280 (30%)
Query: 9 ILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILR 68
ILV E W +LAGQSNM G G + + P+P +
Sbjct: 46 ILVGEIW------------VLAGQSNMEGIGDLIDVE---------------SPSPFVHS 78
Query: 69 LTAKLKWVLAHEPLH-----------------------ADIDVNKTNGVGPGLPFANA-- 103
++ +W +A EPLH D KT G G GL FA
Sbjct: 79 FQSREEWAIAEEPLHWLGESPRIVHHQLWGFDKVPDEIPPRDPQKTKGAGLGLTFAKERY 138
Query: 104 VLTKVPNFGVIGLVPCAIGGTNISQW------RKGSSLYEQMIQRAQVALRGGGTIRAVL 157
+ T VP IGL+P A GGT++ QW +SLY +++R + + GG I VL
Sbjct: 139 IRTGVP----IGLIPSAHGGTSMEQWDPAKRDEGDASLYGALLKRVE---KVGGKIAGVL 191
Query: 158 WYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV----- 212
WYQGESD E + Y +R T LR+DLQ+P LP V + G FI +
Sbjct: 192 WYQGESDAYP-EATERYHQRMHTLVTALRADLQAPDLPFYYVQI----GRFIRSIADPDA 246
Query: 213 -------RKAQLS--SDLPNVRCVDAMGLPLEPDGLHLTT 243
R+AQ + LP+ V + L L+ D +H++T
Sbjct: 247 DVCWSGMREAQRTWQDILPHTAMVATIDLELD-DLIHIST 285
>gi|383110688|ref|ZP_09931507.1| hypothetical protein BSGG_1797 [Bacteroides sp. D2]
gi|313694262|gb|EFS31097.1| hypothetical protein BSGG_1797 [Bacteroides sp. D2]
Length = 265
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 115/238 (48%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFANAVL--TKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA ++ TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMVRQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNSE---AYKQKLISLVKDLREDLDMPDLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P+ V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPHSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|157691912|ref|YP_001486374.1| acetylxylan esterase [Bacillus pumilus SAFR-032]
gi|157680670|gb|ABV61814.1| possible acetylxylan esterase [Bacillus pumilus SAFR-032]
Length = 276
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
+L GQSNMAGRG + VPP +LR +W + EP+H D
Sbjct: 4 FLLIGQSNMAGRG------------FKHEVPPIYNERIMMLR---NGRWQMMTEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V GVG FA K IGL+PCA GG++I +W + +L+ I A A
Sbjct: 49 PVA---GVGLAASFAE-TWCKDHEGEKIGLIPCAEGGSSIDEWSRDGALFRHAISEATFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
+ + +LW+QGESD+ + + K Y E+ F ++R++L P +P++
Sbjct: 105 -KENSELAGILWHQGESDSQDGK-YKEYDEKIRRLFHEIRTELSVPNIPLVIGGLGDFLG 162
Query: 198 RVALASGEGPFIEIVRKAQ-LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+VA +G + I + Q + N V A GL PDG+H+ +Q
Sbjct: 163 KVAFGAGCVEYQLINEELQKYAHRHENCYYVTAKGLIPNPDGIHINAMSQ 212
>gi|323694409|ref|ZP_08108580.1| hypothetical protein HMPREF9475_03444 [Clostridium symbiosum
WAL-14673]
gi|323501490|gb|EGB17381.1| hypothetical protein HMPREF9475_03444 [Clostridium symbiosum
WAL-14673]
Length = 276
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 104/230 (45%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + V P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLHE------------VKPIYNENILMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA A N +GL+PCA GG++I +W +L+ I A+ A
Sbjct: 49 SVA---GIGPAASFAQAWCNANKN-EQVGLIPCAEGGSSIDEWNVEGALFRHAISEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y ++ ++ R +L +P I L G
Sbjct: 105 METSDLI-AILWHQGESDS-HSGKYKDYYQKLNVMVNSFRKELGVLEVPFIVGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F E+V + L N C G L PDG+H+ +Q
Sbjct: 163 KSAFGRSCVEYELVNQELLRYAENNSNCYFVTGEKLYSNPDGIHINAESQ 212
>gi|402573421|ref|YP_006622764.1| hypothetical protein Desmer_3006 [Desulfosporosinus meridiei DSM
13257]
gi|402254618|gb|AFQ44893.1| protein of unknown function (DUF303) [Desulfosporosinus meridiei
DSM 13257]
Length = 275
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 68/230 (29%), Positives = 113/230 (49%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + + VPP +LR +W + EP++ D
Sbjct: 5 FLMIGQSNMAGRGFLND------------VPPIINERIQMLR---NGRWQMMIEPVNYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GV FA+A +K P IGL+PCA GG+++ W L++ + A+ A
Sbjct: 50 PVS---GVSLAASFADAWCSKYPE-DRIGLIPCAEGGSSLDDWSVDGELFQHAVSEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
++ T+ +LW+QGESD+ + + K+Y ++ + LR L P +P+I L
Sbjct: 106 MK-HSTLTGILWHQGESDSSDGK-YKVYYDKLSVIVQTLRDILNVPEVPLIIGGLGDYLG 163
Query: 203 -SGEGPF-IEIVR----KAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G + +E R + + + + V A GL PDG+H+ + +Q
Sbjct: 164 KTGFGQYCVEYARINDCLQKFAFEQAHCYFVSAQGLTANPDGIHVNSLSQ 213
>gi|323487477|ref|ZP_08092771.1| hypothetical protein HMPREF9474_04522 [Clostridium symbiosum
WAL-14163]
gi|323399159|gb|EGA91563.1| hypothetical protein HMPREF9474_04522 [Clostridium symbiosum
WAL-14163]
Length = 276
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 72/230 (31%), Positives = 104/230 (45%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + V P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLHE------------VKPIYNENILMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA A N V GL+PCA GG++I +W +L+ I A+ A
Sbjct: 49 SVA---GIGPAASFAQAWCNANKNEQV-GLIPCAEGGSSIDEWNVEGALFRHAISEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y ++ ++ R +L +P I L G
Sbjct: 105 METSDLI-AILWHQGESDS-HSGKYKDYYQKLNVMVNSFRKELGVLEVPFIVGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F E+V + L N C G L PDG+H+ +Q
Sbjct: 163 KSAFGRSCVEYELVNQELLRYAENNSNCYFVTGEKLYSNPDGIHINAESQ 212
>gi|355628552|ref|ZP_09049834.1| hypothetical protein HMPREF1020_03913 [Clostridium sp. 7_3_54FAA]
gi|354819801|gb|EHF04239.1| hypothetical protein HMPREF1020_03913 [Clostridium sp. 7_3_54FAA]
Length = 276
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 72/230 (31%), Positives = 103/230 (44%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG + V P N +LR +W + EP+H D
Sbjct: 4 VLLIGQSNMAGRGFLHE------------VKPIYNENILMLR---NGRWQMMAEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V G+GP FA A N V GL+PCA GG++I +W +L+ I A+ A
Sbjct: 49 SVA---GIGPAASFAQAWCNANKNEQV-GLIPCAEGGSSIDEWNVEGALFRHAISEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I A+LW+QGESD+ + K Y + ++ R +L +P I L G
Sbjct: 105 METSDLI-AILWHQGESDS-HSGKYKDYYHKLNVMVNSFRKELSVLDVPFIVGGLGDYLG 162
Query: 205 EGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPAQ 246
+ F E+V + L N C G L PDG+H+ +Q
Sbjct: 163 KSAFGRSCVEYELVNQELLRYAENNSNCYFVTGEKLYSNPDGIHINAESQ 212
>gi|392393279|ref|YP_006429881.1| hypothetical protein Desde_1685 [Desulfitobacterium dehalogenans
ATCC 51507]
gi|390524357|gb|AFM00088.1| protein of unknown function (DUF303) [Desulfitobacterium
dehalogenans ATCC 51507]
Length = 276
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 74/232 (31%), Positives = 108/232 (46%), Gaps = 35/232 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++L GQSNMAGRG T++ VPP +LR +W + EP+H D
Sbjct: 4 LLLIGQSNMAGRG-FTHE-----------VPPIYNEKIMMLR---NGRWQMMTEPIHFDR 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V GVG FA A K IGL+PCA GG+ I +W +L+ + A+ A
Sbjct: 49 PVA---GVGLAASFAEA-WCKDNEGEKIGLIPCAEGGSAIDEWSLDGTLFRHAMNEAKFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKL--YKERSDMFFTDLRSDLQSPLLPII------- 197
+ + +LW+QGESD+ +D K Y E+ F ++R +L P +P I
Sbjct: 105 MEDSELV-GILWHQGESDS---QDGKYKEYYEKILRIFNEIRRELSVPNIPFIIGGLGDY 160
Query: 198 --RVALASGEGPFIEIVRKAQ-LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+VA +G + I + Q + N V A GL PDG+H+ +Q
Sbjct: 161 LGKVAFGAGCVEYQLINEELQKYAQGNENCYYVTAKGLTSNPDGIHINAMSQ 212
>gi|423215177|ref|ZP_17201705.1| hypothetical protein HMPREF1074_03237 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692440|gb|EIY85678.1| hypothetical protein HMPREF1074_03237 [Bacteroides xylanisolvens
CL03T12C04]
Length = 265
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 78/238 (32%), Positives = 114/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP+I
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLGMPNLPVIV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P+ V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPHSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|299147477|ref|ZP_07040542.1| acetyl xylan esterase A [Bacteroides sp. 3_1_23]
gi|298514755|gb|EFI38639.1| acetyl xylan esterase A [Bacteroides sp. 3_1_23]
Length = 265
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 113/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKPISLVKDLREDLDMPDLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPYSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|237719610|ref|ZP_04550091.1| acetyl xylan esterase A [Bacteroides sp. 2_2_4]
gi|336406558|ref|ZP_08587209.1| hypothetical protein HMPREF0127_04522 [Bacteroides sp. 1_1_30]
gi|229450879|gb|EEO56670.1| acetyl xylan esterase A [Bacteroides sp. 2_2_4]
gi|335934460|gb|EGM96456.1| hypothetical protein HMPREF0127_04522 [Bacteroides sp. 1_1_30]
Length = 265
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 113/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLDMPDLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPYSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|294643573|ref|ZP_06721377.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294808672|ref|ZP_06767406.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|292641068|gb|EFF59282.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294444111|gb|EFG12844.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 265
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 114/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLRNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLGMPNLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P+ V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPHSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|295088156|emb|CBK69679.1| Domain of unknown function (DUF303). [Bacteroides xylanisolvens
XB1A]
Length = 265
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 113/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRTTLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLDMPDLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPYSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|262405083|ref|ZP_06081633.1| acetyl xylan esterase A [Bacteroides sp. 2_1_22]
gi|345508216|ref|ZP_08787850.1| acetyl xylan esterase A [Bacteroides sp. D1]
gi|229444548|gb|EEO50339.1| acetyl xylan esterase A [Bacteroides sp. D1]
gi|262355958|gb|EEZ05048.1| acetyl xylan esterase A [Bacteroides sp. 2_1_22]
Length = 265
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 114/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR ++P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRAT--------------LIPEVMDTLRNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLGMPNLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P+ V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPHSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|160885616|ref|ZP_02066619.1| hypothetical protein BACOVA_03618 [Bacteroides ovatus ATCC 8483]
gi|423290221|ref|ZP_17269070.1| hypothetical protein HMPREF1069_04113 [Bacteroides ovatus
CL02T12C04]
gi|423294483|ref|ZP_17272610.1| hypothetical protein HMPREF1070_01275 [Bacteroides ovatus
CL03T12C18]
gi|156109238|gb|EDO10983.1| hypothetical protein BACOVA_03618 [Bacteroides ovatus ATCC 8483]
gi|392665608|gb|EIY59131.1| hypothetical protein HMPREF1069_04113 [Bacteroides ovatus
CL02T12C04]
gi|392675674|gb|EIY69115.1| hypothetical protein HMPREF1070_01275 [Bacteroides ovatus
CL03T12C18]
Length = 265
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 113/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLGMPNLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPYSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|257869906|ref|ZP_05649559.1| conserved hypothetical protein [Enterococcus gallinarum EG2]
gi|357051092|ref|ZP_09112288.1| hypothetical protein HMPREF9478_02271 [Enterococcus saccharolyticus
30_1]
gi|257804070|gb|EEV32892.1| conserved hypothetical protein [Enterococcus gallinarum EG2]
gi|355380717|gb|EHG27853.1| hypothetical protein HMPREF9478_02271 [Enterococcus saccharolyticus
30_1]
Length = 282
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 108/232 (46%), Gaps = 35/232 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + + VPP +LR +W + EP++ D
Sbjct: 5 FLMIGQSNMAGRGFIQD------------VPPIYNEKIKMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A + P IGL+PCA GG+ + +W +L+ I A+ A
Sbjct: 50 PVS---GISLAGSFADAWCHENPE-ETIGLIPCAEGGSTLDEWHVDQALFRHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--- 203
+ + +LW+QGESD++N + K+Y ++ R +L +P +PII L
Sbjct: 106 ME-NSELTGILWHQGESDSMNGK-YKVYYQKLLSIMKAFREELNAPNIPIIIGGLGDFLG 163
Query: 204 --------GEGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
E FI + ++K D N V A GL PDG+H+ +Q
Sbjct: 164 KEGFGKNCTEYNFINQELQKFAFEQD--NCYFVTAEGLTSNPDGIHIDAISQ 213
>gi|440781309|ref|ZP_20959651.1| hypothetical protein F502_05772 [Clostridium pasteurianum DSM 525]
gi|440220914|gb|ELP60120.1| hypothetical protein F502_05772 [Clostridium pasteurianum DSM 525]
Length = 282
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + VPP +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFIHE------------VPPIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A + +IGL+PCA GG+++ +W L+ I A+ A
Sbjct: 50 PVS---GISLAGSFADAWCRQNQE-DIIGLIPCAEGGSSLDEWAVDEVLFRHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--- 203
++ + +LW+QGESD+VN + K+Y ++ + LR +L +P +PII L
Sbjct: 106 MQ-SSELTGILWHQGESDSVN-GNYKVYYKKLLLIIEALRKELNAPDIPIIIGGLGDFLG 163
Query: 204 GEGPFIEIVRKAQLSSDLP-------NVRCVDAMGLPLEPDGLHLTTPAQ 246
EG ++ DL N V A GL PDG+H+ +Q
Sbjct: 164 KEGFGKSCTEYNFINQDLEKFAFEQDNCYFVTASGLTSNPDGIHINAISQ 213
>gi|255533730|ref|YP_003094102.1| hypothetical protein Phep_3849 [Pedobacter heparinus DSM 2366]
gi|255346714|gb|ACU06040.1| protein of unknown function DUF303 acetylesterase putative
[Pedobacter heparinus DSM 2366]
Length = 276
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 117/242 (48%), Gaps = 26/242 (10%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ +L GQSNMAGRG + + P++L ++ KW++A PLH
Sbjct: 46 EIYLLLGQSNMAGRGPLLAEY-------------TAMEQPNVLVWDSEGKWIIARHPLH- 91
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYE 137
D K GVGPGL F A+ PN IGLVPCA+GGTNI W+ G + ++
Sbjct: 92 -YDKPKVAGVGPGLSFGFAMARSKPNV-RIGLVPCAVGGTNIDVWKPGAMDKATNTHPFD 149
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
R + A++ G ++ ++W+QGE+++ ++ Y ++ + T +R + + LP++
Sbjct: 150 DAEMRIREAMK-YGVVKGMIWHQGEANS-GAQNMIGYLDKLNELITRIRKMVGNEKLPVV 207
Query: 198 RVALASGEGPFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNE 256
L + + + + A +PN+ + L + D H +P+ + ++ +
Sbjct: 208 VGELGRYKTNYQQFNKMLAGAPQMIPNLALATSESLVDKGDLTHFDSPSATAYGKRYAEK 267
Query: 257 AL 258
L
Sbjct: 268 ML 269
>gi|398311538|ref|ZP_10515012.1| hypothetical protein BmojR_19587 [Bacillus mojavensis RO-H-1]
Length = 280
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKVLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGVLFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +S+ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFASEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|293371648|ref|ZP_06618059.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633345|gb|EFF51915.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 265
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 112/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG +I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGPSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL P LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLDMPDLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPYSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|417303585|ref|ZP_12090635.1| protein of unknown function acetylesterase [Rhodopirellula baltica
WH47]
gi|327540124|gb|EGF26718.1| protein of unknown function acetylesterase [Rhodopirellula baltica
WH47]
Length = 226
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 105/224 (46%), Gaps = 32/224 (14%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRG +++D QP+P +L +W A PLH D GV
Sbjct: 1 MAGRGKISDD--------------DLQPHPRVLVFNKAGEWAPAIAPLH--FDKPGIAGV 44
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYEQMIQRAQVAL 147
G G FA P +GL+PCA+GG+++ W+ G + Y+ ++R Q A+
Sbjct: 45 GLGRTFAIEYAENNPQV-TVGLIPCAVGGSSLDAWQPGGFHESTNTHPYDDCMKRMQHAI 103
Query: 148 RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS-GEG 206
G ++ +LW+QGESD+ N +K Y+ + D F R++ SP +PI+ L E
Sbjct: 104 V-AGELKGILWHQGESDS-NPALSKTYQSKLDQLFERFRTEFDSPNVPIMIGQLGQFTEK 161
Query: 207 PFIE---IVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
P+ E +V +A L + N V + GL + D H + A
Sbjct: 162 PWDESRTLVDQAHRTLPDRMTNTVFVHSDGLGHKGDQTHFSAEA 205
>gi|394992023|ref|ZP_10384816.1| hypothetical protein BB65665_06276 [Bacillus sp. 916]
gi|393807039|gb|EJD68365.1| hypothetical protein BB65665_06276 [Bacillus sp. 916]
Length = 280
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ LP+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELELDELPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|163789766|ref|ZP_02184203.1| hypothetical protein CAT7_06026 [Carnobacterium sp. AT7]
gi|159874988|gb|EDP69055.1| hypothetical protein CAT7_06026 [Carnobacterium sp. AT7]
Length = 280
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 103/230 (44%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLHE------------VDPIYNEKIKMLR---NGQWQMMTEPVNYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V GV FA+A PN+ IGL+PCA GG+ ++ W +L++ + A+ A
Sbjct: 50 PVA---GVSLAASFADAWSKAHPNY-EIGLIPCAEGGSTLNDWHPQGTLFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
L I +LW+QGESD+ N + Y E+ LR +L+ +P+I
Sbjct: 106 LE-SSEICGILWHQGESDSNN-SLHETYYEKLSFIIETLRKELKLEDVPLIIGGLGEFLG 163
Query: 198 RVALASGEGPFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F EI + ++ + + N V A GL PDG+H +Q
Sbjct: 164 KTGFGKYSTEFQEINEQLSKFAHEQQNCYFVSAEGLTANPDGIHFNAVSQ 213
>gi|430756324|ref|YP_007208863.1| Carbohydrate esterase [Bacillus subtilis subsp. subtilis str. BSP1]
gi|430020844|gb|AGA21450.1| Carbohydrate esterase [Bacillus subtilis subsp. subtilis str. BSP1]
Length = 280
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGLVPCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLVPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKFTLIIETLRNELELDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|381199433|ref|ZP_09906582.1| hypothetical protein SyanX_03096 [Sphingobium yanoikuyae XLDN2-5]
Length = 271
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 74/238 (31%), Positives = 105/238 (44%), Gaps = 33/238 (13%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ +LAGQSNM+GRG +T+ + P+ P P ++ L A EP+ +
Sbjct: 29 IYVLAGQSNMSGRGALTD-----------LTEPERAPVPGVMMLGNDGIVRPAMEPIDSA 77
Query: 86 ------IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG---SSLY 136
+ ++ VGPGL FA A++ + I L+PCA GG+ I++WR G ++LY
Sbjct: 78 QGQQDMVSADRLAAVGPGLFFARALIARQRR--PILLIPCAKGGSAIARWRPGGDRTTLY 135
Query: 137 EQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPI 196
+ R + G + +LWYQGESDT A Y R DL LP
Sbjct: 136 GSCLARVRSVR---GRLAGILWYQGESDTEKDTAATGYGAALADLVGHFRRDLGRADLPF 192
Query: 197 IRVALAS--------GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
I +A P +V+ AQ L V GL + D LHL T AQ
Sbjct: 193 IFAQIADRPAAPEHVARYPGWAMVQAAQRDIALRCAYMVPTGGLERQADELHLVTDAQ 250
>gi|298482485|ref|ZP_07000671.1| acetyl xylan esterase A [Bacteroides sp. D22]
gi|298271464|gb|EFI13039.1| acetyl xylan esterase A [Bacteroides sp. D22]
Length = 265
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 111/236 (47%), Gaps = 36/236 (15%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQMI 140
V K +GP FA + K +GLV A GG++I+ W KGS YE+ +
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMTRKTKR--PLGLVVNARGGSSINSWLKGSKDGYYEEAL 136
Query: 141 QRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
R +VA++ GG ++A+LW+QGE+D N E YK++ DLR DL LP+I
Sbjct: 137 SRIRVAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLNMLDLPVIVGQ 193
Query: 201 LA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P+ V + GL D H T AQ
Sbjct: 194 ISQWNWTKREAGTVPFNQMIKK--VSSFIPHSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|336415192|ref|ZP_08595533.1| hypothetical protein HMPREF1017_02641 [Bacteroides ovatus
3_8_47FAA]
gi|335941225|gb|EGN03083.1| hypothetical protein HMPREF1017_02641 [Bacteroides ovatus
3_8_47FAA]
Length = 265
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 113/238 (47%), Gaps = 40/238 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGR +T P ++ L K + A PL+
Sbjct: 33 LYVCIGQSNMAGRATLT--------------PEVMDTLQNVYLLNDKGNFEPAVNPLNRY 78
Query: 86 IDVNKT---NGVGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--LYEQ 138
V K +GP FA A TK P +GLV A GG++I+ W KGS YE+
Sbjct: 79 STVRKDLSMQRLGPAYGFAKEMARQTKRP----VGLVVNARGGSSINSWLKGSKDGYYEE 134
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+ R ++A++ GG ++A+LW+QGE+D N E YK++ DLR DL LP++
Sbjct: 135 ALSRVRIAMKQGGVLKAILWHQGEADCSNPE---AYKQKLISLVKDLREDLGMSNLPVVV 191
Query: 199 VALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL--HLTTPAQ 246
++ +G PF ++++K +SS +P+ V + GL D H T AQ
Sbjct: 192 GQISQWNWTKREAGTVPFNQMIKK--VSSFIPHSDWVSSKGLGWYKDEKDPHFNTEAQ 247
>gi|384176231|ref|YP_005557616.1| hypothetical protein I33_2694 [Bacillus subtilis subsp. subtilis
str. RO-NN-1]
gi|349595455|gb|AEP91642.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis
str. RO-NN-1]
Length = 280
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 111/231 (48%), Gaps = 33/231 (14%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTV-NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA--- 202
LR I +LW+QGESD+ +L + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYRSLHET--YYEKLTLIIETLRNELELDEVPLIIGGLGDFL 162
Query: 203 --SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL + +Q
Sbjct: 163 GKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDSASQ 213
>gi|421613723|ref|ZP_16054795.1| protein of unknown function acetylesterase [Rhodopirellula baltica
SH28]
gi|408495494|gb|EKK00081.1| protein of unknown function acetylesterase [Rhodopirellula baltica
SH28]
Length = 226
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 105/224 (46%), Gaps = 32/224 (14%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRG + ++ QP+P +L + +W A PLH D GV
Sbjct: 1 MAGRGKIADE--------------DLQPHPRVLVVNKAGEWAPAIAPLH--FDKPGIAGV 44
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG-------SSLYEQMIQRAQVAL 147
G G FA P +GL+PCA+GG+++ W+ G + Y+ ++R Q A+
Sbjct: 45 GLGRTFAIEYAENNPQV-TVGLIPCAVGGSSLDAWQPGGFHESTNTHPYDDCMKRMQQAI 103
Query: 148 RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS-GEG 206
G ++ +LW+QGESD+ N +K Y+ + D F R++ SP +PI+ L E
Sbjct: 104 V-AGELKGILWHQGESDS-NPALSKTYQSKLDQLFERFRTEFDSPSVPIVIGQLGQFTEK 161
Query: 207 PFIE---IVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
P+ E +V +A L + N V + GL + D H + A
Sbjct: 162 PWDESRKLVDQAHRTLPDRMTNTVFVHSDGLDHKGDQTHFSAEA 205
>gi|427410773|ref|ZP_18900975.1| hypothetical protein HMPREF9718_03449 [Sphingobium yanoikuyae ATCC
51230]
gi|425710761|gb|EKU73781.1| hypothetical protein HMPREF9718_03449 [Sphingobium yanoikuyae ATCC
51230]
Length = 271
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 105/240 (43%), Gaps = 37/240 (15%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNK--------LTWDGIVPPQCQPNPSILRLTAKLKWVL 77
+ +LAGQSNM+GRG + + T + L DGI+ P +P +
Sbjct: 29 IYVLAGQSNMSGRGALADLTEPERAPVPGVMMLGNDGIIRPAVEP-------------ID 75
Query: 78 AHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG---SS 134
+ + + ++ VGPGL FA A++ + I L+PCA GG+ I++WR G ++
Sbjct: 76 SAQGQQDMVSADRLAAVGPGLFFARALIARQRR--PILLIPCAKGGSAIARWRPGGDRTT 133
Query: 135 LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLL 194
LY + R + G + +LWYQGESDT N A Y R DL L
Sbjct: 134 LYGSCLARVRSVR---GRLAGILWYQGESDTENETAATGYGAALADLVGHFRRDLGRAEL 190
Query: 195 PIIRVALAS--------GEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
P + +A P +V+ AQ L V GL + D LHL T AQ
Sbjct: 191 PFLFAQIADRPAAPEHVARYPGWAMVQAAQRDIALRCAYMVPTGGLARQADELHLVTDAQ 250
>gi|386759194|ref|YP_006232410.1| hypothetical protein MY9_2621 [Bacillus sp. JS]
gi|384932476|gb|AFI29154.1| hypothetical protein MY9_2621 [Bacillus sp. JS]
Length = 280
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLSLIIETLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|384160228|ref|YP_005542301.1| hypothetical protein BAMTA208_13235 [Bacillus amyloliquefaciens
TA208]
gi|384165156|ref|YP_005546535.1| carbohydrate esterase family 6 protein [Bacillus amyloliquefaciens
LL3]
gi|384169298|ref|YP_005550676.1| carbohydrate esterase family 6 protein [Bacillus amyloliquefaciens
XH7]
gi|328554316|gb|AEB24808.1| hypothetical protein BAMTA208_13235 [Bacillus amyloliquefaciens
TA208]
gi|328912711|gb|AEB64307.1| Putative carbohydrate esterase family 6 protein [Bacillus
amyloliquefaciens LL3]
gi|341828577|gb|AEK89828.1| carbohydrate esterase family 6 protein [Bacillus amyloliquefaciens
XH7]
Length = 280
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLHFANEQQNCYFVTATGLTANPDGIHLDAASQ 213
>gi|29349588|ref|NP_813091.1| acetyl xylan esterase A [Bacteroides thetaiotaomicron VPI-5482]
gi|298383849|ref|ZP_06993410.1| acetyl xylan esterase AxeA [Bacteroides sp. 1_1_14]
gi|29341498|gb|AAO79285.1| acetyl xylan esterase A [Bacteroides thetaiotaomicron VPI-5482]
gi|298263453|gb|EFI06316.1| acetyl xylan esterase AxeA [Bacteroides sp. 1_1_14]
Length = 267
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 124/257 (48%), Gaps = 49/257 (19%)
Query: 4 WLLCLIL--VSEAWPVKCQYQQQQLIILAGQSNMAGRGGVT---NDTRTNK--LTWDGIV 56
+LLC+++ SEA K + L + GQSNMAGRG ++ DT N L D
Sbjct: 10 FLLCVLVWGRSEAHAEK-PLKTLDLYLCIGQSNMAGRGKLSPEVMDTLQNVYLLNADDQF 68
Query: 57 PPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGL 116
P P + L W VGP FA + TK +GL
Sbjct: 69 EPAVNPLNRYSTIGKGLSW----------------QQVGPAYGFAKTMATKK---HPVGL 109
Query: 117 VPCAIGGTNISQWRKGSS----LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAK 172
+ A GG++I W K + Y++ I+RA+ A++ G T++A++W+QGE+D + E
Sbjct: 110 IVNARGGSSIRSWVKNAKQSGGYYDEAIRRAKEAMKYG-TLKAIIWHQGEADCHHPE--- 165
Query: 173 LYKERSDMFFTDLRSDLQSPLLPII-----------RVALASGEGPFIEIVRKAQLSSDL 221
YKE+ TDLR+DL P LP++ + + G PF ++++ ++S+ L
Sbjct: 166 AYKEKIIQLMTDLRNDLGMPDLPVVVGQIAQWNWTKKPYIPEGTKPFNDMIK--EISTFL 223
Query: 222 PNVRCVDAMGL-PLEPD 237
P+ CV + GL PL+ +
Sbjct: 224 PHSACVSSEGLTPLKDE 240
>gi|380693922|ref|ZP_09858781.1| acetyl xylan esterase A [Bacteroides faecis MAJ27]
Length = 527
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 116/231 (50%), Gaps = 42/231 (18%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGRG ++ P ++ L A+ K+ A PL+
Sbjct: 293 LYLCVGQSNMAGRGKLS--------------PEVMDTLRNVYLLNAEDKFEPAVNPLNRY 338
Query: 86 IDVNKTNG---VGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS----LYEQ 138
+ K G +GP FA + TK IGL+ A GG++I W K + Y++
Sbjct: 339 STIGKGFGWQQLGPAYGFAKEMATKKH---PIGLIVNARGGSSIRSWVKNAKQSGGYYDE 395
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
++R + A++ G T++A++W+QGE+D + E Y+E+ TDLR+DL P LP++
Sbjct: 396 AVRRTKEAMKYG-TLKAIIWHQGEADCHHSE---AYREKITQLMTDLRNDLGMPDLPVVV 451
Query: 199 VALA-----------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGL-PLEPD 237
+A G PF ++++ ++S+ LP+ CV + GL PL+ +
Sbjct: 452 GQIAQWNWTRKPHIPEGTKPFNDMIK--EISAFLPHSACVSSEGLTPLKDE 500
>gi|308174391|ref|YP_003921096.1| hypothetical protein BAMF_2500 [Bacillus amyloliquefaciens DSM 7]
gi|307607255|emb|CBI43626.1| RBAM024050 [Bacillus amyloliquefaciens DSM 7]
Length = 280
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 111/231 (48%), Gaps = 33/231 (14%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A +K + IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADA-WSKAHSDEEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTV-NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA--- 202
LR I +LW+QGESD+ +L + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYRSLHET--YYEKLTLIIETLRNELKLDEVPLIIGGLGDFL 162
Query: 203 --SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 163 GKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|350266797|ref|YP_004878104.1| hypothetical protein GYO_2864 [Bacillus subtilis subsp. spizizenii
TU-B-10]
gi|349599684|gb|AEP87472.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii
TU-B-10]
Length = 280
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|399887973|ref|ZP_10773850.1| hypothetical protein CarbS_05480 [Clostridium arbusti SL206]
Length = 282
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 108/232 (46%), Gaps = 35/232 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + VPP +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFIHE------------VPPIYNERIQMLR---NGRWQMMAEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A + +IGL+PCA GG+++ +W L+ I A+ A
Sbjct: 50 PVS---GISLAGSFADAWCRQNQE-DIIGLIPCAEGGSSLDEWAVDEVLFRHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--- 203
++ + +LW+QGE D+VN + K+Y ++ + LR L +P +PII L
Sbjct: 106 MQ-SSELTGILWHQGECDSVN-GNYKVYYKKLLLIIEALRKGLNAPDIPIIIGGLGDFLG 163
Query: 204 --------GEGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
E FI + + K D N V A+GL PDG+H+ +Q
Sbjct: 164 KEGFGKSCTEYNFINQELEKFAFEQD--NCYFVTALGLTSNPDGIHIDAISQ 213
>gi|424765938|ref|ZP_18193300.1| hypothetical protein HMPREF1345_02190 [Enterococcus faecium
TX1337RF]
gi|402412945|gb|EJV45296.1| hypothetical protein HMPREF1345_02190 [Enterococcus faecium
TX1337RF]
Length = 285
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 105/230 (45%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + + VPP +LR W + EP++ D
Sbjct: 8 FLMIGQSNMAGRGFIND------------VPPIYNERIKMLRNGG---WQMMTEPINYDR 52
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GV FA+A V IGL+PCA GG+ + +W +L+ I A+ A
Sbjct: 53 PVS---GVSLAASFADA-WCNVNREETIGLIPCAEGGSTLDEWHVDQTLFRHAITEAKFA 108
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
+ I +LW+QGESD++N + K+Y ++ LR +L +P +PII L G
Sbjct: 109 MENSELI-GILWHQGESDSMNGK-YKVYYQKLLAIMKALRKELSAPNIPIIIGGLGDFLG 166
Query: 205 EGPF------IEIVRKAQLSSDLPNVRC--VDAMGLPLEPDGLHLTTPAQ 246
+ F ++ + C V A GL PDG+H+ +Q
Sbjct: 167 KEGFGKNCTEYNLINQELQKFAFEQDHCYFVTAEGLTSNPDGIHIDAISQ 216
>gi|406884852|gb|EKD32179.1| putative acetyl xylan esterase AxeA [uncultured bacterium]
Length = 273
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 92/177 (51%), Gaps = 23/177 (12%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
+ ILAGQSNMAGRG P P+ IL + K + ++A EPLH
Sbjct: 46 VFILAGQSNMAGRGFFE--------------PQDTIPSERILTINNKGEVIVAKEPLHY- 90
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYE--QMIQ-- 141
+ ++T G+ GL F +++ +P I L+P AIGG+++SQW G S Y Q++
Sbjct: 91 YEPSRT-GLDCGLSFGRELVSHIPENITILLIPAAIGGSSVSQWL-GDSTYRNVQLLTNF 148
Query: 142 RAQVAL-RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
R +VAL + G I+ +LW+QGE+D LYK R F R+ + LPI+
Sbjct: 149 REKVALGKKYGQIKGILWHQGETDATQ-NRIPLYKNRLSQLFEKFRAIADNEKLPIL 204
>gi|375363108|ref|YP_005131147.1| putative carbohydrate esterase [Bacillus amyloliquefaciens subsp.
plantarum CAU B946]
gi|371569102|emb|CCF05952.1| putative carbohydrate esterase [Bacillus amyloliquefaciens subsp.
plantarum CAU B946]
Length = 280
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 109/230 (47%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
L+ I +LW+QGESD+ L + Y E+ + LR++L+ +P+I L
Sbjct: 106 LQ-SSQICGILWHQGESDSYRLLH-ETYYEKLTLIIETLRNELKLDDVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|421730905|ref|ZP_16170031.1| hypothetical protein WYY_07449 [Bacillus amyloliquefaciens subsp.
plantarum M27]
gi|407075059|gb|EKE48046.1| hypothetical protein WYY_07449 [Bacillus amyloliquefaciens subsp.
plantarum M27]
Length = 280
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDDVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|423683126|ref|ZP_17657965.1| carbohydrate esterase family 6 protein [Bacillus licheniformis
WX-02]
gi|383439900|gb|EID47675.1| carbohydrate esterase family 6 protein [Bacillus licheniformis
WX-02]
Length = 280
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVAAAGLTANPDGIHLDAASQ 213
>gi|15893819|ref|NP_347168.1| acetylxylan esterase-like protein [Clostridium acetobutylicum ATCC
824]
gi|337735745|ref|YP_004635192.1| acetylxylan esterase-like protein [Clostridium acetobutylicum DSM
1731]
gi|384457256|ref|YP_005669676.1| Acetylxylan esterase related enzyme [Clostridium acetobutylicum EA
2018]
gi|15023393|gb|AAK78508.1|AE007568_2 Acetylxylan esterase related enzyme [Clostridium acetobutylicum
ATCC 824]
gi|325507945|gb|ADZ19581.1| Acetylxylan esterase related enzyme [Clostridium acetobutylicum EA
2018]
gi|336290157|gb|AEI31291.1| acetylxylan esterase-like protein [Clostridium acetobutylicum DSM
1731]
Length = 282
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 107/232 (46%), Gaps = 35/232 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + VP +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFINE------------VPMIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A K +IGL+PCA GG++I +W L+ + A+ A
Sbjct: 50 PVS---GISLAGSFADAWSQKNQE-DIIGLIPCAEGGSSIDEWALDGVLFRHALTEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
+ + +LW+QGESD++N + K+Y ++ + LR +L P +PII
Sbjct: 106 ME-SSELTGILWHQGESDSLN-GNYKVYYKKLLLIIEALRKELNVPDIPIIIGGLGDFLG 163
Query: 198 --RVALASGEGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
R E FI + ++K D N V A GL PDG+H+ +Q
Sbjct: 164 KERFGKGCTEYNFINKELQKFAFEQD--NCYFVTASGLTCNPDGIHIDAISQ 213
>gi|150018418|ref|YP_001310672.1| hypothetical protein Cbei_3596 [Clostridium beijerinckii NCIMB
8052]
gi|149904883|gb|ABR35716.1| protein of unknown function DUF303, acetylesterase putative
[Clostridium beijerinckii NCIMB 8052]
Length = 282
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 105/232 (45%), Gaps = 35/232 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + VP +LR +W + EP++ D
Sbjct: 5 FLMVGQSNMAGRGFIHE------------VPQIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A ++ IGL+PCA GG+ + +W L+ + A+ A
Sbjct: 50 HVS---GISLAGSFADA-WSRQNQEDTIGLIPCAEGGSTLDEWAVDGVLFRHAVTEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--- 203
+ + +LW+QGESD+VN + K+Y + + R +L +P +PII L
Sbjct: 106 ME-SSELTGILWHQGESDSVN-GNYKVYYNKLLLIIEAFRKELNAPDIPIIIGGLGEFLG 163
Query: 204 --------GEGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
E FI E ++K D N V A GL PDG+H+ +Q
Sbjct: 164 KEGFGKSCTEYKFINEELQKFAFEQD--NCFFVTASGLTSNPDGIHIDAISQ 213
>gi|52081153|ref|YP_079944.1| carbohydrate esterase family 6 protein [Bacillus licheniformis DSM
13 = ATCC 14580]
gi|319644879|ref|ZP_07999112.1| hypothetical protein HMPREF1012_00145 [Bacillus sp. BT1B_CT2]
gi|442564237|ref|YP_006714140.2| acetylesterase [Bacillus licheniformis DSM 13 = ATCC 14580]
gi|52004364|gb|AAU24306.1| putative carbohydrate esterase family 6 protein [Bacillus
licheniformis DSM 13 = ATCC 14580]
gi|317392688|gb|EFV73482.1| hypothetical protein HMPREF1012_00145 [Bacillus sp. BT1B_CT2]
gi|440611551|gb|AAU41672.3| putative acetylesterase [Bacillus licheniformis DSM 13 = ATCC
14580]
Length = 280
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALAEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVAAAGLTANPDGIHLDAASQ 213
>gi|150391619|ref|YP_001321668.1| hypothetical protein Amet_3913 [Alkaliphilus metalliredigens QYMF]
gi|149951481|gb|ABR50009.1| protein of unknown function DUF303, acetylesterase putative
[Alkaliphilus metalliredigens QYMF]
Length = 282
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 108/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
+ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FFMLGQSNMAGRGFIHE------------VTPIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A + IGL+PCA GG+++ +W +L++ I A+ A
Sbjct: 50 PVS---GISLAASFADAWCLQNQE-DTIGLIPCAEGGSSLDEWAVDQALFKHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--- 203
++ + +LW+QGESD++N + K+Y ++ + LR +L +P +P+I L
Sbjct: 106 IQ-SSELTGILWHQGESDSMN-GNYKVYYKKLFLIIEALRKELNAPDIPLIIGGLGDFLG 163
Query: 204 GEGPFIEIVRKAQLSSDL-------PNVRCVDAMGLPLEPDGLHLTTPAQ 246
EG I ++ +L N V A GL PDG+H+ +Q
Sbjct: 164 KEGFGISCTEYNFINQELQKFSFEQENCYFVTASGLTSNPDGIHIDAISQ 213
>gi|410725854|ref|ZP_11364156.1| hypothetical protein A370_02233 [Clostridium sp. Maddingley
MBC34-26]
gi|410601640|gb|EKQ56146.1| hypothetical protein A370_02233 [Clostridium sp. Maddingley
MBC34-26]
Length = 285
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + VPP +LR +W + EP++ D
Sbjct: 5 FLMIGQSNMAGRGFIHE------------VPPIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ FA+A + IGL+PCA GG+ + +W L+ I A+ A
Sbjct: 50 PVS---GISLAGSFADAWCRQNQE-DTIGLIPCAEGGSTLDEWAVEGVLFRHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEG 206
++ + +LW+QGESD+ N + K+Y ++ + LR +L +P +PII L G
Sbjct: 106 MQ-NSKLTGILWHQGESDSAN-GNYKVYYKKLLLIIETLRKELSAPDIPIIIGGLGDFLG 163
Query: 207 ---------PFIEIVRKAQ-LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ I ++ Q + + N V A GL PDG+H+ +Q
Sbjct: 164 KEGFGKSCTEYTLINQELQKFAFEQDNCYFVTASGLTSNPDGIHIDAISQ 213
>gi|256422794|ref|YP_003123447.1| hypothetical protein Cpin_3784 [Chitinophaga pinensis DSM 2588]
gi|256037702|gb|ACU61246.1| protein of unknown function DUF303 acetylesterase putative
[Chitinophaga pinensis DSM 2588]
Length = 280
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 102/231 (44%), Gaps = 39/231 (16%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
+L GQSNMAGRG + VP +LR +W L EP+H D
Sbjct: 5 FLLIGQSNMAGRG------------YSQEVPAIINEGIKVLR---NGRWQLMSEPIHND- 48
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
+ G+G F A P+ IG +PCA GGT++ W G L++ + +A++A
Sbjct: 49 --RSSAGIGLAGSFGAAWRMDHPDV-EIGFIPCADGGTSLDDWSVGGPLFDHALSQAKLA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEG 206
R T+ +LW+QGESD E A Y+ + + LR +L++ +P+I G G
Sbjct: 106 QR-SSTLAGILWHQGESDCFP-EKAAEYERKLKVIIDTLRQELRAADVPLI----VGGLG 159
Query: 207 PFIE------------IVRKAQL--SSDLPNVRCVDAMGLPLEPDGLHLTT 243
F+ +V +A L + P A GL PDGLH
Sbjct: 160 DFLTSGMYGKYFGAYPLVNEALLHYTQTAPLSYFATAEGLTSNPDGLHFNA 210
>gi|385265575|ref|ZP_10043662.1| hypothetical protein MY7_2341 [Bacillus sp. 5B6]
gi|385150071|gb|EIF14008.1| hypothetical protein MY7_2341 [Bacillus sp. 5B6]
Length = 280
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 110/231 (47%), Gaps = 33/231 (14%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPAGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTV-NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA--- 202
L+ I +LW+QGESD+ +L + Y E+ + LR++L+ +P+I L
Sbjct: 106 LQ-SSQICGILWHQGESDSYRSLHET--YYEKITLVIETLRNELKLDEVPLIIGGLGDFL 162
Query: 203 --SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 163 GKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|359412446|ref|ZP_09204911.1| protein of unknown function DUF303 acetylesterase [Clostridium sp.
DL-VIII]
gi|357171330|gb|EHI99504.1| protein of unknown function DUF303 acetylesterase [Clostridium sp.
DL-VIII]
Length = 282
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 107/236 (45%), Gaps = 43/236 (18%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + VPP +LR +W + EP++ D
Sbjct: 5 FLMVGQSNMAGRGFIHE------------VPPIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ G+ F++A + IGL+PCA GG+ + +W L+ I A+ A
Sbjct: 50 PVS---GISLAGSFSDA-WCRQNGEDTIGLIPCAEGGSTLDEWAVDEVLFRHAITEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEG 206
++ + +LW+QGESD++N + K+Y ++ + R +L +P +PII G G
Sbjct: 106 MQ-SSELTGILWHQGESDSLN-GNYKVYYKKLLLIIEAFRKELNAPDIPII----IGGLG 159
Query: 207 PFI----------------EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
F+ E ++K D N V A GL PDG+H+ +Q
Sbjct: 160 DFLGKEGFGKSCTEYKLINEELQKFAFEQD--NCYFVTASGLTSNPDGIHINAISQ 213
>gi|423082593|ref|ZP_17071182.1| hypothetical protein HMPREF1122_02170 [Clostridium difficile
002-P50-2011]
gi|423087112|ref|ZP_17075502.1| hypothetical protein HMPREF1123_02655 [Clostridium difficile
050-P50-2011]
gi|357545361|gb|EHJ27336.1| hypothetical protein HMPREF1123_02655 [Clostridium difficile
050-P50-2011]
gi|357547711|gb|EHJ29586.1| hypothetical protein HMPREF1122_02170 [Clostridium difficile
002-P50-2011]
Length = 282
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG ++ V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFISE------------VTPIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GV FA+A + IGL+PCA GG+++ +W L++ I A+ A
Sbjct: 50 PVS---GVSLAASFADAWCCENQE-DRIGLIPCAEGGSSLDEWNIDGILFKHAISEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
++ + +LW+QGE+D+ N + K Y ++ LR +L P +PII
Sbjct: 106 IQ-SSELTGILWHQGENDSNN-SNYKFYYKKLLSIIEALRKELNVPDIPIIIGGLGDFLG 163
Query: 198 RVALASGEGPFIEIVRKAQ-LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+V ++ I ++ Q + + N V A GL PDG+H+ +Q
Sbjct: 164 KVGFGKSCTEYVFINQELQKFAFEQDNCYFVTATGLTSNPDGIHIDAISQ 213
>gi|299144956|ref|ZP_07038024.1| acetyl xylan esterase A [Bacteroides sp. 3_1_23]
gi|336412834|ref|ZP_08593187.1| hypothetical protein HMPREF1017_00295 [Bacteroides ovatus
3_8_47FAA]
gi|298515447|gb|EFI39328.1| acetyl xylan esterase A [Bacteroides sp. 3_1_23]
gi|335942880|gb|EGN04722.1| hypothetical protein HMPREF1017_00295 [Bacteroides ovatus
3_8_47FAA]
Length = 266
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/251 (31%), Positives = 120/251 (47%), Gaps = 47/251 (18%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNP 64
LL + +V PVK L + GQSNMAGRG ++ P
Sbjct: 14 LLGIPMVYAGKPVK----NMDLYLCIGQSNMAGRGKLS--------------PAVMDTMQ 55
Query: 65 SILRLTAKLKWVLAHEPLHADIDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLVPCAI 121
++ L A+ ++ LA PL+ + + +GP FA A+ +K +GL+ A
Sbjct: 56 NVYLLNAEDQFELAVNPLNRYSTIGRGLTGEYLGPVYSFAKAMASKK---HPVGLIVNAR 112
Query: 122 GGTNISQWRK-----GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKE 176
GGT+I W K G Y + ++R + A++ G ++A++W+QGE+D E YK+
Sbjct: 113 GGTSIRSWLKSTEKTGGLYYNEALRRTKEAMKYG-KLKAIIWHQGEADCQYPEG---YKK 168
Query: 177 RSDMFFTDLRSDLQSPLLPIIRVALA-----------SGEGPFIEIVRKAQLSSDLPNVR 225
+ TDLR+DL P LP+I LA G PF ++++ +SS LPN
Sbjct: 169 KIIKLMTDLRNDLGIPDLPVIVGQLAEWNWTKKPYIPEGTKPFNDMIK--DISSFLPNSA 226
Query: 226 CVDAMGL-PLE 235
CV + GL PL+
Sbjct: 227 CVSSEGLKPLK 237
>gi|386814829|ref|ZP_10102047.1| protein of unknown function DUF303 acetylesterase [Thiothrix nivea
DSM 5205]
gi|386419405|gb|EIJ33240.1| protein of unknown function DUF303 acetylesterase [Thiothrix nivea
DSM 5205]
Length = 247
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 103/233 (44%), Gaps = 33/233 (14%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIV-PPQCQPNPSILRLTAKLKWVLAHEP 81
+ +LIILAGQSNM GRG V + T K T + Q + +P AK W
Sbjct: 23 KDRLIILAGQSNMMGRGKVNDLPATYKTTPANVTFFYQGREHP-----LAKFAW------ 71
Query: 82 LHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQ 141
GP + FA+ V PN +I LV A G+ I QW+ G LY+ +++
Sbjct: 72 ------------FGPEVSFAHDVARAFPNDHII-LVKQAASGSLIQQWQPGQGLYKALLR 118
Query: 142 RAQVALRG--GGTIRAVLWYQGESDTVNLED-AKLYKERSDMFFTDLRSDLQSP--LLPI 196
+ A G + A+LW QGESD + D A Y R + LR DLQSP L
Sbjct: 119 QVGFATDAEENGKVDAILWMQGESDARSAPDVANQYGSRFATLVSSLRKDLQSPDSLFIY 178
Query: 197 IRVALASGE-GPFIEIVRKAQLS--SDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+V+L E IE VR Q S S L N + L DG+H Q
Sbjct: 179 GQVSLEHPEHNDTIESVRSQQKSAQSQLANALMIPTDNLGKLDDGIHFNAAGQ 231
>gi|154686835|ref|YP_001421996.1| hypothetical protein RBAM_024050 [Bacillus amyloliquefaciens FZB42]
gi|154352686|gb|ABS74765.1| hypothetical protein RBAM_024050 [Bacillus amyloliquefaciens FZB42]
Length = 280
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 68/230 (29%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + + A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSETRFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
LR I +LW+QGESD+ + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIGTLRNELKLDEVPLIIGGLGDFLG 163
Query: 203 -SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 164 KTGFGQHATEFRQVNEQLLRFANEQQNCYFVTATGLTANPDGIHLDAASQ 213
>gi|384266187|ref|YP_005421894.1| putative carbohydrate esterase [Bacillus amyloliquefaciens subsp.
plantarum YAU B9601-Y2]
gi|380499540|emb|CCG50578.1| putative carbohydrate esterase [Bacillus amyloliquefaciens subsp.
plantarum YAU B9601-Y2]
Length = 280
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 110/231 (47%), Gaps = 33/231 (14%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTV-NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA--- 202
L+ I +LW+QGESD+ +L + Y E+ + LR++L+ +P+I L
Sbjct: 106 LQ-SSQICGILWHQGESDSYRSLHET--YYEKLTLIIETLRNELKLDEVPLIIGGLGDFL 162
Query: 203 --SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A GL PDG+HL +Q
Sbjct: 163 GKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDAASQ 213
>gi|374295860|ref|YP_005046051.1| CBM6-containing protein,glycosyl hydrolase family 11,dockerin-like
protein [Clostridium clariflavum DSM 19732]
gi|359825354|gb|AEV68127.1| CBM6-containing protein,glycosyl hydrolase family 11,dockerin-like
protein [Clostridium clariflavum DSM 19732]
Length = 697
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 101/225 (44%), Gaps = 37/225 (16%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTA-KLKWVLAHEPLHAD 85
+L GQSNMAG W PNP IL L +W +A PLH
Sbjct: 478 FLLLGQSNMAG--------------WARAQDSDKIPNPRILALGYDNNQWGVAVPPLHEA 523
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-GSSLYEQMIQRAQ 144
+GPG FA ++ ++P IGL+PCAI G I + K G S Y ++ RA+
Sbjct: 524 FQ----GAIGPGDWFAKTIIERLPENDTIGLIPCAISGEKIETFMKNGGSKYNWIVSRAR 579
Query: 145 VALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL--- 201
+A + GG I +L++QGES+ + + + +DL+ DL +P++ L
Sbjct: 580 MAQQRGGVIEGILFHQGESNNGQQD----WPNKVSTLISDLKKDLGLGDIPVLVGELLYT 635
Query: 202 --ASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPD---GLHL 241
+G + +L S +PN + A GL +P GLH
Sbjct: 636 GSCAGHNTLVN-----RLPSMIPNCYVISAQGLSGDPADFWGLHF 675
>gi|451346218|ref|YP_007444849.1| hypothetical protein KSO_007355 [Bacillus amyloliquefaciens IT-45]
gi|449849976|gb|AGF26968.1| hypothetical protein KSO_007355 [Bacillus amyloliquefaciens IT-45]
Length = 280
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 110/231 (47%), Gaps = 33/231 (14%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLNE------------VDPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GVG FA+A P+ IGL+PCA GG++++ W+ L++ + A+ A
Sbjct: 50 PVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWQPEGILFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTV-NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA--- 202
LR I +LW+QGESD+ +L + Y E+ + LR++L+ +P+I L
Sbjct: 106 LR-SSQICGILWHQGESDSYRSLHET--YYEKLTLIIETLRNELKLDDVPLIIGGLGDFL 162
Query: 203 --SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+G G R+ + +++ N V A L PDG+HL +Q
Sbjct: 163 GKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAADLTANPDGIHLDAASQ 213
>gi|410458184|ref|ZP_11311946.1| hypothetical protein BAZO_03390 [Bacillus azotoformans LMG 9581]
gi|409931689|gb|EKN68667.1| hypothetical protein BAZO_03390 [Bacillus azotoformans LMG 9581]
Length = 280
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 103/230 (44%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFLHE------------VEPIYNEKIKMLR---NGQWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V GV FA A P+ IGL+PCA GG++++ W +L++ + A+ A
Sbjct: 50 PVA---GVSLAASFAEAWSKAQPD-EEIGLIPCAEGGSSLNDWHPQGTLFQHALSEARFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA---- 202
L I +LW+QGESD+ N + Y E+ LR +L +P+I L
Sbjct: 106 LE-TSEICGILWHQGESDSNN-SLHETYYEKLSFIIETLRKELNLQNVPLIIGELGDFLG 163
Query: 203 -SGEG----PFIEIVRK-AQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
SG G F EI + Q + + N V A GL PDG+H +Q
Sbjct: 164 KSGFGKYSTEFQEINEQLRQFAHEQQNCYFVSAEGLTANPDGIHFNAISQ 213
>gi|332668480|ref|YP_004451496.1| hypothetical protein Halhy_6810 [Haliscomenobacter hydrossis DSM
1100]
gi|332337525|gb|AEE54623.1| protein of unknown function DUF303 acetylesterase
[Haliscomenobacter hydrossis DSM 1100]
Length = 271
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 108/231 (46%), Gaps = 29/231 (12%)
Query: 26 LIILAGQSNMAGRGGV-TNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+ +LAGQSNMAGRG V DT ++ P I + A+ + ++A EPLH
Sbjct: 43 VFLLAGQSNMAGRGLVEAQDTVSD---------------PRIFSINAQAEVIVAKEPLH- 86
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQR-- 142
G+ GL F A+L VP I L+P A+GG+ + QW S+ E +
Sbjct: 87 -FYEPGRAGLDCGLSFGKALLKGVPKKVSILLLPTAVGGSAMRQWLGDSTYREVKLWSNF 145
Query: 143 -AQVAL-RGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
+VAL + G I+AVLW+QGESD N ++ LY E + R + SP LP++
Sbjct: 146 LEKVALGKKHGRIKAVLWHQGESDA-NDKNIPLYPENLARLLQNFRRAVGSPQLPVLMGE 204
Query: 201 L-ASGEGP----FIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
L A + P I + A + D P + L + D +H + Q
Sbjct: 205 LGAFSQNPQQWQKINQLINAHAAKD-PFTTVISTQDLQHKGDKIHFNSAGQ 254
>gi|376260261|ref|YP_005146981.1| putative glycosylase [Clostridium sp. BNL1100]
gi|373944255|gb|AEY65176.1| putative glycosylase [Clostridium sp. BNL1100]
Length = 776
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 104/219 (47%), Gaps = 26/219 (11%)
Query: 25 QLIILAGQSNMAGRGGV-----TNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+L GQSNM G D R L +D N ++ R+T + W +A
Sbjct: 546 HCFLLLGQSNMVGYAASQASDKVEDPRVLVLGFDN--------NAALGRVTDQ--WDVAC 595
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-GSSLYEQ 138
PLHA + + +GPG F ++ KVP+ IGL+PCAI G I + K G + Y
Sbjct: 596 PPLHA----SWLDAIGPGDWFGKTMIQKVPSGDTIGLIPCAISGEKIETFMKSGGTKYSW 651
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+I RA++A + GG I ++++QGES++ + + + DLR+DL +P I
Sbjct: 652 IINRAKLAQQKGGVIEGIIFHQGESNSGDTS----WPGKVKTLVNDLRTDLNLGNVPFIA 707
Query: 199 VALASGEGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEP 236
L GP R QL S + N V A GL ++P
Sbjct: 708 GELLY-SGPCAGHNTRVNQLPSLITNSYVVSADGLVVDP 745
>gi|383120522|ref|ZP_09941250.1| hypothetical protein BSIG_2470 [Bacteroides sp. 1_1_6]
gi|251840427|gb|EES68509.1| hypothetical protein BSIG_2470 [Bacteroides sp. 1_1_6]
Length = 236
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 48/251 (19%)
Query: 4 WLLCLIL--VSEAWPVKCQYQQQQLIILAGQSNMAGRGGVT---NDTRTNK--LTWDGIV 56
+LLC+++ SEA K + L + GQSNMAGRG ++ DT N L D
Sbjct: 10 FLLCVLVWGRSEAHAEK-PLKTLDLYLCIGQSNMAGRGKLSPEVMDTLQNVYLLNADDQF 68
Query: 57 PPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGL 116
P P + L W VGP FA + TK +GL
Sbjct: 69 EPAVNPLNRYSTIGKGLSW----------------QQVGPAYGFAKTMATKK---HPVGL 109
Query: 117 VPCAIGGTNISQWRKGSS----LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAK 172
+ A GG++I W K + Y++ I+RA+ A++ G T++A++W+QGE+D + E
Sbjct: 110 IVNARGGSSIRSWVKNAKQSGGYYDEAIRRAKEAMKYG-TLKAIIWHQGEADCHHPE--- 165
Query: 173 LYKERSDMFFTDLRSDLQSPLLPII-----------RVALASGEGPFIEIVRKAQLSSDL 221
YKE+ TDLR+DL P LP++ + + G PF ++++ ++S+ L
Sbjct: 166 AYKEKIIQLMTDLRNDLGMPDLPVVVGQIAQWNWTKKPYIPEGTKPFNDMIK--EISTFL 223
Query: 222 PNVRCVDAMGL 232
P+ CV L
Sbjct: 224 PHSACVSPKDL 234
>gi|220928667|ref|YP_002505576.1| family 6 carbohydrate binding protein [Clostridium cellulolyticum
H10]
gi|219998995|gb|ACL75596.1| Carbohydrate binding family 6 [Clostridium cellulolyticum H10]
Length = 780
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 103/223 (46%), Gaps = 34/223 (15%)
Query: 25 QLIILAGQSNMAGRGGV-----TNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+L GQSNMAG D R L +D N ++ R+T K W +A
Sbjct: 550 HCFLLLGQSNMAGYAAAQASDKVEDPRVLVLGYDN--------NAALGRVTDK--WDVAC 599
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-GSSLYEQ 138
PLHA + + VGPG F ++ KVP+ IGL+PCAI G I + K G + Y
Sbjct: 600 PPLHA----SWLDAVGPGDWFGKTMIQKVPSGDTIGLIPCAISGEKIETFMKSGGTKYNW 655
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
+I RA++A GG I ++++QGES++ + + + DLR DL +P I
Sbjct: 656 IINRAKLAQEKGGVIDGIIFHQGESNSGDPS----WPGKVKTLVEDLRKDLNLGNVPFIA 711
Query: 199 VAL-----ASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEP 236
L +G + QL S + N V A GL ++P
Sbjct: 712 GELLYSGPCAGHNTLVN-----QLPSLITNSYVVSADGLVVDP 749
>gi|376261580|ref|YP_005148300.1| dockerin-like protein [Clostridium sp. BNL1100]
gi|373945574|gb|AEY66495.1| dockerin-like protein [Clostridium sp. BNL1100]
Length = 330
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 104/229 (45%), Gaps = 25/229 (10%)
Query: 15 WPVKCQYQQQ-QLIILAGQSNMAGR-----GGVTNDTRTNKLTWDGIVPPQCQPNPSILR 68
+PV Q + +L GQSNM G D R L +D NP++ R
Sbjct: 89 FPVDSVTQPKFHCFLLLGQSNMEGYPKALASDKVEDPRVLVLGYDN--------NPALGR 140
Query: 69 LTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ 128
+T + W +A PLH+ +GPG FA ++ K+P IGL+PCAI G I
Sbjct: 141 VTDQ--WDIACPPLHS----TYQGAIGPGDWFAKTIVEKIPAGDTIGLIPCAINGERIET 194
Query: 129 WRK-GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRS 187
+ K G S Y ++ RA++A + GG I +L++QGES+ + + + + DL+
Sbjct: 195 FLKSGGSKYNWIVNRAKLAQQKGGVIEGILFHQGESNNGDTT----WPGKVNTLVEDLKK 250
Query: 188 DLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEP 236
DL +P I L +L S + N V A GL ++P
Sbjct: 251 DLNLGDIPFIAGELLYSGSCAGHNTLVNKLPSIVKNCSVVSASGLVVDP 299
>gi|167745721|ref|ZP_02417848.1| hypothetical protein ANACAC_00414 [Anaerostipes caccae DSM 14662]
gi|167654752|gb|EDR98881.1| hypothetical protein ANACAC_00414 [Anaerostipes caccae DSM 14662]
Length = 255
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 90/194 (46%), Gaps = 17/194 (8%)
Query: 63 NPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIG 122
N +IL L +W + EP+H D V GVGP FA A IGL+PCA G
Sbjct: 5 NENILMLRNG-RWQMMSEPIHFDRSVA---GVGPAASFAQA-WCNANESEQIGLIPCAEG 59
Query: 123 GTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G++I +W +L+ I A+ A++ I A+LW+QGESD+ + E K Y + D+
Sbjct: 60 GSSIDEWNAEETLFCHAISEAKFAMKTSELI-AILWHQGESDS-HSEKYKDYYRKLDVLV 117
Query: 183 TDLRSDLQSPLLPIIRVALAS--GEGPF------IEIVRKAQLSSDLPNVRCVDAMGLPL 234
R +L +P I L G+ F +++ + L N C G L
Sbjct: 118 NSFRKELGVTEVPFIVGGLGDYLGKSGFGRSCVEYDLINQELLRYAENNRNCYFVTGERL 177
Query: 235 --EPDGLHLTTPAQ 246
PDG+H+ +Q
Sbjct: 178 YSNPDGIHINAESQ 191
>gi|427385159|ref|ZP_18881664.1| hypothetical protein HMPREF9447_02697 [Bacteroides oleiciplenus YIT
12058]
gi|425727327|gb|EKU90187.1| hypothetical protein HMPREF9447_02697 [Bacteroides oleiciplenus YIT
12058]
Length = 752
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 29/182 (15%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNK-----LTWDGIVPPQCQPNPSILRLTAKLKWVL 77
Q L + GQSNMAGRG +T++ + + LT +G + P P +
Sbjct: 520 QLDLFLFIGQSNMAGRGYITDNYKGSIKDVYLLTPNGDMEPARNP-------------LN 566
Query: 78 AHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS--SL 135
+ + ID+ GVGP FA A+ K + +GLV A GG++I+ W KG+
Sbjct: 567 KYSTIRKQIDLQ---GVGPAYSFAKAIADKTKH--KLGLVVNARGGSSINSWLKGAKDDY 621
Query: 136 YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLP 195
Y + + R + A++ G T++A++W+QGE+D+ N E Y + DLR DL LP
Sbjct: 622 YGEALSRIRQAMKYG-TLKAIIWHQGEADSRNPE---AYMAKLQKLVADLREDLGDTKLP 677
Query: 196 II 197
+I
Sbjct: 678 VI 679
>gi|189465102|ref|ZP_03013887.1| hypothetical protein BACINT_01446 [Bacteroides intestinalis DSM
17393]
gi|189437376|gb|EDV06361.1| hypothetical protein BACINT_01446 [Bacteroides intestinalis DSM
17393]
Length = 752
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 92/182 (50%), Gaps = 29/182 (15%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNK-----LTWDGIVPPQCQPNPSILRLTAKLKWVL 77
Q L + GQSNMAGRG +T++ + + LT +G + P P +
Sbjct: 520 QLDLFLFIGQSNMAGRGYITDNYKGSIKDVYLLTPNGDMEPARNP-------------LN 566
Query: 78 AHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS--L 135
+ + ID+ GVGP FA A+ K + +GLV A GG++I+ W KG+
Sbjct: 567 KYSTIRKQIDLQ---GVGPAYSFAKAIADKTKH--KLGLVVNARGGSSINSWLKGAKDDY 621
Query: 136 YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLP 195
Y + + R + A++ G T++A++W+QGE+D+ N E Y + DLR DL LP
Sbjct: 622 YGEALSRIRQAMKYG-TLKAIIWHQGEADSRNPE---AYMAKLQKLVADLREDLGDTKLP 677
Query: 196 II 197
+I
Sbjct: 678 VI 679
>gi|427383536|ref|ZP_18880256.1| hypothetical protein HMPREF9447_01289 [Bacteroides oleiciplenus YIT
12058]
gi|425728720|gb|EKU91575.1| hypothetical protein HMPREF9447_01289 [Bacteroides oleiciplenus YIT
12058]
Length = 261
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 105/220 (47%), Gaps = 36/220 (16%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNK-----LTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+ + GQSNMAGRG +T++ + + LT G + P P + +
Sbjct: 29 DIFLFIGQSNMAGRGYITDNYKDSIDNVYLLTPTGDMEPASNP-------------LNKY 75
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS--SLYE 137
+ D+ K GVGP F+ + K + +GLV A GGT+I W KG+ + Y
Sbjct: 76 STIRKDL---KMQGVGPAYSFSKTIAKKTGH--KLGLVVNARGGTSIHSWLKGAEANYYG 130
Query: 138 QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
+ + R + A++ G T++A++W+QGESD+ + E Y + TDLR DL + LP I
Sbjct: 131 EALSRIRQAMKYG-TLKAIIWHQGESDSRHPE---TYMAKLQKLVTDLRKDLGNEDLPFI 186
Query: 198 RVALA-----SGEGPFIEIVRKAQLSSDLPNVRCVDAMGL 232
+A F +++R + +PN CV + L
Sbjct: 187 VGEIAEWSTDDSSEAFNKMLR--TVPQHIPNSYCVSSKEL 224
>gi|218131674|ref|ZP_03460478.1| hypothetical protein BACEGG_03295 [Bacteroides eggerthii DSM 20697]
gi|217985977|gb|EEC52316.1| hypothetical protein BACEGG_03295 [Bacteroides eggerthii DSM 20697]
Length = 752
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 88/177 (49%), Gaps = 25/177 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGRG +T++ +++ + LT A PL+
Sbjct: 523 LFLFIGQSNMAGRGYITDNYKSSI--------------KDVYLLTPTGTMEQARNPLNKY 568
Query: 86 IDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS--SLYEQMI 140
+ K GVGP FA A+ K + +GLV A GG++I+ W KG+ Y + +
Sbjct: 569 STIRKQLDLQGVGPAYSFAKAITEKTGH--QLGLVVNARGGSSINSWLKGARDDYYGEAL 626
Query: 141 QRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
R + A++ G ++A++W+QGESD+ + LY E+ DLR DL LP+I
Sbjct: 627 SRIRQAMK-YGKVKAIIWHQGESDS---REPGLYMEKLKKLVADLRQDLGDEKLPVI 679
>gi|384500310|gb|EIE90801.1| hypothetical protein RO3G_15512 [Rhizopus delemar RA 99-880]
Length = 427
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 82/188 (43%), Gaps = 39/188 (20%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH---- 83
++AGQSNM G G + N L P +I + KW+ A EP H
Sbjct: 55 VMAGQSNMRGHGFLRNPFDNQSLV--------ISPVNNICLYASNEKWMEASEPTHNLFA 106
Query: 84 --------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQW 129
A+ D+ K G GL FA ++ N +GLV CA GGT++ W
Sbjct: 107 SPRAVHHTLPDPTVANPDICKFRGASLGLAFAKE-YQRLNNGIPVGLVACAHGGTSLEDW 165
Query: 130 RK---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDM 180
++ ++LY MI + G + +LWYQGESD V LE +K Y ER
Sbjct: 166 QRPEEINKNTAQTTLYGAMIDKIHAI---GNHVAGILWYQGESDAVKLETSKTYYERFQH 222
Query: 181 FFTDLRSD 188
+ LR+D
Sbjct: 223 WLDLLRAD 230
>gi|317474704|ref|ZP_07933978.1| hypothetical protein HMPREF1016_00957 [Bacteroides eggerthii
1_2_48FAA]
gi|316909385|gb|EFV31065.1| hypothetical protein HMPREF1016_00957 [Bacteroides eggerthii
1_2_48FAA]
Length = 752
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 88/177 (49%), Gaps = 25/177 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGRG +T++ +++ + LT A PL+
Sbjct: 523 LFLFIGQSNMAGRGYITDNYKSSI--------------KDVYLLTPTGTMEQARNPLNKY 568
Query: 86 IDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS--SLYEQMI 140
+ K GVGP FA A+ K + +GLV A GG++I+ W KG+ Y + +
Sbjct: 569 STIRKQLDLQGVGPAYSFAKAITEKTGH--QLGLVVNARGGSSINSWLKGARDDYYGEAL 626
Query: 141 QRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
R + A++ G ++A++W+QGESD+ + LY E+ DLR DL LP+I
Sbjct: 627 SRIRQAMK-YGKVKAIIWHQGESDS---REPGLYMEKLKKLVADLRQDLGDEKLPVI 679
>gi|323453542|gb|EGB09413.1| hypothetical protein AURANDRAFT_62998 [Aureococcus anophagefferens]
Length = 309
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 76/149 (51%), Gaps = 20/149 (13%)
Query: 22 QQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVL-AHE 80
Q + +LAGQSNMAGRG + +D T + P + + A W AH
Sbjct: 24 QPVHVFLLAGQSNMAGRGVLADDATTREA-------PALDDRIFVWKDGA---WAAPAHH 73
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFG-VIGLVPCAIGGTNISQWR-KGSSLYEQ 138
PLH+D D T GVGPGL FA ++ +P +GLVPCA+GGT I++W G L+
Sbjct: 74 PLHSDKD---TAGVGPGLSFAREIIQALPAAERCVGLVPCAVGGTAIARWEPDGGDLFAA 130
Query: 139 MIQRAQVALRGGGTIRA----VLWYQGES 163
A+ ++ A VLW+QGES
Sbjct: 131 AADAAKASVEASAAADARLSGVLWHQGES 159
>gi|345858243|ref|ZP_08810645.1| acetylxylan esterase related enzyme [Desulfosporosinus sp. OT]
gi|344328653|gb|EGW40029.1| acetylxylan esterase related enzyme [Desulfosporosinus sp. OT]
Length = 236
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 89/177 (50%), Gaps = 16/177 (9%)
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQM 139
EP++ D V+ G G FA+A K P IGL+PCA GG+++ W S L++
Sbjct: 4 EPVNFDRPVS---GAGLAASFADAWCLKYPE-DTIGLIPCAEGGSSLDDWSVDSELFQHA 59
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
+ + A++ T+ +LW+QGESD+ + + K+Y E+ + LR L +P +P I
Sbjct: 60 VSETKFAMK-NSTLTGILWHQGESDSSDGK-YKVYYEKLSIIVQALRDILNAPEIPFIIG 117
Query: 200 ALAS--GEGPFIEIVRKAQLSSD------LPNVRC--VDAMGLPLEPDGLHLTTPAQ 246
L G+ F + + + +D L C V A GL PDG+HL + +Q
Sbjct: 118 GLGDFLGKTGFGQYCVEYERINDCLQKFALEQAHCYFVSAQGLAANPDGIHLNSLSQ 174
>gi|408672452|ref|YP_006872200.1| protein of unknown function DUF303 acetylesterase [Emticicia
oligotrophica DSM 17448]
gi|387854076|gb|AFK02173.1| protein of unknown function DUF303 acetylesterase [Emticicia
oligotrophica DSM 17448]
Length = 275
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 91/184 (49%), Gaps = 25/184 (13%)
Query: 26 LIILAGQSNMAGRGGVT-NDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+ ++AGQSNMAGRG V NDT TN IL + + + A EPLH
Sbjct: 47 VFVMAGQSNMAGRGQVEPNDTITN---------------SRILTINKQGDLIYAKEPLHF 91
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS-----LYEQM 139
+ +T G+ GL FAN +L +P+ I L+P A+GG+ I QW S+ L
Sbjct: 92 -YEPTRT-GLDCGLSFANNLLKNIPHDVSILLIPTAVGGSAIGQWLGDSTYRDVKLLTNF 149
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRV 199
++ + ++ G +R +LW+QGESD A +++E F R + + LPII
Sbjct: 150 KEKVAIGMK-YGIVRGILWHQGESDASPKRIA-VHEENLKSLFGTFRKTVGNSKLPIILG 207
Query: 200 ALAS 203
L S
Sbjct: 208 ELGS 211
>gi|418577045|ref|ZP_13141177.1| hypothetical protein SSME_22330 [Staphylococcus saprophyticus
subsp. saprophyticus KACC 16562]
gi|379324710|gb|EHY91856.1| hypothetical protein SSME_22330 [Staphylococcus saprophyticus
subsp. saprophyticus KACC 16562]
Length = 267
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 93/225 (41%), Gaps = 37/225 (16%)
Query: 35 MAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGV 94
MAGRG + VPP +LR KW + EP+H+D V G+
Sbjct: 1 MAGRGFIDE------------VPPIIDERMMMLR---NGKWQMMEEPIHSDRSVA---GI 42
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIR 154
GP FA L + PN IGL+PCA GGT I W L + A A I
Sbjct: 43 GPAASFAKLWLDEHPN-ETIGLIPCADGGTTIDDWAPDQILTRHALSEATFAQETSEII- 100
Query: 155 AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII---------RVALASGE 205
+LW+QGESD++N + K Y ++ R L +P I + A
Sbjct: 101 GILWHQGESDSLN-QRYKDYDKKLKTLINYFREQLNIHEVPFIVGLLPDFLGKAAFGQSA 159
Query: 206 GPFIEI----VRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+++I R QL++ N V A + PD +H+ +Q
Sbjct: 160 VEYLQINEALKRVTQLTT---NCYYVTAQDITANPDAIHINANSQ 201
>gi|67464405|pdb|1ZMB|A Chain A, Crystal Structure Of The Putative Acetylxylan Esterase
From Clostridium Acetobutylicum, Northeast Structural
Genomics Target Car6
gi|67464406|pdb|1ZMB|B Chain B, Crystal Structure Of The Putative Acetylxylan Esterase
From Clostridium Acetobutylicum, Northeast Structural
Genomics Target Car6
gi|67464407|pdb|1ZMB|C Chain C, Crystal Structure Of The Putative Acetylxylan Esterase
From Clostridium Acetobutylicum, Northeast Structural
Genomics Target Car6
gi|67464408|pdb|1ZMB|D Chain D, Crystal Structure Of The Putative Acetylxylan Esterase
From Clostridium Acetobutylicum, Northeast Structural
Genomics Target Car6
gi|67464409|pdb|1ZMB|E Chain E, Crystal Structure Of The Putative Acetylxylan Esterase
From Clostridium Acetobutylicum, Northeast Structural
Genomics Target Car6
gi|67464410|pdb|1ZMB|F Chain F, Crystal Structure Of The Putative Acetylxylan Esterase
From Clostridium Acetobutylicum, Northeast Structural
Genomics Target Car6
Length = 290
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 101/228 (44%), Gaps = 35/228 (15%)
Query: 31 GQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNK 90
GQSN AGRG + VP LR +W EP++ D V+
Sbjct: 9 GQSNXAGRGFINE------------VPXIYNERIQXLR---NGRWQXXTEPINYDRPVS- 52
Query: 91 TNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGG 150
G+ FA+A K +IGL+PCA GG++I +W L+ + A+ A
Sbjct: 53 --GISLAGSFADAWSQKNQE-DIIGLIPCAEGGSSIDEWALDGVLFRHALTEAKFAXE-S 108
Query: 151 GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII-----------RV 199
+ +LW+QGESD++N + K+Y ++ + LR +L P +PII R
Sbjct: 109 SELTGILWHQGESDSLN-GNYKVYYKKLLLIIEALRKELNVPDIPIIIGGLGDFLGKERF 167
Query: 200 ALASGEGPFI-EIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
E FI + ++K D N V A GL PDG+H+ +Q
Sbjct: 168 GKGCTEYNFINKELQKFAFEQD--NCYFVTASGLTCNPDGIHIDAISQ 213
>gi|329956438|ref|ZP_08297035.1| hypothetical protein HMPREF9445_01896 [Bacteroides clarus YIT
12056]
gi|328524335|gb|EGF51405.1| hypothetical protein HMPREF9445_01896 [Bacteroides clarus YIT
12056]
Length = 752
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 89/177 (50%), Gaps = 25/177 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGRG +T++ +++ + LT A PL+
Sbjct: 523 LFLFIGQSNMAGRGYITDNYKSSI--------------KDVYLLTPTGTMEQARNPLNKY 568
Query: 86 IDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS--SLYEQMI 140
+ K GVGP FA A+ K + +GLV A GG++I+ W KG+ Y + +
Sbjct: 569 STIRKQLDLQGVGPAYSFAKAITEKTGH--QLGLVVNARGGSSINSWLKGARDDYYGEAL 626
Query: 141 QRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
R + A++ G ++A++W+QGESD+ + LY E+ DLR D+ + LP+I
Sbjct: 627 SRIRQAMK-YGKLKAIIWHQGESDS---REPGLYMEKLKKLVADLRQDVGNENLPVI 679
>gi|449095084|ref|YP_007427575.1| hypothetical protein C663_2478 [Bacillus subtilis XF-1]
gi|449028999|gb|AGE64238.1| hypothetical protein C663_2478 [Bacillus subtilis XF-1]
Length = 268
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 91/183 (49%), Gaps = 16/183 (8%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+W + EP++ D V+ GVG FA+A P+ IGL+PCA GG++++ W
Sbjct: 25 QWQMMTEPINYDRPVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEG 80
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
L++ + A+ ALR I +LW+QGESD+ + Y E+ + LR++L+
Sbjct: 81 ILFQHALSEARFALR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELELDE 138
Query: 194 LPIIRVALA-----SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
+P+I L +G G R+ + +++ N V A GL PDG+HL
Sbjct: 139 VPLIIGGLGDFLGKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDA 198
Query: 244 PAQ 246
+Q
Sbjct: 199 ASQ 201
>gi|429505986|ref|YP_007187170.1| hypothetical protein B938_12430 [Bacillus amyloliquefaciens subsp.
plantarum AS43.3]
gi|429487576|gb|AFZ91500.1| hypothetical protein B938_12430 [Bacillus amyloliquefaciens subsp.
plantarum AS43.3]
Length = 254
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 91/183 (49%), Gaps = 16/183 (8%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+W + EP++ D V+ GVG FA+A P+ IGL+PCA GG++++ W
Sbjct: 11 QWQMMTEPINYDRPVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEG 66
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
L++ + A+ ALR I +LW+QGESD+ + Y E+ + LR++L+
Sbjct: 67 ILFQHALSEARFALR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIGTLRNELELDE 124
Query: 194 LPIIRVALA-----SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
+P+I L +G G R+ + +++ N V A GL PDG+HL
Sbjct: 125 VPLIIGGLGDFLGKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDA 184
Query: 244 PAQ 246
+Q
Sbjct: 185 ASQ 187
>gi|443631913|ref|ZP_21116093.1| hypothetical protein BSI_11640 [Bacillus subtilis subsp.
inaquosorum KCTC 13429]
gi|443348028|gb|ELS62085.1| hypothetical protein BSI_11640 [Bacillus subtilis subsp.
inaquosorum KCTC 13429]
Length = 268
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 91/183 (49%), Gaps = 16/183 (8%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+W + EP++ D V+ GVG FA+A P+ IGL+PCA GG++++ W
Sbjct: 25 QWQMMTEPINYDRPVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEG 80
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
L++ + A+ ALR I +LW+QGESD+ + Y E+ + LR++L+
Sbjct: 81 ILFQHALSEARFALR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDE 138
Query: 194 LPIIRVALA-----SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
+P+I L +G G R+ + +++ N V A GL PDG+HL
Sbjct: 139 VPLIIGGLGDFLGKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDA 198
Query: 244 PAQ 246
+Q
Sbjct: 199 ASQ 201
>gi|89896499|ref|YP_519986.1| hypothetical protein DSY3753 [Desulfitobacterium hafniense Y51]
gi|219667646|ref|YP_002458081.1| hypothetical protein Dhaf_1596 [Desulfitobacterium hafniense DCB-2]
gi|89335947|dbj|BAE85542.1| hypothetical protein [Desulfitobacterium hafniense Y51]
gi|219537906|gb|ACL19645.1| protein of unknown function DUF303 acetylesterase putative
[Desulfitobacterium hafniense DCB-2]
Length = 281
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 104/233 (44%), Gaps = 37/233 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG ND VPP +LR + EP++ D
Sbjct: 5 FLMIGQSNMAGRG-FLND-----------VPPIYNERIKMLRNGL---FQFMEEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
+ GVG FA A K IGL+PCA GG+++ W +L+ I + ++A
Sbjct: 50 SIA---GVGLAASFA-AAWCKKNKRDEIGLIPCAEGGSSLDDWSVDDALFANAIAQTKLA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT---DLRSDLQSPLLPIIRVALAS 203
R T+ ++W+QGE+++ + Y++ D FF LR L P +P+I L
Sbjct: 106 QR-ISTLDGIIWHQGEAES----HSGKYRDYYDKFFVIIERLRQVLDVPEIPLIIGGLGD 160
Query: 204 GEGPFI---EIVRKAQLSSDLP-------NVRCVDAMGLPLEPDGLHLTTPAQ 246
G I +Q++ +L N V A GL PDG+HL +Q
Sbjct: 161 YLGHGIMGGYFNEYSQVNEELKRFAHSHNNCYYVTAEGLTCNPDGIHLNAVSQ 213
>gi|326201459|ref|ZP_08191330.1| Carbohydrate binding family 6 [Clostridium papyrosolvens DSM 2782]
gi|325988059|gb|EGD48884.1| Carbohydrate binding family 6 [Clostridium papyrosolvens DSM 2782]
Length = 780
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 102/222 (45%), Gaps = 34/222 (15%)
Query: 25 QLIILAGQSNMAGRGGV-----TNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+L GQSNMAG D R L +D N + R+T + W +A
Sbjct: 550 HCFLLLGQSNMAGYAASQASDKVEDPRVLVLGFDN--------NSKLGRVTDQ--WDVAC 599
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-GSSLYEQ 138
PLHA + + +GPG F ++ KVP+ IGL+PCAI G I + K G S Y
Sbjct: 600 PPLHA----SWLDAIGPGDWFGKTMIQKVPSGDTIGLIPCAISGEKIETFMKSGGSKYNW 655
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
++ RA++A + GG I ++++QGES++ + + + DLR DL +P +
Sbjct: 656 IVNRAKLAQQKGGVIEGIIFHQGESNSGDTS----WPGKVKTLVEDLRKDLSLGDVPFLA 711
Query: 199 VAL-----ASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLE 235
L +G + QL S + N V A GL ++
Sbjct: 712 GELLYSGPCAGHNKLVN-----QLPSLISNSYVVSADGLVVD 748
>gi|452856346|ref|YP_007498029.1| Putative acetylesterase [Bacillus amyloliquefaciens subsp.
plantarum UCMB5036]
gi|452080606|emb|CCP22370.1| Putative acetylesterase [Bacillus amyloliquefaciens subsp.
plantarum UCMB5036]
Length = 268
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 90/183 (49%), Gaps = 16/183 (8%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+W + EP++ D V+ GVG FA+A P+ IGL+PCA GG++++ W
Sbjct: 25 QWQMMTEPINYDRPVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEG 80
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
L++ + A+ ALR I +LW+QGESD+ + Y E+ + LR++L+
Sbjct: 81 ILFQHALSEARFALR-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIGTLRNELELDE 138
Query: 194 LPIIRVALA-----SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
+P+I L +G G R+ + + + N V A GL PDG+HL
Sbjct: 139 VPLIIGGLGDFLGKTGFGQHATEFRQVNEQLLRFADEQQNCYFVTAAGLTANPDGIHLDA 198
Query: 244 PAQ 246
+Q
Sbjct: 199 ASQ 201
>gi|365121330|ref|ZP_09338321.1| hypothetical protein HMPREF1033_01667 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645953|gb|EHL85206.1| hypothetical protein HMPREF1033_01667 [Tannerella sp.
6_1_58FAA_CT1]
Length = 260
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 121/278 (43%), Gaps = 41/278 (14%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVT---NDTRTNKLTWD--GI 55
F +L+C + + + + + GQSNMAGR +T DT N ++
Sbjct: 4 FFTYLICSLTFTMMIARSEASGKFDIYLCIGQSNMAGRATLTPAVMDTLVNVYLFNDRNF 63
Query: 56 VPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIG 115
P P + + + +I + K +GP FA V K IG
Sbjct: 64 FEPAVNP-------------LNRYSTIRKEIGMQK---LGPAYSFARKVSEKSD--CKIG 105
Query: 116 LVPCAIGGTNISQWRKGSS--LYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKL 173
LV A GG++I W KG+S Y +M+ R + AL+ G ++AVLW+QGE+D E K+
Sbjct: 106 LVVNARGGSSIKSWEKGASDNYYGEMLSRIREALKYG-RLKAVLWHQGEADCRYPESYKI 164
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALA--------SGEGPFIEIVRKAQLSSDLPNVR 225
Y + LR+DL P L + ++ G PF +++R L +P +
Sbjct: 165 YICK---LVEQLRADLNMPDLLFVAGEISRWNWTGHTEGTIPFNKMLR--SLEDSIPRFK 219
Query: 226 CVDAMGLP--LEPDGLHLTTPAQGSTLNSWSNEALRVN 261
V + GL ++ + H T +Q ++ + LR N
Sbjct: 220 VVSSEGLKPLIDENDPHFDTDSQIILGERYAEKVLRYN 257
>gi|423215429|ref|ZP_17201956.1| hypothetical protein HMPREF1074_03488 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691997|gb|EIY85237.1| hypothetical protein HMPREF1074_03488 [Bacteroides xylanisolvens
CL03T12C04]
Length = 752
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 111/233 (47%), Gaps = 34/233 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L + GQSNMAGRG +T++ + N N +L ++ A PL+
Sbjct: 523 LFLFIGQSNMAGRGYITDNYKGN------------IKNTYLLTPVGGME--SARNPLNKY 568
Query: 86 IDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS--SLYEQMI 140
+ K GVGP FA A+ K +GLV A GG++I+ W KG+ + Y++ +
Sbjct: 569 STIRKRLDLQGVGPAYSFAKAITNKTGR--PLGLVVNARGGSSINSWMKGAKDNYYDEAL 626
Query: 141 QRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
R + A++ G T++A++W+QGESD+ E Y + +LR DL + LP I
Sbjct: 627 SRIRQAMKFG-TLKAIIWHQGESDSNAPE---TYILKLQELVANLRKDLNNARLPFIVGE 682
Query: 201 LAS-----GEGPFIEIVRKAQLSSDLPNVRCVDAMGL-PL-EPDGLHLTTPAQ 246
LA F E++R + +P CV + L PL + + H + +Q
Sbjct: 683 LAEWRINGTSETFNEMLR--TVPQHIPYSYCVSSKELVPLIDENDPHFSADSQ 733
>gi|423072845|ref|ZP_17061594.1| hypothetical protein HMPREF0322_01005 [Desulfitobacterium hafniense
DP7]
gi|361856460|gb|EHL08363.1| hypothetical protein HMPREF0322_01005 [Desulfitobacterium hafniense
DP7]
Length = 275
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 103/231 (44%), Gaps = 37/231 (16%)
Query: 29 LAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDV 88
+ GQSNMAGRG ND VPP +LR + EP++ D +
Sbjct: 1 MIGQSNMAGRG-FLND-----------VPPIYNERIKMLRNGL---FQFMEEPINYDRSI 45
Query: 89 NKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALR 148
GVG FA A K IGL+PCA GG+++ W +L+ I + ++A R
Sbjct: 46 A---GVGLAASFA-AAWCKKNKRDEIGLIPCAEGGSSLDDWSVDDALFANAIAQTKLAQR 101
Query: 149 GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT---DLRSDLQSPLLPIIRVALASGE 205
T+ ++W+QGE+++ + Y++ D FF LR L P +P+I L
Sbjct: 102 -ISTLDGIIWHQGEAES----HSGKYRDYYDKFFVIIERLRQVLDVPEIPLIIGGLGDYL 156
Query: 206 GPFI---EIVRKAQLSSDLP-------NVRCVDAMGLPLEPDGLHLTTPAQ 246
G I +Q++ +L N V A GL PDG+HL +Q
Sbjct: 157 GHGIMGGYFNEYSQVNEELKRFAHSHNNCYYVTAEGLTCNPDGIHLNAVSQ 207
>gi|366163542|ref|ZP_09463297.1| carbohydrate-binding family 6 protein [Acetivibrio cellulolyticus
CD2]
Length = 1203
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/221 (30%), Positives = 99/221 (44%), Gaps = 34/221 (15%)
Query: 27 IILAGQSNMAGRG-----GVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEP 81
+L GQSNMAG D R L +D NP++ R+ K +W +A P
Sbjct: 975 FLLLGQSNMAGYALAQTSDKVEDPRVLVLGYDN--------NPALGRV--KDQWDVACPP 1024
Query: 82 LHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-GSSLYEQMI 140
LH + +GPG F ++ KVP+ IGL+PCAI G I + K G S Y +
Sbjct: 1025 LHPSW----LDAIGPGDWFGKTMIQKVPSGDTIGLIPCAISGEKIETFMKSGGSKYSWIT 1080
Query: 141 QRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVA 200
RA++A + GG I ++++QGES+ + + + DLR DL P I
Sbjct: 1081 DRAKLAQQKGGVIEGIIFHQGESNNGD----PAWPGKVKTLVDDLRKDLNIENAPFIAGE 1136
Query: 201 L-----ASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEP 236
L +G + QL S + N V A L ++P
Sbjct: 1137 LLYSGPCAGHNKLVN-----QLPSLINNCYVVSASDLVVDP 1172
>gi|387899210|ref|YP_006329506.1| iduronate-2-sulfatase [Bacillus amyloliquefaciens Y2]
gi|387173320|gb|AFJ62781.1| iduronate-2-sulfatase [Bacillus amyloliquefaciens Y2]
Length = 268
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 91/183 (49%), Gaps = 16/183 (8%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+W + EP++ D V+ GVG FA+A P+ IGL+PCA GG++++ W
Sbjct: 25 QWQMMTEPINYDRPVS---GVGLAASFADAWSKAHPD-EEIGLIPCAEGGSSLNDWHPEG 80
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
L++ + A+ AL+ I +LW+QGESD+ + Y E+ + LR++L+
Sbjct: 81 ILFQHALSEARFALQ-SSQICGILWHQGESDSYR-SLHETYYEKLTLIIETLRNELKLDE 138
Query: 194 LPIIRVALA-----SGEGPFIEIVRKA-----QLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
+P+I L +G G R+ + +++ N V A GL PDG+HL
Sbjct: 139 VPLIIGGLGDFLGKTGFGQHATEFRQVNEQLLRFANEQQNCYFVTAAGLTANPDGIHLDA 198
Query: 244 PAQ 246
+Q
Sbjct: 199 ASQ 201
>gi|374580433|ref|ZP_09653527.1| protein of unknown function (DUF303) [Desulfosporosinus youngiae
DSM 17734]
gi|374416515|gb|EHQ88950.1| protein of unknown function (DUF303) [Desulfosporosinus youngiae
DSM 17734]
Length = 281
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/233 (28%), Positives = 102/233 (43%), Gaps = 37/233 (15%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG + + VPP +LR + EP++ D
Sbjct: 5 FLMIGQSNMAGRGFLND------------VPPIYNERIKMLRNGL---FQFMEEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
+ GVG FA A K IGL+PCA GG+++ W +L+ I + ++A
Sbjct: 50 SIA---GVGLAASFAAAWCKKNKQ-NEIGLIPCAEGGSSLDDWSVDDALFANAIAQTKLA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF---TDLRSDLQSPLLPIIRVALAS 203
R T+ ++W+QGE+++ + Y++ D FF LR L P +P+I L
Sbjct: 106 QR-ISTLDGIIWHQGEAES----HSGKYRDYQDKFFIIIERLRQVLNVPEIPLIIGGLGD 160
Query: 204 GE------GPFIEIVRKAQ----LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
G F E + + + N V A GL PDG+HL +Q
Sbjct: 161 YLGDGIMGGYFNEYTQVNEELKRFAHSHNNCYYVTAEGLTCNPDGIHLNAVSQ 213
>gi|154505119|ref|ZP_02041857.1| hypothetical protein RUMGNA_02632 [Ruminococcus gnavus ATCC 29149]
gi|153794598|gb|EDN77018.1| hypothetical protein RUMGNA_02632 [Ruminococcus gnavus ATCC 29149]
Length = 287
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 104/230 (45%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I++ GQSNMAGRG + VP C +LR W + EP++ D
Sbjct: 4 ILMIGQSNMAGRGFINE------------VPMICNERILMLRNAG---WQMMAEPINYD- 47
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
G+G FA A+ IGL+PCA GG+++ W +L++ + +A A
Sbjct: 48 --RPNAGIGLAGSFA-AMWCMEHEGEQIGLIPCAEGGSSLDDWAVDKNLFKNAVIQAGFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
++ I +LW+QGESD+ YK + + LR +L + +P+I L G
Sbjct: 105 MQDSELI-GILWHQGESDSYGGGYQTYYK-KLQVIIESLRKELNAFEVPLIIGGLGDFLG 162
Query: 205 EGPF------IEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F E+V + + + + N V A GL PDG+H+ +Q
Sbjct: 163 KNGFGLNCTEYELVNEQLLKFAREQENSCFVTAEGLTPNPDGIHMDAVSQ 212
>gi|336432884|ref|ZP_08612715.1| hypothetical protein HMPREF0991_01834 [Lachnospiraceae bacterium
2_1_58FAA]
gi|336018166|gb|EGN47919.1| hypothetical protein HMPREF0991_01834 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 287
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 104/230 (45%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
I++ GQSNMAGRG + VP C +LR W + EP++ D
Sbjct: 4 ILMIGQSNMAGRGFINE------------VPMICNERILMLRNAG---WQMMAEPINYD- 47
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
G+G FA A+ IGL+PCA GG+++ W +L++ + +A A
Sbjct: 48 --RPNAGIGLAGSFA-AMWCMEHEGEQIGLIPCAEGGSSLDDWAVDKNLFKNAVIQAGFA 104
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS--G 204
++ I +LW+QGESD+ YK + + LR +L + +P+I L G
Sbjct: 105 MQDSELI-GILWHQGESDSYGGGYQTYYK-KLQVIIESLRKELNAFEVPLIIGGLGDFLG 162
Query: 205 EGPF------IEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+ F E+V + + + + N V A GL PDG+H+ +Q
Sbjct: 163 KNGFGLNCTEYELVNEQLLRFAREQENSCFVTAEGLTPNPDGIHMDAVSQ 212
>gi|126699496|ref|YP_001088393.1| acetylesterase [Clostridium difficile 630]
gi|423089316|ref|ZP_17077678.1| hypothetical protein HMPREF9945_00859 [Clostridium difficile
70-100-2010]
gi|115250933|emb|CAJ68761.1| putative acetylesterase [Clostridium difficile 630]
gi|357558452|gb|EHJ39946.1| hypothetical protein HMPREF9945_00859 [Clostridium difficile
70-100-2010]
Length = 282
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 107/230 (46%), Gaps = 31/230 (13%)
Query: 27 IILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADI 86
++ GQSNMAGRG ++ V P +LR +W + EP++ D
Sbjct: 5 FLMLGQSNMAGRGFISE------------VTPIYNERIQMLR---NGRWQMMTEPINYDR 49
Query: 87 DVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVA 146
V+ GV FA+A + IGL+PCA GG+++ +W L++ I A+ A
Sbjct: 50 PVS---GVSLAASFADAWCCENQE-DRIGLIPCAEGGSSLDEWNIDGILFKHAISEAKFA 105
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--------- 197
++ + +LW+QGE+D+ N + K Y ++ LR +L P +PII
Sbjct: 106 IQ-SSELTGILWHQGENDSNN-GNYKFYYKKLLSIIETLRKELNIPDIPIIIGGLGDFLG 163
Query: 198 RVALASGEGPFIEIVRKAQ-LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+V ++ I ++ Q + + N V A GL PDG+H+ +Q
Sbjct: 164 KVGFGKSCTEYVFINQELQKFAFEQDNCYFVTATGLTSNPDGIHIDAISQ 213
>gi|373854811|ref|ZP_09597608.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
gi|372471593|gb|EHP31606.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
Length = 474
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/255 (29%), Positives = 110/255 (43%), Gaps = 60/255 (23%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADID 87
+LAGQSNM G G + + +P+P I + +W LA +PLH +
Sbjct: 104 LLAGQSNMEGCGLLAASS--------------ARPHPLIRVFSLAREWRLAADPLHVPWE 149
Query: 88 -------------------VNKTN--GVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGT 124
KT+ G G G+ FA +L + VP GL+ A G T
Sbjct: 150 SPEPALNDGKPFTREQAEAYRKTSRVGAGVGVHFAREMLARSGVPQ----GLICAARGAT 205
Query: 125 NISQW------RKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
+ QW GS LY M++ + G + VLW+QGE DT E A Y R
Sbjct: 206 RMEQWLPTRARDGGSGLYGAMLRSVRAT---GQPVAGVLWHQGEGDTPG-ERAAFYSRRM 261
Query: 179 DMFFTDLRSDLQSPLLPII--RVALASGEGP-----FIEIVRKAQLSSDLPNVRCVDAMG 231
+R DL+ P LP I ++A GE P F++ ++ L+ +P+ V +
Sbjct: 262 RRLVAAVRRDLELPRLPWIFAQIARVYGERPDCAWNFVQEQQRV-LAERIPDAALVATVD 320
Query: 232 LPLEPDGLHLTTPAQ 246
LPL+ D +HL+ A
Sbjct: 321 LPLD-DFIHLSAEAH 334
>gi|225156164|ref|ZP_03724645.1| hypothetical protein ObacDRAFT_8692 [Diplosphaera colitermitum
TAV2]
gi|224803142|gb|EEG21384.1| hypothetical protein ObacDRAFT_8692 [Diplosphaera colitermitum
TAV2]
Length = 646
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/255 (32%), Positives = 112/255 (43%), Gaps = 61/255 (23%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC-QPNPSILRLTAKLKWVLAHEPLH--- 83
+LAGQSNM G G + + P C +P+P I T +W A +PLH
Sbjct: 113 LLAGQSNMEGCGFMDS--------------PHCARPHPLIRAFTMAREWRQAADPLHIRW 158
Query: 84 ----------ADIDVNKTN--------GVGPGLPFANAVLTK--VPNFGVIGLVPCAIGG 123
A D + G G GLPFA+ +L + VP LV A GG
Sbjct: 159 ESPDSCHNDGATWDRTRAEQHRRTALRGAGVGLPFAHEMLARSGVPQ----ALVCTAHGG 214
Query: 124 TNISQWRK------GSSLYEQMIQRAQVALRGGGT-IRAVLWYQGESDTVNLEDAKLYKE 176
T++ QW SLY M+ +++R G VLWYQGESDT A +Y +
Sbjct: 215 TSMEQWNPLHKKLGDGSLYGSML----LSMRATGQPCAGVLWYQGESDTAA-PLAAIYTD 269
Query: 177 RSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEI----VRKAQ--LSSDLPNVRCVDAM 230
R R DL+ P LP I V LA G E V++ Q L + N+ V A+
Sbjct: 270 RMKKLVAATRRDLRQPDLPWIIVQLARVLGIRPETGWNSVQEQQRLLPKKIQNLDTVVAI 329
Query: 231 GLPLEPDGLHLTTPA 245
L L+ D +H++T A
Sbjct: 330 DLTLD-DRIHISTDA 343
>gi|194699526|gb|ACF83847.1| unknown [Zea mays]
Length = 87
Score = 72.4 bits (176), Expect = 2e-10, Method: Composition-based stats.
Identities = 36/63 (57%), Positives = 45/63 (71%)
Query: 184 DLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
D+R DL P L +I+V LA+G+G F++IVR+AQ L NVR VDA GLP+ D HLTT
Sbjct: 7 DVRRDLGMPDLLVIQVGLATGQGRFVDIVREAQRRVSLRNVRYVDAKGLPVANDYTHLTT 66
Query: 244 PAQ 246
PAQ
Sbjct: 67 PAQ 69
>gi|343085782|ref|YP_004775077.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342354316|gb|AEL26846.1| protein of unknown function DUF303 acetylesterase [Cyclobacterium
marinum DSM 745]
Length = 530
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/263 (30%), Positives = 117/263 (44%), Gaps = 46/263 (17%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQL----IILA-GQSNMAGRGGVTNDTRTNKLTWDGI 55
MF + +L+ P Q Q++ I LA GQSNMAGR + D
Sbjct: 1 MFVLIKKFLLLVLLLPTTFFLQAQEIDSLDIYLAIGQSNMAGRADILADLEA-------- 52
Query: 56 VPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKT---NGVGPGLPFANAVLTKVPNFG 112
P S+ T K +W+ A PL+ V K + P FA K+ N+
Sbjct: 53 ------PVESVYLFTGK-EWLPAANPLNLYSTVRKVVSMQRLSPAYGFAR----KMQNYN 101
Query: 113 ---VIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLE 169
IGLV A GG+ I +W G+ + ++I RA++A G I+ ++W+QGE D ++
Sbjct: 102 QDRKIGLVVNAKGGSVIDEWLPGTLFFSEIIDRARLAAE-SGKIKGIIWHQGEGD---VK 157
Query: 170 DAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNV-RCVD 228
+A Y + T LR LQ P LP + L++ + RKA L+ L N+ + V
Sbjct: 158 EADQYLGKISHLITALRDSLQLPGLPFVAGQLSNDKSN-----RKA-LNDTLLNLPKVVP 211
Query: 229 AMGLPLE-----PDGLHLTTPAQ 246
GL L D H +P+Q
Sbjct: 212 YTGLALSFGTTTFDSTHFDSPSQ 234
>gi|391230125|ref|ZP_10266331.1| protein of unknown function (DUF303) [Opitutaceae bacterium TAV1]
gi|391219786|gb|EIP98206.1| protein of unknown function (DUF303) [Opitutaceae bacterium TAV1]
Length = 495
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 109/255 (42%), Gaps = 60/255 (23%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADID 87
+LAGQSNM G G + + + +P I + +W LA +PLH +
Sbjct: 125 LLAGQSNMEGCGLLAASS--------------ARSHPLIRAFSLAREWRLAADPLHVPWE 170
Query: 88 -------------------VNKTN--GVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGT 124
KT+ G G G+ FA +L + VP GL+ A G T
Sbjct: 171 SPEPALNDGKPFTREQAEAYRKTSRVGAGVGVHFAREMLARSGVPQ----GLICAARGAT 226
Query: 125 NISQW------RKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
+ QW GS LY M++ + G + VLW+QGE DT E A Y R
Sbjct: 227 RMEQWLPTRARDGGSGLYGAMLRSVRTT---GQPVAGVLWHQGEGDTPG-ERAAFYSRRM 282
Query: 179 DMFFTDLRSDLQSPLLPII--RVALASGEGP-----FIEIVRKAQLSSDLPNVRCVDAMG 231
+R DL+ P LP I ++A GE P F++ ++ L+ +P+ V +
Sbjct: 283 RRLVAAVRRDLELPRLPWIFAQIARVYGERPDCAWNFVQEQQRV-LAERIPDAALVATVD 341
Query: 232 LPLEPDGLHLTTPAQ 246
LPL+ D +HL+ A
Sbjct: 342 LPLD-DFIHLSAEAH 355
>gi|189466558|ref|ZP_03015343.1| hypothetical protein BACINT_02933 [Bacteroides intestinalis DSM
17393]
gi|189434822|gb|EDV03807.1| hypothetical protein BACINT_02933 [Bacteroides intestinalis DSM
17393]
Length = 829
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 117/269 (43%), Gaps = 33/269 (12%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
+F +LL LV+ ++L IL GQSNM+GR + + D V P
Sbjct: 6 IFLFLLITTLVASQASA-----HKRLFILLGQSNMSGRAPIED--------ADMAVCPMV 52
Query: 61 QPNPSILRLTAKLKWVLAHEPLHADIDVNKT---NGVGPGLPFANAVLTKVPNFGVIGLV 117
+ L A + + PL+ ++ K +GPG FA + ++ + I V
Sbjct: 53 K------LLNADGHFEVLRNPLNRFSNIRKDIAMQKLGPGYTFAETLSEQLQD--TIFFV 104
Query: 118 PCAIGGTNISQWRKGSS--LYEQMIQRAQVALRGGGTIR--AVLWYQGESDTVNLEDAKL 173
A GGT + ++ K + YE+ + R + ALR ++ ++W+QGES N +D +
Sbjct: 105 VNARGGTALERFMKNDTAGYYEKTLFRIKQALRERPDLKPATIIWHQGES---NRDDYQS 161
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSD-LPNVRCVDAMGL 232
Y + DLRSDL P LP I + + IV K L D +P V + GL
Sbjct: 162 YLNHLNTLVADLRSDLGIPDLPFIAGEIGRWNPDYSHIVEKIALIPDSIPYAGLVSSEGL 221
Query: 233 PLEPDGLHLTTPAQGSTLNSWSNEALRVN 261
D H T +Q ++ + L ++
Sbjct: 222 T-NIDEFHFDTRSQRELGKRYAKKYLELS 249
>gi|302852779|ref|XP_002957908.1| hypothetical protein VOLCADRAFT_99022 [Volvox carteri f.
nagariensis]
gi|300256785|gb|EFJ41044.1| hypothetical protein VOLCADRAFT_99022 [Volvox carteri f.
nagariensis]
Length = 622
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 34/244 (13%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGI-VPPQCQPNPS-ILRLTAKLKWVLAHEPLHAD 85
I+AGQSN G + DG VP +P P +L W A +HA
Sbjct: 209 IIAGQSNAVG-----------DNSADGTPVPAASKPLPGLVLSYDCTGTWRDATPNIHAG 257
Query: 86 ID-VNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWR--KGSSLYEQMIQ 141
I + GP + F L + G +GLVP A G TN+ W+ G LY MI
Sbjct: 258 IQGYTREPSCGPAISFGR-TLVSLGLSGRVGLVPAAKGATNLFHDWKPTGGGELYGTMIA 316
Query: 142 RAQVALR----GGGT--IRAVLWYQGESDT---VNLEDAKLYKERSDMFFTDLRSDLQS- 191
R + AL GGGT +R ++W QGE+D V ++ Y F +R DL S
Sbjct: 317 RTKAALMSTPPGGGTCRLRGLIWIQGEADAEERVGPGPSEAYGANFTAFVQAVRRDLASY 376
Query: 192 -PLLPIIRVALASGEG---PFIEIVRKAQLSSDLPNVRCVDAMGLPL--EPDGLHLTTPA 245
LPI+ +A + P++ VR+AQ S LP + +D G E G H+
Sbjct: 377 HAQLPIVMGVMALRKRECFPYLATVRRAQQSVPLPGLLRIDLAGYEFFEEYGGYHVHLTK 436
Query: 246 QGST 249
G T
Sbjct: 437 DGVT 440
>gi|224105609|ref|XP_002313871.1| predicted protein [Populus trichocarpa]
gi|222850279|gb|EEE87826.1| predicted protein [Populus trichocarpa]
Length = 188
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 53/90 (58%), Gaps = 14/90 (15%)
Query: 157 LWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQ 216
LWYQGE T +++DA++Y+ + D VA+ SG+G ++E VR+A+
Sbjct: 4 LWYQGERGTSHIQDAEVYQRNMEKLIED--------------VAIISGDGKYVEKVREAR 49
Query: 217 LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
+LPN+ CVDA GL L+ D L LTT +Q
Sbjct: 50 PGINLPNMVCVDAKGLHLKEDHLQLTTESQ 79
>gi|391228432|ref|ZP_10264638.1| protein of unknown function (DUF303) [Opitutaceae bacterium TAV1]
gi|391218093|gb|EIP96513.1| protein of unknown function (DUF303) [Opitutaceae bacterium TAV1]
Length = 657
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/254 (30%), Positives = 111/254 (43%), Gaps = 59/254 (23%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADID 87
+LAGQSNM G G + + +P+P I + + +W A +PLH ++
Sbjct: 131 LLAGQSNMEGCGRMDDGG-------------AARPHPLIRAFSMRREWRQAADPLHLRME 177
Query: 88 VNKT---------------------NGVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGT 124
+ GVG G+ FA +L + VP GLV A GGT
Sbjct: 178 SPDSCHNDGAQHTREQAENARRTAQRGVGAGVFFAREMLARSGVPQ----GLVCTAHGGT 233
Query: 125 NISQWRK------GSSLYEQMIQRAQVALRGGGT-IRAVLWYQGESDTVNLEDAKLYKER 177
++ QW G+S Y M+ ++LR G VLWYQGESDT A +Y +R
Sbjct: 234 SMEQWNPVHKKSGGASQYGSML----LSLRATGQPCAGVLWYQGESDTA-APLAAVYTDR 288
Query: 178 SDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEI----VRKAQ--LSSDLPNVRCVDAMG 231
R DL P LP I V LA G E V++ Q L + + N+ V A+
Sbjct: 289 MKKLVAATRRDLHQPDLPWIIVQLARVFGHRSETGWNSVQEQQRLLPAKIRNLATVAAID 348
Query: 232 LPLEPDGLHLTTPA 245
L L+ D +H++ A
Sbjct: 349 LALD-DPIHISATA 361
>gi|224540313|ref|ZP_03680852.1| hypothetical protein BACCELL_05226 [Bacteroides cellulosilyticus
DSM 14838]
gi|224518066|gb|EEF87171.1| hypothetical protein BACCELL_05226 [Bacteroides cellulosilyticus
DSM 14838]
Length = 829
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/202 (28%), Positives = 95/202 (47%), Gaps = 26/202 (12%)
Query: 20 QYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
Q ++L IL GQSNM+GR + N P + L A ++ +A
Sbjct: 21 QDTHKRLFILLGQSNMSGRAPIEN--------------ADTAALPLVKLLDADGRFEVAR 66
Query: 80 EPLHADIDVNK---TNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG--SS 134
PL+ ++ K +GPG FA + ++ + I LV A GGT + ++ K +
Sbjct: 67 NPLNRFSNIRKGITMQKLGPGYHFAKTLSEQLQD--TIYLVVNARGGTALERFMKKDPAG 124
Query: 135 LYEQMIQRAQVALRGGGTIR--AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSP 192
Y++ + R + ALR ++ A++W+QGES N +D + Y + TDLR+DL P
Sbjct: 125 YYKKTLSRIKQALRAYPDMKPEAIIWHQGES---NRDDYQNYLNHLNKLVTDLRTDLGIP 181
Query: 193 LLPIIRVALASGEGPFIEIVRK 214
LP I + + IV++
Sbjct: 182 DLPFIAGEIGKWNPDYSHIVKR 203
>gi|298707684|emb|CBJ26001.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 279
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 87/189 (46%), Gaps = 25/189 (13%)
Query: 11 VSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPN-PSILRL 69
VS + K + +I+L GQSNM+GRG + DG PN P I +
Sbjct: 22 VSADYTAKRKVAGSDVILLMGQSNMSGRG------QGYDANIDG-------PNDPRIQQW 68
Query: 70 TAKLKWVLAHEPL-HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ 128
+ + A E L HAD + + VG G F A + +P + LV GGT +
Sbjct: 69 SRANTVITASEHLQHADFAIVEETRVGMGTAFGRAYVETLPAKRNVLLVSTGYGGTRLVN 128
Query: 129 --WRKGSSLYEQMIQRAQVALRGGGT----IRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
W G L+E ++R + AL G + AVLW+QGESD + D + Y+ +
Sbjct: 129 GPWSPGGRLFEDAVRRTEAALASNGATGNCVAAVLWHQGESDAIAGVDQETYQ----FTW 184
Query: 183 TDLRSDLQS 191
TD+ + L+S
Sbjct: 185 TDMINTLRS 193
>gi|149175675|ref|ZP_01854294.1| iduronate-2-sulfatase [Planctomyces maris DSM 8797]
gi|148845394|gb|EDL59738.1| iduronate-2-sulfatase [Planctomyces maris DSM 8797]
Length = 667
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 25/219 (11%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+L +LAGQSNM +G + +P Q Q P+ + + W+ P H
Sbjct: 24 KLFLLAGQSNMVSQGTLAE------------LPEQLQQPPTNVYFWSNGTWI----PYHN 67
Query: 85 DID-VNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
+ V GP L A+ + P+ IGL+ A GGT I W+ L + Q+
Sbjct: 68 KVAYVKPGKEFGPELAIAHELSRAFPD-EKIGLIKHAKGGTAIRLWQPRMPLVRGLFQKL 126
Query: 144 QVALR-GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--RVA 200
A + GGG + A+ W QGE D E A Y ++ +R P LP++ R++
Sbjct: 127 DDAQKAGGGEVAALFWMQGERDARFHEPA--YAKKFQNLIQAVRQKSDQPELPVVFGRIS 184
Query: 201 LASGEGPFIEIVR--KAQLSSDLPNVRCVDAMGLPLEPD 237
E + + +R + Q++ +L NV +D L +P+
Sbjct: 185 RIIPEREYTDQIRQIQQQVADELANVVMIDTDALERKPE 223
>gi|373850372|ref|ZP_09593173.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
gi|372476537|gb|EHP36546.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
Length = 627
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/254 (30%), Positives = 110/254 (43%), Gaps = 60/254 (23%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADID 87
+LAGQSNM G G + +P+P + + + +W A +PLH ++
Sbjct: 102 LLAGQSNMEGCGRMDGGA--------------ARPHPLVRAFSMRREWRQAADPLHLRME 147
Query: 88 VNKT---------------------NGVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGT 124
+ GVG G+ FA +L + VP GLV A GGT
Sbjct: 148 SPDSCHNDGAQHTREQAENARRTAQRGVGAGVFFAREMLARSGVPQ----GLVCIAHGGT 203
Query: 125 NISQWRK------GSSLYEQMIQRAQVALRGGGT-IRAVLWYQGESDTVNLEDAKLYKER 177
++ QW G+S Y M+ ++LR G VLWYQGESDT A +Y +R
Sbjct: 204 SMEQWNPVHKKSGGASQYGSML----LSLRATGQPCAGVLWYQGESDTA-APLAAVYTDR 258
Query: 178 SDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEI----VRKAQ--LSSDLPNVRCVDAMG 231
R DL P LP I V LA G E V++ Q L + + N+ V A+
Sbjct: 259 MKKLVAATRRDLHQPDLPWIIVQLARVFGHRSETGWNSVQEQQRLLPAKIRNLATVAAID 318
Query: 232 LPLEPDGLHLTTPA 245
L L+ D +H++ A
Sbjct: 319 LALD-DPIHISATA 331
>gi|340619470|ref|YP_004737923.1| carbohydrate esterase [Zobellia galactanivorans]
gi|339734267|emb|CAZ97644.1| Carbohydrate esterase, family CE6 [Zobellia galactanivorans]
Length = 269
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/240 (26%), Positives = 107/240 (44%), Gaps = 36/240 (15%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRL--TAKLKWVLAHEPL 82
++ IL GQSNM G G + +P + + +P + + K KWV L
Sbjct: 30 KVFILGGQSNMDGTGKSED------------LPEKYRSHPDEVMIWDNKKEKWV----SL 73
Query: 83 HAD-IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWRKGSSLYEQMI 140
D + GP + F++ + K PN I +V + GGT + W G +Y + +
Sbjct: 74 GTDSFSERRKFKFGPEIAFSHLMAKKFPNH-TIAIVKTSGGGTKLWKHWLPGQPMYTRFL 132
Query: 141 QRAQVAL---RGGGT---IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLL 194
+ AL +G G + +LW QGESD LE A Y+E + + D+R + L
Sbjct: 133 KNMDNALQNLKGQGVAYEVSGMLWMQGESDAETLEWANAYEENLKVLYKDVRKETGKKNL 192
Query: 195 PIIRVALASG---EGPF----IEIVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
PI+ ++ G + P+ E+V+ AQ ++++ NV ++ L D H + +
Sbjct: 193 PIVMGRISIGLLRKTPWNFDHTEVVQAAQDKVAAEDKNVFIINTDKLETLNDNTHFNSES 252
>gi|428163885|gb|EKX32934.1| hypothetical protein GUITHDRAFT_148284 [Guillardia theta CCMP2712]
Length = 248
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 95/217 (43%), Gaps = 40/217 (18%)
Query: 69 LTAKLKWVLAHEPLHADID-----------VNKTNGVGPGLPFANAV--LTKVPNFGVIG 115
L ++W +A EPLH ++D + G GPGL FA+ + L + N
Sbjct: 33 LDGNVRWAMAEEPLHREVDDIPLREAANSPSKRACGTGPGLFFAHELTRLMRANNQKETA 92
Query: 116 LVPCAIGGTNISQWRKGSSLYEQMIQRAQVAL----RGGGT---IRAVLWYQGESDTVNL 168
I +W G L+E M++R + L R G+ I +L+YQGESD +
Sbjct: 93 ------EPLRIDRWLPGEVLFESMVKRTEEVLAVTERAQGSRPPISGILFYQGESDALEE 146
Query: 169 EDAKLYKERSDMFF-------TDLRSDLQSPLLPIIRVALASGEG--PFIEIVRKAQ--L 217
A+ Y+ + F + Q+ +P+I + E P IVR+AQ +
Sbjct: 147 TAARAYQHKLVRFIDGARRALGGGGAGGQADTIPVILCKIWGDESRVPHKLIVREAQENV 206
Query: 218 SSDLPNVRCVDAMGLPLEPDGLHLTTPAQGST-LNSW 253
+ V +D LP + DGLHL A+G+ NSW
Sbjct: 207 CKQVELVDSIDVEDLPFQSDGLHLR--AEGAEPSNSW 241
>gi|87240754|gb|ABD32612.1| hypothetical protein MtrDRAFT_AC150207g2v2 [Medicago truncatula]
gi|87241432|gb|ABD33290.1| hypothetical protein MtrDRAFT_AC158501g27v2 [Medicago truncatula]
Length = 75
Score = 67.8 bits (164), Expect = 5e-09, Method: Composition-based stats.
Identities = 33/50 (66%), Positives = 39/50 (78%)
Query: 197 IRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
++VALASGEG FIE VR AQL LPNV+CVDA GL L+ D LHLTT ++
Sbjct: 1 MQVALASGEGKFIEKVRHAQLGIKLPNVKCVDAKGLHLKTDKLHLTTMSE 50
>gi|373851350|ref|ZP_09594150.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
gi|372473579|gb|EHP33589.1| protein of unknown function DUF303 acetylesterase [Opitutaceae
bacterium TAV5]
Length = 262
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 105/249 (42%), Gaps = 34/249 (13%)
Query: 22 QQQQLIILAGQSNMAG-RGGVTNDTRTNKLTWDGIVPPQCQ-PNPSILRLTAKLKWVLAH 79
Q ++ +LAGQSNM G R +T +P + NP IL + W
Sbjct: 30 QPLKVFVLAGQSNMVGVRSEIT------------ALPENLKTENPDILFFDGQ-TWA--- 73
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLT--KVPNFGVIGLVPCAIGGTNI-SQWRKGSS-- 134
P+ + G GP + FA + K P +G++ + GG+ + S W S+
Sbjct: 74 -PMKPG--NTEAKGFGPEISFARKIHDAWKEP----VGIIKHSKGGSMLASNWSPRSTKE 126
Query: 135 -LYEQMIQRAQVALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSP 192
L +++ R + A I VLW QGESD VN + A LY D+ RS+ +P
Sbjct: 127 NLLAELLARVKAAQAAREIEIVGVLWMQGESDAVNEKRAALYANNLDLLIERFRSEFNNP 186
Query: 193 LLPII--RVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTL 250
L + RV P IVRKAQ + R +D L D LH T
Sbjct: 187 ALLFLCARVNPPEDRYPTAAIVRKAQEECTYAHYRLIDCDDLEKVGDNLHYNTRGIIELG 246
Query: 251 NSWSNEALR 259
N +++ AL+
Sbjct: 247 NRFADAALK 255
>gi|363582077|ref|ZP_09314887.1| hypothetical protein FbacHQ_11544 [Flavobacteriaceae bacterium
HQM9]
Length = 263
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 96/210 (45%), Gaps = 36/210 (17%)
Query: 1 MF-AWLLCLILVSEAWPVKCQYQQQQ-----LIILAGQSNMAGRGGVTNDTRTNKLTWDG 54
MF ++L +L+S + C+++ + I+AGQSN G+ DT+ +
Sbjct: 1 MFKSYLQKFLLIS---LISCEFKTFDDTGFDIFIIAGQSNTLAGSGL--DTKID------ 49
Query: 55 IVPPQCQPNPSILRL--TAKLKWVL--AHEPLHADIDVNKTNGVGPGLPFANAVLTKVPN 110
P+ I +L + +++ A+EPL + N +G GL FA
Sbjct: 50 ------TPDKDIFQLGRFSIFDFMISQANEPLQHH--TARKNKIGFGLTFAKLYKNHKKK 101
Query: 111 FGVIGLVPCAIGGTNIS-QWRKGSSLYEQMIQRAQVALRG--GGTIRAVLWYQGESDTVN 167
I L+PC GG ++ +W+ LYE +I+R + ++A+LW+QGESDT
Sbjct: 102 AKPILLIPCGFGGASLKKEWKISEFLYEDLIERVNFVKQKHPKSIVKAILWHQGESDT-G 160
Query: 168 LEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
L + Y D F +R DL S LP I
Sbjct: 161 LTN---YDILLDKFINSIRKDLNSERLPFI 187
>gi|257053456|ref|YP_003131289.1| Carbohydrate-binding family V/XII [Halorhabdus utahensis DSM 12940]
gi|256692219|gb|ACV12556.1| Carbohydrate-binding family V/XII [Halorhabdus utahensis DSM 12940]
Length = 523
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 110/239 (46%), Gaps = 35/239 (14%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
L +L GQSNM G+G + R + C P++ R + W LA PL
Sbjct: 70 DLYLLFGQSNMEGQGPIEAQDRETHPRIHVLADKTC---PNLDREYGE--WYLAEPPL-- 122
Query: 85 DIDVNKTNG-VGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSL-------- 135
N+ G +GPG FA +++ ++P+ IGLVP A+ G +I+ + KG+ +
Sbjct: 123 ----NRCYGKLGPGDYFAKSMIEEMPDDRSIGLVPAAVSGADIALFEKGAPIGRNDRDIP 178
Query: 136 ------YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL 189
YE M+ A+ A + GT R +L++QGE++T + + + ++ DLR+DL
Sbjct: 179 SQFDGGYEWMVDLAETAQQ-VGTFRGILFHQGETNTND----QQWTDQVQGIVEDLRADL 233
Query: 190 QSPLLPIIRVAL---ASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
+P + + ++G +L + N V A GL + D H T+ A
Sbjct: 234 GIGNVPFLAGEMLYDSAGGCCGSHNTEVNELPDVIENAHVVSAEGLAGQ-DYAHFTSEA 291
>gi|298709128|emb|CBJ31074.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 374
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 59/117 (50%), Gaps = 10/117 (8%)
Query: 139 MIQRAQVALRG---GGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLP 195
M R AL+ G + +LWYQGE+D + A+ Y +R D+R L P L
Sbjct: 1 MSARVDEALKAAPEGSHLGGMLWYQGETDAAKEDRAETYGDRFQTLIEDVRG-LGYPDLN 59
Query: 196 IIRVALASGEG--PFIEIVRKAQL----SSDLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
I VA+ P+++ VR AQL S+ + V D GLP+ PDGLHL T AQ
Sbjct: 60 IFTVAVTGTTARLPYLQQVRDAQLFAGSSTGIAGVWVTDTFGLPMFPDGLHLVTKAQ 116
>gi|365133291|ref|ZP_09342675.1| hypothetical protein HMPREF1032_00471 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363616101|gb|EHL67555.1| hypothetical protein HMPREF1032_00471 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 470
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 71/228 (31%), Positives = 99/228 (43%), Gaps = 41/228 (17%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPL--- 82
+ ++AGQSN AGR N + D P + L +W LA PL
Sbjct: 126 VFVIAGQSNAAGRA-------KNPVADD--------PELGVHVLRTSARWELATHPLGET 170
Query: 83 ----HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQW--RKGSSLY 136
H N G P L FA + ++ IGLVPCA GG + W + +L+
Sbjct: 171 TNALHVGHYENHNPGHSPWLHFAKRLKRELGY--PIGLVPCAYGGAPLRWWNPEENGALF 228
Query: 137 EQMIQR-AQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLP 195
M++ A + RAVLWYQGE++ + A+ Y ER +F R+ L P LP
Sbjct: 229 TNMLEMLADYDIH----PRAVLWYQGEAEGYE-DSAQTYLERFAVFVRHTRAALGQPELP 283
Query: 196 IIRVALASG-EGPFIEI------VRKAQLSS--DLPNVRCVDAMGLPL 234
+ V L EGP ++ VR+AQ + L +V V A L L
Sbjct: 284 FLTVQLNRCMEGPSEKLDRQWGMVREAQRQAWHTLEHVTVVPAADLAL 331
>gi|448410563|ref|ZP_21575268.1| Carbohydrate-binding family V/XII [Halosimplex carlsbadense 2-9-1]
gi|445671599|gb|ELZ24186.1| Carbohydrate-binding family V/XII [Halosimplex carlsbadense 2-9-1]
Length = 665
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 110/250 (44%), Gaps = 33/250 (13%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L +L GQSNM G+G + R + C P++ R + W LA PL+
Sbjct: 79 LYLLFGQSNMEGQGTIGAQDRETNERIHLLADLDC---PTLEREYGE--WYLAEPPLN-- 131
Query: 86 IDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSL---------- 135
+ G+GPG FA ++ + P+ +GLVP A+ G +I+ ++KG+ +
Sbjct: 132 ---RCSQGLGPGTSFAKTMIEETPDDRGVGLVPAAVSGADIALFQKGAPIGRNDRNIPSQ 188
Query: 136 ----YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQS 191
Y+ ++ A+ A GTI+ +L++QGE++T E + +LRSDL
Sbjct: 189 FDGGYQWLLDLAEQAQE-VGTIKGILFHQGETNTGQQE----WTSEVQGIVENLRSDLGI 243
Query: 192 PLLPIIRVA-LASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGS 248
+P + L EG +L + N V A GL + D H TT A
Sbjct: 244 GTVPFLAGEMLYDSEGGCCASHNSEVNELPDVIENAHVVSAEGLAGQ-DYAHFTTEAYRE 302
Query: 249 TLNSWSNEAL 258
++NE L
Sbjct: 303 LGRRYANEML 312
>gi|372208478|ref|ZP_09496280.1| hypothetical protein FbacS_00060 [Flavobacteriaceae bacterium S85]
Length = 264
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 80/180 (44%), Gaps = 29/180 (16%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKW----VLAHEP 81
+ ++AGQSN G+ QP+ +IL+L + + A EP
Sbjct: 29 IFVIAGQSNTNSGKGLNYKID--------------QPDANILQLGRNYPYDYLIIPAKEP 74
Query: 82 LHADIDVNKTNGVGPGLPFANAVLTKV-PNFGVIGLVPCAIGGTNISQ-WRKGSSLYEQM 139
L + N +G GL FA P+ I ++PC GGT++ + W LY M
Sbjct: 75 LQHH--TSNKNQIGFGLTFAKLYNKHTNPSKKTILIIPCGYGGTSLQKDWTFDGYLYNDM 132
Query: 140 IQRAQVALRG--GGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII 197
I+R Q L G ++A+LW+QGESD + Y + D F R DL+ LP+I
Sbjct: 133 IERIQKTLEKYPGSQLKALLWHQGESDV----NHPKYDQLLDQFIHQTRKDLKVN-LPVI 187
>gi|229816892|ref|ZP_04447174.1| hypothetical protein BIFANG_02140 [Bifidobacterium angulatum DSM
20098 = JCM 7096]
gi|229785637|gb|EEP21751.1| hypothetical protein BIFANG_02140 [Bifidobacterium angulatum DSM
20098 = JCM 7096]
Length = 464
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 67/140 (47%), Gaps = 8/140 (5%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
FA + T PN IG++ A GGT IS+ KG +Y+ I Q G + VLWY
Sbjct: 177 FAQELRTTSPNIP-IGIIQTAWGGTAISRHIKGGDIYKNHIAPLQ-----GFHVAGVLWY 230
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEG-PFIEIVRKAQLS 218
QG +D N A Y+ + R LP + V LA G + +IVR+AQLS
Sbjct: 231 QGCNDAANNATALAYESQFTALINQYRKVFDDASLPFLYVQLARWPGYQYTQIVRQAQLS 290
Query: 219 S-DLPNVRCVDAMGLPLEPD 237
+ D PN+ +G+ + D
Sbjct: 291 ALDNPNLNSTGNVGMTVSID 310
>gi|298707683|emb|CBJ26000.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 273
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 81/186 (43%), Gaps = 23/186 (12%)
Query: 11 VSEAWPVKCQYQQQQLIILAGQSNMAGRG-GVTNDTRTNKLTWDGIVPPQCQPNPSILRL 69
VS K +++L GQSNM+G G G D DG +P I +
Sbjct: 16 VSADCAAKRNVAGSDVVLLMGQSNMSGWGEGYDADI-------DG------PDDPRIQQW 62
Query: 70 TAKLKWVLAHEPL-HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-- 126
+ + A E L HAD D VG G F A + +P + LVP A G T +
Sbjct: 63 SRANTVITASERLQHADFDRIDQTRVGMGTAFGRAYVKTLPANRNVLLVPTAFGATRLVN 122
Query: 127 SQWRKGSSLYEQMIQRAQVALRGGGTI----RAVLWYQGESDTVNLEDAKLYKER-SDMF 181
W G +L+E + R + AL G + AVLW+QGE D D + Y+ +DM
Sbjct: 123 GPWSPGGNLFEDAVTRMEAALASNGAVGNCVAAVLWHQGEGDAAGRIDQETYQSTWTDMI 182
Query: 182 FTDLRS 187
T LRS
Sbjct: 183 NT-LRS 187
>gi|225165070|ref|ZP_03727256.1| hypothetical protein ObacDRAFT_5385 [Diplosphaera colitermitum
TAV2]
gi|224800332|gb|EEG18728.1| hypothetical protein ObacDRAFT_5385 [Diplosphaera colitermitum
TAV2]
Length = 520
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 103/254 (40%), Gaps = 58/254 (22%)
Query: 28 ILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA--- 84
+LAGQSNM G GG+ + +P+P I + W A +PLH
Sbjct: 141 LLAGQSNMEG-GGL-------------LAASVARPHPFIRAFSLARVWRQAADPLHVPWE 186
Query: 85 ------------------DIDVNKTNGVGPGLPFANAVLTK--VPNFGVIGLVPCAIGGT 124
D G G GL F +L + VP GL+ A G T
Sbjct: 187 SQEAALNDGKPFTREQAEDYRRTSRVGAGVGLHFGREMLLRSGVPQ----GLICAARGAT 242
Query: 125 NISQW------RKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
+ QW G+ LY M++ + G + VLW+QGE D+ E A LY +R
Sbjct: 243 RMEQWLPARGRDGGAGLYGAMLRSVRAT---GQPVAGVLWHQGEGDSPR-ERAALYSQRM 298
Query: 179 DMFFTDLRSDLQSPLLPIIRVALAS--GEGPFI--EIVRKAQ--LSSDLPNVRCVDAMGL 232
+R DL P LP I LA GE P V++ Q L+ + +V V + L
Sbjct: 299 RKLIAAVRRDLGLPRLPWIFAQLARVYGERPDCAWNSVQEQQRALADRIHDVALVATVDL 358
Query: 233 PLEPDGLHLTTPAQ 246
L+ D +HL+ A
Sbjct: 359 SLD-DFIHLSAEAH 371
>gi|298707681|emb|CBJ25998.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 287
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 108/257 (42%), Gaps = 36/257 (14%)
Query: 7 CLILVSEAWPVKCQYQQQQLIILAGQSNMAGRG-GVTNDTRTNKLTWDGIVPPQCQPN-P 64
C VS K +++L GQSNM+G G G D DG PN P
Sbjct: 22 CSTSVSADCTAKRDVAGSDVVLLMGQSNMSGWGEGYDADI-------DG-------PNDP 67
Query: 65 SILRLTAKLKWVLAHEPL-HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGG 123
I + + + A E L HAD VG G F A + +P + LVP A G
Sbjct: 68 RIQQWSRDNTVITASERLQHADHGRAGKRRVGMGTAFGRAFVKTLPANRNVLLVPTAFGA 127
Query: 124 TNISQ--WRKGSSLYEQMIQRAQVALRGGGT----IRAVLWYQGESDTVNLEDAKLYKER 177
T + W G +L+E + R + AL G + A+LW+QGESD + D + Y+
Sbjct: 128 TRLVNGPWSPGGNLFEDAVTRMEAALASNGAAGNCVAAILWHQGESDAGDGIDQETYQS- 186
Query: 178 SDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPD 237
+T++ + L+S + + GE + R LS P + + A+ PD
Sbjct: 187 ---IWTNMINTLRSRIPAAAEAPVILGEFTPHMLARNRALSE--PIIAAIRAI-----PD 236
Query: 238 GLHLT--TPAQGSTLNS 252
+ T P+ G + NS
Sbjct: 237 SVPFTAVAPSDGLSTNS 253
>gi|374295921|ref|YP_005046112.1| dockerin-like protein [Clostridium clariflavum DSM 19732]
gi|359825415|gb|AEV68188.1| dockerin-like protein [Clostridium clariflavum DSM 19732]
Length = 353
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 101/244 (41%), Gaps = 43/244 (17%)
Query: 23 QQQLIILAGQSNMAGRG-------GVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKW 75
+ ++ ILAGQSNMAG G + K+ +G V + S L+
Sbjct: 36 KHKVFILAGQSNMAGCGMNHELSAEYLGEQERVKIYAEGTVEASLKGTWSTLKPGFGSGS 95
Query: 76 VLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWRKGSS 134
P L F + P+ ++ L+ C GT++ WR S+
Sbjct: 96 GCFG----------------PELTFGREISKAYPDCEIL-LIKCGWSGTSLQGDWRPPSA 138
Query: 135 ------LYEQMIQRAQVALRG-----GGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
LY+ +I+ A+ + W QGESD N+ A+ Y+E F
Sbjct: 139 GGATGPLYKNLIETVNKAIGALDKSIDYEFAGMCWMQGESDACNIYPAREYEENLTAFIN 198
Query: 184 DLRSDLQSPLLPIIRVALASGEGPFIE--IVRKAQL--SSDLPNVRCVDAMGLPLEPDGL 239
D+R +L +P +P + +A+ ++E IVR+AQ+ ++ +P V D + DG+
Sbjct: 199 DVRKELNAPTMPFV-IAMIDDSDAWVENAIVRQAQINVANKVPYVYIFDTK--DYDTDGM 255
Query: 240 HLTT 243
H T
Sbjct: 256 HYKT 259
>gi|404404604|ref|ZP_10996188.1| acetyl xylan esterase A [Alistipes sp. JC136]
Length = 294
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 69/229 (30%), Positives = 103/229 (44%), Gaps = 54/229 (23%)
Query: 25 QLIILAGQSNMAGRG----GVTNDTRTNKL-TWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
+L++ GQSNMAGRG G + R+ L DG + A
Sbjct: 68 RLVLCIGQSNMAGRGLMDAGAADTLRSVYLFNGDG--------------------FERAA 107
Query: 80 EPLHADIDVNKTNG---VGPGLPFAN--AVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS 134
EP++ V K G VGP FA A +T P +G+V A GG++I +W GS
Sbjct: 108 EPMNRYSTVRKELGMQRVGPVGSFAARYAEVTGAP----VGVVVNARGGSSIDEWLPGSE 163
Query: 135 LYEQMIQRAQVALRGGGT---IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQS 191
+ +A +R G + AVLW+QGE+D+ + E Y+ + LR++L +
Sbjct: 164 T--DYLAKAVERIRAAGDWGDVAAVLWHQGEADSAHPER---YEAKLRRLVGILRTELGN 218
Query: 192 PLLPIIRVALA--------SGEGPFIEIVRKAQLSSDLPNVRCVDAMGL 232
P LP++ +A G PF ++R S +P+ CV A GL
Sbjct: 219 PSLPVVFGEIAHWNWTNRVEGTAPFNAMLR----SLRIPHTACVSAEGL 263
>gi|110638437|ref|YP_678646.1| multifunctional acetylxylan
esterase/b-xylosidase/a-L-arabinofuranosidaseand
carbohydrate esterase family 6 protein [Cytophaga
hutchinsonii ATCC 33406]
gi|110281118|gb|ABG59304.1| CHU large protein; candidate polyfunctional acetylxylan
esterase/b-xylosidase/a-L-arabinofuranosidase, CBM9
module, Glycoside Hydrolase Family 43 protein and
Carbohydrate Esterase Family 6 protein [Cytophaga
hutchinsonii ATCC 33406]
Length = 1585
Score = 61.2 bits (147), Expect = 4e-07, Method: Composition-based stats.
Identities = 66/247 (26%), Positives = 107/247 (43%), Gaps = 42/247 (17%)
Query: 31 GQSNMAGRGGVTNDTRT---NKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADID 87
GQSNM G G + +T ++ G V N + + KW A P+
Sbjct: 36 GQSNMEGNGVIEAQDQTAVNSRFQVMGAV------NCTGTKSYTTGKWTTATAPI----- 84
Query: 88 VNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK---------------- 131
V G+GP F +++ +P +G+VP AIGG +I+ + K
Sbjct: 85 VRCNTGLGPLDYFGRTMVSNLPANIKVGVVPVAIGGCDIALFDKVNYGSYVATAPSWMIG 144
Query: 132 -----GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLR 186
G + Y ++++ A++A + G I+ +L++QGE++ + K D DL
Sbjct: 145 TINQYGGNPYARLVEVAKLAQK-DGVIKGILFHQGETNNGQQDWPAKVKAIYDNLIKDLG 203
Query: 187 SD-LQSPLLP--IIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT 243
D ++P L ++ A G I+ A+L + +PN V A GLP + D LH T
Sbjct: 204 LDPAKTPFLAGELVTTAQGGACGGHNSII--AKLPNVIPNAHVVSAAGLPHKGDNLHF-T 260
Query: 244 PAQGSTL 250
PA T
Sbjct: 261 PASYRTF 267
>gi|417301292|ref|ZP_12088453.1| iduronate-2-sulfatase [Rhodopirellula baltica WH47]
gi|327542407|gb|EGF28890.1| iduronate-2-sulfatase [Rhodopirellula baltica WH47]
Length = 745
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 23/191 (12%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKL-TWDGIVPPQCQPNPSILRLTAKLKWVLAHEP 81
+ +LAGQSNM GRG +++ ++ K T D I+ + P S T + + P
Sbjct: 43 HHDVYLLAGQSNMDGRGQISDLSKEQKQSTSDAIIFYRSVPRESDGWQTLAPGFSV---P 99
Query: 82 LHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWRKG-------- 132
D+ + GP + FA ++L PN + L+ + GGT++ + W+ G
Sbjct: 100 PKYKGDL-PSPTFGPEIGFARSMLNANPN-QKLALIKGSKGGTSLRADWKPGVKGDPKSQ 157
Query: 133 SSLYEQMIQ-----RAQVALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLR 186
Y I+ Q++ RG TIR +LW+QGESD+ + D LY+ R + +R
Sbjct: 158 GPRYCDFIETIRMATKQLSDRGDQFTIRGLLWHQGESDSKSSTD--LYQRRLEELIVRIR 215
Query: 187 SDLQSPLLPII 197
D+ P LP++
Sbjct: 216 EDVGVPDLPVV 226
>gi|108712201|gb|ABF99996.1| expressed protein [Oryza sativa Japonica Group]
gi|215692856|dbj|BAG88276.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 102
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 180 MFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGL 239
+FF+ + Q + V LASG G + E+VR+AQ L NVR VDA GLPLE L
Sbjct: 18 IFFSSFKPPRQCKT-KLFVVGLASGLGQYTEVVREAQKGIKLRNVRFVDAKGLPLEDGHL 76
Query: 240 HLTTPAQ 246
HL+T AQ
Sbjct: 77 HLSTQAQ 83
>gi|384099722|ref|ZP_10000802.1| acetyl xylan esterase A [Imtechella halotolerans K1]
gi|383832171|gb|EID71649.1| acetyl xylan esterase A [Imtechella halotolerans K1]
Length = 369
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 78/195 (40%), Gaps = 35/195 (17%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+ +L GQSNM G I P ++ T K +W A +
Sbjct: 43 DVYLLIGQSNMQGVAP--------------IEPLDTISLRNVFLFTDKNEWEFAKN--YP 86
Query: 85 DIDVNKTNGV--------GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS-- 134
D +N+ + V GP F + IG+V A G T I W+KG +
Sbjct: 87 DNGMNRYSTVKKKPITLFGPAYTFGREIAQYSNR--TIGIVSNARGATRIDWWQKGYTGD 144
Query: 135 ----LYEQMIQRAQVALRG--GGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSD 188
LYE+ ++R ++AL G T++ +LW+QGE++ Y + TDLR D
Sbjct: 145 NDYDLYEEAVKRTKIALESTPGATLKGILWHQGEANNGGGRHVN-YMSKLQSLVTDLRKD 203
Query: 189 LQSPLLPIIRVALAS 203
+P I + +
Sbjct: 204 FGDMNIPFIAAEVGT 218
>gi|409196591|ref|ZP_11225254.1| hypothetical protein MsalJ2_06097 [Marinilabilia salmonicolor JCM
21150]
Length = 278
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 100/248 (40%), Gaps = 47/248 (18%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
Q+ + GQSNMAG T D + S L K KW A PL A
Sbjct: 30 QIYLCFGQSNMAGAA----KTEAQDSIVDSRFVMMSTMDCSDLN-REKGKWYPATPPL-A 83
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------- 131
D + G+ P F ++ +P +G++ A+GG I + K
Sbjct: 84 DCNA----GLSPVDYFGRTMVENLPKKIKVGVINVAVGGCKIELFDKDNYQAYADSAPDW 139
Query: 132 --------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
G + YE++++ A+VA + G I+ +L +QGES+ + L+ + +
Sbjct: 140 MQGWIANYGGNPYERLVEMAKVAQKDG-VIKGILLHQGESNP----NDTLWTGKVKAIYD 194
Query: 184 DLRSDLQSPLLPIIRVALASGEGPFIEIVRKA--------QLSSDLPNVRCVDAMGLPLE 235
+L DL L V L +GE E K QL LPN + + G P +
Sbjct: 195 NLMVDLN---LNPGEVPLLAGETLSAEYDGKCAAFNQFINQLPEVLPNSYVISSQGCPGQ 251
Query: 236 PDGLHLTT 243
PDGLH T
Sbjct: 252 PDGLHFTA 259
>gi|440717772|ref|ZP_20898249.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
gi|436437074|gb|ELP30748.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
Length = 747
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 88/199 (44%), Gaps = 39/199 (19%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKL-TWDGIVPPQCQPNPSI--------LRLTAKL 73
+ +LAGQSNM GRG V++ + K T D I+ + P S + K
Sbjct: 43 HHDVYLLAGQSNMDGRGQVSDLSEEQKQSTGDAIIFYRSVPRESDGWQTLAPGFSVPPKY 102
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWRKG 132
K L + GP + FA ++ PN + L+ + GGT++ + W+ G
Sbjct: 103 KGGLP------------SPTFGPEIGFARSMSNANPN-QKLALIKGSKGGTSLRADWKPG 149
Query: 133 --------SSLYEQMIQ-----RAQVALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERS 178
Y I+ Q++ RG TIR +LW+QGESD+ + +LY+ R
Sbjct: 150 VKGDPKSQGPRYRDFIETIRMATKQLSDRGDQFTIRGLLWHQGESDSKS--STELYRRRL 207
Query: 179 DMFFTDLRSDLQSPLLPII 197
+ +R D+ P LP++
Sbjct: 208 EELIVRIREDVGVPDLPVV 226
>gi|26249465|ref|NP_755505.1| hypothetical protein c3630 [Escherichia coli CFT073]
gi|218692471|ref|YP_002400683.1| hypothetical protein ECED1_4919 [Escherichia coli ED1a]
gi|218706529|ref|YP_002414048.1| hypothetical protein ECUMN_3377 [Escherichia coli UMN026]
gi|227884851|ref|ZP_04002656.1| YjhS like protein [Escherichia coli 83972]
gi|300895623|ref|ZP_07114228.1| conserved domain protein [Escherichia coli MS 198-1]
gi|300972323|ref|ZP_07171899.1| conserved domain protein [Escherichia coli MS 45-1]
gi|301019316|ref|ZP_07183505.1| conserved domain protein [Escherichia coli MS 69-1]
gi|386630765|ref|YP_006150485.1| hypothetical protein i02_3325 [Escherichia coli str. 'clone D i2']
gi|386635685|ref|YP_006155404.1| hypothetical protein i14_3325 [Escherichia coli str. 'clone D i14']
gi|422361943|ref|ZP_16442530.1| conserved domain protein [Escherichia coli MS 153-1]
gi|26109873|gb|AAN82078.1|AE016766_166 Hypothetical protein yjhS precursor [Escherichia coli CFT073]
gi|47600662|emb|CAE55784.1| hypothetical protein YjhS precusor [Escherichia coli Nissle 1917]
gi|218430035|emb|CAR11022.2| conserved hypothetical protein [Escherichia coli ED1a]
gi|218433626|emb|CAR14537.1| conserved hypothetical protein [Escherichia coli UMN026]
gi|227838168|gb|EEJ48634.1| YjhS like protein [Escherichia coli 83972]
gi|300360439|gb|EFJ76309.1| conserved domain protein [Escherichia coli MS 198-1]
gi|300399311|gb|EFJ82849.1| conserved domain protein [Escherichia coli MS 69-1]
gi|300410977|gb|EFJ94515.1| conserved domain protein [Escherichia coli MS 45-1]
gi|315295302|gb|EFU54632.1| conserved domain protein [Escherichia coli MS 153-1]
gi|355421664|gb|AER85861.1| hypothetical protein i02_3325 [Escherichia coli str. 'clone D i2']
gi|355426584|gb|AER90780.1| hypothetical protein i14_3325 [Escherichia coli str. 'clone D i14']
Length = 329
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 87/201 (43%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPN---PSILRLTA 71
+I LAGQSN MA G+ ++R +L + P +C+ N P+ L
Sbjct: 16 VIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHCLHD 75
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS---- 127
H P AD+ + VG GL A +L +P I LVPC GG +
Sbjct: 76 VQDMSGYHHPA-ADLHKGEYGCVGQGLHIAKKLLPYIPEQAGILLVPCCRGGAAFTVGAE 134
Query: 128 --------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
+W G++LYE ++ R +VAL + +V W QGE D ++ +
Sbjct: 135 GMYVPDTGATADAMRWGTGTALYEDLVARVKVALEYNRKNKLLSVCWMQGEFDLMSPD-- 192
Query: 172 KLYKERSDMFF---TDLRSDL 189
Y++ D+F+ T RS+L
Sbjct: 193 --YEKHPDLFYQMVTSFRSEL 211
>gi|386640503|ref|YP_006107301.1| hypothetical protein ECABU_c33010 [Escherichia coli ABU 83972]
gi|404376309|ref|ZP_10981472.1| hypothetical protein ESCG_00400 [Escherichia sp. 1_1_43]
gi|419159626|ref|ZP_13704134.1| hypothetical protein ECDEC6D_2440 [Escherichia coli DEC6D]
gi|419934827|ref|ZP_14451925.1| hypothetical protein EC5761_13814 [Escherichia coli 576-1]
gi|432354948|ref|ZP_19598217.1| hypothetical protein WCA_03941 [Escherichia coli KTE2]
gi|432393494|ref|ZP_19636320.1| hypothetical protein WE9_03817 [Escherichia coli KTE21]
gi|432403297|ref|ZP_19646045.1| hypothetical protein WEK_03503 [Escherichia coli KTE26]
gi|432413132|ref|ZP_19655789.1| hypothetical protein WG9_03628 [Escherichia coli KTE39]
gi|432427579|ref|ZP_19670067.1| hypothetical protein A139_02978 [Escherichia coli KTE181]
gi|432452446|ref|ZP_19694696.1| hypothetical protein A13W_03436 [Escherichia coli KTE193]
gi|432458031|ref|ZP_19700209.1| hypothetical protein A15C_03836 [Escherichia coli KTE201]
gi|432462031|ref|ZP_19704172.1| hypothetical protein A15I_02906 [Escherichia coli KTE204]
gi|432467207|ref|ZP_19709290.1| hypothetical protein A15K_03166 [Escherichia coli KTE205]
gi|432497023|ref|ZP_19738818.1| hypothetical protein A173_04201 [Escherichia coli KTE214]
gi|432501462|ref|ZP_19743215.1| hypothetical protein A177_03572 [Escherichia coli KTE216]
gi|432505789|ref|ZP_19747510.1| hypothetical protein A17E_02862 [Escherichia coli KTE220]
gi|432539297|ref|ZP_19776193.1| hypothetical protein A195_02928 [Escherichia coli KTE235]
gi|432544652|ref|ZP_19781489.1| hypothetical protein A197_03244 [Escherichia coli KTE236]
gi|432550141|ref|ZP_19786903.1| hypothetical protein A199_03619 [Escherichia coli KTE237]
gi|432560193|ref|ZP_19796852.1| hypothetical protein A1S7_03847 [Escherichia coli KTE49]
gi|432581906|ref|ZP_19818320.1| hypothetical protein A1SM_01110 [Escherichia coli KTE57]
gi|432632796|ref|ZP_19868717.1| hypothetical protein A1UW_03183 [Escherichia coli KTE80]
gi|432642509|ref|ZP_19878337.1| hypothetical protein A1W1_03387 [Escherichia coli KTE83]
gi|432652467|ref|ZP_19888217.1| hypothetical protein A1W7_03491 [Escherichia coli KTE87]
gi|432667502|ref|ZP_19903077.1| hypothetical protein A1Y3_04118 [Escherichia coli KTE116]
gi|432695774|ref|ZP_19930968.1| hypothetical protein A31I_03258 [Escherichia coli KTE162]
gi|432707251|ref|ZP_19942329.1| hypothetical protein WCG_00522 [Escherichia coli KTE6]
gi|432784879|ref|ZP_20019057.1| hypothetical protein A1SY_03742 [Escherichia coli KTE63]
gi|432921926|ref|ZP_20124890.1| hypothetical protein A133_03830 [Escherichia coli KTE173]
gi|432928725|ref|ZP_20129826.1| hypothetical protein A135_03895 [Escherichia coli KTE175]
gi|432963371|ref|ZP_20152790.1| hypothetical protein A15E_03729 [Escherichia coli KTE202]
gi|432975113|ref|ZP_20163948.1| hypothetical protein A15S_00976 [Escherichia coli KTE209]
gi|432982357|ref|ZP_20171129.1| hypothetical protein A15W_03501 [Escherichia coli KTE211]
gi|432996672|ref|ZP_20185255.1| hypothetical protein A17A_03750 [Escherichia coli KTE218]
gi|433001269|ref|ZP_20189789.1| hypothetical protein A17K_03617 [Escherichia coli KTE223]
gi|433036098|ref|ZP_20223775.1| hypothetical protein WIC_04663 [Escherichia coli KTE112]
gi|433053501|ref|ZP_20240692.1| hypothetical protein WIK_02314 [Escherichia coli KTE122]
gi|433057736|ref|ZP_20244808.1| hypothetical protein WIM_01516 [Escherichia coli KTE124]
gi|433063416|ref|ZP_20250348.1| hypothetical protein WIO_02243 [Escherichia coli KTE125]
gi|433070852|ref|ZP_20257590.1| hypothetical protein WIQ_04727 [Escherichia coli KTE128]
gi|433072467|ref|ZP_20259149.1| hypothetical protein WIS_01437 [Escherichia coli KTE129]
gi|433087017|ref|ZP_20273404.1| hypothetical protein WIY_01466 [Escherichia coli KTE137]
gi|433096279|ref|ZP_20282482.1| hypothetical protein WK3_01485 [Escherichia coli KTE139]
gi|433105588|ref|ZP_20291591.1| hypothetical protein WK7_01460 [Escherichia coli KTE148]
gi|433115289|ref|ZP_20301098.1| hypothetical protein WKA_01481 [Escherichia coli KTE153]
gi|433128141|ref|ZP_20313647.1| hypothetical protein WKE_04624 [Escherichia coli KTE160]
gi|433142365|ref|ZP_20327568.1| hypothetical protein WKM_04633 [Escherichia coli KTE167]
gi|433150543|ref|ZP_20335552.1| hypothetical protein WKQ_03197 [Escherichia coli KTE174]
gi|433178667|ref|ZP_20363074.1| hypothetical protein WGM_02313 [Escherichia coli KTE82]
gi|433182895|ref|ZP_20367179.1| hypothetical protein WGO_01349 [Escherichia coli KTE85]
gi|442594458|ref|ZP_21012360.1| FIG00640604: hypothetical protein [Escherichia coli O10:K5(L):H4
str. ATCC 23506]
gi|442608333|ref|ZP_21023092.1| FIG00640604: hypothetical protein [Escherichia coli Nissle 1917]
gi|307554995|gb|ADN47770.1| conserved hypothetical protein [Escherichia coli ABU 83972]
gi|378008018|gb|EHV70980.1| hypothetical protein ECDEC6D_2440 [Escherichia coli DEC6D]
gi|388406733|gb|EIL67119.1| hypothetical protein EC5761_13814 [Escherichia coli 576-1]
gi|404290358|gb|EEH71713.2| hypothetical protein ESCG_00400 [Escherichia sp. 1_1_43]
gi|430873856|gb|ELB97422.1| hypothetical protein WCA_03941 [Escherichia coli KTE2]
gi|430916325|gb|ELC37392.1| hypothetical protein WE9_03817 [Escherichia coli KTE21]
gi|430924456|gb|ELC45177.1| hypothetical protein WEK_03503 [Escherichia coli KTE26]
gi|430934077|gb|ELC54458.1| hypothetical protein WG9_03628 [Escherichia coli KTE39]
gi|430953261|gb|ELC72164.1| hypothetical protein A139_02978 [Escherichia coli KTE181]
gi|430976048|gb|ELC92924.1| hypothetical protein A13W_03436 [Escherichia coli KTE193]
gi|430980657|gb|ELC97407.1| hypothetical protein A15C_03836 [Escherichia coli KTE201]
gi|430987709|gb|ELD04239.1| hypothetical protein A15I_02906 [Escherichia coli KTE204]
gi|430992161|gb|ELD08544.1| hypothetical protein A15K_03166 [Escherichia coli KTE205]
gi|431022716|gb|ELD35977.1| hypothetical protein A173_04201 [Escherichia coli KTE214]
gi|431026829|gb|ELD39897.1| hypothetical protein A177_03572 [Escherichia coli KTE216]
gi|431037305|gb|ELD48293.1| hypothetical protein A17E_02862 [Escherichia coli KTE220]
gi|431067710|gb|ELD76226.1| hypothetical protein A195_02928 [Escherichia coli KTE235]
gi|431072886|gb|ELD80625.1| hypothetical protein A197_03244 [Escherichia coli KTE236]
gi|431078490|gb|ELD85541.1| hypothetical protein A199_03619 [Escherichia coli KTE237]
gi|431089498|gb|ELD95311.1| hypothetical protein A1S7_03847 [Escherichia coli KTE49]
gi|431122188|gb|ELE25057.1| hypothetical protein A1SM_01110 [Escherichia coli KTE57]
gi|431167925|gb|ELE68179.1| hypothetical protein A1UW_03183 [Escherichia coli KTE80]
gi|431180041|gb|ELE79932.1| hypothetical protein A1W1_03387 [Escherichia coli KTE83]
gi|431189053|gb|ELE88483.1| hypothetical protein A1W7_03491 [Escherichia coli KTE87]
gi|431198894|gb|ELE97675.1| hypothetical protein A1Y3_04118 [Escherichia coli KTE116]
gi|431232402|gb|ELF28070.1| hypothetical protein A31I_03258 [Escherichia coli KTE162]
gi|431256361|gb|ELF49435.1| hypothetical protein WCG_00522 [Escherichia coli KTE6]
gi|431328036|gb|ELG15356.1| hypothetical protein A1SY_03742 [Escherichia coli KTE63]
gi|431436949|gb|ELH18462.1| hypothetical protein A133_03830 [Escherichia coli KTE173]
gi|431441848|gb|ELH22955.1| hypothetical protein A135_03895 [Escherichia coli KTE175]
gi|431471946|gb|ELH51838.1| hypothetical protein A15E_03729 [Escherichia coli KTE202]
gi|431487179|gb|ELH66824.1| hypothetical protein A15S_00976 [Escherichia coli KTE209]
gi|431490116|gb|ELH69737.1| hypothetical protein A15W_03501 [Escherichia coli KTE211]
gi|431503467|gb|ELH82202.1| hypothetical protein A17A_03750 [Escherichia coli KTE218]
gi|431506388|gb|ELH84985.1| hypothetical protein A17K_03617 [Escherichia coli KTE223]
gi|431544583|gb|ELI19399.1| hypothetical protein WIC_04663 [Escherichia coli KTE112]
gi|431571364|gb|ELI44251.1| hypothetical protein WIK_02314 [Escherichia coli KTE122]
gi|431572388|gb|ELI45228.1| hypothetical protein WIM_01516 [Escherichia coli KTE124]
gi|431576714|gb|ELI49384.1| hypothetical protein WIQ_04727 [Escherichia coli KTE128]
gi|431582609|gb|ELI54623.1| hypothetical protein WIO_02243 [Escherichia coli KTE125]
gi|431590484|gb|ELI61506.1| hypothetical protein WIS_01437 [Escherichia coli KTE129]
gi|431607590|gb|ELI76951.1| hypothetical protein WIY_01466 [Escherichia coli KTE137]
gi|431617798|gb|ELI86789.1| hypothetical protein WK3_01485 [Escherichia coli KTE139]
gi|431630841|gb|ELI99168.1| hypothetical protein WK7_01460 [Escherichia coli KTE148]
gi|431635653|gb|ELJ03852.1| hypothetical protein WKA_01481 [Escherichia coli KTE153]
gi|431636796|gb|ELJ04917.1| hypothetical protein WKE_04624 [Escherichia coli KTE160]
gi|431652175|gb|ELJ19331.1| hypothetical protein WKM_04633 [Escherichia coli KTE167]
gi|431668818|gb|ELJ35262.1| hypothetical protein WKQ_03197 [Escherichia coli KTE174]
gi|431703822|gb|ELJ68507.1| hypothetical protein WGM_02313 [Escherichia coli KTE82]
gi|431709705|gb|ELJ74154.1| hypothetical protein WGO_01349 [Escherichia coli KTE85]
gi|441605594|emb|CCP97640.1| FIG00640604: hypothetical protein [Escherichia coli O10:K5(L):H4
str. ATCC 23506]
gi|441710312|emb|CCQ09069.1| FIG00640604: hypothetical protein [Escherichia coli Nissle 1917]
Length = 328
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 87/201 (43%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPN---PSILRLTA 71
+I LAGQSN MA G+ ++R +L + P +C+ N P+ L
Sbjct: 15 VIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHCLHD 74
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS---- 127
H P AD+ + VG GL A +L +P I LVPC GG +
Sbjct: 75 VQDMSGYHHPA-ADLHKGEYGCVGQGLHIAKKLLPYIPEQAGILLVPCCRGGAAFTVGAE 133
Query: 128 --------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
+W G++LYE ++ R +VAL + +V W QGE D ++ +
Sbjct: 134 GMYVPDTGATADAMRWGTGTALYEDLVARVKVALEYNRKNKLLSVCWMQGEFDLMSPD-- 191
Query: 172 KLYKERSDMFF---TDLRSDL 189
Y++ D+F+ T RS+L
Sbjct: 192 --YEKHPDLFYQMVTSFRSEL 210
>gi|432619817|ref|ZP_19855893.1| hypothetical protein A1UM_05276 [Escherichia coli KTE75]
gi|431146828|gb|ELE48256.1| hypothetical protein A1UM_05276 [Escherichia coli KTE75]
Length = 328
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 87/201 (43%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPN---PSILRLTA 71
+I LAGQSN MA G+ ++R +L + P +C+ N P+ L
Sbjct: 15 VIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHCLHD 74
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS---- 127
H P AD+ + VG GL A +L +P I LVPC GG +
Sbjct: 75 VQDMSGYHHPA-ADLHKGEYGCVGQGLHIAKKLLPYIPEQAGILLVPCCRGGAAFTVGAE 133
Query: 128 --------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
+W G++LYE ++ R +VAL + +V W QGE D ++ +
Sbjct: 134 GMYVPDTGATADAMRWGTGTALYEDLVARVKVALEYNRKNKLLSVCWMQGEFDLMSPD-- 191
Query: 172 KLYKERSDMFF---TDLRSDL 189
Y++ D+F+ T RS+L
Sbjct: 192 --YEKHPDLFYQMVTSFRSEL 210
>gi|417149518|ref|ZP_11989609.1| PF03629 domain protein [Escherichia coli 1.2264]
gi|417164323|ref|ZP_11999128.1| PF03629 domain protein [Escherichia coli 99.0741]
gi|417614479|ref|ZP_12264934.1| hypothetical protein ECSTECEH250_3562 [Escherichia coli STEC_EH250]
gi|345360325|gb|EGW92494.1| hypothetical protein ECSTECEH250_3562 [Escherichia coli STEC_EH250]
gi|386161739|gb|EIH23542.1| PF03629 domain protein [Escherichia coli 1.2264]
gi|386172586|gb|EIH44609.1| PF03629 domain protein [Escherichia coli 99.0741]
Length = 328
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 87/201 (43%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPN---PSILRLTA 71
+I LAGQSN MA G+ ++R +L + P +C+ N P+ L
Sbjct: 15 VIALAGQSNGMAYGEGIPLPDTLDKPESRVKQLARRKTITPGGKECKFNEIIPADHCLHD 74
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS---- 127
H P AD+ + VG GL A +L +P I LVPC GG +
Sbjct: 75 VQDMSGYHHPA-ADLHKGEYGCVGQGLHIAKKLLPYIPEQAGILLVPCCRGGAAFTVGAE 133
Query: 128 --------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
+W G++LYE ++ R +VAL + +V W QGE D ++ +
Sbjct: 134 GMYVPDTGATADAMRWGTGTALYEDLVARVKVALEYNRKNKLLSVCWMQGEFDLMSPD-- 191
Query: 172 KLYKERSDMFF---TDLRSDL 189
Y++ D+F+ T RS+L
Sbjct: 192 --YEKHPDLFYQMVTSFRSEL 210
>gi|149195713|ref|ZP_01872770.1| sialate O-acetylesterase [Lentisphaera araneosa HTCC2155]
gi|149141175|gb|EDM29571.1| sialate O-acetylesterase [Lentisphaera araneosa HTCC2155]
Length = 583
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 73/167 (43%), Gaps = 25/167 (14%)
Query: 97 GLPFANAVLT--KVPNFGVIGLVPCAIGGTNISQW--------RKGSSLYEQMIQRAQVA 146
G FA + K+P IGL+ GG+ I W R S M +
Sbjct: 190 GFVFAKKLQADLKIP----IGLIDANKGGSFIKFWEPPHALKARGESRPARNMFNSMLGS 245
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE- 205
G I+ +WYQGESD +NL+ A+ Y++ R + + P +P + V LAS E
Sbjct: 246 YAHGFPIKGFIWYQGESDAINLQKAQEYEKTFKTMIEGWRHEFKDPEMPFLFVQLASFER 305
Query: 206 GPFIE-----IVRKAQLSS-DLPNVRCVDAMGLPLEPDGLHLTTPAQ 246
P++ ++R AQ ++ +L N M + ++ +H P Q
Sbjct: 306 NPYMHGITYPVLRDAQTAALELDNT----GMAVAIDLGMIHDIHPPQ 348
>gi|417124382|ref|ZP_11973071.1| PF03629 domain protein [Escherichia coli 97.0246]
gi|386146277|gb|EIG92725.1| PF03629 domain protein [Escherichia coli 97.0246]
Length = 626
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 64/215 (29%), Positives = 87/215 (40%), Gaps = 42/215 (19%)
Query: 12 SEAWPVKCQYQQQQLIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPPQ---C 60
SE P + +++LAGQSN MA G+ D R +L VPP C
Sbjct: 61 SEPLPNNKTPEWYYVVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVPPGGEGC 120
Query: 61 QPNPSILRLTAKLKWVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLV 117
N I+ L V L+ AD+ + VG GL A +L +PN I LV
Sbjct: 121 TYN-DIIPADHCLHDVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLV 179
Query: 118 PCAIGGTNISQ------------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVL 157
PC GG+ +Q W G LY+ +I R + AL+ + AV
Sbjct: 180 PCCRGGSAFTQGAEGTFSTTTGASQDSARWGAGKPLYQDLIARTKAALQKNPKNVLLAVC 239
Query: 158 WYQGESDTVNLEDAKLYKERSDMF---FTDLRSDL 189
W QGE D A Y ++ D+F R+DL
Sbjct: 240 WMQGEFDM----SAATYAQQPDLFTAMLKQFRTDL 270
>gi|421612350|ref|ZP_16053458.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
gi|408496805|gb|EKK01356.1| iduronate-2-sulfatase [Rhodopirellula baltica SH28]
Length = 747
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 89/192 (46%), Gaps = 23/192 (11%)
Query: 22 QQQQLIILAGQSNMAGRGGVTNDTRTNKL-TWDGIVPPQCQPNPSILRLTAKLKWVLAHE 80
+ +LAGQSNM GRG V++ + K T D I+ + P S T + +
Sbjct: 42 DHHDVYLLAGQSNMDGRGQVSDLSEEQKQSTGDAIIFYRSVPRESDGWQTLAPGFSV--- 98
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWRKG------- 132
P D+ + GP + FA ++ PN + L+ + GGT++ + W+ G
Sbjct: 99 PPKYKGDL-PSPTFGPEIGFARSMSNANPN-QKLALIKGSKGGTSLRADWKPGVKGDPKS 156
Query: 133 -SSLYEQMIQ-----RAQVALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDL 185
Y I+ Q++ RG TIR +LW+QGESD+ + + Y+ R + +
Sbjct: 157 QGPRYRDFIETIRMATKQLSDRGDQFTIRGLLWHQGESDSKS--STERYRRRLEELIVRI 214
Query: 186 RSDLQSPLLPII 197
R D+ P LP++
Sbjct: 215 REDVGVPDLPVV 226
>gi|419850632|ref|ZP_14373612.1| PF03629 domain protein [Bifidobacterium longum subsp. longum 35B]
gi|419851551|ref|ZP_14374477.1| PF03629 domain protein [Bifidobacterium longum subsp. longum 2-2B]
gi|386408474|gb|EIJ23384.1| PF03629 domain protein [Bifidobacterium longum subsp. longum 35B]
gi|386413268|gb|EIJ27881.1| PF03629 domain protein [Bifidobacterium longum subsp. longum 2-2B]
Length = 444
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 61/134 (45%), Gaps = 10/134 (7%)
Query: 89 NKTNGVGPG-LP--FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQV 145
N++N G LP FA + PN IG++ A GGT+I++ + +Y I
Sbjct: 132 NESNAKKLGYLPQLFAEQLRLHHPNIP-IGIIQTAWGGTDIARHLRDGDIYANHI----- 185
Query: 146 ALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE 205
A G + +LWYQGE+D E A Y+ R L LP + V LA
Sbjct: 186 APLDGYNVAGILWYQGENDAAEQEPALQYEANFSTLINQYREVLGDSDLPFLYVQLARYT 245
Query: 206 G-PFIEIVRKAQLS 218
G + IVR+AQ S
Sbjct: 246 GYAYTPIVRQAQFS 259
>gi|32471069|ref|NP_864062.1| iduronate-2-sulfatase [Rhodopirellula baltica SH 1]
gi|32396771|emb|CAD71736.1| iduronate-2-sulfatase [Rhodopirellula baltica SH 1]
Length = 745
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 89/191 (46%), Gaps = 23/191 (12%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKL-TWDGIVPPQCQPNPSILRLTAKLKWVLAHEP 81
+ +LAGQSNM GRG V++ + K T D I+ + P S T + + P
Sbjct: 43 HHDVYLLAGQSNMDGRGQVSDLSEEQKQSTGDAIIFYRSVPRESDGWQTLAPGFSV---P 99
Query: 82 LHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI-SQWRKG-------- 132
D+ + GP + FA ++ PN + L+ + GGT++ + W+ G
Sbjct: 100 PKYKGDL-PSPTFGPEIGFARSMSNANPN-QKLALIKGSKGGTSLRADWKPGVQGDPKSQ 157
Query: 133 SSLYEQMIQ-----RAQVALRGGG-TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLR 186
Y I+ Q++ RG TIR +LW+QGESD+ + + Y+ R + +R
Sbjct: 158 GPRYRDFIETIRMATKQLSDRGDQFTIRGLLWHQGESDSKS--STERYRRRLEELIVRIR 215
Query: 187 SDLQSPLLPII 197
D+ P LP++
Sbjct: 216 EDVGVPDLPVV 226
>gi|325105290|ref|YP_004274944.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324974138|gb|ADY53122.1| protein of unknown function DUF303 acetylesterase [Pedobacter
saltans DSM 12145]
Length = 279
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 116/288 (40%), Gaps = 51/288 (17%)
Query: 4 WLLCLILVSEAWPVKCQYQQQ-----QLIILAGQSNMAGRGGVTNDTR--TNKLTWDGIV 56
+L+ LI+++ A + QQ + + GQSNM G V DT NK +
Sbjct: 5 YLITLIIIAGA--TLNSFSQQVNKNFHIYLCFGQSNMEGHARVETDTNLPVNKR----VK 58
Query: 57 PPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGL 116
Q + R + W A PL V G+GP F A+ +P+ IG+
Sbjct: 59 LLQAVSCEDLKR--EQGNWYDAIAPL-----VRCNTGLGPADYFGRAMANSLPDSVTIGI 111
Query: 117 VPCAIGGTNISQWRKGSSL---------------------YEQMIQRAQVALRGGGTIRA 155
V A+GG I + K S Y+++I+ A++A + G I+
Sbjct: 112 VNVAVGGCKIELFHKKYSESYINTAPDWMVSALKAYSNNPYQRLIEMAKIAQQ-SGVIKG 170
Query: 156 VLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFI-----E 210
+L +QGES+T + + K+ DL +L +P++ + E + E
Sbjct: 171 ILLHQGESNTGDTSWPQKVKDVYGDLIQDL--NLNPSKVPLLAGEVVHAEQNGVCAGMNE 228
Query: 211 IVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEAL 258
I+ QL +PN + + G D LH T N ++++ L
Sbjct: 229 II--GQLPGFIPNAHVISSKGCEAGADRLHFTAVGYKELGNRYASKML 274
>gi|336427499|ref|ZP_08607500.1| hypothetical protein HMPREF0994_03506 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336009587|gb|EGN39579.1| hypothetical protein HMPREF0994_03506 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 480
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 99/239 (41%), Gaps = 47/239 (19%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
L ++AGQSN AG G D P P+ + + +W LA P++
Sbjct: 123 LFLIAGQSNSAGYGK------------DYCTDP---PHLCVHLFRNRNQWDLASHPMNES 167
Query: 86 IDVNK-------TNGVGPGLPFANAV--LTKVPNFGVIGLVPCAIGGTNISQWR-KGSSL 135
GV P L F LT +P +GL+ A GG++I +W K L
Sbjct: 168 TAAGSLPNEEMGIPGVSPYLSFGKKYYELTGMP----VGLIQTAQGGSSIERWNPKDGDL 223
Query: 136 YEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL-- 193
Y M+ + + G VLWYQG DT E A+ Y E F +L L++ L
Sbjct: 224 YGNMMNKIR---ETKGRYAGVLWYQGCEDT-RPEQAEAYGEH----FRELAEALRAALGY 275
Query: 194 -LPIIRVALASG-EGPFIE---IVRKAQLSSDL--PNVRCVDAMGLPLEPDGLHLTTPA 245
+P + L GPF E +VR+AQ + L P V + L L D +H + A
Sbjct: 276 EIPFFTMQLNRFINGPFDEAWGMVREAQRRAALSIPAVFVLPTTNLSLS-DSVHNSAQA 333
>gi|159468526|ref|XP_001692425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278138|gb|EDP03903.1| predicted protein [Chlamydomonas reinhardtii]
Length = 304
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 92 NGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-WRKGSSLYEQMIQRAQVALRGG 150
+ GP L F VL ++ G +G VP A GGTN++ W G LY+ M Q A+R
Sbjct: 169 DSCGPDLGFGR-VLLQLGVSGRVGFVPTAAGGTNLADMWCPGCPLYKDMAQTVVRAMRAA 227
Query: 151 G---TIRAVLWYQGESDTVN 167
G +R +LW QGESD N
Sbjct: 228 GPNARLRGMLWVQGESDANN 247
>gi|419166557|ref|ZP_13711006.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC6E]
gi|378006781|gb|EHV69754.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC6E]
Length = 304
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 61/134 (45%), Gaps = 28/134 (20%)
Query: 79 HEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS----------- 127
H P AD+ + VG GL A +L +P I LVPC GG +
Sbjct: 58 HHPA-ADLHKGEYGCVGQGLHIAKKLLPYIPEQAGILLVPCCRGGAAFTVGAEGMYVPDT 116
Query: 128 -------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERS 178
+W G++LYE ++ R +VAL + +V W QGE D ++ + Y++
Sbjct: 117 GATADAMRWGTGTALYEDLVARVKVALEYNRKNKLLSVCWMQGEFDLMSPD----YEKHP 172
Query: 179 DMFF---TDLRSDL 189
D+F+ T RS+L
Sbjct: 173 DLFYQMVTSFRSEL 186
>gi|254787498|ref|YP_003074927.1| acetylxylan esterase / xylanase [Teredinibacter turnerae T7901]
gi|237684484|gb|ACR11748.1| acetylxylan esterase / xylanase [Teredinibacter turnerae T7901]
Length = 952
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 34/260 (13%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPL-- 82
+ ++ GQSNM G+G +++ + G++ Q N ++ + +W A PL
Sbjct: 43 HIYLMFGQSNMEGQGQISSQDQQVPT---GLLAMQADNNCTVGGASYG-EWRTATPPLIR 98
Query: 83 -HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK---------- 131
+ G+GPG F +L +GLV A G +I+ +RK
Sbjct: 99 CYNTAHAWNNGGLGPGDYFGRTMLENSGAGVRVGLVGAAYQGQSINFFRKNCAALGSCQP 158
Query: 132 ---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G+ Y M+ A+ A G I+ ++++QGESDT + + R +
Sbjct: 159 SGANGSVPGGAGGYAWMLDLARKAQEDG-VIKGIIFHQGESDT----GSSTWSSRVNEVV 213
Query: 183 TDLRSD--LQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLH 240
TDLR+D L + +P I + G R ++ S + N V A GL D H
Sbjct: 214 TDLRTDLGLSASEVPFIAGEMVPGACCTSHDARVHEIPSVVANGHYVSAAGLGSR-DQYH 272
Query: 241 LTTPAQGSTLNSWSNEALRV 260
++N+ L +
Sbjct: 273 FNAAGYREIGRRYANKMLEL 292
>gi|415806678|ref|ZP_11501582.1| hypothetical protein ECE128010_5344, partial [Escherichia coli
E128010]
gi|323158346|gb|EFZ44402.1| hypothetical protein ECE128010_5344 [Escherichia coli E128010]
Length = 354
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 83/201 (41%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G SLY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKSLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|423044841|ref|ZP_17035502.1| hypothetical protein EUMG_04433, partial [Escherichia coli O104:H4
str. 11-4632 C3]
gi|354919056|gb|EHF79011.1| hypothetical protein EUMG_04433, partial [Escherichia coli O104:H4
str. 11-4632 C3]
Length = 342
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|425271789|ref|ZP_18663281.1| hypothetical protein ECTW15901_1068, partial [Escherichia coli
TW15901]
gi|408196278|gb|EKI21564.1| hypothetical protein ECTW15901_1068, partial [Escherichia coli
TW15901]
Length = 228
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 142 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 197
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 198 ATHAQQPALFTAMLTQFRADL 218
>gi|425360100|ref|ZP_18745843.1| hypothetical protein ECEC1856_2272, partial [Escherichia coli
EC1856]
gi|408280486|gb|EKJ00035.1| hypothetical protein ECEC1856_2272, partial [Escherichia coli
EC1856]
Length = 262
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419890195|ref|ZP_14410494.1| hypothetical protein ECO9570_27646, partial [Escherichia coli
O111:H8 str. CVM9570]
gi|388355318|gb|EIL20167.1| hypothetical protein ECO9570_27646, partial [Escherichia coli
O111:H8 str. CVM9570]
Length = 254
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 57 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 115
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 116 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 175
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 176 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 231
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 232 ATHAQQPALFTAMLTQFRADL 252
>gi|425282409|ref|ZP_18673512.1| hypothetical protein ECTW00353_1059, partial [Escherichia coli
TW00353]
gi|408205086|gb|EKI29990.1| hypothetical protein ECTW00353_1059, partial [Escherichia coli
TW00353]
Length = 271
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425248895|ref|ZP_18641900.1| hypothetical protein EC5905_2541, partial [Escherichia coli 5905]
gi|408166016|gb|EKH93656.1| hypothetical protein EC5905_2541, partial [Escherichia coli 5905]
Length = 218
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 142 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 197
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 198 ATHAQQPALFTAMLTQFRADL 218
>gi|218695501|ref|YP_002403168.1| hypothetical protein EC55989_2114 [Escherichia coli 55989]
gi|417833175|ref|ZP_12479623.1| hypothetical protein HUSEC41_10452 [Escherichia coli O104:H4 str.
01-09591]
gi|218352233|emb|CAU97985.1| conserved hypothetical protein [Escherichia coli 55989]
gi|340734057|gb|EGR63187.1| hypothetical protein HUSEC41_10452 [Escherichia coli O104:H4 str.
01-09591]
Length = 626
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSTTTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y + D+F
Sbjct: 206 SARWGAGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQHPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|420275221|ref|ZP_14777526.1| hypothetical protein ECPA40_2460 [Escherichia coli PA40]
gi|421823771|ref|ZP_16259172.1| hypothetical protein ECFRIK920_2188 [Escherichia coli FRIK920]
gi|390759559|gb|EIO28941.1| hypothetical protein ECPA40_2460 [Escherichia coli PA40]
gi|408071522|gb|EKH05858.1| hypothetical protein ECFRIK920_2188 [Escherichia coli FRIK920]
Length = 315
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424462028|ref|ZP_17912622.1| hypothetical protein ECPA39_2377, partial [Escherichia coli PA39]
gi|425102752|ref|ZP_18505378.1| hypothetical protein EC52239_1370, partial [Escherichia coli
5.2239]
gi|425342022|ref|ZP_18729010.1| hypothetical protein ECEC1848_2455, partial [Escherichia coli
EC1848]
gi|425354131|ref|ZP_18740286.1| hypothetical protein ECEC1850_2448, partial [Escherichia coli
EC1850]
gi|390772308|gb|EIO40912.1| hypothetical protein ECPA39_2377, partial [Escherichia coli PA39]
gi|408262651|gb|EKI83576.1| hypothetical protein ECEC1848_2455, partial [Escherichia coli
EC1848]
gi|408278410|gb|EKI98159.1| hypothetical protein ECEC1850_2448, partial [Escherichia coli
EC1850]
gi|408557437|gb|EKK33907.1| hypothetical protein EC52239_1370, partial [Escherichia coli
5.2239]
Length = 261
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|254442451|ref|ZP_05055927.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
gi|198256759|gb|EDY81067.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
Length = 277
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 105/269 (39%), Gaps = 53/269 (19%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQ--- 61
LL I +S + Q + + + GQSNM G G+ + G V P+ Q
Sbjct: 10 LLACITISN---TQAQDEDFYVFLCFGQSNMEGYPGIPESEK-------GPVDPRFQVLA 59
Query: 62 --PNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
P + R W A PL G+ P F A++ +P +G++
Sbjct: 60 AVDFPEMDRQQGH--WYTATPPLS-----RPPTGLSPADYFGRALVAGLPKDKKVGVINV 112
Query: 120 AIGGTNISQWRKGS---------------------SLYEQMIQRAQVALRGGGTIRAVLW 158
A+GGT I + + + Y ++I+ ++A + G I+ +L
Sbjct: 113 AVGGTRIELFDEATREAYLADAPDWLHNISAAYDKDPYARLIEMGKLAQK-DGVIKGILL 171
Query: 159 YQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPII--RVALASGEGP---FIEIVR 213
+QGES+T + E K DL DL + +P++ V A G EI+R
Sbjct: 172 HQGESNTGDKEWPAKVKAIYQNILRDL--DLDASDVPLLAGEVVAADQNGKCASMNEIIR 229
Query: 214 KAQLSSDLPNVRCVDAMGLPLEPDGLHLT 242
L +P V + G P PD LH +
Sbjct: 230 T--LPETIPTAHVVSSEGCPDGPDDLHFS 256
>gi|425380803|ref|ZP_18764816.1| hypothetical protein ECEC1865_3806, partial [Escherichia coli
EC1865]
gi|408295445|gb|EKJ13764.1| hypothetical protein ECEC1865_3806, partial [Escherichia coli
EC1865]
Length = 207
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 7 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 65
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 66 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 125
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 126 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 181
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 182 ATHAQQPALFTAMLTQFRADL 202
>gi|425283817|ref|ZP_18674857.1| hypothetical protein ECTW00353_2414, partial [Escherichia coli
TW00353]
gi|408201869|gb|EKI27011.1| hypothetical protein ECTW00353_2414, partial [Escherichia coli
TW00353]
Length = 271
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425193054|ref|ZP_18589939.1| hypothetical protein ECNE1487_2712, partial [Escherichia coli
NE1487]
gi|408112252|gb|EKH43918.1| hypothetical protein ECNE1487_2712, partial [Escherichia coli
NE1487]
Length = 255
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 142 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 197
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 198 ATHAQQPALFTAMLTQFRADL 218
>gi|416268859|ref|ZP_11642294.1| hypothetical protein SDB_02534 [Shigella dysenteriae CDC 74-1112]
gi|320174953|gb|EFW50069.1| hypothetical protein SDB_02534 [Shigella dysenteriae CDC 74-1112]
Length = 618
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|300901718|ref|ZP_07119774.1| conserved hypothetical protein [Escherichia coli MS 198-1]
gi|300354892|gb|EFJ70762.1| conserved hypothetical protein [Escherichia coli MS 198-1]
Length = 625
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|423049253|ref|ZP_17039910.1| hypothetical protein EUMG_01741 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|354904783|gb|EHF64872.1| hypothetical protein EUMG_01741 [Escherichia coli O104:H4 str.
11-4632 C3]
Length = 617
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|419236688|ref|ZP_13779436.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC9C]
gi|378089152|gb|EHW50997.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC9C]
Length = 281
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|423053374|ref|ZP_17042182.1| hypothetical protein EUNG_01780, partial [Escherichia coli O104:H4
str. 11-4632 C4]
gi|354919917|gb|EHF79856.1| hypothetical protein EUNG_01780, partial [Escherichia coli O104:H4
str. 11-4632 C4]
Length = 495
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|432499124|ref|ZP_19740898.1| hypothetical protein A177_01220 [Escherichia coli KTE216]
gi|433108791|ref|ZP_20294725.1| hypothetical protein WK7_04655 [Escherichia coli KTE148]
gi|431031470|gb|ELD44357.1| hypothetical protein A177_01220 [Escherichia coli KTE216]
gi|431620823|gb|ELI89649.1| hypothetical protein WK7_04655 [Escherichia coli KTE148]
Length = 626
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|419142686|ref|ZP_13687431.1| hypothetical protein ECDEC6A_2327 [Escherichia coli DEC6A]
gi|377995745|gb|EHV58859.1| hypothetical protein ECDEC6A_2327 [Escherichia coli DEC6A]
Length = 618
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|415777841|ref|ZP_11488990.1| conserved hypothetical protein [Escherichia coli 3431]
gi|315616049|gb|EFU96673.1| conserved hypothetical protein [Escherichia coli 3431]
Length = 618
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SVRWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|432960921|ref|ZP_20150966.1| hypothetical protein A15E_01882 [Escherichia coli KTE202]
gi|433062911|ref|ZP_20249848.1| hypothetical protein WIO_01731 [Escherichia coli KTE125]
gi|431477477|gb|ELH57246.1| hypothetical protein A15E_01882 [Escherichia coli KTE202]
gi|431583778|gb|ELI55770.1| hypothetical protein WIO_01731 [Escherichia coli KTE125]
Length = 618
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|429954195|ref|ZP_19420031.1| hypothetical protein S91_00569 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|429444276|gb|EKZ80222.1| hypothetical protein S91_00569 [Escherichia coli O104:H4 str.
Ec12-0466]
Length = 626
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|420379732|ref|ZP_14879207.1| hypothetical protein SD22575_1684 [Shigella dysenteriae 225-75]
gi|391303704|gb|EIQ61535.1| hypothetical protein SD22575_1684 [Shigella dysenteriae 225-75]
Length = 618
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|419080556|ref|ZP_13626017.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4A]
gi|377928925|gb|EHU92827.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4A]
Length = 315
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|407468784|ref|YP_006784774.1| hypothetical protein O3O_10525 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|407064819|gb|AFS85866.1| hypothetical protein O3O_10525 [Escherichia coli O104:H4 str.
2009EL-2071]
Length = 617
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|417311174|ref|ZP_12097960.1| hypothetical protein PPECC33_45320 [Escherichia coli PCN033]
gi|338767242|gb|EGP22076.1| hypothetical protein PPECC33_45320 [Escherichia coli PCN033]
Length = 627
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSTTTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|407481629|ref|YP_006778778.1| hypothetical protein O3K_10415 [Escherichia coli O104:H4 str.
2011C-3493]
gi|423019378|ref|ZP_17010087.1| hypothetical protein EUHG_02488 [Escherichia coli O104:H4 str.
11-4404]
gi|423024544|ref|ZP_17015241.1| hypothetical protein EUIG_02489 [Escherichia coli O104:H4 str.
11-4522]
gi|423030365|ref|ZP_17021053.1| hypothetical protein EUJG_01124 [Escherichia coli O104:H4 str.
11-4623]
gi|423038193|ref|ZP_17028867.1| hypothetical protein EUKG_02470 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|423043314|ref|ZP_17033981.1| hypothetical protein EULG_02489 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|423060554|ref|ZP_17049350.1| hypothetical protein EUOG_02494 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|429771304|ref|ZP_19303327.1| hypothetical protein C212_01059 [Escherichia coli O104:H4 str.
11-02030]
gi|429781234|ref|ZP_19313165.1| hypothetical protein C213_01055 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429784884|ref|ZP_19316789.1| hypothetical protein C214_01056 [Escherichia coli O104:H4 str.
11-02092]
gi|429790865|ref|ZP_19322722.1| hypothetical protein C215_01056 [Escherichia coli O104:H4 str.
11-02093]
gi|429796688|ref|ZP_19328499.1| hypothetical protein C216_01056 [Escherichia coli O104:H4 str.
11-02281]
gi|429798290|ref|ZP_19330091.1| hypothetical protein C217_01056 [Escherichia coli O104:H4 str.
11-02318]
gi|429806803|ref|ZP_19338530.1| hypothetical protein C218_01056 [Escherichia coli O104:H4 str.
11-02913]
gi|429811636|ref|ZP_19343326.1| hypothetical protein C219_01055 [Escherichia coli O104:H4 str.
11-03439]
gi|429817223|ref|ZP_19348864.1| hypothetical protein C220_01056 [Escherichia coli O104:H4 str.
11-04080]
gi|429822434|ref|ZP_19354032.1| hypothetical protein C221_01055 [Escherichia coli O104:H4 str.
11-03943]
gi|429913822|ref|ZP_19379770.1| hypothetical protein O7C_00711 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429924674|ref|ZP_19390588.1| hypothetical protein O7G_01534 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429940830|ref|ZP_19406704.1| hypothetical protein O7M_02533 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429943510|ref|ZP_19409373.1| hypothetical protein O7O_00029 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|354890735|gb|EHF50973.1| hypothetical protein EUHG_02488 [Escherichia coli O104:H4 str.
11-4404]
gi|354894070|gb|EHF54267.1| hypothetical protein EUIG_02489 [Escherichia coli O104:H4 str.
11-4522]
gi|354895695|gb|EHF55874.1| hypothetical protein EUKG_02470 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|354898226|gb|EHF58381.1| hypothetical protein EUJG_01124 [Escherichia coli O104:H4 str.
11-4623]
gi|354899871|gb|EHF60009.1| hypothetical protein EULG_02489 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|354913495|gb|EHF73486.1| hypothetical protein EUOG_02494 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|407053926|gb|AFS73977.1| hypothetical protein O3K_10415 [Escherichia coli O104:H4 str.
2011C-3493]
gi|429347263|gb|EKY84037.1| hypothetical protein C213_01055 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429349661|gb|EKY86397.1| hypothetical protein C214_01056 [Escherichia coli O104:H4 str.
11-02092]
gi|429360787|gb|EKY97444.1| hypothetical protein C212_01059 [Escherichia coli O104:H4 str.
11-02030]
gi|429362218|gb|EKY98865.1| hypothetical protein C215_01056 [Escherichia coli O104:H4 str.
11-02093]
gi|429363538|gb|EKZ00171.1| hypothetical protein C216_01056 [Escherichia coli O104:H4 str.
11-02281]
gi|429365607|gb|EKZ02219.1| hypothetical protein C217_01056 [Escherichia coli O104:H4 str.
11-02318]
gi|429376462|gb|EKZ12990.1| hypothetical protein C218_01056 [Escherichia coli O104:H4 str.
11-02913]
gi|429380504|gb|EKZ16993.1| hypothetical protein C221_01055 [Escherichia coli O104:H4 str.
11-03943]
gi|429380950|gb|EKZ17438.1| hypothetical protein C219_01055 [Escherichia coli O104:H4 str.
11-03439]
gi|429392725|gb|EKZ29124.1| hypothetical protein C220_01056 [Escherichia coli O104:H4 str.
11-04080]
gi|429406360|gb|EKZ42619.1| hypothetical protein O7C_00711 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429422561|gb|EKZ58675.1| hypothetical protein O7G_01534 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429424933|gb|EKZ61030.1| hypothetical protein O7M_02533 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429446350|gb|EKZ82280.1| hypothetical protein O7O_00029 [Escherichia coli O104:H4 str.
Ec11-6006]
Length = 617
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|422994835|ref|ZP_16985599.1| hypothetical protein EUBG_02486 [Escherichia coli O104:H4 str.
C236-11]
gi|423010152|ref|ZP_17000886.1| hypothetical protein EUFG_02478 [Escherichia coli O104:H4 str.
11-3677]
gi|354861670|gb|EHF22108.1| hypothetical protein EUBG_02486 [Escherichia coli O104:H4 str.
C236-11]
gi|354879635|gb|EHF39971.1| hypothetical protein EUFG_02478 [Escherichia coli O104:H4 str.
11-3677]
Length = 617
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|417175963|ref|ZP_12005759.1| PF08410 domain protein [Escherichia coli 3.2608]
gi|386178655|gb|EIH56134.1| PF08410 domain protein [Escherichia coli 3.2608]
Length = 315
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419148676|ref|ZP_13693338.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6B]
gi|377994218|gb|EHV57346.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6B]
Length = 617
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|422987944|ref|ZP_16978717.1| hypothetical protein EUAG_03059, partial [Escherichia coli O104:H4
str. C227-11]
gi|423053579|ref|ZP_17042386.1| hypothetical protein EUNG_03296, partial [Escherichia coli O104:H4
str. 11-4632 C4]
gi|354866955|gb|EHF27377.1| hypothetical protein EUAG_03059, partial [Escherichia coli O104:H4
str. C227-11]
gi|354919408|gb|EHF79356.1| hypothetical protein EUNG_03296, partial [Escherichia coli O104:H4
str. 11-4632 C4]
Length = 478
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|407469491|ref|YP_006784067.1| hypothetical protein O3O_14110 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|407481847|ref|YP_006778996.1| hypothetical protein O3K_11525 [Escherichia coli O104:H4 str.
2011C-3493]
gi|410482397|ref|YP_006769943.1| hypothetical protein O3M_11490 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|422987741|ref|ZP_16978517.1| hypothetical protein EUAG_04729 [Escherichia coli O104:H4 str.
C227-11]
gi|422994624|ref|ZP_16985388.1| hypothetical protein EUBG_02275 [Escherichia coli O104:H4 str.
C236-11]
gi|423009937|ref|ZP_17000675.1| hypothetical protein EUFG_02274 [Escherichia coli O104:H4 str.
11-3677]
gi|423019166|ref|ZP_17009875.1| hypothetical protein EUHG_02276 [Escherichia coli O104:H4 str.
11-4404]
gi|423024332|ref|ZP_17015029.1| hypothetical protein EUIG_02277 [Escherichia coli O104:H4 str.
11-4522]
gi|423030149|ref|ZP_17020837.1| hypothetical protein EUJG_00908 [Escherichia coli O104:H4 str.
11-4623]
gi|423037981|ref|ZP_17028655.1| hypothetical protein EUKG_02258 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|423043102|ref|ZP_17033769.1| hypothetical protein EULG_02277 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|423060340|ref|ZP_17049136.1| hypothetical protein EUOG_02280 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|429719196|ref|ZP_19254136.1| hypothetical protein MO3_01921 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429724541|ref|ZP_19259409.1| hypothetical protein MO5_00528 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429776239|ref|ZP_19308224.1| hypothetical protein C212_00843 [Escherichia coli O104:H4 str.
11-02030]
gi|429780692|ref|ZP_19312639.1| hypothetical protein C213_00840 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429783279|ref|ZP_19315195.1| hypothetical protein C214_00843 [Escherichia coli O104:H4 str.
11-02092]
gi|429790457|ref|ZP_19322326.1| hypothetical protein C215_00841 [Escherichia coli O104:H4 str.
11-02093]
gi|429794419|ref|ZP_19326260.1| hypothetical protein C216_00841 [Escherichia coli O104:H4 str.
11-02281]
gi|429798072|ref|ZP_19329876.1| hypothetical protein C217_00841 [Escherichia coli O104:H4 str.
11-02318]
gi|429806585|ref|ZP_19338315.1| hypothetical protein C218_00841 [Escherichia coli O104:H4 str.
11-02913]
gi|429810937|ref|ZP_19342638.1| hypothetical protein C219_00842 [Escherichia coli O104:H4 str.
11-03439]
gi|429816377|ref|ZP_19348035.1| hypothetical protein C220_00841 [Escherichia coli O104:H4 str.
11-04080]
gi|429821064|ref|ZP_19352678.1| hypothetical protein C221_00840 [Escherichia coli O104:H4 str.
11-03943]
gi|429912739|ref|ZP_19378695.1| hypothetical protein MO7_00511 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|429913609|ref|ZP_19379557.1| hypothetical protein O7C_00498 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429918651|ref|ZP_19384584.1| hypothetical protein O7E_00515 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429928396|ref|ZP_19394298.1| hypothetical protein O7I_00192 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429934949|ref|ZP_19400836.1| hypothetical protein O7K_01761 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429940619|ref|ZP_19406493.1| hypothetical protein O7M_02322 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429948252|ref|ZP_19414107.1| hypothetical protein O7O_04855 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429950897|ref|ZP_19416745.1| hypothetical protein S7Y_02320 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|354865699|gb|EHF26128.1| hypothetical protein EUBG_02275 [Escherichia coli O104:H4 str.
C236-11]
gi|354869868|gb|EHF30276.1| hypothetical protein EUAG_04729 [Escherichia coli O104:H4 str.
C227-11]
gi|354881305|gb|EHF41635.1| hypothetical protein EUFG_02274 [Escherichia coli O104:H4 str.
11-3677]
gi|354891608|gb|EHF51836.1| hypothetical protein EUHG_02276 [Escherichia coli O104:H4 str.
11-4404]
gi|354894493|gb|EHF54687.1| hypothetical protein EUIG_02277 [Escherichia coli O104:H4 str.
11-4522]
gi|354896775|gb|EHF56944.1| hypothetical protein EUKG_02258 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|354899740|gb|EHF59884.1| hypothetical protein EUJG_00908 [Escherichia coli O104:H4 str.
11-4623]
gi|354901899|gb|EHF62023.1| hypothetical protein EULG_02277 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|354914564|gb|EHF74548.1| hypothetical protein EUOG_02280 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|406777559|gb|AFS56983.1| hypothetical protein O3M_11490 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|407054144|gb|AFS74195.1| hypothetical protein O3K_11525 [Escherichia coli O104:H4 str.
2011C-3493]
gi|407065526|gb|AFS86573.1| hypothetical protein O3O_14110 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|429347985|gb|EKY84757.1| hypothetical protein C212_00843 [Escherichia coli O104:H4 str.
11-02030]
gi|429350493|gb|EKY87224.1| hypothetical protein C213_00840 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429354666|gb|EKY91362.1| hypothetical protein C214_00843 [Escherichia coli O104:H4 str.
11-02092]
gi|429364785|gb|EKZ01404.1| hypothetical protein C215_00841 [Escherichia coli O104:H4 str.
11-02093]
gi|429372435|gb|EKZ08985.1| hypothetical protein C216_00841 [Escherichia coli O104:H4 str.
11-02281]
gi|429374385|gb|EKZ10925.1| hypothetical protein C217_00841 [Escherichia coli O104:H4 str.
11-02318]
gi|429377714|gb|EKZ14232.1| hypothetical protein C218_00841 [Escherichia coli O104:H4 str.
11-02913]
gi|429384490|gb|EKZ20947.1| hypothetical protein C219_00842 [Escherichia coli O104:H4 str.
11-03439]
gi|429386574|gb|EKZ23022.1| hypothetical protein C221_00840 [Escherichia coli O104:H4 str.
11-03943]
gi|429394193|gb|EKZ30574.1| hypothetical protein MO3_01921 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429394489|gb|EKZ30865.1| hypothetical protein MO5_00528 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429396498|gb|EKZ32850.1| hypothetical protein C220_00841 [Escherichia coli O104:H4 str.
11-04080]
gi|429407373|gb|EKZ43626.1| hypothetical protein O7C_00498 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429418766|gb|EKZ54908.1| hypothetical protein O7K_01761 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429426364|gb|EKZ62453.1| hypothetical protein O7M_02322 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429426770|gb|EKZ62857.1| hypothetical protein O7I_00192 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429431334|gb|EKZ67383.1| hypothetical protein O7E_00515 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429440696|gb|EKZ76673.1| hypothetical protein O7O_04855 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429449903|gb|EKZ85801.1| hypothetical protein S7Y_02320 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|429453766|gb|EKZ89634.1| hypothetical protein MO7_00511 [Escherichia coli O104:H4 str.
Ec11-9941]
Length = 626
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|432583833|ref|ZP_19820234.1| hypothetical protein A1SM_03055 [Escherichia coli KTE57]
gi|431117003|gb|ELE20275.1| hypothetical protein A1SM_03055 [Escherichia coli KTE57]
Length = 626
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 146 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 205
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 206 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 261
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 262 MLKQFRTDL 270
>gi|429928609|ref|ZP_19394511.1| hypothetical protein O7I_00405 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429420770|gb|EKZ56893.1| hypothetical protein O7I_00405 [Escherichia coli O104:H4 str.
Ec11-4987]
Length = 618
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|425271653|ref|ZP_18663148.1| hypothetical protein ECTW15901_0933, partial [Escherichia coli
TW15901]
gi|408196981|gb|EKI22254.1| hypothetical protein ECTW15901_0933, partial [Escherichia coli
TW15901]
Length = 224
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 19 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 77
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 78 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 137
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 138 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 193
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 194 ATHAQQPALFTAMLTQFRADL 214
>gi|419098155|ref|ZP_13643369.1| hypothetical protein ECDEC4D_2136, partial [Escherichia coli DEC4D]
gi|377944940|gb|EHV08640.1| hypothetical protein ECDEC4D_2136, partial [Escherichia coli DEC4D]
Length = 266
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMFFT---DLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTTMLAQFRADL 261
>gi|356534309|ref|XP_003535699.1| PREDICTED: probable carbohydrate esterase At4g34215-like [Glycine
max]
Length = 147
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/46 (60%), Positives = 32/46 (69%), Gaps = 1/46 (2%)
Query: 196 IIRVALASGEGPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHL 241
++RVALASG E VR+AQ DLPNV CVDA GL L+ D LHL
Sbjct: 97 LLRVALASGSD-HTEKVREAQKVIDLPNVICVDAKGLQLKEDNLHL 141
>gi|417809020|ref|ZP_12455740.1| hypothetical protein HUSEC_28704 [Escherichia coli O104:H4 str.
LB226692]
gi|340736397|gb|EGR70857.1| hypothetical protein HUSEC_28704 [Escherichia coli O104:H4 str.
LB226692]
Length = 617
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|421829520|ref|ZP_16264846.1| hypothetical protein ECPA7_1681 [Escherichia coli PA7]
gi|408071494|gb|EKH05838.1| hypothetical protein ECPA7_1681 [Escherichia coli PA7]
Length = 272
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 142 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 197
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 198 ATHAQQPALFTAMLTQFRADL 218
>gi|347755320|ref|YP_004862884.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347587838|gb|AEP12368.1| protein of unknown function (DUF303) [Candidatus
Chloracidobacterium thermophilum B]
Length = 367
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 82/193 (42%), Gaps = 25/193 (12%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
++ I+AGQSN AG T + + L G+V L W H+P
Sbjct: 142 EVFIVAGQSNAAGSC-TTLFSAASPLVRTGLVDED-----------GHLTWRTGHDP--- 186
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGV-IGLVPCAIGGTNISQWRKGSSLYEQMIQRA 143
NG G P +L V G +G V A+GG++I W G+ +++++Q
Sbjct: 187 ----QVLNGGGSVWPLVGDLL--VQRLGTPVGFVNVAVGGSSIRDWAPGAPHFQRLVQVL 240
Query: 144 QVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS 203
Q G RA+LW+QGESD+ D + + + T ++PL ++ A +
Sbjct: 241 QTL--GPHGARAILWHQGESDSAMAADEYATRLTAIIEATRAAVRTETPLTWVVARA-SF 297
Query: 204 GEGPFIEIVRKAQ 216
EG VR Q
Sbjct: 298 KEGQTFAGVRDGQ 310
>gi|429918853|ref|ZP_19384785.1| hypothetical protein O7E_00716, partial [Escherichia coli O104:H4
str. Ec11-5604]
gi|429430134|gb|EKZ66200.1| hypothetical protein O7E_00716, partial [Escherichia coli O104:H4
str. Ec11-5604]
Length = 485
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|417276637|ref|ZP_12063964.1| PF08410 domain protein, partial [Escherichia coli 3.2303]
gi|386240572|gb|EII77495.1| PF08410 domain protein, partial [Escherichia coli 3.2303]
Length = 297
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|421825439|ref|ZP_16260795.1| hypothetical protein ECFRIK920_3842 [Escherichia coli FRIK920]
gi|408066007|gb|EKH00473.1| hypothetical protein ECFRIK920_3842 [Escherichia coli FRIK920]
Length = 315
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 83/204 (40%), Gaps = 40/204 (19%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMFFTDLRSD-LQSPLL 194
+ ++ +F L S L SP L
Sbjct: 241 ATHAQQPALFTAMLHSFVLTSPCL 264
>gi|419108652|ref|ZP_13653748.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4F]
gi|377963492|gb|EHV26938.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4F]
Length = 315
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417866938|ref|ZP_12511977.1| hypothetical protein C22711_3867 [Escherichia coli O104:H4 str.
C227-11]
gi|341920227|gb|EGT69835.1| hypothetical protein C22711_3867 [Escherichia coli O104:H4 str.
C227-11]
Length = 489
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 124
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 125 MLKQFRTDL 133
>gi|424749660|ref|ZP_18177744.1| hypothetical protein CFSAN001629_12445, partial [Escherichia coli
O26:H11 str. CFSAN001629]
gi|424771559|ref|ZP_18198696.1| hypothetical protein CFSAN001632_14662, partial [Escherichia coli
O111:H8 str. CFSAN001632]
gi|421939970|gb|EKT97457.1| hypothetical protein CFSAN001632_14662, partial [Escherichia coli
O111:H8 str. CFSAN001632]
gi|421941709|gb|EKT99090.1| hypothetical protein CFSAN001629_12445, partial [Escherichia coli
O26:H11 str. CFSAN001629]
Length = 278
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|417865440|ref|ZP_12510484.1| hypothetical protein C22711_2372 [Escherichia coli O104:H4 str.
C227-11]
gi|341918729|gb|EGT68342.1| hypothetical protein C22711_2372 [Escherichia coli O104:H4 str.
C227-11]
Length = 552
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 72 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 131
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 132 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 187
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 188 MLKQFRTDL 196
>gi|419896435|ref|ZP_14416127.1| hypothetical protein ECO9574_16838, partial [Escherichia coli
O111:H8 str. CVM9574]
gi|388357769|gb|EIL22292.1| hypothetical protein ECO9574_16838, partial [Escherichia coli
O111:H8 str. CVM9574]
Length = 161
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 33 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 92
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 93 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 148
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 149 MLTQFRADL 157
>gi|417298924|ref|ZP_12086162.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386257963|gb|EIJ13446.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 455
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419031857|ref|ZP_13578990.1| hypothetical protein ECDEC2C_4946, partial [Escherichia coli DEC2C]
gi|377871271|gb|EHU35936.1| hypothetical protein ECDEC2C_4946, partial [Escherichia coli DEC2C]
Length = 317
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424753268|ref|ZP_18181227.1| hypothetical protein CFSAN001629_23061, partial [Escherichia coli
O26:H11 str. CFSAN001629]
gi|421935836|gb|EKT93517.1| hypothetical protein CFSAN001629_23061, partial [Escherichia coli
O26:H11 str. CFSAN001629]
Length = 302
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419221213|ref|ZP_13764150.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8E]
gi|378068021|gb|EHW30127.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8E]
Length = 153
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 124
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 125 MLTQFRADL 133
>gi|420291837|ref|ZP_14793984.1| yjhS [Escherichia coli TW11039]
gi|390799658|gb|EIO66793.1| yjhS [Escherichia coli TW11039]
Length = 240
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 62 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 121
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 122 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 177
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 178 MLTQFRADL 186
>gi|291527415|emb|CBK93001.1| Domain of unknown function (DUF303) [Eubacterium rectale M104/1]
Length = 266
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 78/178 (43%), Gaps = 26/178 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILR-LTAKLKWVLAHEPLHA 84
LI+ AGQSNMAGRG VT+ W P + R +TA + EP A
Sbjct: 6 LILFAGQSNMAGRGIVTD-------KWPQKAPVLVKGAGYEYRAITAPDRLCPIEEPFGA 58
Query: 85 DIDVNKTNGV-GPGLPFANAVLTKVPNFGVIGLVP-----CAIGGTNISQWRKGSSLYEQ 138
D N +G+ PG+ + V V + + +P + GG++IS+W+ +
Sbjct: 59 --DENNPDGIFEPGMKTGSMVTAFVNEYYKLTHIPVLAVSASKGGSSISEWQGNNDFLSD 116
Query: 139 MIQR----AQVALRGGGTIRA--VLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQ 190
I R + A + IR VLW QGE+D D + Y + F ++ S LQ
Sbjct: 117 AIARYRKATEYAQKNHIEIRHKYVLWCQGETDGDRATDIEAYGK----LFINMFSQLQ 170
>gi|419033191|ref|ZP_13580289.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC2D]
gi|377883610|gb|EHU48128.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC2D]
Length = 315
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|423053375|ref|ZP_17042183.1| hypothetical protein EUNG_01781 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|354919732|gb|EHF79673.1| hypothetical protein EUNG_01781 [Escherichia coli O104:H4 str.
11-4632 C4]
Length = 486
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 6 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 65
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ D+F
Sbjct: 66 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPDLFTA 121
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 122 MLKQFRTDL 130
>gi|419109203|ref|ZP_13654277.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4F]
gi|377959857|gb|EHV23349.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4F]
Length = 326
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|197103936|ref|YP_002129313.1| hypothetical protein PHZ_c0470 [Phenylobacterium zucineum HLK1]
gi|196477356|gb|ACG76884.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
Length = 267
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 106/254 (41%), Gaps = 37/254 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHAD 85
++++AGQSN G G LT + P P+P + R+ ++ +P+ A
Sbjct: 32 IVVVAGQSNALGYG----------LTAADLPPSLASPDPDV-RIWDGARF----QPMAAG 76
Query: 86 IDVN---KTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS-----QWRKGS---- 133
+ + GP FA A P+ + +V A G T ++ W G+
Sbjct: 77 RNTGFGPQPGAWGPEAGFARAWRAAHPD-APLHVVKFARGSTPLAASPGRDWSPGTQELF 135
Query: 134 SLYEQMIQRAQVALR-GGGTIR--AVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQ 190
+ I+ A+ AL GG R A+LW QGE+D V+ A Y +R D
Sbjct: 136 AAATTEIEEAKAALAVNGGPARVVAILWVQGEADAVDPAKAAAYGPNLAGLIQAIRRDWS 195
Query: 191 SPLLPIIRVALASGEG-PFIEIVRKAQLSSDLPNVR--CVDAMGLPLEPDGLHLTTPAQG 247
S PI V +G G P+ + VR Q + P R VD LP + DGLH+ Q
Sbjct: 196 S-EAPI--VVGQTGPGLPYAKAVRAGQAAVASPEGRVAVVDTGPLPRQADGLHIAAEGQA 252
Query: 248 STLNSWSNEALRVN 261
+ + A R++
Sbjct: 253 RLGAAMAEAAQRLS 266
>gi|419069874|ref|ZP_13615505.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC3E]
gi|377913402|gb|EHU77540.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC3E]
Length = 337
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419321519|ref|ZP_13863255.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC12B]
gi|378173770|gb|EHX34604.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC12B]
Length = 341
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|317474170|ref|ZP_07933447.1| alpha-L-arabinofuranosidase [Bacteroides eggerthii 1_2_48FAA]
gi|316909741|gb|EFV31418.1| alpha-L-arabinofuranosidase [Bacteroides eggerthii 1_2_48FAA]
Length = 976
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 37/197 (18%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-- 131
KW A P+ + N +GP F ++ +P+ +G++ ++ G I W +
Sbjct: 36 KWYTAIPPI-----CREGNNLGPVDFFGRKMIDILPSEYHVGVINVSVAGAKIQLWDRED 90
Query: 132 -------------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAK 172
G + YE+++ A++A + G I+ +L +QGES N ED
Sbjct: 91 YKDYIDNERDWMKNIVSQYGGNPYERLVNMARLAQK-DGVIKGILMHQGES---NSEDP- 145
Query: 173 LYKERSDMFFTDLRSDL-----QSPLLP-IIRVALASGEGPFIEIVRKAQLSSDLPNVRC 226
L+ ER + +L DL Q+PLL ++ A G +L LPN
Sbjct: 146 LWPERVKKIYDNLCKDLNLNPKQTPLLAGELKYAEQGGVCAAFNSSIMPKLPKVLPNAHI 205
Query: 227 VDAMGLPLEPDGLHLTT 243
+ A+G D H +T
Sbjct: 206 ISALGCESTGDQFHFST 222
>gi|218130640|ref|ZP_03459444.1| hypothetical protein BACEGG_02229 [Bacteroides eggerthii DSM 20697]
gi|217986984|gb|EEC53315.1| hypothetical protein BACEGG_02229 [Bacteroides eggerthii DSM 20697]
Length = 1019
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 37/197 (18%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK-- 131
KW A P+ + N +GP F ++ +P+ +G++ ++ G I W +
Sbjct: 79 KWYTAIPPI-----CREGNNLGPVDFFGRKMIDILPSEYHVGVINVSVAGAKIQLWDRED 133
Query: 132 -------------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAK 172
G + YE+++ A++A + G I+ +L +QGES N ED
Sbjct: 134 YKDYIDNERDWMKNIVSQYGGNPYERLVNMARLAQK-DGVIKGILMHQGES---NSEDP- 188
Query: 173 LYKERSDMFFTDLRSDL-----QSPLLP-IIRVALASGEGPFIEIVRKAQLSSDLPNVRC 226
L+ ER + +L DL Q+PLL ++ A G +L LPN
Sbjct: 189 LWPERVKKIYDNLCKDLNLNPKQTPLLAGELKYAEQGGVCAAFNSSIMPKLPKVLPNAHI 248
Query: 227 VDAMGLPLEPDGLHLTT 243
+ A+G D H +T
Sbjct: 249 ISALGCESTGDQFHFST 265
>gi|419009261|ref|ZP_13556682.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC1C]
gi|377841840|gb|EHU06900.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC1C]
Length = 328
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419265802|ref|ZP_13808182.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC10C]
gi|378116827|gb|EHW78346.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC10C]
Length = 315
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417600069|ref|ZP_12250679.1| hypothetical protein EC30301_5330 [Escherichia coli 3030-1]
gi|345345394|gb|EGW77734.1| hypothetical protein EC30301_5330 [Escherichia coli 3030-1]
Length = 318
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417125315|ref|ZP_11973456.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386145907|gb|EIG92362.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 455
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|374598537|ref|ZP_09671539.1| protein of unknown function DUF303 acetylesterase [Myroides
odoratus DSM 2801]
gi|423323222|ref|ZP_17301064.1| hypothetical protein HMPREF9716_00421 [Myroides odoratimimus CIP
103059]
gi|373910007|gb|EHQ41856.1| protein of unknown function DUF303 acetylesterase [Myroides
odoratus DSM 2801]
gi|404609688|gb|EKB09053.1| hypothetical protein HMPREF9716_00421 [Myroides odoratimimus CIP
103059]
Length = 364
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 116 LVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRG--GGTIRAVLWYQGESDTVNLEDAKL 173
++P G++I+ W+KG +LY I+R L G + A+LW+ GE++
Sbjct: 114 IIPAGYSGSSIANWKKGGNLYTDAIERVNYVLDNIHGSRVVAILWHHGEANV----GWAP 169
Query: 174 YKERSDMFFTDLRSDLQ 190
Y+E D D+RSD+
Sbjct: 170 YQETLDTMIADMRSDIH 186
>gi|15801507|ref|NP_287524.1| phage protein YjhS encoded within prophage CP-933O [Escherichia
coli O157:H7 str. EDL933]
gi|12515009|gb|AAG56136.1|AE005344_12 similar to conserved hypothetical phage protein YjhS encoded within
prophage CP-933O [Escherichia coli O157:H7 str. EDL933]
Length = 316
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|424550449|ref|ZP_17992406.1| hypothetical protein ECEC4439_2294, partial [Escherichia coli
EC4439]
gi|424575244|ref|ZP_18015427.1| hypothetical protein ECEC1845_2273, partial [Escherichia coli
EC1845]
gi|425155835|ref|ZP_18555174.1| hypothetical protein ECPA34_2436, partial [Escherichia coli PA34]
gi|390881111|gb|EIP41745.1| hypothetical protein ECEC4439_2294, partial [Escherichia coli
EC4439]
gi|390922479|gb|EIP80554.1| hypothetical protein ECEC1845_2273, partial [Escherichia coli
EC1845]
gi|408077713|gb|EKH11905.1| hypothetical protein ECPA34_2436, partial [Escherichia coli PA34]
Length = 260
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 81/200 (40%), Gaps = 42/200 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSD 188
+ ++ +F T R+D
Sbjct: 241 ATHAQQPALFTAMLTQFRAD 260
>gi|419254750|ref|ZP_13797273.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC10A]
gi|378101792|gb|EHW63476.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC10A]
Length = 336
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419248961|ref|ZP_13791551.1| hypothetical protein ECDEC9E_2184, partial [Escherichia coli DEC9E]
gi|378096853|gb|EHW58621.1| hypothetical protein ECDEC9E_2184, partial [Escherichia coli DEC9E]
Length = 304
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|419230799|ref|ZP_13773593.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
gi|378083048|gb|EHW44985.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
Length = 455
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|425379245|ref|ZP_18763370.1| hypothetical protein ECEC1865_2326, partial [Escherichia coli
EC1865]
gi|408298944|gb|EKJ16832.1| hypothetical protein ECEC1865_2326, partial [Escherichia coli
EC1865]
Length = 163
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHVQQPALFTA 149
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 150 MLTQFRADL 158
>gi|420274453|ref|ZP_14776774.1| hypothetical protein ECPA40_1697 [Escherichia coli PA40]
gi|390760642|gb|EIO29955.1| hypothetical protein ECPA40_1697 [Escherichia coli PA40]
Length = 315
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419248556|ref|ZP_13791153.1| hypothetical protein ECDEC9E_1786 [Escherichia coli DEC9E]
gi|378098298|gb|EHW60040.1| hypothetical protein ECDEC9E_1786 [Escherichia coli DEC9E]
Length = 313
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417756044|ref|ZP_12404126.1| hypothetical protein ECDEC2B_2367, partial [Escherichia coli DEC2B]
gi|377875338|gb|EHU39951.1| hypothetical protein ECDEC2B_2367, partial [Escherichia coli DEC2B]
Length = 311
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|424480767|ref|ZP_17929819.1| hypothetical protein ECTW07945_2337, partial [Escherichia coli
TW07945]
gi|390797492|gb|EIO64739.1| hypothetical protein ECTW07945_2337, partial [Escherichia coli
TW07945]
Length = 173
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 72 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 127
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 128 MLTQFRADL 136
>gi|419230444|ref|ZP_13773250.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
gi|378084445|gb|EHW46354.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
Length = 455
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSASTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|419200755|ref|ZP_13744011.1| hypothetical protein ECDEC8A_5831 [Escherichia coli DEC8A]
gi|378038652|gb|EHW01165.1| hypothetical protein ECDEC8A_5831 [Escherichia coli DEC8A]
Length = 285
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|417622888|ref|ZP_12273200.1| hypothetical protein ECSTECH18_1641 [Escherichia coli STEC_H.1.8]
gi|345381092|gb|EGX12979.1| hypothetical protein ECSTECH18_1641 [Escherichia coli STEC_H.1.8]
Length = 616
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 83/201 (41%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYDCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417600109|ref|ZP_12250714.1| hypothetical protein EC30301_5365, partial [Escherichia coli
3030-1]
gi|345345104|gb|EGW77454.1| hypothetical protein EC30301_5365 [Escherichia coli 3030-1]
Length = 385
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425300368|ref|ZP_18690313.1| hypothetical protein EC07798_2224, partial [Escherichia coli 07798]
gi|408216830|gb|EKI41140.1| hypothetical protein EC07798_2224, partial [Escherichia coli 07798]
Length = 169
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 127
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 128 MLTQFRADL 136
>gi|15801549|ref|NP_287566.1| phage protein YjhS encoded within prophage CP-933O [Escherichia
coli O157:H7 str. EDL933]
gi|12515060|gb|AAG56178.1|AE005347_10 hypothetical phage protein similar to YjhS encoded within prophage
CP-933O [Escherichia coli O157:H7 str. EDL933]
Length = 316
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 78/197 (39%), Gaps = 38/197 (19%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPXADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD---TVNL 168
W G LY+ +I R + AL+ + AV W QGE D +
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDMSAATHA 244
Query: 169 EDAKLYKERSDMFFTDL 185
+ L+ F DL
Sbjct: 245 QQPALFTAMLXQFRADL 261
>gi|416825847|ref|ZP_11896956.1| hypothetical protein ECO5905_20963, partial [Escherichia coli
O55:H7 str. USDA 5905]
gi|320659548|gb|EFX27117.1| hypothetical protein ECO5905_20963 [Escherichia coli O55:H7 str.
USDA 5905]
Length = 415
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|293370621|ref|ZP_06617173.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
gi|292634355|gb|EFF52892.1| conserved domain protein [Bacteroides ovatus SD CMC 3f]
Length = 607
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 89/215 (41%), Gaps = 36/215 (16%)
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK 131
K +W A PL G+ P F ++ +P IG+V AIGG I ++K
Sbjct: 33 KGEWYPARAPL-----CRPNTGLTPADYFGRTLVENLPPHVRIGVVHVAIGGCRIELFQK 87
Query: 132 ---------------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED 170
+ Y ++++ A++A + G I+ +L +QGES+T + E
Sbjct: 88 DKCEEYIKTAPDWMVNTLKEYDNDPYTRLVEMARIAQK-SGVIKGILLHQGESNTGDKEW 146
Query: 171 AKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFI-----EIVRKAQLSSDLPNVR 225
++ K D DL LQ+ +P+I + + + + E++ A L + N
Sbjct: 147 SQKVKSVYDNLLADLH--LQADEVPLIAGEVVNADHGGVCAGMNEVI--AMLPQVIKNCA 202
Query: 226 CVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALRV 260
V + GL PD LH ++ +AL +
Sbjct: 203 IVSSKGLSCAPDHLHFDAAGYRVLGRRYAAQALHL 237
>gi|417192950|ref|ZP_12014797.1| PF08410 domain protein, partial [Escherichia coli 4.0522]
gi|386190131|gb|EIH78879.1| PF08410 domain protein, partial [Escherichia coli 4.0522]
Length = 415
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419030121|ref|ZP_13577279.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2C]
gi|377876458|gb|EHU41061.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2C]
Length = 432
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|419104350|ref|ZP_13649487.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC4E]
gi|377948728|gb|EHV12373.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC4E]
Length = 427
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|416776361|ref|ZP_11874763.1| hypothetical protein ECO5101_19667, partial [Escherichia coli
O157:H7 str. G5101]
gi|320640716|gb|EFX10232.1| hypothetical protein ECO5101_19667 [Escherichia coli O157:H7 str.
G5101]
Length = 273
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419338573|ref|ZP_13880059.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
gi|378193477|gb|EHX54016.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
Length = 617
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419200969|ref|ZP_13744210.1| hypothetical protein ECDEC8A_6036, partial [Escherichia coli DEC8A]
gi|378036519|gb|EHV99060.1| hypothetical protein ECDEC8A_6036, partial [Escherichia coli DEC8A]
Length = 400
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|194430405|ref|ZP_03062890.1| YjhS [Escherichia coli B171]
gi|194411543|gb|EDX27880.1| YjhS [Escherichia coli B171]
Length = 617
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424761819|ref|ZP_18189353.1| hypothetical protein CFSAN001630_17115, partial [Escherichia coli
O111:H11 str. CFSAN001630]
gi|421942005|gb|EKT99369.1| hypothetical protein CFSAN001630_17115, partial [Escherichia coli
O111:H11 str. CFSAN001630]
Length = 372
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425366229|ref|ZP_18751526.1| hypothetical protein ECEC1862_2274, partial [Escherichia coli
EC1862]
gi|408292342|gb|EKJ10889.1| hypothetical protein ECEC1862_2274, partial [Escherichia coli
EC1862]
Length = 215
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 142 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 194
>gi|329851877|ref|ZP_08266558.1| hypothetical protein ABI_46470 [Asticcacaulis biprosthecum C19]
gi|328839726|gb|EGF89299.1| hypothetical protein ABI_46470 [Asticcacaulis biprosthecum C19]
Length = 284
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 104/258 (40%), Gaps = 51/258 (19%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPL-H 83
+ +LAGQSNMAGRG + + L P+P I + A +P+ H
Sbjct: 28 HVFLLAGQSNMAGRGVIPQPMDADGL-----------PSPDIFMWDPDAGIIPATDPIPH 76
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI--------SQWRK---- 131
+ V K +GPGL FA A + G + LV A GGT +W K
Sbjct: 77 PERGV-KPTAIGPGLSFAKA--WRAAKGGRVLLVGAAWGGTGFFVKVPKYGQRWLKTADP 133
Query: 132 --GSSLYEQMIQRAQVALR-----GGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTD 184
G L+ + RA A++ G T +LW+QGESD +++ Y
Sbjct: 134 TVGGDLFRGAVTRANAAIKAARATGPVTFGGILWHQGESD-ISIGAMAGYATAHVELMQA 192
Query: 185 LRSDLQSPL-LPIIRVALA----SGEGPFIEIVRKAQL----------SSDLPNVRCVDA 229
LR+++ PI+ L + EG ++ + QL LP+ V +
Sbjct: 193 LRTEITGAADAPIVVGELTPQYLAREGEALQKLDPEQLRLFLNYIHNIDKHLPHAGWVSS 252
Query: 230 MGLPLEP-DGLHLTTPAQ 246
GL +P D +H AQ
Sbjct: 253 AGLTCKPGDPVHFDAAAQ 270
>gi|419050526|ref|ZP_13597418.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3B]
gi|377897739|gb|EHU62114.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3B]
Length = 259
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419091002|ref|ZP_13636319.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4C]
gi|377949161|gb|EHV12801.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4C]
Length = 315
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419268410|ref|ZP_13810757.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC10C]
gi|378109490|gb|EHW71097.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC10C]
Length = 253
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419079992|ref|ZP_13625462.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4A]
gi|377930780|gb|EHU94656.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4A]
Length = 500
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 61 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 119
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 120 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 179
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 180 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 235
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 236 ATHAQQPALFTAMLTQFRADL 256
>gi|424114252|ref|ZP_17848379.1| hypothetical protein ECPA3_1207, partial [Escherichia coli PA3]
gi|425136767|ref|ZP_18537473.1| hypothetical protein EC100833_1421, partial [Escherichia coli
10.0833]
gi|425255267|ref|ZP_18647847.1| hypothetical protein ECCB7326_2889, partial [Escherichia coli
CB7326]
gi|390687638|gb|EIN62817.1| hypothetical protein ECPA3_1207, partial [Escherichia coli PA3]
gi|408176197|gb|EKI03067.1| hypothetical protein ECCB7326_2889, partial [Escherichia coli
CB7326]
gi|408589072|gb|EKK63606.1| hypothetical protein EC100833_1421, partial [Escherichia coli
10.0833]
Length = 258
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|424486946|ref|ZP_17935584.1| hypothetical protein ECTW09098_2421, partial [Escherichia coli
TW09098]
gi|424544183|ref|ZP_17986718.1| hypothetical protein ECEC4402_2343, partial [Escherichia coli
EC4402]
gi|390810636|gb|EIO77385.1| hypothetical protein ECTW09098_2421, partial [Escherichia coli
TW09098]
gi|390874345|gb|EIP35479.1| hypothetical protein ECEC4402_2343, partial [Escherichia coli
EC4402]
Length = 257
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|425294546|ref|ZP_18684842.1| hypothetical protein ECPA38_2300, partial [Escherichia coli PA38]
gi|408220765|gb|EKI44780.1| hypothetical protein ECPA38_2300, partial [Escherichia coli PA38]
Length = 254
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419214926|ref|ZP_13757946.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC8D]
gi|378066310|gb|EHW28447.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC8D]
Length = 367
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|418997887|ref|ZP_13545480.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC1A]
gi|377842948|gb|EHU07994.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC1A]
Length = 348
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|424498862|ref|ZP_17946111.1| hypothetical protein ECEC4203_1183, partial [Escherichia coli
EC4203]
gi|390835996|gb|EIP00608.1| hypothetical protein ECEC4203_1183, partial [Escherichia coli
EC4203]
Length = 212
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 142 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 194
>gi|419271203|ref|ZP_13813531.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC10D]
gi|378121225|gb|EHW82683.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC10D]
Length = 190
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKSVLLAVCWMQGEFDM----SAATHAQQPALFTA 127
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 128 MLTQFRADL 136
>gi|281422538|ref|ZP_06253537.1| putative esterase [Prevotella copri DSM 18205]
gi|281403362|gb|EFB34042.1| putative esterase [Prevotella copri DSM 18205]
Length = 627
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 112/286 (39%), Gaps = 49/286 (17%)
Query: 5 LLCLILVSEAWPVKCQYQQQ---QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQ 61
+ CL ++++ K Q + Q+ + GQSNM G + + RT V P+ Q
Sbjct: 4 MACLPMMAQKTGSKVQEKPDPNFQIYLCFGQSNMEGNAAIEDIDRTG-------VNPRFQ 56
Query: 62 PNPSILRLTA---KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVP 118
++ A K +W A P + G+ P F ++ +P+ +G +
Sbjct: 57 AMYAVDDEKAGWKKGQWHTAVPP-----QARPSTGLTPVDYFGRQMVDNLPDSIKVGTIT 111
Query: 119 CAIGGTNIS---------------QWRKG------SSLYEQMIQRAQVALRGGGTIRAVL 157
A+GG +I W K + Y ++I+ A++A + G I+ +L
Sbjct: 112 VAVGGASIDLFDKRTYKAYLKKQPDWMKNFASQYNGNPYARLIELAKIA-KKQGVIKGIL 170
Query: 158 WYQGESDTVNLEDAKLYKERSDMFFTDLRSDL-----QSPLLPIIRVALASGEGPFIEIV 212
+QGE+ N DA + R + D+ DL PLL V G + I
Sbjct: 171 LHQGET---NNGDAN-WPNRVKTVYNDILKDLNLKAEDVPLLVGETVQKDMGGKCWAHIA 226
Query: 213 RKAQLSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEAL 258
++ +P + + G P DGLH + + ++N L
Sbjct: 227 IVDDIAKTIPTAHVISSKGCPQRGDGLHFIAESYRTMGKRYANMML 272
>gi|419108303|ref|ZP_13653408.1| hypothetical protein ECDEC4F_1133, partial [Escherichia coli DEC4F]
gi|377965250|gb|EHV28674.1| hypothetical protein ECDEC4F_1133, partial [Escherichia coli DEC4F]
Length = 302
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425200034|ref|ZP_18596338.1| hypothetical protein ECNE037_3205, partial [Escherichia coli NE037]
gi|408117194|gb|EKH48415.1| hypothetical protein ECNE037_3205, partial [Escherichia coli NE037]
Length = 255
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|424563038|ref|ZP_18004107.1| hypothetical protein ECEC4437_2428, partial [Escherichia coli
EC4437]
gi|390897054|gb|EIP56406.1| hypothetical protein ECEC4437_2428, partial [Escherichia coli
EC4437]
Length = 213
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 23 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 81
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 82 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 141
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 142 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 194
>gi|416831589|ref|ZP_11899115.1| hypothetical protein ECOSU61_19540, partial [Escherichia coli
O157:H7 str. LSU-61]
gi|320667451|gb|EFX34396.1| hypothetical protein ECOSU61_19540 [Escherichia coli O157:H7 str.
LSU-61]
Length = 255
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|424127942|ref|ZP_17860929.1| hypothetical protein ECPA9_2450, partial [Escherichia coli PA9]
gi|424153148|ref|ZP_17884171.1| hypothetical protein ECPA24_2253, partial [Escherichia coli PA24]
gi|424569114|ref|ZP_18009776.1| hypothetical protein ECEC4448_2323, partial [Escherichia coli
EC4448]
gi|425317164|ref|ZP_18706024.1| hypothetical protein ECEC1736_2284, partial [Escherichia coli
EC1736]
gi|390685997|gb|EIN61420.1| hypothetical protein ECPA9_2450, partial [Escherichia coli PA9]
gi|390727959|gb|EIO00341.1| hypothetical protein ECPA24_2253, partial [Escherichia coli PA24]
gi|390901483|gb|EIP60662.1| hypothetical protein ECEC4448_2323, partial [Escherichia coli
EC4448]
gi|408241743|gb|EKI64371.1| hypothetical protein ECEC1736_2284, partial [Escherichia coli
EC1736]
Length = 259
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|417589230|ref|ZP_12239983.1| hypothetical protein EC253486_5483 [Escherichia coli 2534-86]
gi|345351349|gb|EGW83611.1| hypothetical protein EC253486_5483 [Escherichia coli 2534-86]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424767957|ref|ZP_18195256.1| hypothetical protein CFSAN001632_01435, partial [Escherichia coli
O111:H8 str. CFSAN001632]
gi|421946949|gb|EKU04048.1| hypothetical protein CFSAN001632_01435, partial [Escherichia coli
O111:H8 str. CFSAN001632]
Length = 254
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|417295850|ref|ZP_12083097.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386259294|gb|EIJ14768.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 622
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG------GVTN--DTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G + D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPGSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417251054|ref|ZP_12042820.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|386218818|gb|EII35300.1| PF08410 domain protein [Escherichia coli 4.0967]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417194062|ref|ZP_12015483.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|386189704|gb|EIH78458.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417179545|ref|ZP_12007535.1| PF08410 domain protein [Escherichia coli 93.0624]
gi|386186207|gb|EIH68924.1| PF08410 domain protein [Escherichia coli 93.0624]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425131567|ref|ZP_18532491.1| hypothetical protein EC82524_2251, partial [Escherichia coli
8.2524]
gi|408583719|gb|EKK58819.1| hypothetical protein EC82524_2251, partial [Escherichia coli
8.2524]
Length = 255
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|417276453|ref|ZP_12063783.1| PF08410 domain protein [Escherichia coli 3.2303]
gi|386240923|gb|EII77843.1| PF08410 domain protein [Escherichia coli 3.2303]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419338246|ref|ZP_13879736.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
gi|378193775|gb|EHX54301.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
Length = 455
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|387507476|ref|YP_006159732.1| hypothetical protein ECO55CA74_12995 [Escherichia coli O55:H7 str.
RM12579]
gi|374359470|gb|AEZ41177.1| hypothetical protein ECO55CA74_12995 [Escherichia coli O55:H7 str.
RM12579]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419893483|ref|ZP_14413466.1| hypothetical protein ECO9574_16001, partial [Escherichia coli
O111:H8 str. CVM9574]
gi|388367217|gb|EIL30908.1| hypothetical protein ECO9574_16001, partial [Escherichia coli
O111:H8 str. CVM9574]
Length = 254
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|416331102|ref|ZP_11669847.1| hypothetical protein ECF_04834 [Escherichia coli O157:H7 str. 1125]
gi|326338866|gb|EGD62683.1| hypothetical protein ECF_04834 [Escherichia coli O157:H7 str. 1125]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|416810473|ref|ZP_11889362.1| hypothetical protein ECO7815_21982, partial [Escherichia coli
O55:H7 str. 3256-97]
gi|320656851|gb|EFX24717.1| hypothetical protein ECO7815_21982 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
Length = 256
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|298481662|ref|ZP_06999853.1| hypothetical protein HMPREF0106_02110 [Bacteroides sp. D22]
gi|298272203|gb|EFI13773.1| hypothetical protein HMPREF0106_02110 [Bacteroides sp. D22]
Length = 641
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 62/266 (23%), Positives = 105/266 (39%), Gaps = 51/266 (19%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQ-----PNPSILRLTAKLKWVLAHE 80
+ + GQSNM G D +V + Q N + R+ K +W A
Sbjct: 26 IYLCLGQSNMEGNARYE--------AQDTLVDARFQVLAAVDNKELGRV--KGEWYPARA 75
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK--------- 131
PL G+ P F ++ +P IG+V AIGG I ++K
Sbjct: 76 PL-----CRPNTGLTPADYFGRTLVENLPPHVRIGVVHVAIGGCRIELFQKDKCEEYIKT 130
Query: 132 ------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
+ Y ++++ A++A + G I+ +L +QGES+T + E + K D
Sbjct: 131 APDWMVNTLKEYDNDPYTRLVEMARIAQK-SGVIKGILLHQGESNTGDKEWPQKVKSVYD 189
Query: 180 MFFTDLRSDLQSPLLPIIRVALASGEGPFI-----EIVRKAQLSSDLPNVRCVDAMGLPL 234
DL LQ+ +P+I + + + + E++ A L + N V + GL
Sbjct: 190 NLLADLH--LQADEVPLIAGEVVNADHGGVCAGMNEVI--AMLPQVIKNCAIVSSKGLSC 245
Query: 235 EPDGLHLTTPAQGSTLNSWSNEALRV 260
PD LH ++ +AL +
Sbjct: 246 APDHLHFDAAGYRVLGRRYAAQALHL 271
>gi|420314976|ref|ZP_14816862.1| hypothetical protein ECEC1734_2187 [Escherichia coli EC1734]
gi|390909405|gb|EIP68190.1| hypothetical protein ECEC1734_2187 [Escherichia coli EC1734]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417123911|ref|ZP_11972821.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386147302|gb|EIG93747.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 455
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGTACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419052331|ref|ZP_13599201.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3B]
gi|377892671|gb|EHU57115.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3B]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|291282304|ref|YP_003499122.1| hypothetical protein G2583_1556 [Escherichia coli O55:H7 str.
CB9615]
gi|416816093|ref|ZP_11892364.1| hypothetical protein ECO7815_04531 [Escherichia coli O55:H7 str.
3256-97]
gi|290762177|gb|ADD56138.1| hypothetical protein G2583_1556 [Escherichia coli O55:H7 str.
CB9615]
gi|320653653|gb|EFX21737.1| hypothetical protein ECO7815_04531 [Escherichia coli O55:H7 str.
3256-97 TW 07815]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|415779556|ref|ZP_11490237.1| conserved hypothetical protein [Escherichia coli 3431]
gi|315614767|gb|EFU95406.1| conserved hypothetical protein [Escherichia coli 3431]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|193065605|ref|ZP_03046672.1| YjhS [Escherichia coli E22]
gi|194430189|ref|ZP_03062689.1| YjhS [Escherichia coli B171]
gi|417174410|ref|ZP_12004206.1| PF08410 domain protein [Escherichia coli 3.2608]
gi|419327743|ref|ZP_13869372.1| hypothetical protein ECDEC12C_0951 [Escherichia coli DEC12C]
gi|419333172|ref|ZP_13874731.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12D]
gi|192926790|gb|EDV81417.1| YjhS [Escherichia coli E22]
gi|194411770|gb|EDX28092.1| YjhS [Escherichia coli B171]
gi|378175746|gb|EHX36561.1| hypothetical protein ECDEC12C_0951 [Escherichia coli DEC12C]
gi|378190369|gb|EHX50954.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12D]
gi|386177102|gb|EIH54581.1| PF08410 domain protein [Escherichia coli 3.2608]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|15832000|ref|NP_310773.1| hypothetical protein ECs2746 [Escherichia coli O157:H7 str. Sakai]
gi|168757639|ref|ZP_02782646.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|168763872|ref|ZP_02788879.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|168770260|ref|ZP_02795267.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|168777477|ref|ZP_02802484.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|168789372|ref|ZP_02814379.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|195939780|ref|ZP_03085162.1| hypothetical protein EscherichcoliO157_25835 [Escherichia coli
O157:H7 str. EC4024]
gi|208810353|ref|ZP_03252229.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208816991|ref|ZP_03258111.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|209397280|ref|YP_002271122.1| hypothetical protein ECH74115_2790 [Escherichia coli O157:H7 str.
EC4115]
gi|217329598|ref|ZP_03445677.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254793659|ref|YP_003078496.1| hypothetical protein ECSP_2614 [Escherichia coli O157:H7 str.
TW14359]
gi|387883103|ref|YP_006313405.1| hypothetical protein CDCO157_2534 [Escherichia coli Xuzhou21]
gi|416312051|ref|ZP_11657252.1| hypothetical protein ECoA_02982 [Escherichia coli O157:H7 str.
1044]
gi|416321355|ref|ZP_11663410.1| hypothetical protein ECoD_03723 [Escherichia coli O157:H7 str.
EC1212]
gi|13362214|dbj|BAB36169.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|187767297|gb|EDU31141.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|189355392|gb|EDU73811.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|189360791|gb|EDU79210.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|189366020|gb|EDU84436.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|189371048|gb|EDU89464.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|208724869|gb|EDZ74576.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208731334|gb|EDZ80023.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|209158680|gb|ACI36113.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|217317366|gb|EEC25795.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254593059|gb|ACT72420.1| hypothetical protein ECSP_2614 [Escherichia coli O157:H7 str.
TW14359]
gi|320189669|gb|EFW64326.1| hypothetical protein ECoD_03723 [Escherichia coli O157:H7 str.
EC1212]
gi|326341918|gb|EGD65699.1| hypothetical protein ECoA_02982 [Escherichia coli O157:H7 str.
1044]
gi|386796561|gb|AFJ29595.1| hypothetical protein CDCO157_2534 [Escherichia coli Xuzhou21]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|422819186|ref|ZP_16867397.1| hypothetical protein ESMG_03709 [Escherichia coli M919]
gi|385537283|gb|EIF84161.1| hypothetical protein ESMG_03709 [Escherichia coli M919]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419119896|ref|ZP_13664873.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5B]
gi|377970449|gb|EHV33809.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5B]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419081092|ref|ZP_13626545.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4A]
gi|420304486|ref|ZP_14806491.1| hypothetical protein ECTW10119_3138 [Escherichia coli TW10119]
gi|377927162|gb|EHU91083.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4A]
gi|390816579|gb|EIO83058.1| hypothetical protein ECTW10119_3138 [Escherichia coli TW10119]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419050872|ref|ZP_13597757.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3B]
gi|377896290|gb|EHU60688.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3B]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|421824314|ref|ZP_16259702.1| hypothetical protein ECFRIK920_2726, partial [Escherichia coli
FRIK920]
gi|408070145|gb|EKH04516.1| hypothetical protein ECFRIK920_2726, partial [Escherichia coli
FRIK920]
Length = 451
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|420294073|ref|ZP_14796188.1| hypothetical protein ECTW11039_4222 [Escherichia coli TW11039]
gi|390795687|gb|EIO62971.1| hypothetical protein ECTW11039_4222 [Escherichia coli TW11039]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|15831215|ref|NP_309988.1| hypothetical protein ECs1961 [Escherichia coli O157:H7 str. Sakai]
gi|168763156|ref|ZP_02788163.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|217329058|ref|ZP_03445138.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|416311239|ref|ZP_11656927.1| hypothetical protein ECoA_02630 [Escherichia coli O157:H7 str.
1044]
gi|419044220|ref|ZP_13591189.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3A]
gi|419056371|ref|ZP_13603207.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3C]
gi|13361426|dbj|BAB35384.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|189366633|gb|EDU85049.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|217317497|gb|EEC25925.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|326343195|gb|EGD66962.1| hypothetical protein ECoA_02630 [Escherichia coli O157:H7 str.
1044]
gi|377899174|gb|EHU63525.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3A]
gi|377910196|gb|EHU74389.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3C]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|387506409|ref|YP_006158665.1| hypothetical protein ECO55CA74_07570 [Escherichia coli O55:H7 str.
RM12579]
gi|419114252|ref|ZP_13659281.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5A]
gi|419125446|ref|ZP_13670341.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5C]
gi|419131117|ref|ZP_13675964.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5D]
gi|419136013|ref|ZP_13680816.1| hypothetical protein ECDEC5E_1507 [Escherichia coli DEC5E]
gi|374358403|gb|AEZ40110.1| hypothetical protein ECO55CA74_07570 [Escherichia coli O55:H7 str.
RM12579]
gi|377963953|gb|EHV27393.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5A]
gi|377977711|gb|EHV40994.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5C]
gi|377979688|gb|EHV42965.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5D]
gi|377986037|gb|EHV49242.1| hypothetical protein ECDEC5E_1507 [Escherichia coli DEC5E]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|209427766|ref|YP_002274178.1| hypothetical protein YYZ_gp42 [Enterobacteria phage YYZ-2008]
gi|208970834|gb|ACI32378.1| conserved hypothetical protein [Escherichia coli]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GKFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQXALFTAMLTQFRADL 261
>gi|415819205|ref|ZP_11508682.1| hypothetical protein ECOK1180_1402 [Escherichia coli OK1180]
gi|323179809|gb|EFZ65369.1| hypothetical protein ECOK1180_1402 [Escherichia coli OK1180]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|420390203|ref|ZP_14889471.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli EPEC C342-62]
gi|391314527|gb|EIQ72077.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli EPEC C342-62]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424524738|ref|ZP_17968739.1| hypothetical protein ECEC4421_1169, partial [Escherichia coli
EC4421]
gi|390857191|gb|EIP19648.1| hypothetical protein ECEC4421_1169, partial [Escherichia coli
EC4421]
Length = 253
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|417171497|ref|ZP_12001825.1| PF08410 domain protein [Escherichia coli 3.2608]
gi|386180767|gb|EIH58238.1| PF08410 domain protein [Escherichia coli 3.2608]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419196644|ref|ZP_13740042.1| hypothetical protein ECDEC8A_1748 [Escherichia coli DEC8A]
gi|378049960|gb|EHW12296.1| hypothetical protein ECDEC8A_1748 [Escherichia coli DEC8A]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419086745|ref|ZP_13632112.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|420269916|ref|ZP_14772285.1| hypothetical protein ECPA22_2841 [Escherichia coli PA22]
gi|377932002|gb|EHU95858.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|390714924|gb|EIN87793.1| hypothetical protein ECPA22_2841 [Escherichia coli PA22]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|260868059|ref|YP_003234461.1| hypothetical protein ECO111_2033 [Escherichia coli O111:H- str.
11128]
gi|415817304|ref|ZP_11507472.1| hypothetical protein ECOK1180_0164 [Escherichia coli OK1180]
gi|257764415|dbj|BAI35910.1| hypothetical protein ECO111_2033 [Escherichia coli O111:H- str.
11128]
gi|323181039|gb|EFZ66576.1| hypothetical protein ECOK1180_0164 [Escherichia coli OK1180]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424505023|ref|ZP_17951774.1| hypothetical protein ECEC4196_1115, partial [Escherichia coli
EC4196]
gi|390838695|gb|EIP02895.1| hypothetical protein ECEC4196_1115, partial [Escherichia coli
EC4196]
Length = 256
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419266160|ref|ZP_13808534.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
gi|378115588|gb|EHW77124.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
Length = 455
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|260844545|ref|YP_003222323.1| hypothetical protein ECO103_2404, partial [Escherichia coli O103:H2
str. 12009]
gi|257759692|dbj|BAI31189.1| hypothetical protein ECO103_2404 [Escherichia coli O103:H2 str.
12009]
Length = 474
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417195296|ref|ZP_12015710.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|386189338|gb|EIH78104.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 601
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419056895|ref|ZP_13603719.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3C]
gi|377907892|gb|EHU72114.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3C]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|260855354|ref|YP_003229245.1| hypothetical protein ECO26_2254 [Escherichia coli O26:H11 str.
11368]
gi|257754003|dbj|BAI25505.1| hypothetical protein ECO26_2254 [Escherichia coli O26:H11 str.
11368]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424536915|ref|ZP_17980155.1| hypothetical protein ECEC4013_1393, partial [Escherichia coli
EC4013]
gi|390874506|gb|EIP35621.1| hypothetical protein ECEC4013_1393, partial [Escherichia coli
EC4013]
Length = 249
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 58 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 116
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 117 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 176
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 177 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 229
>gi|419289201|ref|ZP_13831299.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
gi|378133075|gb|EHW94423.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
Length = 616
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417756034|ref|ZP_12404117.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2B]
gi|418999599|ref|ZP_13547170.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1A]
gi|419016307|ref|ZP_13563637.1| hypothetical protein ECDEC1D_5231 [Escherichia coli DEC1D]
gi|419026688|ref|ZP_13573895.1| hypothetical protein ECDEC2A_4887 [Escherichia coli DEC2A]
gi|419034938|ref|ZP_13582028.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2D]
gi|377838342|gb|EHU03463.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1A]
gi|377852295|gb|EHU17222.1| hypothetical protein ECDEC1D_5231 [Escherichia coli DEC1D]
gi|377856958|gb|EHU21815.1| hypothetical protein ECDEC2A_4887 [Escherichia coli DEC2A]
gi|377875378|gb|EHU39989.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2B]
gi|377881255|gb|EHU45817.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2D]
Length = 617
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425206503|ref|ZP_18602369.1| hypothetical protein ECFRIK2001_3289, partial [Escherichia coli
FRIK2001]
gi|408123332|gb|EKH54107.1| hypothetical protein ECFRIK2001_3289, partial [Escherichia coli
FRIK2001]
Length = 252
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|417289318|ref|ZP_12076603.1| PF08410 domain protein [Escherichia coli TW07793]
gi|386248110|gb|EII94283.1| PF08410 domain protein [Escherichia coli TW07793]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419315674|ref|ZP_13857499.1| hypothetical protein ECDEC12A_0976 [Escherichia coli DEC12A]
gi|378174128|gb|EHX34956.1| hypothetical protein ECDEC12A_0976 [Escherichia coli DEC12A]
Length = 453
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419215109|ref|ZP_13758127.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC8D]
gi|378065850|gb|EHW27992.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC8D]
Length = 487
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|9955659|emb|CAC05558.1| unnamed protein product [Escherichia coli]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|260843388|ref|YP_003221166.1| hypothetical protein ECO103_1194 [Escherichia coli O103:H2 str.
12009]
gi|260855073|ref|YP_003228964.1| hypothetical protein ECO26_1946 [Escherichia coli O26:H11 str.
11368]
gi|260855733|ref|YP_003229624.1| hypothetical protein ECO26_2643 [Escherichia coli O26:H11 str.
11368]
gi|419215117|ref|ZP_13758134.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8D]
gi|257753722|dbj|BAI25224.1| hypothetical protein ECO26_1946 [Escherichia coli O26:H11 str.
11368]
gi|257754382|dbj|BAI25884.1| hypothetical protein ECO26_2643 [Escherichia coli O26:H11 str.
11368]
gi|257758535|dbj|BAI30032.1| hypothetical protein ECO103_1194 [Escherichia coli O103:H2 str.
12009]
gi|378065430|gb|EHW27576.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8D]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425277760|ref|ZP_18669032.1| hypothetical protein ECARS42123_1876, partial [Escherichia coli
ARS4.2123]
gi|408203551|gb|EKI28592.1| hypothetical protein ECARS42123_1876, partial [Escherichia coli
ARS4.2123]
Length = 535
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424556699|ref|ZP_17998185.1| hypothetical protein ECEC4436_2280, partial [Escherichia coli
EC4436]
gi|390885578|gb|EIP45794.1| hypothetical protein ECEC4436_2280, partial [Escherichia coli
EC4436]
Length = 239
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419335648|ref|ZP_13877171.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12D]
gi|378180940|gb|EHX41619.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12D]
Length = 617
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|168751765|ref|ZP_02776787.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|168758010|ref|ZP_02783017.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|168768888|ref|ZP_02793895.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|168774191|ref|ZP_02799198.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|168781553|ref|ZP_02806560.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|168790255|ref|ZP_02815262.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|195937064|ref|ZP_03082446.1| hypothetical protein EscherichcoliO157_11512 [Escherichia coli
O157:H7 str. EC4024]
gi|208811075|ref|ZP_03252908.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208820207|ref|ZP_03260527.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209399696|ref|YP_002270570.1| hypothetical protein ECH74115_2188 [Escherichia coli O157:H7 str.
EC4115]
gi|217329790|ref|ZP_03445867.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254793116|ref|YP_003077953.1| hypothetical protein ECSP_2056 [Escherichia coli O157:H7 str.
TW14359]
gi|416315471|ref|ZP_11659360.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|416320909|ref|ZP_11663209.1| hypothetical protein ECoD_03515 [Escherichia coli O157:H7 str.
EC1212]
gi|416331655|ref|ZP_11669947.1| hypothetical protein ECF_04942 [Escherichia coli O157:H7 str. 1125]
gi|419103572|ref|ZP_13648724.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4E]
gi|187770168|gb|EDU34012.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|188014247|gb|EDU52369.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|189000903|gb|EDU69889.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|189355085|gb|EDU73504.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|189361940|gb|EDU80359.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|189370258|gb|EDU88674.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|208724581|gb|EDZ74289.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208740330|gb|EDZ88012.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209161096|gb|ACI38529.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|217317209|gb|EEC25640.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254592516|gb|ACT71877.1| hypothetical protein ECSP_2056 [Escherichia coli O157:H7 str.
TW14359]
gi|320189797|gb|EFW64451.1| hypothetical protein ECoD_03515 [Escherichia coli O157:H7 str.
EC1212]
gi|326337980|gb|EGD61813.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|326338676|gb|EGD62499.1| hypothetical protein ECF_04942 [Escherichia coli O157:H7 str. 1125]
gi|377951856|gb|EHV15466.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4E]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417126270|ref|ZP_11973995.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386145314|gb|EIG91774.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 617
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419097733|ref|ZP_13642960.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4D]
gi|377947093|gb|EHV10761.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4D]
Length = 617
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424770900|ref|ZP_18198076.1| hypothetical protein CFSAN001632_11962, partial [Escherichia coli
O111:H8 str. CFSAN001632]
gi|421941408|gb|EKT98807.1| hypothetical protein CFSAN001632_11962, partial [Escherichia coli
O111:H8 str. CFSAN001632]
Length = 474
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|260854635|ref|YP_003228526.1| hypothetical protein ECO26_1485 [Escherichia coli O26:H11 str.
11368]
gi|257753284|dbj|BAI24786.1| hypothetical protein ECO26_1485 [Escherichia coli O26:H11 str.
11368]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|237719761|ref|ZP_04550242.1| glycoside hydrolase family 43 protein [Bacteroides sp. 2_2_4]
gi|229451030|gb|EEO56821.1| glycoside hydrolase family 43 protein [Bacteroides sp. 2_2_4]
Length = 641
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 62/266 (23%), Positives = 105/266 (39%), Gaps = 51/266 (19%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQ-----PNPSILRLTAKLKWVLAHE 80
+ + GQSNM G D +V + Q N + R+ K +W A
Sbjct: 26 IYLCLGQSNMEGNARYE--------AQDTLVDARFQVLAAVDNKELGRV--KGEWYPARA 75
Query: 81 PLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK--------- 131
PL G+ P F ++ +P IG+V AIGG I ++K
Sbjct: 76 PL-----CRPNTGLTPADYFGRTLVENLPPHVRIGVVHVAIGGCRIELFQKDKCEEYIKT 130
Query: 132 ------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
+ Y ++++ A++A + G I+ +L +QGES+T + E + K D
Sbjct: 131 APDWMVNTLKEYDNDPYTRLVKMARIAQK-SGVIKGILLHQGESNTGDKEWPQKVKSVYD 189
Query: 180 MFFTDLRSDLQSPLLPIIRVALASGEGPFI-----EIVRKAQLSSDLPNVRCVDAMGLPL 234
DL LQ+ +P+I + + + + E++ A L + N V + GL
Sbjct: 190 NLLADLH--LQADEVPLIAGEVVNADHGGVCAGMNEVI--AMLPQVIKNCAIVSSKGLSC 245
Query: 235 EPDGLHLTTPAQGSTLNSWSNEALRV 260
PD LH ++ +AL +
Sbjct: 246 APDHLHFDAAGYRVLGRRYAAQALHL 271
>gi|416820986|ref|ZP_11893843.1| YjhS, partial [Escherichia coli O55:H7 str. USDA 5905]
gi|320662610|gb|EFX29980.1| YjhS [Escherichia coli O55:H7 str. USDA 5905]
Length = 460
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|419253812|ref|ZP_13796346.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
gi|378104813|gb|EHW66470.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419021233|ref|ZP_13568525.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1E]
gi|377855351|gb|EHU20223.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1E]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|193071339|ref|ZP_03052256.1| YjhS [Escherichia coli E110019]
gi|192955323|gb|EDV85809.1| YjhS [Escherichia coli E110019]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419304026|ref|ZP_13845969.1| hypothetical protein ECDEC11C_5974 [Escherichia coli DEC11C]
gi|378139171|gb|EHX00414.1| hypothetical protein ECDEC11C_5974 [Escherichia coli DEC11C]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GKFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|416788080|ref|ZP_11879679.1| hypothetical protein ECO9389_17253, partial [Escherichia coli
O157:H- str. 493-89]
gi|320646058|gb|EFX15026.1| hypothetical protein ECO9389_17253 [Escherichia coli O157:H- str.
493-89]
Length = 463
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425204649|ref|ZP_18600757.1| hypothetical protein ECFRIK2001_1629, partial [Escherichia coli
FRIK2001]
gi|408130719|gb|EKH60826.1| hypothetical protein ECFRIK2001_1629, partial [Escherichia coli
FRIK2001]
Length = 309
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419086187|ref|ZP_13631561.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|377934170|gb|EHU98006.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|373952817|ref|ZP_09612777.1| protein of unknown function DUF303 acetylesterase [Mucilaginibacter
paludis DSM 18603]
gi|373889417|gb|EHQ25314.1| protein of unknown function DUF303 acetylesterase [Mucilaginibacter
paludis DSM 18603]
Length = 273
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 102/259 (39%), Gaps = 37/259 (14%)
Query: 8 LILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSIL 67
+ILV + Q + + + GQSNM G T + + C P I
Sbjct: 1 MILVLLSKGAFSQDKNFYIFLCFGQSNMEGNAKFEPQDTTVDQRFKVLQAVDC---PEIG 57
Query: 68 RLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS 127
R+ K W A PL G+ P F ++ +P IG++ ++ G I
Sbjct: 58 RV--KNNWYTAVPPL-----CRCKTGITPADYFGRTLVANLPKKIRIGIINVSVAGAKIE 110
Query: 128 ---------------QWRK------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTV 166
W K + Y ++++ A++A + G I+ VL +QGES+T
Sbjct: 111 VFGQDTYQSYSATAPDWMKSMIGEYNGNPYARLLELAKLAQKSG-VIKGVLLHQGESNTN 169
Query: 167 NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE-GPFIEIVRK--AQLSSDLPN 223
+ K K D DL +L++ +P++ + + + G + K A L +PN
Sbjct: 170 DTLWTKKVKAVYDNLVKDL--NLKAKSVPLLAGEVVNADQGGICSSMNKIIATLPKTIPN 227
Query: 224 VRCVDAMGLPLEPDGLHLT 242
+ + G P D LH T
Sbjct: 228 TYVISSAGCPCSADHLHFT 246
>gi|419230370|ref|ZP_13773180.1| hypothetical protein ECDEC9B_5452, partial [Escherichia coli DEC9B]
gi|378084523|gb|EHW46426.1| hypothetical protein ECDEC9B_5452, partial [Escherichia coli DEC9B]
Length = 247
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419208905|ref|ZP_13752011.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC8C]
gi|378057678|gb|EHW19902.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC8C]
Length = 474
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/203 (28%), Positives = 81/203 (39%), Gaps = 46/203 (22%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPPQCQ--------PNPSILRL 69
+++LAGQSN MA G+ D R +L V P + P LR
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYNDIIPADHCLRD 125
Query: 70 TAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ- 128
+ L H AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 126 VQDMS-TLNHP--KADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQG 182
Query: 129 -----------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLE 169
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 183 AEGTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM---- 238
Query: 170 DAKLYKERSDMF---FTDLRSDL 189
A + ++ +F T R+DL
Sbjct: 239 SAATHAQQPALFTAMLTQFRADL 261
>gi|420392589|ref|ZP_14891837.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli EPEC C342-62]
gi|391311188|gb|EIQ68824.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli EPEC C342-62]
Length = 617
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|424134087|ref|ZP_17866644.1| hypothetical protein ECPA10_2434, partial [Escherichia coli PA10]
gi|425311240|ref|ZP_18700493.1| hypothetical protein ECEC1735_2395, partial [Escherichia coli
EC1735]
gi|425323271|ref|ZP_18711712.1| hypothetical protein ECEC1737_2295, partial [Escherichia coli
EC1737]
gi|425347828|ref|ZP_18734407.1| hypothetical protein ECEC1849_2204, partial [Escherichia coli
EC1849]
gi|390702281|gb|EIN76461.1| hypothetical protein ECPA10_2434, partial [Escherichia coli PA10]
gi|408230501|gb|EKI53892.1| hypothetical protein ECEC1735_2395, partial [Escherichia coli
EC1735]
gi|408245698|gb|EKI68062.1| hypothetical protein ECEC1737_2295, partial [Escherichia coli
EC1737]
gi|408268328|gb|EKI88710.1| hypothetical protein ECEC1849_2204, partial [Escherichia coli
EC1849]
Length = 238
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|377805722|gb|AFB75447.1| hypothetical protein PP_44 [Escherichia coli]
Length = 617
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|419260560|ref|ZP_13802993.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
gi|378110244|gb|EHW71840.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
Length = 615
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419055899|ref|ZP_13602748.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC3C]
gi|377912409|gb|EHU76570.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC3C]
Length = 320
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|415842808|ref|ZP_11523272.1| hypothetical protein ECRN5871_5071 [Escherichia coli RN587/1]
gi|323186681|gb|EFZ72006.1| hypothetical protein ECRN5871_5071 [Escherichia coli RN587/1]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|421824714|ref|ZP_16260084.1| hypothetical protein ECFRIK920_3120 [Escherichia coli FRIK920]
gi|408068581|gb|EKH03000.1| hypothetical protein ECFRIK920_3120 [Escherichia coli FRIK920]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GKFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419293596|ref|ZP_13835655.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC11B]
gi|378145793|gb|EHX06949.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC11B]
Length = 603
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419294234|ref|ZP_13836283.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11B]
gi|378143670|gb|EHX04858.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11B]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417251578|ref|ZP_12043343.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|419289205|ref|ZP_13831302.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
gi|419294111|ref|ZP_13836163.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11B]
gi|378132881|gb|EHW94232.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
gi|378144215|gb|EHX05390.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11B]
gi|386218427|gb|EII34910.1| PF08410 domain protein [Escherichia coli 4.0967]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419042536|ref|ZP_13589546.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
gi|377885158|gb|EHU49661.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
Length = 603
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|260854941|ref|YP_003228832.1| hypothetical protein ECO26_1798 [Escherichia coli O26:H11 str.
11368]
gi|417297657|ref|ZP_12084901.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|419208879|ref|ZP_13751986.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8C]
gi|257753590|dbj|BAI25092.1| hypothetical protein ECO26_1798 [Escherichia coli O26:H11 str.
11368]
gi|378057988|gb|EHW20209.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8C]
gi|386258869|gb|EIJ14346.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 617
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419255065|ref|ZP_13797587.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
gi|378101229|gb|EHW62916.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
Length = 581
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|312965241|ref|ZP_07779477.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|312290125|gb|EFR18009.1| conserved hypothetical protein [Escherichia coli 2362-75]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|420298025|ref|ZP_14800090.1| hypothetical protein ECTW09109_2486 [Escherichia coli TW09109]
gi|390808650|gb|EIO75481.1| hypothetical protein ECTW09109_2486 [Escherichia coli TW09109]
Length = 623
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417282766|ref|ZP_12070065.1| PF08410 domain protein [Escherichia coli 3003]
gi|386244399|gb|EII86130.1| PF08410 domain protein [Escherichia coli 3003]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417128555|ref|ZP_11975425.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386143839|gb|EIG90314.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 616
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPNSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424767136|ref|ZP_18194471.1| hypothetical protein CFSAN001630_29498, partial [Escherichia coli
O111:H11 str. CFSAN001630]
gi|421932939|gb|EKT90735.1| hypothetical protein CFSAN001630_29498, partial [Escherichia coli
O111:H11 str. CFSAN001630]
Length = 204
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 20 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 79
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 80 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 135
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 136 MLTQFRADL 144
>gi|419241182|ref|ZP_13783858.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9D]
gi|378098592|gb|EHW60326.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9D]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|291283930|ref|YP_003500748.1| YjhS [Escherichia coli O55:H7 str. CB9615]
gi|290763803|gb|ADD57764.1| YjhS [Escherichia coli O55:H7 str. CB9615]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|420303653|ref|ZP_14805668.1| hypothetical protein ECTW10119_2339 [Escherichia coli TW10119]
gi|390817715|gb|EIO84135.1| hypothetical protein ECTW10119_2339 [Escherichia coli TW10119]
Length = 455
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|260867516|ref|YP_003233918.1| hypothetical protein ECO111_1432 [Escherichia coli O111:H- str.
11128]
gi|419202380|ref|ZP_13745595.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8B]
gi|257763872|dbj|BAI35367.1| hypothetical protein ECO111_1432 [Escherichia coli O111:H- str.
11128]
gi|378054316|gb|EHW16595.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8B]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419203111|ref|ZP_13746313.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8B]
gi|378052465|gb|EHW14772.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8B]
Length = 245
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419272779|ref|ZP_13815080.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10D]
gi|378117496|gb|EHW79010.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10D]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|193069552|ref|ZP_03050505.1| YjhS [Escherichia coli E110019]
gi|192957099|gb|EDV87549.1| YjhS [Escherichia coli E110019]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|432675076|ref|ZP_19910542.1| hypothetical protein A1YU_01616 [Escherichia coli KTE142]
gi|431214847|gb|ELF12596.1| hypothetical protein A1YU_01616 [Escherichia coli KTE142]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419092943|ref|ZP_13638233.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
gi|377943133|gb|EHV06855.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLIPRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419288874|ref|ZP_13830977.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
gi|378133950|gb|EHW95282.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
Length = 616
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417254581|ref|ZP_12046335.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|386215525|gb|EII32019.1| PF08410 domain protein [Escherichia coli 4.0967]
Length = 546
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|15831443|ref|NP_310216.1| hypothetical protein ECs2189 [Escherichia coli O157:H7 str. Sakai]
gi|387882594|ref|YP_006312896.1| hypothetical protein CDCO157_2030 [Escherichia coli Xuzhou21]
gi|13361655|dbj|BAB35612.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|386796052|gb|AFJ29086.1| hypothetical protein CDCO157_2030 [Escherichia coli Xuzhou21]
Length = 602
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|420268980|ref|ZP_14771366.1| hypothetical protein ECPA22_1946, partial [Escherichia coli PA22]
gi|390717350|gb|EIN90136.1| hypothetical protein ECPA22_1946, partial [Escherichia coli PA22]
Length = 592
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|417180608|ref|ZP_12008316.1| PF08410 domain protein [Escherichia coli 93.0624]
gi|386185963|gb|EIH68689.1| PF08410 domain protein [Escherichia coli 93.0624]
Length = 603
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GKFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424768610|ref|ZP_18195877.1| hypothetical protein CFSAN001632_03733, partial [Escherichia coli
O111:H8 str. CFSAN001632]
gi|421945874|gb|EKU03051.1| hypothetical protein CFSAN001632_03733, partial [Escherichia coli
O111:H8 str. CFSAN001632]
Length = 245
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 57 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 115
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 116 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 175
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 176 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 228
>gi|415790653|ref|ZP_11495184.1| hypothetical protein ECEPECA14_4819 [Escherichia coli EPECa14]
gi|323153274|gb|EFZ39533.1| hypothetical protein ECEPECA14_4819 [Escherichia coli EPECa14]
Length = 603
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|425267062|ref|ZP_18658770.1| hypothetical protein EC5412_2353, partial [Escherichia coli 5412]
gi|408185101|gb|EKI11357.1| hypothetical protein EC5412_2353, partial [Escherichia coli 5412]
Length = 190
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 59 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 118
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 119 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 174
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 175 MLAQFRADL 183
>gi|419884919|ref|ZP_14405779.1| hypothetical protein ECO9545_12305, partial [Escherichia coli
O111:H11 str. CVM9545]
gi|388352273|gb|EIL17402.1| hypothetical protein ECO9545_12305, partial [Escherichia coli
O111:H11 str. CVM9545]
Length = 202
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 18 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 77
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 78 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 133
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 134 MLTQFRADL 142
>gi|420390508|ref|ZP_14889775.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli EPEC C342-62]
gi|391314371|gb|EIQ71927.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli EPEC C342-62]
Length = 617
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSALNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTRFRADL 261
>gi|312965253|ref|ZP_07779488.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|312290089|gb|EFR17974.1| conserved hypothetical protein [Escherichia coli 2362-75]
Length = 617
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|433456987|ref|ZP_20415010.1| hypothetical protein D477_08580, partial [Arthrobacter
crystallopoietes BAB-32]
gi|432195541|gb|ELK52065.1| hypothetical protein D477_08580, partial [Arthrobacter
crystallopoietes BAB-32]
Length = 321
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 79/188 (42%), Gaps = 30/188 (15%)
Query: 25 QLIILAGQSNMAGRGGVTN---DTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEP 81
+++L GQSNM G G + D R + + P K + A +P
Sbjct: 72 DVVLLLGQSNMQGAGTPYDPGLDIRMDGIDQFAGSGPHAG------------KVLPAEDP 119
Query: 82 LH--ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS-----QWRKGSS 134
LH N VGPG+ FA + P + LVP A GGT+ + W ++
Sbjct: 120 LHHVTTYLFNGAASVGPGMEFARQFWLRQPADRRVLLVPAARGGTSFAGGADYSWDPDNT 179
Query: 135 -----LYEQMIQ--RAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRS 187
L + I + +AL + A+LW+QGESD + + A+ Y+ + LR+
Sbjct: 180 TARVNLAHRAISECKTALALNPNHRLAAILWHQGESDALPGKSARWYRNKLLQLIDLLRA 239
Query: 188 DL-QSPLL 194
+ Q P L
Sbjct: 240 EFGQVPFL 247
>gi|419038552|ref|ZP_13585609.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
gi|377897881|gb|EHU62252.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
Length = 522
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|415771585|ref|ZP_11485410.1| conserved hypothetical protein [Escherichia coli 3431]
gi|315619748|gb|EFV00268.1| conserved hypothetical protein [Escherichia coli 3431]
Length = 617
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|295086433|emb|CBK67956.1| Enterochelin esterase and related enzymes [Bacteroides
xylanisolvens XB1A]
Length = 607
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 88/215 (40%), Gaps = 36/215 (16%)
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK 131
K +W A PL G+ P F ++ +P IG+V AIGG I ++K
Sbjct: 33 KGEWYPARAPL-----CRPNTGLTPADYFGRTLVENLPPHVRIGVVHVAIGGCRIELFQK 87
Query: 132 ---------------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED 170
+ Y ++++ A++A + G I+ +L +QGES+T + E
Sbjct: 88 DKCEEYIKTAPDWMVNTLKEYDNDPYTRLVKMARIAQK-SGVIKGILLHQGESNTGDKEW 146
Query: 171 AKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFI-----EIVRKAQLSSDLPNVR 225
+ K D DL LQ+ +P+I + + + + E++ A L + N
Sbjct: 147 PQKVKSVYDNLLADLH--LQADEVPLIAGEVVNADHGGVCAGMNEVI--AMLPQVIKNCA 202
Query: 226 CVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALRV 260
V + GL PD LH ++ +AL +
Sbjct: 203 IVSSKGLSCAPDHLHFDAAGYRVLGRRYAAQALHL 237
>gi|419864217|ref|ZP_14386694.1| hypothetical protein ECO9340_14607, partial [Escherichia coli
O103:H25 str. CVM9340]
gi|388340711|gb|EIL06904.1| hypothetical protein ECO9340_14607, partial [Escherichia coli
O103:H25 str. CVM9340]
Length = 228
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 57 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 115
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 116 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 175
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 176 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 228
>gi|417228816|ref|ZP_12030574.1| PF08410 domain protein [Escherichia coli 5.0959]
gi|386208151|gb|EII12656.1| PF08410 domain protein [Escherichia coli 5.0959]
Length = 620
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 69 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 127
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 128 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 187
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 188 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 243
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 244 ATHAQQPALFTAMLTQFRADL 264
>gi|417804388|ref|ZP_12451406.1| prophage protein, partial [Escherichia coli O104:H4 str. LB226692]
gi|340741033|gb|EGR75196.1| prophage protein [Escherichia coli O104:H4 str. LB226692]
Length = 139
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 15 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGPSQD 74
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 75 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 123
>gi|416342670|ref|ZP_11676834.1| hypothetical protein ECoL_01769 [Escherichia coli EC4100B]
gi|320201061|gb|EFW75645.1| hypothetical protein ECoL_01769 [Escherichia coli EC4100B]
Length = 616
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|425335593|ref|ZP_18723091.1| hypothetical protein ECEC1847_2264, partial [Escherichia coli
EC1847]
gi|408260489|gb|EKI81598.1| hypothetical protein ECEC1847_2264, partial [Escherichia coli
EC1847]
Length = 155
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 134
>gi|419006475|ref|ZP_13553929.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC1C]
gi|377850357|gb|EHU15322.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC1C]
Length = 510
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|417219206|ref|ZP_12024048.1| PF03629 domain protein [Escherichia coli JB1-95]
gi|386192968|gb|EIH87276.1| PF03629 domain protein [Escherichia coli JB1-95]
Length = 581
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 31 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 89
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 90 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 149
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 150 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 205
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 206 ATHVQQPALFTAMLTQFRADL 226
>gi|419021820|ref|ZP_13569076.1| hypothetical protein ECDEC2A_5308, partial [Escherichia coli DEC2A]
gi|377869943|gb|EHU34640.1| hypothetical protein ECDEC2A_5308, partial [Escherichia coli DEC2A]
Length = 530
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|432673740|ref|ZP_19909233.1| hypothetical protein A1YU_00298 [Escherichia coli KTE142]
gi|431217522|gb|ELF15092.1| hypothetical protein A1YU_00298 [Escherichia coli KTE142]
Length = 616
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|215485826|ref|YP_002328257.1| hypothetical protein E2348C_0688 [Escherichia coli O127:H6 str.
E2348/69]
gi|419000934|ref|ZP_13548491.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1B]
gi|215263898|emb|CAS08236.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
gi|377853110|gb|EHU18014.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1B]
Length = 616
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|15803152|ref|NP_289184.1| hypothetical protein Z3927 [Escherichia coli O157:H7 str. EDL933]
gi|12517059|gb|AAG57742.1|AE005492_11 unknown protein encoded by prophage CP-933Y [Escherichia coli
O157:H7 str. EDL933]
Length = 390
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQXALFTAMLAQFRADL 261
>gi|424233083|ref|ZP_17889624.1| hypothetical protein ECPA25_2122, partial [Escherichia coli PA25]
gi|390727707|gb|EIO00100.1| hypothetical protein ECPA25_2122, partial [Escherichia coli PA25]
Length = 134
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|420133454|ref|ZP_14641688.1| hypothetical protein ECO9952_18343, partial [Escherichia coli
O26:H11 str. CVM9952]
gi|394425609|gb|EJE98553.1| hypothetical protein ECO9952_18343, partial [Escherichia coli
O26:H11 str. CVM9952]
Length = 231
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 33 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 92
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 93 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 148
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 149 MLTQFRADL 157
>gi|419260741|ref|ZP_13803173.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC10B]
gi|378109944|gb|EHW71544.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC10B]
Length = 210
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 124
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 125 MLTQFRADL 133
>gi|424480733|ref|ZP_17929796.1| hypothetical protein ECTW07945_2312, partial [Escherichia coli
TW07945]
gi|390797871|gb|EIO65094.1| hypothetical protein ECTW07945_2312, partial [Escherichia coli
TW07945]
Length = 130
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|419243639|ref|ZP_13786279.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC9D]
gi|378091244|gb|EHW53076.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC9D]
Length = 377
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 80/201 (39%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|260866900|ref|YP_003233302.1| hypothetical protein ECO111_0789 [Escherichia coli O111:H- str.
11128]
gi|415818830|ref|ZP_11508446.1| hypothetical protein ECOK1180_1152 [Escherichia coli OK1180]
gi|417193103|ref|ZP_12014950.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|417589255|ref|ZP_12240001.1| hypothetical protein EC253486_5506 [Escherichia coli 2534-86]
gi|257763256|dbj|BAI34751.1| hypothetical protein ECO111_0789 [Escherichia coli O111:H- str.
11128]
gi|323179988|gb|EFZ65544.1| hypothetical protein ECOK1180_1152 [Escherichia coli OK1180]
gi|345349779|gb|EGW82055.1| hypothetical protein EC253486_5506 [Escherichia coli 2534-86]
gi|386190284|gb|EIH79032.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 616
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|424512710|ref|ZP_17958645.1| yjhS, partial [Escherichia coli TW14313]
gi|390851290|gb|EIP14591.1| yjhS, partial [Escherichia coli TW14313]
Length = 156
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 134
>gi|424096683|ref|ZP_17832115.1| hypothetical protein ECFRIK1985_2494, partial [Escherichia coli
FRIK1985]
gi|390665668|gb|EIN42946.1| hypothetical protein ECFRIK1985_2494, partial [Escherichia coli
FRIK1985]
Length = 153
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 134
>gi|374598538|ref|ZP_09671540.1| protein of unknown function DUF303 acetylesterase [Myroides
odoratus DSM 2801]
gi|423323221|ref|ZP_17301063.1| hypothetical protein HMPREF9716_00420 [Myroides odoratimimus CIP
103059]
gi|373910008|gb|EHQ41857.1| protein of unknown function DUF303 acetylesterase [Myroides
odoratus DSM 2801]
gi|404609687|gb|EKB09052.1| hypothetical protein HMPREF9716_00420 [Myroides odoratimimus CIP
103059]
Length = 374
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 68/163 (41%), Gaps = 24/163 (14%)
Query: 48 NKLTWDGIVPPQCQPNPSILRLTAKLKWVLA--HEPLHADIDVNKTNGVGPGLPF----- 100
N+L++D I+ L A+L V A +E D + N GP L F
Sbjct: 32 NELSYDVILVAGQSNTHYGYPLNAQLDTVNARVYELKRHDSKNFRINPAGPVLDFWTRQT 91
Query: 101 -ANAVLTKVPNFGV----------IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRG 149
N+ T N + + ++PC G++I+ W +G Y ++R L
Sbjct: 92 NRNSFATTFSNLYINTYLKDNNRKVLIIPCGYAGSSITDWTQGKRFYNDAMERVNYVLDN 151
Query: 150 --GGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQ 190
G + A+LW+QGE++ Y+ D TD+R+D+
Sbjct: 152 VPGSKLVAILWHQGEANV----GWNPYQTTLDGMITDMRNDVH 190
>gi|415782021|ref|ZP_11491342.1| hypothetical protein ECEPECA14_0891, partial [Escherichia coli
EPECa14]
gi|323157232|gb|EFZ43353.1| hypothetical protein ECEPECA14_0891 [Escherichia coli EPECa14]
Length = 377
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|424229047|ref|ZP_17889605.1| hypothetical protein ECPA25_2100, partial [Escherichia coli PA25]
gi|390728058|gb|EIO00412.1| hypothetical protein ECPA25_2100, partial [Escherichia coli PA25]
Length = 129
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|416797672|ref|ZP_11883904.1| hypothetical protein ECO2687_06928, partial [Escherichia coli
O157:H- str. H 2687]
gi|320652134|gb|EFX20461.1| hypothetical protein ECO2687_06928 [Escherichia coli O157:H- str. H
2687]
Length = 394
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|187736159|ref|YP_001878271.1| hypothetical protein Amuc_1672 [Akkermansia muciniphila ATCC
BAA-835]
gi|187426211|gb|ACD05490.1| protein of unknown function DUF303 acetylesterase putative
[Akkermansia muciniphila ATCC BAA-835]
Length = 303
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 102/239 (42%), Gaps = 38/239 (15%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC 60
+F+WLLC + V A P+ + +I++ GQSN G+G V N +PP
Sbjct: 14 LFSWLLCSLTVFPAPPLHAD--EVNVILIGGQSNATGQGYVNN------------IPPCF 59
Query: 61 QPNPSI-LRLTAKLKWVLAHEPLHADIDVNKT-NGVGPGLPFANAVLTKVPNFGVIGLVP 118
+ + I L + LK E L +++ + G L A+ K P ++
Sbjct: 60 KTDKRILLYYSGSLKGTEPAEQLVPLSPASESPDRFGVELSLGTALQKKFPQ-KKWAIIK 118
Query: 119 CAIGGTNI-SQWRKGSSLYEQM----------IQRAQVALRGGG---TIRAVLWYQGESD 164
A G+N+ QW G + ++ ++ AL+ G ++A++W QGE D
Sbjct: 119 HARSGSNLFRQWNPGKTSQDKQGEEYVKLLRTVRNGMEALKKQGHAPVLKAMVWQQGEGD 178
Query: 165 T---VNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL----ASGEGPFIEIVRKAQ 216
+++A Y + +R+DL++P L I ++ A P E VR+ Q
Sbjct: 179 ARDIAGIKNALSYGANLNNLIKRIRADLEAPGLAFIYGSVLPVPALARFPGREKVRQGQ 237
>gi|81239425|gb|ABB60239.1| hypothetical protein [Escherichia coli]
Length = 344
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419320394|ref|ZP_13862149.1| hypothetical protein ECDEC12A_5754 [Escherichia coli DEC12A]
gi|419323020|ref|ZP_13864725.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12B]
gi|419339054|ref|ZP_13880538.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
gi|378158487|gb|EHX19511.1| hypothetical protein ECDEC12A_5754 [Escherichia coli DEC12A]
gi|378167292|gb|EHX28206.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12B]
gi|378193058|gb|EHX53604.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
Length = 616
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|419311796|ref|ZP_13853658.1| hypothetical protein ECDEC11E_2325, partial [Escherichia coli
DEC11E]
gi|378157424|gb|EHX18455.1| hypothetical protein ECDEC11E_2325, partial [Escherichia coli
DEC11E]
Length = 251
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 77/190 (40%), Gaps = 39/190 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF 181
Y ++ +F
Sbjct: 241 ATYAQQPALF 250
>gi|420104206|ref|ZP_14614943.1| hypothetical protein ECO9455_13939, partial [Escherichia coli
O111:H11 str. CVM9455]
gi|394404979|gb|EJE80281.1| hypothetical protein ECO9455_13939, partial [Escherichia coli
O111:H11 str. CVM9455]
Length = 266
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 18 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 77
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 78 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 133
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 134 MLTQFRADL 142
>gi|419236693|ref|ZP_13779440.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9C]
gi|378089116|gb|EHW50962.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9C]
Length = 603
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|15801265|ref|NP_287282.1| hypothetical protein Z1793 [Escherichia coli O157:H7 str. EDL933]
gi|12514704|gb|AAG55894.1|AE005323_10 unknown protein encoded by prophage CP-933N [Escherichia coli
O157:H7 str. EDL933]
Length = 617
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQXALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|417299176|ref|ZP_12086407.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386257348|gb|EIJ12838.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 624
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIVRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419259571|ref|ZP_13802020.1| hypothetical protein ECDEC10B_1160, partial [Escherichia coli
DEC10B]
gi|378115119|gb|EHW76667.1| hypothetical protein ECDEC10B_1160, partial [Escherichia coli
DEC10B]
Length = 147
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 134
>gi|417831012|ref|ZP_12477546.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Shigella flexneri J1713]
gi|420318409|ref|ZP_14820269.1| hypothetical protein SF285071_0008 [Shigella flexneri 2850-71]
gi|335572465|gb|EGM58845.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Shigella flexneri J1713]
gi|391255252|gb|EIQ14400.1| hypothetical protein SF285071_0008 [Shigella flexneri 2850-71]
Length = 617
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|420297011|ref|ZP_14799103.1| hypothetical protein ECTW09109_1486 [Escherichia coli TW09109]
gi|390811249|gb|EIO77973.1| hypothetical protein ECTW09109_1486 [Escherichia coli TW09109]
Length = 432
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419044653|ref|ZP_13591618.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC3A]
gi|377898108|gb|EHU62470.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC3A]
Length = 363
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|215486517|ref|YP_002328948.1| hypothetical protein E2348C_1409 [Escherichia coli O127:H6 str.
E2348/69]
gi|312966529|ref|ZP_07780750.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|419001637|ref|ZP_13549183.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1B]
gi|419028410|ref|ZP_13575595.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2C]
gi|419039178|ref|ZP_13586227.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
gi|215264589|emb|CAS08957.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
gi|312288804|gb|EFR16703.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|377851892|gb|EHU16828.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1B]
gi|377882490|gb|EHU47030.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2C]
gi|377896268|gb|EHU60668.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
Length = 617
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|193065485|ref|ZP_03046554.1| YjhS [Escherichia coli E22]
gi|417249076|ref|ZP_12040861.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|192926890|gb|EDV81515.1| YjhS [Escherichia coli E22]
gi|386221059|gb|EII37522.1| PF08410 domain protein [Escherichia coli 4.0967]
Length = 616
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIVRTKAALQKNQKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|424090151|ref|ZP_17826190.1| hypothetical protein ECFRIK1996_2374, partial [Escherichia coli
FRIK1996]
gi|390645878|gb|EIN25017.1| hypothetical protein ECFRIK1996_2374, partial [Escherichia coli
FRIK1996]
Length = 127
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|417295000|ref|ZP_12082256.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386261363|gb|EIJ16828.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 390
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419061783|ref|ZP_13608546.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3D]
gi|377915046|gb|EHU79156.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3D]
Length = 542
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 62 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 121
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 122 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 177
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 178 MLTQFRADL 186
>gi|416793656|ref|ZP_11882798.1| hypothetical protein ECO9389_10369, partial [Escherichia coli
O157:H- str. 493-89]
gi|320642686|gb|EFX11911.1| hypothetical protein ECO9389_10369 [Escherichia coli O157:H- str.
493-89]
Length = 364
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|425300338|ref|ZP_18690295.1| hypothetical protein EC07798_2205, partial [Escherichia coli 07798]
gi|408217333|gb|EKI41606.1| hypothetical protein EC07798_2205, partial [Escherichia coli 07798]
Length = 169
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD---TVNLEDAKLYKERSDM 180
W G LY+ +I R + AL+ + AV W QGE D + + L+
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDMSAATHAQQPALFTAMLKQ 131
Query: 181 FFTDL 185
F DL
Sbjct: 132 FRADL 136
>gi|419200794|ref|ZP_13744049.1| hypothetical protein ECDEC8A_5872 [Escherichia coli DEC8A]
gi|378038297|gb|EHW00813.1| hypothetical protein ECDEC8A_5872 [Escherichia coli DEC8A]
Length = 390
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|416836651|ref|ZP_11902223.1| hypothetical protein ECOSU61_17643, partial [Escherichia coli
O157:H7 str. LSU-61]
gi|320664132|gb|EFX31292.1| hypothetical protein ECOSU61_17643 [Escherichia coli O157:H7 str.
LSU-61]
Length = 382
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|417120452|ref|ZP_11970010.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386149107|gb|EIG95539.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 469
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|260842980|ref|YP_003220758.1| hypothetical protein ECO103_0770 [Escherichia coli O103:H2 str.
12009]
gi|257758127|dbj|BAI29624.1| hypothetical protein ECO103_0770 [Escherichia coli O103:H2 str.
12009]
Length = 616
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|225352515|ref|ZP_03743538.1| hypothetical protein BIFPSEUDO_04138 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225156709|gb|EEG70103.1| hypothetical protein BIFPSEUDO_04138 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 566
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 59/129 (45%), Gaps = 11/129 (8%)
Query: 114 IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKL 173
IG++ A GGT I + +G +Y I + G + VLWYQG +D+ N A
Sbjct: 271 IGIIQTAWGGTPIRRHVQGGDIYANHIAPLE-----GFHVAGVLWYQGCNDSTNEATALA 325
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEG-PFIEIVRKAQLSS----DLPNVRCVD 228
Y+ + + R LP + V LA G + + VR AQL++ L N V
Sbjct: 326 YESQMTLLINQYREVFDQDDLPFLYVQLARWPGYQYTQNVRFAQLNTLSNAGLRNASNV- 384
Query: 229 AMGLPLEPD 237
AM + L+ D
Sbjct: 385 AMTVSLDTD 393
>gi|419220038|ref|ZP_13762991.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
gi|378071890|gb|EHW33957.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
Length = 541
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 62 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 121
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 122 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 177
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 178 MLTQFRADL 186
>gi|420268165|ref|ZP_14770569.1| hypothetical protein ECPA22_1252 [Escherichia coli PA22]
gi|390719472|gb|EIN92197.1| hypothetical protein ECPA22_1252 [Escherichia coli PA22]
Length = 616
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419241439|ref|ZP_13784096.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC9D]
gi|378096208|gb|EHW57981.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC9D]
Length = 394
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPL-HADIDVNK--TNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L H D++K VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGLYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419120849|ref|ZP_13665811.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5B]
gi|377967927|gb|EHV31325.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5B]
Length = 439
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419097032|ref|ZP_13642272.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4D]
gi|377949439|gb|EHV13073.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4D]
Length = 616
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|15830342|ref|NP_309115.1| hypothetical protein ECs1088 [Escherichia coli O157:H7 str. Sakai]
gi|168751240|ref|ZP_02776262.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|168754248|ref|ZP_02779255.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|168763162|ref|ZP_02788169.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|168780947|ref|ZP_02805954.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|168787434|ref|ZP_02812441.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|168801298|ref|ZP_02826305.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|195935187|ref|ZP_03080569.1| hypothetical protein EscherichcoliO157_01807 [Escherichia coli
O157:H7 str. EC4024]
gi|208808028|ref|ZP_03250365.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208815228|ref|ZP_03256407.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208822243|ref|ZP_03262562.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209399945|ref|YP_002269663.1| hypothetical protein ECH74115_1168 [Escherichia coli O157:H7 str.
EC4115]
gi|217324289|ref|ZP_03440373.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254792197|ref|YP_003077034.1| hypothetical protein ECSP_1106 [Escherichia coli O157:H7 str.
TW14359]
gi|387881609|ref|YP_006311911.1| hypothetical protein CDCO157_1056 [Escherichia coli Xuzhou21]
gi|416310648|ref|ZP_11656455.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|416322562|ref|ZP_11664331.1| hypothetical protein ECoD_04688 [Escherichia coli O157:H7 str.
EC1212]
gi|416331036|ref|ZP_11669842.1| hypothetical protein ECF_04827 [Escherichia coli O157:H7 str. 1125]
gi|419062260|ref|ZP_13609010.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3D]
gi|419085755|ref|ZP_13631139.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|420302761|ref|ZP_14804787.1| hypothetical protein ECTW10119_1664 [Escherichia coli TW10119]
gi|421822771|ref|ZP_16258205.1| hypothetical protein ECFRIK920_1214 [Escherichia coli FRIK920]
gi|13360548|dbj|BAB34511.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|188014661|gb|EDU52783.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|189001387|gb|EDU70373.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|189358359|gb|EDU76778.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|189366628|gb|EDU85044.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|189372714|gb|EDU91130.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|189376540|gb|EDU94956.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|208727829|gb|EDZ77430.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208731876|gb|EDZ80564.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208737728|gb|EDZ85411.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209161345|gb|ACI38778.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|217320510|gb|EEC28934.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|254591597|gb|ACT70958.1| hypothetical protein ECSP_1106 [Escherichia coli O157:H7 str.
TW14359]
gi|320188736|gb|EFW63396.1| hypothetical protein ECoD_04688 [Escherichia coli O157:H7 str.
EC1212]
gi|326338932|gb|EGD62748.1| hypothetical protein ECF_04827 [Escherichia coli O157:H7 str. 1125]
gi|326344336|gb|EGD68095.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|377913391|gb|EHU77530.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3D]
gi|377935130|gb|EHU98946.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|386795067|gb|AFJ28101.1| hypothetical protein CDCO157_1056 [Escherichia coli Xuzhou21]
gi|390818586|gb|EIO84955.1| hypothetical protein ECTW10119_1664 [Escherichia coli TW10119]
gi|408075173|gb|EKH09415.1| hypothetical protein ECFRIK920_1214 [Escherichia coli FRIK920]
Length = 616
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419260225|ref|ZP_13802663.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
gi|378111870|gb|EHW73453.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
Length = 616
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|416786337|ref|ZP_11878984.1| hypothetical protein ECO9389_09456, partial [Escherichia coli
O157:H- str. 493-89]
gi|320646856|gb|EFX15718.1| hypothetical protein ECO9389_09456 [Escherichia coli O157:H- str.
493-89]
Length = 408
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|420274096|ref|ZP_14776426.1| yjhS, partial [Escherichia coli PA40]
gi|390761597|gb|EIO30879.1| yjhS, partial [Escherichia coli PA40]
Length = 415
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419319956|ref|ZP_13861740.1| hypothetical protein ECDEC12A_5340 [Escherichia coli DEC12A]
gi|378162116|gb|EHX23082.1| hypothetical protein ECDEC12A_5340 [Escherichia coli DEC12A]
Length = 617
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|260844405|ref|YP_003222183.1| hypothetical protein ECO103_2260 [Escherichia coli O103:H2 str.
12009]
gi|419303717|ref|ZP_13845680.1| hypothetical protein ECDEC11C_5683 [Escherichia coli DEC11C]
gi|257759552|dbj|BAI31049.1| hypothetical protein ECO103_2260 [Escherichia coli O103:H2 str.
12009]
gi|378141671|gb|EHX02880.1| hypothetical protein ECDEC11C_5683 [Escherichia coli DEC11C]
Length = 616
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419136860|ref|ZP_13681658.1| hypothetical protein ECDEC5E_2354 [Escherichia coli DEC5E]
gi|377984746|gb|EHV47974.1| hypothetical protein ECDEC5E_2354 [Escherichia coli DEC5E]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|420297580|ref|ZP_14799654.1| hypothetical protein ECTW09109_2043, partial [Escherichia coli
TW09109]
gi|390809569|gb|EIO76356.1| hypothetical protein ECTW09109_2043, partial [Escherichia coli
TW09109]
Length = 380
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|291283185|ref|YP_003500003.1| YjhS [Escherichia coli O55:H7 str. CB9615]
gi|387507250|ref|YP_006159506.1| YjhS [Escherichia coli O55:H7 str. RM12579]
gi|419115234|ref|ZP_13660253.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5A]
gi|419126430|ref|ZP_13671318.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5C]
gi|290763058|gb|ADD57019.1| YjhS [Escherichia coli O55:H7 str. CB9615]
gi|374359244|gb|AEZ40951.1| YjhS [Escherichia coli O55:H7 str. RM12579]
gi|377961029|gb|EHV24503.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5A]
gi|377975821|gb|EHV39137.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5C]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|420291219|ref|ZP_14793381.1| hypothetical protein ECTW11039_1363 [Escherichia coli TW11039]
gi|390800857|gb|EIO67932.1| hypothetical protein ECTW11039_1363 [Escherichia coli TW11039]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|15802440|ref|NP_288466.1| hypothetical protein Z3107 [Escherichia coli O157:H7 str. EDL933]
gi|12516124|gb|AAG57020.1|AE005421_8 unknown protein encoded within prophage CP-933U [Escherichia coli
O157:H7 str. EDL933]
Length = 617
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQXALFTAMLXQFRADL 261
>gi|421829507|ref|ZP_16264834.1| hypothetical protein ECPA7_1669 [Escherichia coli PA7]
gi|408071834|gb|EKH06169.1| hypothetical protein ECPA7_1669 [Escherichia coli PA7]
Length = 610
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|420089175|ref|ZP_14601003.1| hypothetical protein ECO9602_14066, partial [Escherichia coli
O111:H8 str. CVM9602]
gi|394388503|gb|EJE65777.1| hypothetical protein ECO9602_14066, partial [Escherichia coli
O111:H8 str. CVM9602]
Length = 142
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 33 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 92
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 93 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 133
>gi|417124682|ref|ZP_11973140.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386145975|gb|EIG92426.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419074578|ref|ZP_13620135.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3F]
gi|377928891|gb|EHU92794.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3F]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|420314027|ref|ZP_14815931.1| hypothetical protein ECEC1734_1255 [Escherichia coli EC1734]
gi|390911217|gb|EIP69931.1| hypothetical protein ECEC1734_1255 [Escherichia coli EC1734]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419231784|ref|ZP_13774570.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC9B]
gi|378080545|gb|EHW42506.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC9B]
Length = 432
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPL-HADIDVNK--TNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L H D++K VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGLYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419200819|ref|ZP_13744072.1| hypothetical protein ECDEC8A_5897 [Escherichia coli DEC8A]
gi|378038155|gb|EHW00674.1| hypothetical protein ECDEC8A_5897 [Escherichia coli DEC8A]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLYIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|416808812|ref|ZP_11888568.1| YjhS, partial [Escherichia coli O55:H7 str. 3256-97]
gi|320657704|gb|EFX25493.1| YjhS [Escherichia coli O55:H7 str. 3256-97 TW 07815]
Length = 430
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419208170|ref|ZP_13751290.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC8C]
gi|378060456|gb|EHW22648.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC8C]
Length = 529
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 50 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASHD 109
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 110 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 165
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 166 MLTQFRADL 174
>gi|416805138|ref|ZP_11887652.1| hypothetical protein ECO2687_05571, partial [Escherichia coli
O157:H- str. H 2687]
gi|320648040|gb|EFX16723.1| hypothetical protein ECO2687_05571 [Escherichia coli O157:H- str. H
2687]
Length = 384
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|260856740|ref|YP_003230631.1| hypothetical protein ECO26_3694 [Escherichia coli O26:H11 str.
11368]
gi|257755389|dbj|BAI26891.1| hypothetical protein ECO26_3694 [Escherichia coli O26:H11 str.
11368]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419309768|ref|ZP_13851646.1| hypothetical protein ECDEC11E_0274 [Escherichia coli DEC11E]
gi|378161887|gb|EHX22859.1| hypothetical protein ECDEC11E_0274 [Escherichia coli DEC11E]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|260854294|ref|YP_003228185.1| hypothetical protein ECO26_1129 [Escherichia coli O26:H11 str.
11368]
gi|257752943|dbj|BAI24445.1| hypothetical protein ECO26_1129 [Escherichia coli O26:H11 str.
11368]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|420275279|ref|ZP_14777580.1| hypothetical protein ECPA40_2516, partial [Escherichia coli PA40]
gi|390759060|gb|EIO28458.1| hypothetical protein ECPA40_2516, partial [Escherichia coli PA40]
Length = 370
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|415800259|ref|ZP_11499252.1| hypothetical protein ECE128010_2970 [Escherichia coli E128010]
gi|323160794|gb|EFZ46725.1| hypothetical protein ECE128010_2970 [Escherichia coli E128010]
Length = 616
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|419306798|ref|ZP_13848698.1| hypothetical protein ECDEC11D_2361 [Escherichia coli DEC11D]
gi|378148785|gb|EHX09918.1| hypothetical protein ECDEC11D_2361 [Escherichia coli DEC11D]
Length = 617
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|424109747|ref|ZP_17844078.1| hypothetical protein EC93001_2498, partial [Escherichia coli
93-001]
gi|390664214|gb|EIN41672.1| hypothetical protein EC93001_2498, partial [Escherichia coli
93-001]
Length = 127
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 11 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 70
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 71 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 111
>gi|419314670|ref|ZP_13856507.1| hypothetical protein ECDEC11E_5262, partial [Escherichia coli
DEC11E]
gi|378151520|gb|EHX12630.1| hypothetical protein ECDEC11E_5262, partial [Escherichia coli
DEC11E]
Length = 612
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 132 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 191
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 192 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 247
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 248 MLKQFRADL 256
>gi|419132612|ref|ZP_13677448.1| hypothetical protein ECDEC5D_3378, partial [Escherichia coli DEC5D]
gi|377975029|gb|EHV38353.1| hypothetical protein ECDEC5D_3378, partial [Escherichia coli DEC5D]
Length = 232
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 127
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 128 MLTQFRADL 136
>gi|419282921|ref|ZP_13825131.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10F]
gi|378137804|gb|EHW99069.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10F]
Length = 485
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 6 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 65
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 66 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 121
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 122 MLTQFRADL 130
>gi|416819226|ref|ZP_11893121.1| YjhS, partial [Escherichia coli O55:H7 str. USDA 5905]
gi|320663387|gb|EFX30684.1| YjhS [Escherichia coli O55:H7 str. USDA 5905]
Length = 450
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|417171229|ref|ZP_12001758.1| PF08410 domain protein [Escherichia coli 3.2608]
gi|386181153|gb|EIH58623.1| PF08410 domain protein [Escherichia coli 3.2608]
Length = 602
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 253 MLTQFRADL 261
>gi|419074662|ref|ZP_13620212.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3F]
gi|377927275|gb|EHU91191.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3F]
Length = 489
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 124
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 125 MLTQFRADL 133
>gi|425329957|ref|ZP_18717909.1| hypothetical protein ECEC1846_2767, partial [Escherichia coli
EC1846]
gi|408248803|gb|EKI70794.1| hypothetical protein ECEC1846_2767, partial [Escherichia coli
EC1846]
Length = 110
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 69 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 109
>gi|313203152|ref|YP_004041809.1| hypothetical protein Palpr_0668 [Paludibacter propionicigenes WB4]
gi|312442468|gb|ADQ78824.1| protein of unknown function DUF303 acetylesterase [Paludibacter
propionicigenes WB4]
Length = 297
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 87/215 (40%), Gaps = 35/215 (16%)
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK 131
K W +A P++ NG+GP F ++ +P +G++ ++ G I W K
Sbjct: 74 KGNWYVATPPIN-----RPENGMGPVDFFGRTMVANLPKEYRVGVINVSVAGAKIELWDK 128
Query: 132 G---------------------SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED 170
+ Y+++I+ A++A + G I+ +L +QGES+ +
Sbjct: 129 AGYKNYLDSAAGWMQNICKQYDGNPYQRLIEMAKIAQQ-DGVIKGILLHQGESNPNDKAW 187
Query: 171 AKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGE-----GPFIEIVRKAQLSSDLPNVR 225
+ K D DL +L++ +P + L S E F V A L LPN
Sbjct: 188 PQKVKAIYDNILKDL--NLKAKDVPFLAGELKSAEEHGVCAAFNTDVL-AYLPKALPNSY 244
Query: 226 CVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALRV 260
+ + G+ PD H T ++ + L++
Sbjct: 245 IISSKGVKGSPDQFHFNTAGMREFGKRYAIQMLKI 279
>gi|420308823|ref|ZP_14810785.1| hypothetical protein ECEC1738_1701, partial [Escherichia coli
EC1738]
gi|390902549|gb|EIP61638.1| hypothetical protein ECEC1738_1701, partial [Escherichia coli
EC1738]
Length = 405
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|417124105|ref|ZP_11972876.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386146385|gb|EIG92832.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 455
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R DL
Sbjct: 241 ATHAQQPALFTAMLTQFRVDL 261
>gi|424306212|ref|ZP_17895472.1| hypothetical protein ECPA28_2400, partial [Escherichia coli PA28]
gi|390730366|gb|EIO02405.1| hypothetical protein ECPA28_2400, partial [Escherichia coli PA28]
Length = 114
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|419267244|ref|ZP_13809603.1| hypothetical protein ECDEC10C_2909 [Escherichia coli DEC10C]
gi|378112506|gb|EHW74083.1| hypothetical protein ECDEC10C_2909 [Escherichia coli DEC10C]
Length = 344
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKRLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 124
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 125 MLTQFRADL 133
>gi|419310789|ref|ZP_13852660.1| hypothetical protein ECDEC11E_1318 [Escherichia coli DEC11E]
gi|378160504|gb|EHX21501.1| hypothetical protein ECDEC11E_1318 [Escherichia coli DEC11E]
Length = 616
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R DL
Sbjct: 241 ATHAQQPALFTAMLTQFRVDL 261
>gi|425104046|ref|ZP_18506444.1| hypothetical protein EC52239_2481, partial [Escherichia coli
5.2239]
gi|425397005|ref|ZP_18780022.1| hypothetical protein ECEC1869_1330, partial [Escherichia coli
EC1869]
gi|408330181|gb|EKJ45497.1| hypothetical protein ECEC1869_1330, partial [Escherichia coli
EC1869]
gi|408552935|gb|EKK30083.1| hypothetical protein EC52239_2481, partial [Escherichia coli
5.2239]
Length = 118
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 14 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 73
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 74 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 114
>gi|424500138|ref|ZP_17947172.1| hypothetical protein ECEC4203_2304, partial [Escherichia coli
EC4203]
gi|390830966|gb|EIO96435.1| hypothetical protein ECEC4203_2304, partial [Escherichia coli
EC4203]
Length = 115
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|15801970|ref|NP_287991.1| hypothetical protein Z6054 [Escherichia coli O157:H7 str. EDL933]
gi|15831516|ref|NP_310289.1| hypothetical protein ECs2262 [Escherichia coli O157:H7 str. Sakai]
gi|168784406|ref|ZP_02809413.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|168802069|ref|ZP_02827076.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|208810548|ref|ZP_03252424.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208816710|ref|ZP_03257830.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208821046|ref|ZP_03261366.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209399622|ref|YP_002270629.1| hypothetical protein ECH74115_2258 [Escherichia coli O157:H7 str.
EC4115]
gi|254793171|ref|YP_003078008.1| hypothetical protein ECSP_2114 [Escherichia coli O157:H7 str.
TW14359]
gi|387882661|ref|YP_006312963.1| hypothetical protein CDCO157_2097 [Escherichia coli Xuzhou21]
gi|416313898|ref|ZP_11658465.1| hypothetical protein ECoA_04276 [Escherichia coli O157:H7 str.
1044]
gi|416328129|ref|ZP_11667970.1| hypothetical protein ECF_02882 [Escherichia coli O157:H7 str. 1125]
gi|13259601|gb|AAK16970.1|AE006460_8 conserved hypothetical YjhS family protein encoded by cryptic
prophage CP-933P [Escherichia coli O157:H7 str. EDL933]
gi|13361728|dbj|BAB35685.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|188998427|gb|EDU67434.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|189375909|gb|EDU94325.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|208725064|gb|EDZ74771.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208731053|gb|EDZ79742.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208741169|gb|EDZ88851.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209161022|gb|ACI38455.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|254592571|gb|ACT71932.1| conserved hypothetical YjhS family protein encoded by cryptic
prophage CP-933P [Escherichia coli O157:H7 str. TW14359]
gi|326340102|gb|EGD63907.1| hypothetical protein ECoA_04276 [Escherichia coli O157:H7 str.
1044]
gi|326342514|gb|EGD66290.1| hypothetical protein ECF_02882 [Escherichia coli O157:H7 str. 1125]
gi|386796119|gb|AFJ29153.1| hypothetical protein CDCO157_2097 [Escherichia coli Xuzhou21]
Length = 616
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419272172|ref|ZP_13814481.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10D]
gi|378119580|gb|EHW81073.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10D]
Length = 616
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|419157661|ref|ZP_13702188.1| hypothetical protein ECDEC6D_0458 [Escherichia coli DEC6D]
gi|378014556|gb|EHV77459.1| hypothetical protein ECDEC6D_0458 [Escherichia coli DEC6D]
Length = 416
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 105 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGADGSFSEASGASAD 164
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R +VAL + AV+W QGE+D + + + L+ F
Sbjct: 165 SSRWGAGKPLYQDLVSRTKVALAKNPKNKLLAVVWMQGEADLASGSQQHNSLFTAMVQQF 224
Query: 182 FTDL 185
TDL
Sbjct: 225 RTDL 228
>gi|432691182|ref|ZP_19926417.1| hypothetical protein A31G_03402 [Escherichia coli KTE161]
gi|431228207|gb|ELF25324.1| hypothetical protein A31G_03402 [Escherichia coli KTE161]
Length = 618
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ ++
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTKGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ ++ R + AL+ + A+ W QGE D A Y ++ D+F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDM----SAATYAQQPDLFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRTDL 261
>gi|425254699|ref|ZP_18647325.1| hypothetical protein ECCB7326_2349, partial [Escherichia coli
CB7326]
gi|408177768|gb|EKI04525.1| hypothetical protein ECCB7326_2349, partial [Escherichia coli
CB7326]
Length = 110
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 109
>gi|260854031|ref|YP_003227922.1| hypothetical protein ECO26_0861 [Escherichia coli O26:H11 str.
11368]
gi|257752680|dbj|BAI24182.1| hypothetical protein ECO26_0861 [Escherichia coli O26:H11 str.
11368]
Length = 513
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHVQQPALFTA 149
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 150 MLTQFRADL 158
>gi|168802138|ref|ZP_02827145.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|189375851|gb|EDU94267.1| YjhS [Escherichia coli O157:H7 str. EC508]
Length = 616
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|424581103|ref|ZP_18020836.1| hypothetical protein ECEC1863_2009, partial [Escherichia coli
EC1863]
gi|390921427|gb|EIP79624.1| hypothetical protein ECEC1863_2009, partial [Escherichia coli
EC1863]
Length = 112
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 11 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 70
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 71 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 111
>gi|419300439|ref|ZP_13842439.1| hypothetical protein ECDEC11C_2311 [Escherichia coli DEC11C]
gi|378151328|gb|EHX12440.1| hypothetical protein ECDEC11C_2311 [Escherichia coli DEC11C]
Length = 573
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 94 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSAATGASQD 153
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 154 SARWGVGKPLYQDLIARTRAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 209
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 210 MLKQFRADL 218
>gi|419068750|ref|ZP_13614586.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3E]
gi|377916417|gb|EHU80502.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3E]
Length = 617
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|425384196|ref|ZP_18768047.1| hypothetical protein ECEC1866_1006, partial [Escherichia coli
EC1866]
gi|408315067|gb|EKJ31401.1| hypothetical protein ECEC1866_1006, partial [Escherichia coli
EC1866]
Length = 116
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 13 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQD 72
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 73 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 113
>gi|417298962|ref|ZP_12086197.1| PF03629 domain protein [Escherichia coli 900105 (10e)]
gi|386257563|gb|EIJ13049.1| PF03629 domain protein [Escherichia coli 900105 (10e)]
Length = 513
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 34 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 93
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 94 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHVQQPALFTA 149
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 150 MLTQFRADL 158
>gi|419091783|ref|ZP_13637087.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
gi|377946294|gb|EHV09976.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
Length = 616
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|420283379|ref|ZP_14785605.1| hypothetical protein ECTW06591_4998 [Escherichia coli TW06591]
gi|390778868|gb|EIO46622.1| hypothetical protein ECTW06591_4998 [Escherichia coli TW06591]
Length = 616
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|422998834|ref|ZP_16989590.1| hypothetical protein EUEG_01262 [Escherichia coli O104:H4 str.
09-7901]
gi|423007294|ref|ZP_16998037.1| hypothetical protein EUDG_04293 [Escherichia coli O104:H4 str.
04-8351]
gi|354856682|gb|EHF17140.1| hypothetical protein EUDG_04293 [Escherichia coli O104:H4 str.
04-8351]
gi|354875011|gb|EHF35377.1| hypothetical protein EUEG_01262 [Escherichia coli O104:H4 str.
09-7901]
Length = 684
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 188 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGADGSFSEASGASAD 247
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R +VAL + AV+W QGE+D + + + L+ F
Sbjct: 248 SSRWGAGKPLYQDLVSRTKVALAKNPKNKLLAVVWMQGEADLASGSQQHNSLFTAMVQQF 307
Query: 182 FTDL 185
TDL
Sbjct: 308 RTDL 311
>gi|425174631|ref|ZP_18572784.1| hypothetical protein ECFDA504_2926, partial [Escherichia coli
FDA504]
gi|408092944|gb|EKH26084.1| hypothetical protein ECFDA504_2926, partial [Escherichia coli
FDA504]
Length = 117
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 12 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 71
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 72 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|417209765|ref|ZP_12020968.1| PF08410 domain protein [Escherichia coli JB1-95]
gi|386196071|gb|EIH90298.1| PF08410 domain protein [Escherichia coli JB1-95]
Length = 497
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|386280366|ref|ZP_10058033.1| hypothetical protein ESBG_01401 [Escherichia sp. 4_1_40B]
gi|386122581|gb|EIG71191.1| hypothetical protein ESBG_01401 [Escherichia sp. 4_1_40B]
Length = 344
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
W G LY+ ++ R + AL+ + A+ W QGE D N Y ++ F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNAS----YAQQPAAFLA 252
Query: 184 ---DLRSDL 189
R+DL
Sbjct: 253 MVQQFRADL 261
>gi|425384194|ref|ZP_18768046.1| hypothetical protein ECEC1866_1005, partial [Escherichia coli
EC1866]
gi|408315113|gb|EKJ31444.1| hypothetical protein ECEC1866_1005, partial [Escherichia coli
EC1866]
Length = 240
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|383179584|ref|YP_005457589.1| hypothetical protein SSON53_15405 [Shigella sonnei 53G]
gi|419147570|ref|ZP_13692253.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6B]
gi|419163957|ref|ZP_13708419.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6E]
gi|432704042|ref|ZP_19939156.1| hypothetical protein A31Q_01920 [Escherichia coli KTE171]
gi|377998589|gb|EHV61680.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6B]
gi|378012760|gb|EHV75688.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6E]
gi|431244739|gb|ELF39042.1| hypothetical protein A31Q_01920 [Escherichia coli KTE171]
Length = 684
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 188 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGADGSFSEASGASAD 247
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R +VAL + AV+W QGE+D + + + L+ F
Sbjct: 248 SSRWGAGKPLYQDLVSRTKVALAKNPKNKLLAVVWMQGEADLASGSQQHNSLFTAMVQQF 307
Query: 182 FTDL 185
TDL
Sbjct: 308 RTDL 311
>gi|417836482|ref|ZP_12482793.1| hypothetical protein HUSEC41_27119 [Escherichia coli O104:H4 str.
01-09591]
gi|340730820|gb|EGR60086.1| hypothetical protein HUSEC41_27119 [Escherichia coli O104:H4 str.
01-09591]
Length = 684
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 188 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGADGSFSEASGASAD 247
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R +VAL + AV+W QGE+D + + + L+ F
Sbjct: 248 SSRWGAGKPLYQDLVSRTKVALAKNPKNKLLAVVWMQGEADLASGSQQHNSLFTAMVQQF 307
Query: 182 FTDL 185
TDL
Sbjct: 308 RTDL 311
>gi|416799974|ref|ZP_11884591.1| hypothetical protein ECO2687_22780, partial [Escherichia coli
O157:H- str. H 2687]
gi|320651355|gb|EFX19779.1| hypothetical protein ECO2687_22780 [Escherichia coli O157:H- str. H
2687]
Length = 416
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|420112623|ref|ZP_14622417.1| YjhS, partial [Escherichia coli O26:H11 str. CVM10021]
gi|394414140|gb|EJE88103.1| YjhS, partial [Escherichia coli O26:H11 str. CVM10021]
Length = 446
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 80/201 (39%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|419074907|ref|ZP_13620455.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3F]
gi|377927154|gb|EHU91076.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3F]
Length = 616
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|415771675|ref|ZP_11485482.1| conserved hypothetical protein [Escherichia coli 3431]
gi|315619657|gb|EFV00179.1| conserved hypothetical protein [Escherichia coli 3431]
Length = 677
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 181 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGADGSFSEASGASAD 240
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R +VAL + AV+W QGE+D + + + L+ F
Sbjct: 241 SSRWGAGKPLYQDLVSRTKVALAKNPKNKLLAVVWMQGEADLASGSQQHNSLFTAMVQQF 300
Query: 182 FTDL 185
TDL
Sbjct: 301 RTDL 304
>gi|291281117|ref|YP_003497935.1| YjhS [Escherichia coli O55:H7 str. CB9615]
gi|290760990|gb|ADD54951.1| YjhS [Escherichia coli O55:H7 str. CB9615]
Length = 651
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 77/200 (38%), Gaps = 40/200 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPPQCQ--------PNPSILRL 69
+++LAGQSN G G+ D R +L V P + P L
Sbjct: 104 VVVLAGQSNAMSYGEGIPLPDSYDAPDPRIKQLARRSTVTPGGEACVFNDVIPADHCLHD 163
Query: 70 TAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT----- 124
+ V +H AD+ + VG GL A +L +P I LVPC+ GG+
Sbjct: 164 VQDMS-VFSHP--EADLSKGQYGCVGQGLHIAKRLLPYIPKNAGILLVPCSRGGSAFTAG 220
Query: 125 -------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLE 169
N ++W G LY+ +I R + AL + AV W QGE D N
Sbjct: 221 ADGTFSEATGASQNSARWGVGKPLYQDLILRTKAALAKNPENVLLAVCWMQGEFDMTNAG 280
Query: 170 DAKLYKERSDMFFTDLRSDL 189
A+ M RSDL
Sbjct: 281 YAQQPAAFQSM-VQQFRSDL 299
>gi|15801790|ref|NP_287808.1| hypothetical protein Z2377 [Escherichia coli O157:H7 str. EDL933]
gi|12515371|gb|AAG56422.1|AE005369_11 unknown protein encoded within prophage CP-933R [Escherichia coli
O157:H7 str. EDL933]
Length = 616
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|383178790|ref|YP_005456795.1| prophage protein [Shigella sonnei 53G]
gi|414576370|ref|ZP_11433556.1| hypothetical protein SS323385_2201 [Shigella sonnei 3233-85]
gi|418266070|ref|ZP_12885704.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Shigella sonnei str. Moseley]
gi|420358898|ref|ZP_14859876.1| hypothetical protein SS322685_2684 [Shigella sonnei 3226-85]
gi|391283035|gb|EIQ41659.1| hypothetical protein SS322685_2684 [Shigella sonnei 3226-85]
gi|391285441|gb|EIQ44020.1| hypothetical protein SS323385_2201 [Shigella sonnei 3233-85]
gi|397900156|gb|EJL16521.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Shigella sonnei str. Moseley]
Length = 617
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCCGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419322451|ref|ZP_13864173.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12B]
gi|378170769|gb|EHX31646.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12B]
Length = 617
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 80/201 (39%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|419232774|ref|ZP_13775553.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
gi|419237108|ref|ZP_13779850.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9C]
gi|419284479|ref|ZP_13826657.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10F]
gi|378078387|gb|EHW40374.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
gi|378087542|gb|EHW49401.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9C]
gi|378133221|gb|EHW94567.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10F]
Length = 616
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 80/201 (39%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|414577021|ref|ZP_11434202.1| hypothetical protein SS323385_2857 [Shigella sonnei 3233-85]
gi|419157052|ref|ZP_13701596.1| hypothetical protein ECDEC6C_5293 [Escherichia coli DEC6C]
gi|419157178|ref|ZP_13701714.1| hypothetical protein ECDEC6D_5296 [Escherichia coli DEC6D]
gi|419158932|ref|ZP_13703444.1| hypothetical protein ECDEC6D_1738 [Escherichia coli DEC6D]
gi|377989505|gb|EHV52672.1| hypothetical protein ECDEC6C_5293 [Escherichia coli DEC6C]
gi|378009900|gb|EHV72849.1| hypothetical protein ECDEC6D_1738 [Escherichia coli DEC6D]
gi|378016354|gb|EHV79237.1| hypothetical protein ECDEC6D_5296 [Escherichia coli DEC6D]
gi|391284238|gb|EIQ42837.1| hypothetical protein SS323385_2857 [Shigella sonnei 3233-85]
Length = 601
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 105 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGADGSFSEASGASAD 164
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R +VAL + AV+W QGE+D + + + L+ F
Sbjct: 165 SSRWGAGKPLYQDLVSRTKVALAKNPKNKLLAVVWMQGEADLASGSQQHNSLFTAMVQQF 224
Query: 182 FTDL 185
TDL
Sbjct: 225 RTDL 228
>gi|419045811|ref|ZP_13592755.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC3A]
gi|377894617|gb|EHU59036.1| putative 9-O-acetyl-N-acetylneuraminate esterase, partial
[Escherichia coli DEC3A]
Length = 386
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG L A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQDLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|416339028|ref|ZP_11674966.1| hypothetical protein EcoM_04454 [Escherichia coli WV_060327]
gi|320193221|gb|EFW67859.1| hypothetical protein EcoM_04454 [Escherichia coli WV_060327]
Length = 616
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 80/201 (39%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|306824334|ref|ZP_07457703.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679]
gi|304552365|gb|EFM40283.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679]
Length = 571
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 9/128 (7%)
Query: 114 IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKL 173
IG++ + GGT IS+ +G +Y I A G + VLWYQG +D L +
Sbjct: 276 IGIIQTSWGGTAISRHVQGGDIYANHI-----APLTGFRVAGVLWYQGCNDASTLSTSLD 330
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEG-PFIEIVRKAQLSS-DLPNVRCVD--A 229
Y+ + R LP + V LA G + + VR+ QL + D N+R A
Sbjct: 331 YESQMTALINQYREVFDESTLPFLYVQLARWSGYQYTQNVRQGQLRTLDNANLRNSANVA 390
Query: 230 MGLPLEPD 237
M + ++ D
Sbjct: 391 MTVSIDTD 398
>gi|15831035|ref|NP_309808.1| hypothetical protein ECs1781 [Escherichia coli O157:H7 str. Sakai]
gi|168762427|ref|ZP_02787434.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|387882276|ref|YP_006312578.1| hypothetical protein CDCO157_1711 [Escherichia coli Xuzhou21]
gi|416310399|ref|ZP_11656415.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|419044858|ref|ZP_13591819.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3A]
gi|419050461|ref|ZP_13597358.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3B]
gi|419062029|ref|ZP_13608787.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3D]
gi|419097634|ref|ZP_13642862.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4D]
gi|419103497|ref|ZP_13648651.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4E]
gi|419108924|ref|ZP_13654011.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4F]
gi|420315629|ref|ZP_14817509.1| hypothetical protein ECEC1734_2851 [Escherichia coli EC1734]
gi|13361246|dbj|BAB35204.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|189367279|gb|EDU85695.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|326344502|gb|EGD68253.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|377897659|gb|EHU62035.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3A]
gi|377897978|gb|EHU62342.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3B]
gi|377914876|gb|EHU78997.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3D]
gi|377947605|gb|EHV11271.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4D]
gi|377952102|gb|EHV15704.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4E]
gi|377962011|gb|EHV25475.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4F]
gi|386795734|gb|AFJ28768.1| hypothetical protein CDCO157_1711 [Escherichia coli Xuzhou21]
gi|390908333|gb|EIP67157.1| hypothetical protein ECEC1734_2851 [Escherichia coli EC1734]
Length = 617
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|449450532|ref|XP_004143016.1| PREDICTED: uncharacterized protein LOC101219489 [Cucumis sativus]
Length = 111
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 43/71 (60%), Gaps = 8/71 (11%)
Query: 172 KLYKERSDMFFTDLRSDLQSPLLPII--RVALASGEGPF----IEIVRKAQ--LSSDLPN 223
K+YK+ FFTD+R D++ LPII ++AL P + VR+AQ +S +LP+
Sbjct: 2 KIYKDNLKKFFTDIRDDIKPRFLPIIVVKIALYDFFRPHDTHNLPAVREAQEAVSKELPD 61
Query: 224 VRCVDAMGLPL 234
V +D++ LP+
Sbjct: 62 VVAIDSLKLPI 72
>gi|260066208|gb|ACX30648.1| Axe19 precursor [Sphingobacterium sp. TN19]
Length = 277
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 101/251 (40%), Gaps = 45/251 (17%)
Query: 20 QYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAH 79
Q + + + GQSNM G + + + C P + R K +W LA
Sbjct: 24 QDKNFHIYLCFGQSNMEGHSKFEDQDTLGNNRFYSLQAVDC---PDLNR--KKGEWYLAK 78
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS------------ 127
P+ G+ P F + +P+ IG++ +IGG +I
Sbjct: 79 PPI-----TRSNTGLTPADYFGRTLAENLPDSIRIGIINVSIGGCHIQLFDRDSVTNYVE 133
Query: 128 ---QWRKG------SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERS 178
QW KG ++ Y+++++ A+VA + G I+ +L +QGES+T + + + +
Sbjct: 134 RAPQWMKGMLAAYDNNPYDRLVEMAKVAQQ-TGVIKGILLHQGESNTGDQD----WPNKV 188
Query: 179 DMFFTDLRSDL-----QSPLLPIIRVALASGE--GPFIEIVRKAQLSSDLPNVRCVDAMG 231
+ ++ DL ++PLL +A G I+R L L N V +
Sbjct: 189 SRVYHNILEDLALQEEETPLLAGELLAADQGGRCASMNTIIRT--LPKTLKNAHIVSSKD 246
Query: 232 LPLEPDGLHLT 242
DGLH +
Sbjct: 247 CEGVADGLHFS 257
>gi|168750875|ref|ZP_02775897.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|168768722|ref|ZP_02793729.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|188014961|gb|EDU53083.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|189362043|gb|EDU80462.1| YjhS [Escherichia coli O157:H7 str. EC4486]
Length = 617
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419056627|ref|ZP_13603459.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3C]
gi|377909315|gb|EHU73518.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3C]
Length = 617
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|171741969|ref|ZP_02917776.1| hypothetical protein BIFDEN_01072 [Bifidobacterium dentium ATCC
27678]
gi|171277583|gb|EDT45244.1| hypothetical protein BIFDEN_01072 [Bifidobacterium dentium ATCC
27678]
Length = 571
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 9/128 (7%)
Query: 114 IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKL 173
IG++ + GGT IS+ +G +Y I A G + VLWYQG +D L +
Sbjct: 276 IGIIQTSWGGTAISRHVQGGDIYANHI-----APLTGFRVAGVLWYQGCNDASTLSTSLD 330
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEG-PFIEIVRKAQLSS-DLPNVRCVD--A 229
Y+ + R LP + V LA G + + VR+ QL + D N+R A
Sbjct: 331 YESQMTALINQYRKVFDESTLPFLYVQLARWSGYQYTQNVRQGQLRTLDNANLRNSANVA 390
Query: 230 MGLPLEPD 237
M + ++ D
Sbjct: 391 MTVSIDTD 398
>gi|15800855|ref|NP_286871.1| hypothetical protein Z1349 [Escherichia coli O157:H7 str. EDL933]
gi|12514187|gb|AAG55482.1|AE005288_13 conserved hypothetical protein similar to yjhS for cryptic prophage
CP-933M [Escherichia coli O157:H7 str. EDL933]
Length = 616
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLXQFRADL 261
>gi|417194503|ref|ZP_12015541.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|386189649|gb|EIH78409.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 455
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|168755378|ref|ZP_02780385.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|168774836|ref|ZP_02799843.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|168778612|ref|ZP_02803619.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|168800514|ref|ZP_02825521.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|195939686|ref|ZP_03085068.1| hypothetical protein EscherichcoliO157_25325 [Escherichia coli
O157:H7 str. EC4024]
gi|208809446|ref|ZP_03251783.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208813328|ref|ZP_03254657.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208821588|ref|ZP_03261908.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209399258|ref|YP_002272106.1| hypothetical protein ECH74115_3879 [Escherichia coli O157:H7 str.
EC4115]
gi|254794581|ref|YP_003079418.1| hypothetical protein ECSP_3580 [Escherichia coli O157:H7 str.
TW14359]
gi|416325071|ref|ZP_11665539.1| hypothetical protein ECF_00344 [Escherichia coli O157:H7 str. 1125]
gi|187769576|gb|EDU33420.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|189003499|gb|EDU72485.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|189357339|gb|EDU75758.1| YjhS [Escherichia coli O157:H7 str. EC4401]
gi|189377183|gb|EDU95599.1| YjhS [Escherichia coli O157:H7 str. EC508]
gi|208729247|gb|EDZ78848.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208734605|gb|EDZ83292.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208741711|gb|EDZ89393.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209160658|gb|ACI38091.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|254593981|gb|ACT73342.1| hypothetical protein ECSP_3580 [Escherichia coli O157:H7 str.
TW14359]
gi|326346319|gb|EGD70056.1| hypothetical protein ECF_00344 [Escherichia coli O157:H7 str. 1125]
Length = 617
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|421830246|ref|ZP_16265561.1| hypothetical protein ECPA7_2402 [Escherichia coli PA7]
gi|408069345|gb|EKH03732.1| hypothetical protein ECPA7_2402 [Escherichia coli PA7]
Length = 617
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419220044|ref|ZP_13762996.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
gi|378071278|gb|EHW33348.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
Length = 624
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|15832752|ref|NP_311525.1| hypothetical protein ECs3498 [Escherichia coli O157:H7 str. Sakai]
gi|168789535|ref|ZP_02814542.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|217326899|ref|ZP_03442982.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|387883825|ref|YP_006314127.1| hypothetical protein CDCO157_3261 [Escherichia coli Xuzhou21]
gi|416321788|ref|ZP_11663636.1| hypothetical protein ECoD_03963 [Escherichia coli O157:H7 str.
EC1212]
gi|13362969|dbj|BAB36921.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|189370844|gb|EDU89260.1| YjhS [Escherichia coli O157:H7 str. EC869]
gi|217319266|gb|EEC27691.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|320188968|gb|EFW63627.1| hypothetical protein ECoD_03963 [Escherichia coli O157:H7 str.
EC1212]
gi|386797283|gb|AFJ30317.1| hypothetical protein CDCO157_3261 [Escherichia coli Xuzhou21]
Length = 617
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419407356|ref|ZP_13948046.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15D]
gi|378254767|gb|EHY14629.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15D]
Length = 475
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ ++
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTKGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D +A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----NAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|419866689|ref|ZP_14389039.1| YjhS [Escherichia coli O103:H25 str. CVM9340]
gi|388334172|gb|EIL00776.1| YjhS [Escherichia coli O103:H25 str. CVM9340]
Length = 616
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|419224181|ref|ZP_13767087.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
gi|378060267|gb|EHW22465.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
Length = 616
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|419391826|ref|ZP_13932640.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15A]
gi|419396915|ref|ZP_13937685.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15B]
gi|419402244|ref|ZP_13942968.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15C]
gi|419412928|ref|ZP_13953583.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15E]
gi|378237947|gb|EHX97960.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15A]
gi|378245266|gb|EHY05204.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15B]
gi|378246778|gb|EHY06697.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15C]
gi|378259313|gb|EHY19126.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15E]
Length = 617
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 55/129 (42%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ ++
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTKGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D +A Y ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----NAATYAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLKQFRADL 261
>gi|421823451|ref|ZP_16258865.1| hypothetical protein ECFRIK920_1880 [Escherichia coli FRIK920]
gi|408073760|gb|EKH08065.1| hypothetical protein ECFRIK920_1880 [Escherichia coli FRIK920]
Length = 618
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|415822173|ref|ZP_11510924.1| hypothetical protein ECOK1180_3718 [Escherichia coli OK1180]
gi|417202218|ref|ZP_12018468.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|417594560|ref|ZP_12245246.1| hypothetical protein EC253486_5224 [Escherichia coli 2534-86]
gi|323177639|gb|EFZ63224.1| hypothetical protein ECOK1180_3718 [Escherichia coli OK1180]
gi|345331667|gb|EGW64127.1| hypothetical protein EC253486_5224 [Escherichia coli 2534-86]
gi|386187105|gb|EIH75928.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 616
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|309802031|ref|ZP_07696144.1| putative lipoprotein [Bifidobacterium dentium JCVIHMP022]
gi|308221366|gb|EFO77665.1| putative lipoprotein [Bifidobacterium dentium JCVIHMP022]
Length = 538
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 9/128 (7%)
Query: 114 IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKL 173
IG++ + GGT IS+ +G +Y I A G + VLWYQG +D L +
Sbjct: 243 IGIIQTSWGGTAISRHVQGGDIYANHI-----APLTGFRVAGVLWYQGCNDASTLSTSLD 297
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEG-PFIEIVRKAQLSS-DLPNVRCVD--A 229
Y+ + R LP + V LA G + + VR+ QL + D N+R A
Sbjct: 298 YESQMTALINQYREVFDESTLPFLYVQLARWSGYQYTQNVRQGQLRTLDNANLRNSANVA 357
Query: 230 MGLPLEPD 237
M + ++ D
Sbjct: 358 MTVSIDTD 365
>gi|417199457|ref|ZP_12016909.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|386188438|gb|EIH77244.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 589
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|417591594|ref|ZP_12242297.1| hypothetical protein EC253486_2194 [Escherichia coli 2534-86]
gi|345341739|gb|EGW74142.1| hypothetical protein EC253486_2194 [Escherichia coli 2534-86]
Length = 589
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|387505227|ref|YP_006157483.1| YjhS [Escherichia coli O55:H7 str. RM12579]
gi|416815186|ref|ZP_11891792.1| YjhS [Escherichia coli O55:H7 str. 3256-97]
gi|416825925|ref|ZP_11896990.1| YjhS [Escherichia coli O55:H7 str. USDA 5905]
gi|419118630|ref|ZP_13663617.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5B]
gi|419127162|ref|ZP_13672042.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5C]
gi|419129867|ref|ZP_13674721.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5D]
gi|425250343|ref|ZP_18643286.1| hypothetical protein EC5905_3955 [Escherichia coli 5905]
gi|320654234|gb|EFX22294.1| YjhS [Escherichia coli O55:H7 str. 3256-97 TW 07815]
gi|320659259|gb|EFX26839.1| YjhS [Escherichia coli O55:H7 str. USDA 5905]
gi|374357221|gb|AEZ38928.1| YjhS [Escherichia coli O55:H7 str. RM12579]
gi|377973603|gb|EHV36942.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5C]
gi|377973960|gb|EHV37290.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5B]
gi|377981935|gb|EHV45192.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5D]
gi|408163200|gb|EKH91075.1| hypothetical protein EC5905_3955 [Escherichia coli 5905]
Length = 613
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 77/200 (38%), Gaps = 40/200 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPPQCQ--------PNPSILRL 69
+++LAGQSN G G+ D R +L V P + P L
Sbjct: 66 VVVLAGQSNAMSYGEGIPLPDSYDAPDPRIKQLARRSTVTPGGEACVFNDVIPADHCLHD 125
Query: 70 TAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT----- 124
+ V +H AD+ + VG GL A +L +P I LVPC+ GG+
Sbjct: 126 VQDMS-VFSHP--EADLSKGQYGCVGQGLHIAKRLLPYIPKNAGILLVPCSRGGSAFTAG 182
Query: 125 -------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLE 169
N ++W G LY+ +I R + AL + AV W QGE D N
Sbjct: 183 ADGTFSEATGASQNSARWGVGKPLYQDLILRTKAALAKNPENVLLAVCWMQGEFDMTNAG 242
Query: 170 DAKLYKERSDMFFTDLRSDL 189
A+ M RSDL
Sbjct: 243 YAQQPAAFQSM-VQQFRSDL 261
>gi|283456892|ref|YP_003361456.1| Sialic acid-specific 9-O-acetylesterase [Bifidobacterium dentium
Bd1]
gi|283103526|gb|ADB10632.1| Sialic acid-specific 9-O-acetylesterase [Bifidobacterium dentium
Bd1]
Length = 538
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 9/128 (7%)
Query: 114 IGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKL 173
IG++ + GGT IS+ +G +Y I A G + VLWYQG +D L +
Sbjct: 243 IGIIQTSWGGTAISRHVQGGDIYANHI-----APLTGFRVAGVLWYQGCNDASTLSTSLD 297
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEG-PFIEIVRKAQLSS-DLPNVRCVD--A 229
Y+ + R LP + V LA G + + VR+ QL + D N+R A
Sbjct: 298 YESQMTALINQYRKVFDESTLPFLYVQLARWSGYQYTQNVRQGQLRTLDNANLRNSANVA 357
Query: 230 MGLPLEPD 237
M + ++ D
Sbjct: 358 MTVSIDTD 365
>gi|383818103|ref|ZP_09973401.1| hypothetical protein MPHLEI_02433 [Mycobacterium phlei RIVM601174]
gi|383339348|gb|EID17684.1| hypothetical protein MPHLEI_02433 [Mycobacterium phlei RIVM601174]
Length = 294
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 84/201 (41%), Gaps = 27/201 (13%)
Query: 9 ILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILR 68
+L PV ++ + GQSN G G L DG+ P + + +
Sbjct: 28 LLAPRGVPVDPPETPYLVVPILGQSNAFGMG--------VGLDPDGLDRPHPRVHQWAMC 79
Query: 69 LTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS- 127
+K VLA +PL +I GVG G+ FA + + L+P A G T+ +
Sbjct: 80 GRSKNTAVLARDPLLHEI---PGKGVGFGMTFARNLADATGR--TVLLIPGARGDTSFTP 134
Query: 128 ----QW-----RKGSSLYEQMIQRAQVALRG--GGTIRAVLWYQGESDTVNLEDAKLYKE 176
W R +LY + + LR G + VLW+QGE+D V L Y+
Sbjct: 135 KNGYTWDPADTRTRVNLYRRAVSAIDTVLRRYPGSEVAVVLWHQGETD-VPLMSGPDYQA 193
Query: 177 RSDMFFTDLRSDLQSPLLPII 197
+ D F DLRS S LPI+
Sbjct: 194 KLDSTFNDLRSRYGSD-LPIL 213
>gi|260870776|ref|YP_003237178.1| hypothetical protein ECO111_4888 [Escherichia coli O111:H- str.
11128]
gi|257767132|dbj|BAI38627.1| hypothetical protein ECO111_4888 [Escherichia coli O111:H- str.
11128]
Length = 616
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 71/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|417230975|ref|ZP_12032391.1| PF03629 domain protein [Escherichia coli 5.0959]
gi|386205556|gb|EII10066.1| PF03629 domain protein [Escherichia coli 5.0959]
Length = 489
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 69 SARWGVGKPLYQDLIVRTKAALQKNPKNVLLAVCWMQGEFDM----SAATYAQQPALFTA 124
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 125 MLKQFRADL 133
>gi|419147340|ref|ZP_13692029.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC6B]
gi|377999583|gb|EHV62661.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC6B]
Length = 592
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419289446|ref|ZP_13831541.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
gi|378131377|gb|EHW92734.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC11A]
Length = 616
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I L PC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLAPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHVQQPALFTAMLTQFRADL 261
>gi|432450027|ref|ZP_19692295.1| hypothetical protein A13W_00970 [Escherichia coli KTE193]
gi|433033681|ref|ZP_20221409.1| hypothetical protein WIC_02250 [Escherichia coli KTE112]
gi|430980786|gb|ELC97535.1| hypothetical protein A13W_00970 [Escherichia coli KTE193]
gi|431552970|gb|ELI26912.1| hypothetical protein WIC_02250 [Escherichia coli KTE112]
Length = 656
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 81/197 (41%), Gaps = 33/197 (16%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPPQ---CQPNPSIL--RLTAK 72
+++ AGQSN MA G+ D+R +L V P C N IL
Sbjct: 76 VVVSAGQSNSMAYGEGLPLPDSYDKPDSRIRQLARRSTVTPSGKACAYNDIILADHCLHD 135
Query: 73 LKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS----- 127
++ + + AD++ + V GL A +L +P I LVPC+ GG+ +
Sbjct: 136 VQDMSQYNHPKADLNKGQYGCVSQGLHIAKRLLPFIPANAGILLVPCSRGGSGFTTGDAG 195
Query: 128 -------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
+W + LY+ +I R + AL + AV W QGE+D ++A
Sbjct: 196 QFSEIGGATEKSCRWGTNTPLYKDLISRTKAALAKNPKNVLLAVCWTQGEADLEKEQNAA 255
Query: 173 LYKERSDMFFTDLRSDL 189
+K+ R+DL
Sbjct: 256 QHKDLFTAMVKQFRADL 272
>gi|423010142|ref|ZP_17000879.1| hypothetical protein EUFG_05117, partial [Escherichia coli O104:H4
str. 11-3677]
gi|354880970|gb|EHF41301.1| hypothetical protein EUFG_05117, partial [Escherichia coli O104:H4
str. 11-3677]
Length = 534
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419002439|ref|ZP_13549973.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC1B]
gi|377848784|gb|EHU13762.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC1B]
Length = 477
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419247729|ref|ZP_13790339.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9E]
gi|378100914|gb|EHW62605.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9E]
Length = 630
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +P I LVPC+ GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKRLLPYIPKNAGILLVPCSRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|386618386|ref|YP_006137966.1| hypothetical protein ECNA114_0869 [Escherichia coli NA114]
gi|333968887|gb|AEG35692.1| Hypothetical protein ECNA114_0869 [Escherichia coli NA114]
Length = 923
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 444 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 503
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 504 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 552
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFT 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|419896433|ref|ZP_14416126.1| hypothetical protein ECO9574_19261, partial [Escherichia coli
O111:H8 str. CVM9574]
gi|388357779|gb|EIL22299.1| hypothetical protein ECO9574_19261, partial [Escherichia coli
O111:H8 str. CVM9574]
Length = 163
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 51/119 (42%), Gaps = 27/119 (22%)
Query: 94 VGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ------------------WRKGSSL 135
VG GL A +L +PN I LVPC GG+ +Q W G L
Sbjct: 6 VGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQDSARWGVGKPL 65
Query: 136 YEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF---FTDLRSDL 189
Y+ +I R + AL+ + AV W QGE D A + ++ +F T R+DL
Sbjct: 66 YQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTAMLTQFRADL 120
>gi|420293631|ref|ZP_14795746.1| hypothetical protein ECTW11039_3767 [Escherichia coli TW11039]
gi|390795245|gb|EIO62529.1| hypothetical protein ECTW11039_3767 [Escherichia coli TW11039]
Length = 617
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 70/172 (40%), Gaps = 35/172 (20%)
Query: 27 IILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLKW 75
I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 67 IVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLHD 125
Query: 76 VLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ---- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 126 VQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEG 185
Query: 129 --------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 186 TFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 237
>gi|331675565|ref|ZP_08376313.1| conserved hypothetical YjhS family protein encoded by [Escherichia
coli TA280]
gi|331067339|gb|EGI38746.1| conserved hypothetical YjhS family protein encoded by [Escherichia
coli TA280]
Length = 736
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 53/129 (41%), Gaps = 23/129 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD + VG GL A +L +P+ I LVPC GG+ +Q
Sbjct: 148 ADPAAGQYGCVGQGLHIAKKLLPYIPDNAGILLVPCCRGGSAFTQGSDGTFSETSGATEA 207
Query: 129 ---WRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDTVNLEDAK---LYKERSDM 180
W G LY +I R + AL R AV+W QGE D A+ L+ +
Sbjct: 208 SARWGVGKPLYRDLISRTKAALDNNPKNRLLAVVWMQGEFDMAGANYAQQPALFTQMVQQ 267
Query: 181 FFTDLRSDL 189
F T+L S L
Sbjct: 268 FRTELASHL 276
>gi|419011124|ref|ZP_13558504.1| hypothetical protein ECDEC1D_5384 [Escherichia coli DEC1D]
gi|377866493|gb|EHU31262.1| hypothetical protein ECDEC1D_5384 [Escherichia coli DEC1D]
Length = 546
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|417804719|ref|ZP_12451712.1| hypothetical protein HUSEC_07147 [Escherichia coli O104:H4 str.
LB226692]
gi|340740702|gb|EGR74890.1| hypothetical protein HUSEC_07147 [Escherichia coli O104:H4 str.
LB226692]
Length = 546
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|218461796|ref|ZP_03501887.1| hypothetical protein RetlK5_20963 [Rhizobium etli Kim 5]
Length = 259
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 59/126 (46%), Gaps = 8/126 (6%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
AN ++ N VI L P A GG+ +++W G ++ + G I +VLW
Sbjct: 133 LANKLIASGQNDNVI-LAPLAYGGSEVARWAAGGDFNPLLVDTVKQLHDSGYRITSVLWV 191
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPL-LPIIRVALASGEGPFIE-----IV 212
QGE+D V A+ Y+ER LR +++P+ + I L G F E ++
Sbjct: 192 QGEADLVFGTTAETYQERFLSMVGTLRQHGVEAPVYISIASKCLEPSNGGFKEHIPDNVI 251
Query: 213 RKAQLS 218
+AQL+
Sbjct: 252 VQAQLA 257
>gi|74312380|ref|YP_310799.1| prophage protein [Shigella sonnei Ss046]
gi|420362108|ref|ZP_14863033.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Shigella sonnei 4822-66]
gi|73855857|gb|AAZ88564.1| unknown protein encoded within prophage [Shigella sonnei Ss046]
gi|391296678|gb|EIQ54761.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Shigella sonnei 4822-66]
Length = 617
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|417234231|ref|ZP_12034450.1| PF08410 domain protein [Escherichia coli 5.0959]
gi|386203443|gb|EII07967.1| PF08410 domain protein [Escherichia coli 5.0959]
Length = 617
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +PN I LVPC GG+
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTLGAE 184
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
+ ++W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIVRTKAALQKNQKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|429953654|ref|ZP_19419490.1| hypothetical protein S91_00026 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|429453023|gb|EKZ88895.1| hypothetical protein S91_00026 [Escherichia coli O104:H4 str.
Ec12-0466]
Length = 617
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|423052835|ref|ZP_17041643.1| hypothetical protein EUNG_01241, partial [Escherichia coli O104:H4
str. 11-4632 C4]
gi|354920733|gb|EHF80665.1| hypothetical protein EUNG_01241, partial [Escherichia coli O104:H4
str. 11-4632 C4]
Length = 478
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
W G LY+ ++ R + AL+ + A+ W QGE D N Y ++ F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNAS----YAQQPAAFLA 252
Query: 184 ---DLRSDL 189
R+DL
Sbjct: 253 MVQQFRADL 261
>gi|387606803|ref|YP_006095659.1| putative phage protein [Escherichia coli 042]
gi|284921103|emb|CBG34168.1| putative phage protein [Escherichia coli 042]
Length = 617
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|417832466|ref|ZP_12478942.1| hypothetical protein HUSEC41_06902, partial [Escherichia coli
O104:H4 str. 01-09591]
gi|340734879|gb|EGR63981.1| hypothetical protein HUSEC41_06902 [Escherichia coli O104:H4 str.
01-09591]
Length = 529
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|416774487|ref|ZP_11874086.1| hypothetical protein ECO5101_01015, partial [Escherichia coli
O157:H7 str. G5101]
gi|320641485|gb|EFX10907.1| hypothetical protein ECO5101_01015 [Escherichia coli O157:H7 str.
G5101]
Length = 392
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|301029200|ref|ZP_07192315.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gi|423702107|ref|ZP_17676566.1| hypothetical protein ESSG_01638 [Escherichia coli H730]
gi|432563448|ref|ZP_19800052.1| hypothetical protein A1SA_02098 [Escherichia coli KTE51]
gi|433051012|ref|ZP_20238294.1| hypothetical protein WII_04918 [Escherichia coli KTE120]
gi|299877880|gb|EFI86091.1| conserved hypothetical protein [Escherichia coli MS 196-1]
gi|385711069|gb|EIG48035.1| hypothetical protein ESSG_01638 [Escherichia coli H730]
gi|431096192|gb|ELE01766.1| hypothetical protein A1SA_02098 [Escherichia coli KTE51]
gi|431558905|gb|ELI32487.1| hypothetical protein WII_04918 [Escherichia coli KTE120]
Length = 617
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|432558415|ref|ZP_19795099.1| hypothetical protein A1S7_02066 [Escherichia coli KTE49]
gi|431092871|gb|ELD98548.1| hypothetical protein A1S7_02066 [Escherichia coli KTE49]
Length = 617
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|432559136|ref|ZP_19795814.1| hypothetical protein A1S7_02785 [Escherichia coli KTE49]
gi|431092187|gb|ELD97895.1| hypothetical protein A1S7_02785 [Escherichia coli KTE49]
Length = 617
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419390489|ref|ZP_13931321.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15A]
gi|378242279|gb|EHY02237.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15A]
Length = 618
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|260867994|ref|YP_003234396.1| hypothetical protein ECO111_1953 [Escherichia coli O111:H- str.
11128]
gi|257764350|dbj|BAI35845.1| hypothetical protein ECO111_1953 [Escherichia coli O111:H- str.
11128]
Length = 594
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|420324987|ref|ZP_14826759.1| hypothetical protein SFCCH060_1316 [Shigella flexneri CCH060]
gi|391254027|gb|EIQ13190.1| hypothetical protein SFCCH060_1316 [Shigella flexneri CCH060]
Length = 617
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419354509|ref|ZP_13895782.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC13C]
gi|419359737|ref|ZP_13900961.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC13D]
gi|419364099|ref|ZP_13905279.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC13E]
gi|378205797|gb|EHX66206.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC13C]
gi|378206130|gb|EHX66536.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC13D]
gi|378218035|gb|EHX78308.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC13E]
Length = 617
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
W G LY+ ++ R + AL+ + A+ W QGE D N Y ++ F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNAS----YAQQPAAFLA 252
Query: 184 ---DLRSDL 189
R+DL
Sbjct: 253 MVQQFRADL 261
>gi|417617749|ref|ZP_12268175.1| hypothetical protein ECG581_1557 [Escherichia coli G58-1]
gi|345379212|gb|EGX11126.1| hypothetical protein ECG581_1557 [Escherichia coli G58-1]
Length = 618
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|420285284|ref|ZP_14787499.1| hypothetical protein ECTW10246_1357 [Escherichia coli TW10246]
gi|390794147|gb|EIO61446.1| hypothetical protein ECTW10246_1357 [Escherichia coli TW10246]
Length = 616
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLAQFRADL 261
>gi|170770020|ref|ZP_02904473.1| YjhS [Escherichia albertii TW07627]
gi|170770165|ref|ZP_02904618.1| YjhS [Escherichia albertii TW07627]
gi|419310945|ref|ZP_13852815.1| hypothetical protein ECDEC11E_1477 [Escherichia coli DEC11E]
gi|432703876|ref|ZP_19938991.1| hypothetical protein A31Q_01755 [Escherichia coli KTE171]
gi|170120966|gb|EDS89897.1| YjhS [Escherichia albertii TW07627]
gi|170121086|gb|EDS90017.1| YjhS [Escherichia albertii TW07627]
gi|378159543|gb|EHX20547.1| hypothetical protein ECDEC11E_1477 [Escherichia coli DEC11E]
gi|431245001|gb|ELF39298.1| hypothetical protein A31Q_01755 [Escherichia coli KTE171]
Length = 618
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|187733174|ref|YP_001880054.1| YjhS [Shigella boydii CDC 3083-94]
gi|218694797|ref|YP_002402464.1| hypothetical protein EC55989_1381 [Escherichia coli 55989]
gi|187430166|gb|ACD09440.1| YjhS [Shigella boydii CDC 3083-94]
gi|218351529|emb|CAU97239.1| conserved hypothetical protein [Escherichia coli 55989]
Length = 618
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|432495218|ref|ZP_19737030.1| hypothetical protein A173_02386 [Escherichia coli KTE214]
gi|431025995|gb|ELD39080.1| hypothetical protein A173_02386 [Escherichia coli KTE214]
Length = 617
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|416829594|ref|ZP_11898444.1| hypothetical protein ECOSU61_06259, partial [Escherichia coli
O157:H7 str. LSU-61]
gi|320668209|gb|EFX35063.1| hypothetical protein ECOSU61_06259 [Escherichia coli O157:H7 str.
LSU-61]
Length = 394
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F R+DL
Sbjct: 241 ATHAQQPALFTAMLAQFRADL 261
>gi|432679752|ref|ZP_19915141.1| hypothetical protein A1YW_01507 [Escherichia coli KTE143]
gi|431222950|gb|ELF20221.1| hypothetical protein A1YW_01507 [Escherichia coli KTE143]
Length = 617
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
W G LY+ ++ R + AL+ + A+ W QGE D N Y ++ F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNAS----YAQQPAAFLA 252
Query: 184 ---DLRSDL 189
R+DL
Sbjct: 253 MVQQFRADL 261
>gi|419028397|ref|ZP_13575583.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2C]
gi|377882700|gb|EHU47237.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2C]
Length = 616
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|407468948|ref|YP_006784610.1| prophage protein [Escherichia coli O104:H4 str. 2009EL-2071]
gi|407482386|ref|YP_006779535.1| prophage protein [Escherichia coli O104:H4 str. 2011C-3493]
gi|410482939|ref|YP_006770485.1| prophage protein [Escherichia coli O104:H4 str. 2009EL-2050]
gi|417864808|ref|ZP_12509853.1| hypothetical protein C22711_1740 [Escherichia coli O104:H4 str.
C227-11]
gi|422999279|ref|ZP_16990035.1| hypothetical protein EUEG_01707 [Escherichia coli O104:H4 str.
09-7901]
gi|423002879|ref|ZP_16993625.1| hypothetical protein EUDG_00363 [Escherichia coli O104:H4 str.
04-8351]
gi|423023594|ref|ZP_17014297.1| hypothetical protein EUHG_01747 [Escherichia coli O104:H4 str.
11-4404]
gi|423028742|ref|ZP_17019435.1| hypothetical protein EUIG_01746 [Escherichia coli O104:H4 str.
11-4522]
gi|423029608|ref|ZP_17020296.1| hypothetical protein EUJG_00367 [Escherichia coli O104:H4 str.
11-4623]
gi|423037447|ref|ZP_17028121.1| hypothetical protein EUKG_01724 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|423042562|ref|ZP_17033229.1| hypothetical protein EULG_01737 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|423059802|ref|ZP_17048598.1| hypothetical protein EUOG_01742 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|429723652|ref|ZP_19258533.1| hypothetical protein MO3_01710 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429723996|ref|ZP_19258870.1| hypothetical protein MO5_04503 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429775026|ref|ZP_19307028.1| hypothetical protein C212_00303 [Escherichia coli O104:H4 str.
11-02030]
gi|429777705|ref|ZP_19309674.1| hypothetical protein C213_00300 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429781948|ref|ZP_19313875.1| hypothetical protein C214_00302 [Escherichia coli O104:H4 str.
11-02092]
gi|429788452|ref|ZP_19320332.1| hypothetical protein C215_00301 [Escherichia coli O104:H4 str.
11-02093]
gi|429793881|ref|ZP_19325722.1| hypothetical protein C216_00301 [Escherichia coli O104:H4 str.
11-02281]
gi|429797535|ref|ZP_19329339.1| hypothetical protein C217_00302 [Escherichia coli O104:H4 str.
11-02318]
gi|429802738|ref|ZP_19334499.1| hypothetical protein C218_00300 [Escherichia coli O104:H4 str.
11-02913]
gi|429810399|ref|ZP_19342100.1| hypothetical protein C219_00302 [Escherichia coli O104:H4 str.
11-03439]
gi|429814505|ref|ZP_19346174.1| hypothetical protein C220_00301 [Escherichia coli O104:H4 str.
11-04080]
gi|429819868|ref|ZP_19351493.1| hypothetical protein C221_00301 [Escherichia coli O104:H4 str.
11-03943]
gi|429912196|ref|ZP_19378152.1| hypothetical protein MO7_02626 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|429918033|ref|ZP_19383973.1| hypothetical protein O7C_05012 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429923072|ref|ZP_19388993.1| hypothetical protein O7E_05015 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429923922|ref|ZP_19389838.1| hypothetical protein O7G_00782 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429932816|ref|ZP_19398710.1| hypothetical protein O7I_04696 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429934419|ref|ZP_19400309.1| hypothetical protein O7K_01232 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429940082|ref|ZP_19405956.1| hypothetical protein O7M_01783 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429947720|ref|ZP_19413575.1| hypothetical protein O7O_04321 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429950358|ref|ZP_19416206.1| hypothetical protein S7Y_01779 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|341918097|gb|EGT67711.1| hypothetical protein C22711_1740 [Escherichia coli O104:H4 str.
C227-11]
gi|354871955|gb|EHF32352.1| hypothetical protein EUDG_00363 [Escherichia coli O104:H4 str.
04-8351]
gi|354875456|gb|EHF35822.1| hypothetical protein EUEG_01707 [Escherichia coli O104:H4 str.
09-7901]
gi|354876003|gb|EHF36365.1| hypothetical protein EUHG_01747 [Escherichia coli O104:H4 str.
11-4404]
gi|354882198|gb|EHF42524.1| hypothetical protein EUIG_01746 [Escherichia coli O104:H4 str.
11-4522]
gi|354898669|gb|EHF58821.1| hypothetical protein EUKG_01724 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|354900803|gb|EHF60936.1| hypothetical protein EUJG_00367 [Escherichia coli O104:H4 str.
11-4623]
gi|354902580|gb|EHF62697.1| hypothetical protein EULG_01737 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|354914820|gb|EHF74801.1| hypothetical protein EUOG_01742 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|406778101|gb|AFS57525.1| prophage protein [Escherichia coli O104:H4 str. 2009EL-2050]
gi|407054683|gb|AFS74734.1| prophage protein [Escherichia coli O104:H4 str. 2011C-3493]
gi|407064983|gb|AFS86030.1| prophage protein [Escherichia coli O104:H4 str. 2009EL-2071]
gi|429350839|gb|EKY87563.1| hypothetical protein C212_00303 [Escherichia coli O104:H4 str.
11-02030]
gi|429358040|gb|EKY94710.1| hypothetical protein C213_00300 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429359443|gb|EKY96108.1| hypothetical protein C214_00302 [Escherichia coli O104:H4 str.
11-02092]
gi|429369188|gb|EKZ05769.1| hypothetical protein C215_00301 [Escherichia coli O104:H4 str.
11-02093]
gi|429371897|gb|EKZ08447.1| hypothetical protein C216_00301 [Escherichia coli O104:H4 str.
11-02281]
gi|429373848|gb|EKZ10388.1| hypothetical protein C217_00302 [Escherichia coli O104:H4 str.
11-02318]
gi|429383952|gb|EKZ20409.1| hypothetical protein C219_00302 [Escherichia coli O104:H4 str.
11-03439]
gi|429389242|gb|EKZ25663.1| hypothetical protein C221_00301 [Escherichia coli O104:H4 str.
11-03943]
gi|429390182|gb|EKZ26598.1| hypothetical protein C218_00300 [Escherichia coli O104:H4 str.
11-02913]
gi|429394789|gb|EKZ31162.1| hypothetical protein MO3_01710 [Escherichia coli O104:H4 str.
Ec11-9450]
gi|429400474|gb|EKZ36789.1| hypothetical protein C220_00301 [Escherichia coli O104:H4 str.
11-04080]
gi|429401578|gb|EKZ37878.1| hypothetical protein MO5_04503 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429408318|gb|EKZ44557.1| hypothetical protein O7C_05012 [Escherichia coli O104:H4 str.
Ec11-4984]
gi|429417186|gb|EKZ53337.1| hypothetical protein O7I_04696 [Escherichia coli O104:H4 str.
Ec11-4987]
gi|429417261|gb|EKZ53411.1| hypothetical protein O7G_00782 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429422014|gb|EKZ58135.1| hypothetical protein O7K_01232 [Escherichia coli O104:H4 str.
Ec11-4988]
gi|429425827|gb|EKZ61916.1| hypothetical protein O7M_01783 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429431916|gb|EKZ67957.1| hypothetical protein O7E_05015 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429442228|gb|EKZ78187.1| hypothetical protein O7O_04321 [Escherichia coli O104:H4 str.
Ec11-6006]
gi|429451570|gb|EKZ87459.1| hypothetical protein S7Y_01779 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|429454417|gb|EKZ90277.1| hypothetical protein MO7_02626 [Escherichia coli O104:H4 str.
Ec11-9941]
Length = 617
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|417124924|ref|ZP_11973314.1| PF03629 domain protein [Escherichia coli 97.0246]
gi|386145961|gb|EIG92413.1| PF03629 domain protein [Escherichia coli 97.0246]
Length = 488
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 9 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 68
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 69 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 124
Query: 182 -FTDLRSDL 189
T R DL
Sbjct: 125 MLTQFRVDL 133
>gi|419135793|ref|ZP_13680599.1| hypothetical protein ECDEC5E_1286 [Escherichia coli DEC5E]
gi|377986942|gb|EHV50132.1| hypothetical protein ECDEC5E_1286 [Escherichia coli DEC5E]
Length = 617
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419134429|ref|ZP_13679246.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5D]
gi|419248191|ref|ZP_13790795.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9E]
gi|377969287|gb|EHV32666.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC5D]
gi|378099190|gb|EHW60909.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9E]
Length = 617
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|300937236|ref|ZP_07152084.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gi|432411431|ref|ZP_19654104.1| hypothetical protein WG9_01913 [Escherichia coli KTE39]
gi|300457711|gb|EFK21204.1| conserved hypothetical protein [Escherichia coli MS 21-1]
gi|430936151|gb|ELC56442.1| hypothetical protein WG9_01913 [Escherichia coli KTE39]
Length = 617
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|386623823|ref|YP_006143551.1| hypothetical protein CE10_1459 [Escherichia coli O7:K1 str. CE10]
gi|432391481|ref|ZP_19634329.1| hypothetical protein WE9_01799 [Escherichia coli KTE21]
gi|432543406|ref|ZP_19780253.1| hypothetical protein A197_01985 [Escherichia coli KTE236]
gi|432548896|ref|ZP_19785668.1| hypothetical protein A199_02355 [Escherichia coli KTE237]
gi|432630975|ref|ZP_19866910.1| hypothetical protein A1UW_01349 [Escherichia coli KTE80]
gi|433004778|ref|ZP_20193212.1| hypothetical protein A17S_02347 [Escherichia coli KTE227]
gi|433153397|ref|ZP_20338359.1| hypothetical protein WKS_01330 [Escherichia coli KTE176]
gi|349737561|gb|AEQ12267.1| unknown protein encoded within prophage [Escherichia coli O7:K1
str. CE10]
gi|430920791|gb|ELC41667.1| hypothetical protein WE9_01799 [Escherichia coli KTE21]
gi|431074629|gb|ELD82177.1| hypothetical protein A197_01985 [Escherichia coli KTE236]
gi|431080191|gb|ELD86996.1| hypothetical protein A199_02355 [Escherichia coli KTE237]
gi|431171826|gb|ELE71981.1| hypothetical protein A1UW_01349 [Escherichia coli KTE80]
gi|431516238|gb|ELH93851.1| hypothetical protein A17S_02347 [Escherichia coli KTE227]
gi|431676711|gb|ELJ42795.1| hypothetical protein WKS_01330 [Escherichia coli KTE176]
Length = 617
Score = 46.2 bits (108), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|432352417|ref|ZP_19595713.1| hypothetical protein WCA_01400, partial [Escherichia coli KTE2]
gi|430879346|gb|ELC02695.1| hypothetical protein WCA_01400, partial [Escherichia coli KTE2]
Length = 563
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 83 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 142
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 143 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 191
>gi|424108368|ref|ZP_17842921.1| hypothetical protein EC93001_1294, partial [Escherichia coli
93-001]
gi|429006694|ref|ZP_19074568.1| hypothetical protein EC951288_1137, partial [Escherichia coli
95.1288]
gi|390668757|gb|EIN45512.1| hypothetical protein EC93001_1294, partial [Escherichia coli
93-001]
gi|427273013|gb|EKW37714.1| hypothetical protein EC951288_1137, partial [Escherichia coli
95.1288]
Length = 116
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 43/100 (43%), Gaps = 20/100 (20%)
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ---------------- 128
D+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 1 DLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQDS 60
Query: 129 --WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 61 ARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFD 100
>gi|419933687|ref|ZP_14450870.1| unknown protein encoded within prophage, partial [Escherichia coli
576-1]
gi|388411465|gb|EIL71640.1| unknown protein encoded within prophage, partial [Escherichia coli
576-1]
Length = 579
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 99 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 158
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 159 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 207
>gi|419142692|ref|ZP_13687436.1| hypothetical protein ECDEC6A_2334, partial [Escherichia coli DEC6A]
gi|377995334|gb|EHV58451.1| hypothetical protein ECDEC6A_2334, partial [Escherichia coli DEC6A]
Length = 424
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 82/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +PN I LVPC GG+
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTRGAE 184
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
+ ++W G LY+ +I R + AL+ + AV W QGE D A
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
Y ++ +F R+DL
Sbjct: 241 ATYAQQPALFTAMLKQFRADL 261
>gi|419145949|ref|ZP_13690651.1| hypothetical protein ECDEC6A_5656 [Escherichia coli DEC6A]
gi|377984680|gb|EHV47910.1| hypothetical protein ECDEC6A_5656 [Escherichia coli DEC6A]
Length = 617
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|419017755|ref|ZP_13565073.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1E]
gi|377864713|gb|EHU29506.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1E]
Length = 616
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|119026531|ref|YP_910376.1| putative sialic acid-specific acetylesterase [Bifidobacterium
adolescentis ATCC 15703]
gi|118766115|dbj|BAF40294.1| putative sialic acid-specific acetylesterase [Bifidobacterium
adolescentis ATCC 15703]
Length = 551
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 97/235 (41%), Gaps = 35/235 (14%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTW-DGIVPPQCQPNPSILRLTAKLKWVLA-HEPLH 83
+ + AGQSNM T N + +GI+ P P + + +K+V+A H+ +
Sbjct: 150 VFVAAGQSNM--ELNYTQYYPENSANFGNGIIKETDLPKPLVDK---NVKFVIADHDVKN 204
Query: 84 AD-------------IDVNKTNGVGPGL---PFANAVLTKVPNFGVIGLVPCAIGGTNIS 127
D ++ + TN + FA + K PN V G++ A GGT I
Sbjct: 205 TDFPLANVNLNAGAWLNADSTNSLHLSYLTQQFALQLRAKHPNVPV-GIIQTAWGGTPIR 263
Query: 128 QWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRS 187
+ +G +Y I A + VLWYQG D +N A Y+ + R+
Sbjct: 264 RHVRGGDIYANHI-----APLKDFHVAGVLWYQGCDDAMNFATATEYESQMTALINQYRT 318
Query: 188 DLQSPLLPIIRVALAS-GEGPFIEIVRKAQLSSDLPNVRCVD----AMGLPLEPD 237
LP + V LA + + VR+AQ ++ L N D AM + L+ D
Sbjct: 319 VFGRKNLPFLYVQLARWTNYQYTQNVREAQRTT-LDNANLQDRSNVAMTVSLDTD 372
>gi|215486362|ref|YP_002328793.1| hypothetical protein E2348C_1246 [Escherichia coli O127:H6 str.
E2348/69]
gi|312968765|ref|ZP_07782972.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|417755087|ref|ZP_12403177.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2B]
gi|418997186|ref|ZP_13544784.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1A]
gi|419006951|ref|ZP_13554403.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1C]
gi|419023394|ref|ZP_13570632.1| hypothetical protein ECDEC2A_1525 [Escherichia coli DEC2A]
gi|419038999|ref|ZP_13586050.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
gi|215264434|emb|CAS08794.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
gi|312286167|gb|EFR14080.1| conserved hypothetical protein [Escherichia coli 2362-75]
gi|377844850|gb|EHU09882.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1A]
gi|377849278|gb|EHU14253.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC1C]
gi|377867360|gb|EHU32122.1| hypothetical protein ECDEC2A_1525 [Escherichia coli DEC2A]
gi|377877652|gb|EHU42245.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2B]
gi|377896729|gb|EHU61120.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2E]
Length = 616
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|425299867|ref|ZP_18689855.1| hypothetical protein EC07798_1761, partial [Escherichia coli 07798]
gi|408219068|gb|EKI43243.1| hypothetical protein EC07798_1761, partial [Escherichia coli 07798]
Length = 572
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|224540303|ref|ZP_03680842.1| hypothetical protein BACCELL_05216, partial [Bacteroides
cellulosilyticus DSM 14838]
gi|224518095|gb|EEF87200.1| hypothetical protein BACCELL_05216 [Bacteroides cellulosilyticus
DSM 14838]
Length = 157
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 69/154 (44%), Gaps = 9/154 (5%)
Query: 113 VIGLVPCAIGGTNISQWRKGSS--LYEQMIQRAQVALRGGGTIR--AVLWYQGESDTVNL 168
I +V A GGT++ ++ K S YE I R + AL+ + A++W+QGES N
Sbjct: 6 TIFIVVNARGGTSLERFMKNDSTGYYESTISRIKQALKKYPDLELGAIIWHQGES---NR 62
Query: 169 EDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK-AQLSSDLPNVRCV 227
+ K Y D R+DL P LP I + + IV++ A + + +
Sbjct: 63 DYYKDYIVHLRTLIKDYRADLNLPDLPFIAGEMGRWNPTYTNIVKQIAMIPDSIDKAYLI 122
Query: 228 DAMGLPLEPDGLHLTTPAQGSTLNSWSNEALRVN 261
+ GL D H + +Q N ++ + + ++
Sbjct: 123 SSEGLG-NIDEFHFDSNSQEILGNRYAEKYIEIS 155
>gi|422994087|ref|ZP_16984851.1| hypothetical protein EUBG_01738 [Escherichia coli O104:H4 str.
C236-11]
gi|354865162|gb|EHF25591.1| hypothetical protein EUBG_01738 [Escherichia coli O104:H4 str.
C236-11]
Length = 617
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|422992139|ref|ZP_16982910.1| hypothetical protein EUAG_01732 [Escherichia coli O104:H4 str.
C227-11]
gi|354857372|gb|EHF17828.1| hypothetical protein EUAG_01732 [Escherichia coli O104:H4 str.
C227-11]
Length = 617
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
W G LY+ ++ R + AL+ + A+ W QGE D N Y ++ F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNAS----YAQQPAAFLA 252
Query: 184 ---DLRSDL 189
R+DL
Sbjct: 253 MVQQFRADL 261
>gi|419033968|ref|ZP_13581063.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2D]
gi|377882587|gb|EHU47126.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC2D]
Length = 616
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 245
>gi|330998042|ref|ZP_08321873.1| hypothetical protein HMPREF9442_02977 [Paraprevotella xylaniphila
YIT 11841]
gi|329569343|gb|EGG51123.1| hypothetical protein HMPREF9442_02977 [Paraprevotella xylaniphila
YIT 11841]
Length = 546
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 49/104 (47%), Gaps = 12/104 (11%)
Query: 147 LRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL-------QSPLLPIIRV 199
L GG ++A++W+QGESD D Y+ +M T +R + + LP I
Sbjct: 175 LPGGYDVKAIMWHQGESDRTKAGD--YYRNFKEM-ITFMRERIYAVTGKEKDKTLPFIFG 231
Query: 200 ALASGEGPFIEIVRKAQL--SSDLPNVRCVDAMGLPLEPDGLHL 241
+ + +V AQL + +LPNV +D L+ DGLH
Sbjct: 232 TVPHASRQYDPLVEAAQLQVARELPNVHVIDLSDAGLQADGLHF 275
>gi|194430589|ref|ZP_03063049.1| YjhS [Escherichia coli B171]
gi|194411370|gb|EDX27732.1| YjhS [Escherichia coli B171]
Length = 620
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 140 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 199
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 200 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 248
>gi|309793311|ref|ZP_07687738.1| conserved hypothetical protein [Escherichia coli MS 145-7]
gi|308122898|gb|EFO60160.1| conserved hypothetical protein [Escherichia coli MS 145-7]
Length = 574
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 94 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 153
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 154 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 202
>gi|420279490|ref|ZP_14781754.1| hypothetical protein ECTW06591_1025 [Escherichia coli TW06591]
gi|390784665|gb|EIO52226.1| hypothetical protein ECTW06591_1025 [Escherichia coli TW06591]
Length = 616
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLAQFRADL 261
>gi|420380629|ref|ZP_14880091.1| hypothetical protein SD22575_2519 [Shigella dysenteriae 225-75]
gi|391301775|gb|EIQ59656.1| hypothetical protein SD22575_2519 [Shigella dysenteriae 225-75]
Length = 542
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 62 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 121
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 122 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 170
>gi|225388891|ref|ZP_03758615.1| hypothetical protein CLOSTASPAR_02631 [Clostridium asparagiforme
DSM 15981]
gi|225045046|gb|EEG55292.1| hypothetical protein CLOSTASPAR_02631 [Clostridium asparagiforme
DSM 15981]
Length = 260
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 99/234 (42%), Gaps = 56/234 (23%)
Query: 22 QQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEP 81
++ L + GQSNMAGR G+ PQ + L A ++ +P
Sbjct: 7 KEHDLFLFLGQSNMAGR---------------GVTSPQWPESAPALTPGAGYEYRAISDP 51
Query: 82 --LH---ADIDVNKTNGVG---PGLP-------FANAVL--TKVPNFGVIGLVPCAIGGT 124
LH VN+ N G PG+ F NA TK+P VIG V + GG+
Sbjct: 52 GRLHPASEPFGVNENNPDGICEPGMKTGSMVTAFINAYYARTKIP---VIG-VSASKGGS 107
Query: 125 NISQWR-KGSSLYE---------QMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLY 174
I QW+ G L + + ++ ++ +R R +LW QGE+D + Y
Sbjct: 108 AIGQWQGDGDYLSDALMRLKRTGKFLKEQEITVR----HRYMLWCQGETDGDLGTSPEDY 163
Query: 175 KERSDMFFTDLRSD-LQSPLLPIIRVALASGEGPF-IEIVRKAQLS--SDLPNV 224
K R F+ LR +++ L I + +G F +R+AQL +LP+V
Sbjct: 164 KARFTNMFSQLREKGIETCFL--IAIGEYNGRKGFDYSEIRRAQLELPKELPDV 215
>gi|425174117|ref|ZP_18572299.1| hypothetical protein ECFDA504_2432, partial [Escherichia coli
FDA504]
gi|408093678|gb|EKH26740.1| hypothetical protein ECFDA504_2432, partial [Escherichia coli
FDA504]
Length = 117
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 43/100 (43%), Gaps = 20/100 (20%)
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ---------------- 128
D+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 13 DLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGIFSESTGASQDS 72
Query: 129 --WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
W G LY+ +I R + AL+ + AV W QGE D
Sbjct: 73 ARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFD 112
>gi|419333774|ref|ZP_13875321.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12D]
gi|378187063|gb|EHX47679.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12D]
Length = 620
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 140 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 199
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 200 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 248
>gi|415803407|ref|ZP_11500505.1| hypothetical protein ECE128010_4246 [Escherichia coli E128010]
gi|419316463|ref|ZP_13858279.1| hypothetical protein ECDEC12A_1764 [Escherichia coli DEC12A]
gi|419321870|ref|ZP_13863601.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12B]
gi|419328643|ref|ZP_13870261.1| hypothetical protein ECDEC12C_1845 [Escherichia coli DEC12C]
gi|419338833|ref|ZP_13880318.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
gi|323159467|gb|EFZ45448.1| hypothetical protein ECE128010_4246 [Escherichia coli E128010]
gi|378171965|gb|EHX32826.1| hypothetical protein ECDEC12A_1764 [Escherichia coli DEC12A]
gi|378172709|gb|EHX33558.1| hypothetical protein ECDEC12C_1845 [Escherichia coli DEC12C]
gi|378172805|gb|EHX33653.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12B]
gi|378193356|gb|EHX53897.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC12E]
Length = 620
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 140 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 199
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 200 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNASYAQ 248
>gi|419080483|ref|ZP_13625946.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4A]
gi|377929396|gb|EHU93293.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4A]
Length = 616
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLAQFRADL 261
>gi|420309130|ref|ZP_14811084.1| hypothetical protein ECEC1738_1957 [Escherichia coli EC1738]
gi|390902108|gb|EIP61241.1| hypothetical protein ECEC1738_1957 [Escherichia coli EC1738]
Length = 616
Score = 45.8 bits (107), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADAGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLAQFRADL 261
>gi|419158668|ref|ZP_13703181.1| hypothetical protein ECDEC6D_1475 [Escherichia coli DEC6D]
gi|419163760|ref|ZP_13708222.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6E]
gi|378010125|gb|EHV73071.1| hypothetical protein ECDEC6D_1475 [Escherichia coli DEC6D]
gi|378012563|gb|EHV75491.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC6E]
Length = 458
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
W G LY+ ++ R + AL+ + A+ W QGE D N Y ++ F
Sbjct: 197 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMTNAS----YAQQPAAFLA 252
Query: 184 ---DLRSDL 189
R+DL
Sbjct: 253 MVQQFRADL 261
>gi|419067589|ref|ZP_13614002.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3E]
gi|377919025|gb|EHU83069.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3E]
Length = 616
Score = 45.4 bits (106), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLAQFRADL 261
>gi|317022254|gb|ADU86928.1| putative acetyl xylan esterase [uncultured bacterium]
Length = 292
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 61/267 (22%), Positives = 98/267 (36%), Gaps = 72/267 (26%)
Query: 18 KCQYQQQQLIILA-GQSNMAGRG------------------GVTNDTRTNKL-TWDGIVP 57
+C+ I L GQSNM G V + R K+ W VP
Sbjct: 27 ECKKDSNFYIFLCFGQSNMEGAAKPEAQDLVSPGPRFLLMPAVDDAERGRKMGEWCEAVP 86
Query: 58 PQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLV 117
P C+PN G+ P F ++ +P IG++
Sbjct: 87 PLCRPN----------------------------TGLTPADWFGRTMVASLPENIKIGVI 118
Query: 118 PCAIGGTNIS----------------QWRKG------SSLYEQMIQRAQVALRGGGTIRA 155
AIGG I W KG + YE+++ A+ A + G ++
Sbjct: 119 HVAIGGIKIEGFMKDKIGDYVKTEAPDWMKGMLKSYDDNPYERLVMLAKKAQKEG-VVKG 177
Query: 156 VLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK- 214
+L +QGES+T + E AK ++ + DL+ ++ L + A G+G I ++
Sbjct: 178 ILMHQGESNTGDPEWAKKVQQVYNALCKDLKLKPKNVPLFAGNIVQAGGQGVCIGCKKQI 237
Query: 215 AQLSSDLPNVRCVDAMGLPLEPDGLHL 241
+L +P + + G PD LH
Sbjct: 238 DELPLTIPTAHIISSDGCSNGPDRLHF 264
>gi|260867169|ref|YP_003233571.1| hypothetical protein ECO111_1070 [Escherichia coli O111:H- str.
11128]
gi|415824531|ref|ZP_11512820.1| hypothetical protein ECOK1180_5652 [Escherichia coli OK1180]
gi|417192917|ref|ZP_12014764.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|417590733|ref|ZP_12241447.1| hypothetical protein EC253486_1333 [Escherichia coli 2534-86]
gi|257763525|dbj|BAI35020.1| hypothetical protein ECO111_1070 [Escherichia coli O111:H- str.
11128]
gi|323175909|gb|EFZ61503.1| hypothetical protein ECOK1180_5652 [Escherichia coli OK1180]
gi|345344172|gb|EGW76547.1| hypothetical protein EC253486_1333 [Escherichia coli 2534-86]
gi|386190098|gb|EIH78846.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 616
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 54/129 (41%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 137 ADLSKGQYGCVGQGLYIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSADTGASQD 196
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
W G LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 197 SARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 252
Query: 182 -FTDLRSDL 189
R+DL
Sbjct: 253 MLAQFRADL 261
>gi|419254614|ref|ZP_13797141.1| hypothetical protein ECDEC10A_2125, partial [Escherichia coli
DEC10A]
gi|378102653|gb|EHW64327.1| hypothetical protein ECDEC10A_2125, partial [Escherichia coli
DEC10A]
Length = 235
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 52/171 (30%), Positives = 70/171 (40%), Gaps = 35/171 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGE 162
W G LY+ +I R + AL+ + AV W QGE
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQGE 235
>gi|420305514|ref|ZP_14807505.1| hypothetical protein ECTW10119_4215, partial [Escherichia coli
TW10119]
gi|390815213|gb|EIO81757.1| hypothetical protein ECTW10119_4215, partial [Escherichia coli
TW10119]
Length = 405
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 27/129 (20%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS---------- 133
AD+ + VG GL A +L +PN I LVPC GG+ +Q +G+
Sbjct: 28 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAEGTFSESTGASQD 87
Query: 134 --------SLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF-- 181
LY+ +I R + AL+ + AV W QGE D A + ++ +F
Sbjct: 88 SARWGWAKPLYQDLISRTKAALQKNPKNVLLAVCWMQGEFDM----SAATHAQQPALFTA 143
Query: 182 -FTDLRSDL 189
T R+DL
Sbjct: 144 MLTQFRADL 152
>gi|417160170|ref|ZP_11997089.1| PF03629 domain protein [Escherichia coli 99.0741]
gi|386174661|gb|EIH46654.1| PF03629 domain protein [Escherichia coli 99.0741]
Length = 670
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 106 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 152
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 153 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 212
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ N ++W LY+ +I R + AL R AV+W
Sbjct: 213 CRGGSAFTAGADGTYSDSAGASENSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 272
Query: 160 QGES--DTVNLEDAKLYKERSDMFFTDL 185
QGE D E + L+ + F TDL
Sbjct: 273 QGEFDIDAKPTEHSALFLAMVEKFRTDL 300
>gi|419136763|ref|ZP_13681562.1| hypothetical protein ECDEC5E_2255 [Escherichia coli DEC5E]
gi|377985097|gb|EHV48319.1| hypothetical protein ECDEC5E_2255 [Escherichia coli DEC5E]
Length = 625
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 55/130 (42%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R +VAL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKVALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFA 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|406836172|ref|ZP_11095766.1| hypothetical protein SpalD1_31174 [Schlesneria paludicola DSM
18645]
Length = 370
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 63/150 (42%), Gaps = 24/150 (16%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+L ++AGQS AG ND + PQ + + +W LA++P
Sbjct: 142 ELFVVAGQSYAAG----ANDELQK------VADPQGRVSAYDWHTK---RWQLANDP--- 185
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQ 144
+V + P L +VP +G V A+GGT+ QW L+++++
Sbjct: 186 QPNVGDGGTIWPALGDLLVPTLRVP----VGFVNVAVGGTSTKQWMPDGELHKRLVAVGN 241
Query: 145 VALRGGGTIRAVLWYQGESDTVNLEDAKLY 174
G RAVLW QGESD + +Y
Sbjct: 242 DV----GAFRAVLWQQGESDVIEKTPTDVY 267
>gi|154488189|ref|ZP_02029306.1| hypothetical protein BIFADO_01761 [Bifidobacterium adolescentis
L2-32]
gi|154083662|gb|EDN82707.1| hypothetical protein BIFADO_01761 [Bifidobacterium adolescentis
L2-32]
Length = 491
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 62/143 (43%), Gaps = 12/143 (8%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
FA + K PN IG++ A GGT I + +G +Y I A G + VLWY
Sbjct: 177 FAMQLRAKHPNVP-IGIIQTAWGGTPIRRHVQGGDIYANHI-----APLKGFHVAGVLWY 230
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS-GEGPFIEIVRKAQLS 218
QG D N A Y+ + R+ LP + V LA + + VR+AQ +
Sbjct: 231 QGCDDANNYGTALQYESQMTALINQYRNVFGRKDLPFLYVQLARWTNYQYTQNVREAQRT 290
Query: 219 SDLPNVRCVD----AMGLPLEPD 237
+ L N D AM + L+ D
Sbjct: 291 T-LDNANLQDRSNVAMTVSLDTD 312
>gi|218694476|ref|YP_002402143.1| phage protein [Escherichia coli 55989]
gi|218351208|emb|CAU96912.1| conserved hypothetical protein from phage origin [Escherichia coli
55989]
Length = 620
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 140 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 199
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 200 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMKNASYAQ 248
>gi|419395616|ref|ZP_13936398.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15B]
gi|419400970|ref|ZP_13941701.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15C]
gi|419406182|ref|ZP_13946881.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15D]
gi|419412334|ref|ZP_13952996.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15E]
gi|378250228|gb|EHY10136.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15B]
gi|378251275|gb|EHY11176.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15C]
gi|378257023|gb|EHY16868.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15D]
gi|378260011|gb|EHY19817.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15E]
Length = 620
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 140 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 199
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 200 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMKNASYAQ 248
>gi|423221840|ref|ZP_17208310.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645258|gb|EIY38987.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1074
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 64/253 (25%), Positives = 100/253 (39%), Gaps = 58/253 (22%)
Query: 25 QLIILAGQSNMAGRGGVTN-DTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + GQSNM G V DT + + C P++ R K W A PL
Sbjct: 24 HIYLCLGQSNMEGNAKVEEQDTVAVDSRFQVLAAVDC---PNLGR--TKGNWYKAVPPL- 77
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------ 131
G+ PG F A++ +P+ IG++ A+GG I + K
Sbjct: 78 ----ARCYTGLTPGDYFGRAMVANLPSNVRIGIINVAVGGCRIELFDKDNYQSYVATSPD 133
Query: 132 ---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G + Y ++++ A++A + G I+ VL +QGES+T N +D L + +
Sbjct: 134 WLKNMVKEYGGNPYARLVEMAKLAQK-DGVIKGVLLHQGESNT-NDKDWPL---KVKGVY 188
Query: 183 TDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQ-------------LSSDLPNVRCVDA 229
+L +DL L V L +G E+V Q L +P + +
Sbjct: 189 DNLLNDLG---LSAANVPLLAG-----EVVHADQNGVCASMNTIIDSLPQVIPTAHVISS 240
Query: 230 MGLPLEPDGLHLT 242
G P D LH T
Sbjct: 241 AGCPAAFDNLHFT 253
>gi|295132889|ref|YP_003583565.1| esterase [Zunongwangia profunda SM-A87]
gi|294980904|gb|ADF51369.1| putative esterase [Zunongwangia profunda SM-A87]
Length = 896
Score = 45.1 bits (105), Expect = 0.041, Method: Composition-based stats.
Identities = 62/262 (23%), Positives = 103/262 (39%), Gaps = 36/262 (13%)
Query: 5 LLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVT-NDTRTNKLTWDGIVPPQCQPN 63
LL ++ A+ + Q + + GQSNM G + DT + + +C
Sbjct: 6 LLLFSMLLFAFSARAQDPNFHIYLAFGQSNMEGHAKIEPQDTVAISERFKVLSAVEC--- 62
Query: 64 PSILRLTAKL-KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIG 122
L L K +W A PL G+ P F ++ +P+ +G++ A+G
Sbjct: 63 ---LNLDRKKGEWYTAKPPL-----CRCNTGLTPTDYFGREMVENLPDSIKVGIINVAVG 114
Query: 123 GTNIS---------------QWRKG------SSLYEQMIQRAQVALRGGGTIRAVLWYQG 161
G I W K + Y+++++ A++ + G I+ +L +QG
Sbjct: 115 GCKIELFDKENYESYVASAPGWLKNMVKEYDGNPYKRLVEMAKIGQKRG-VIKGILLHQG 173
Query: 162 ESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPLLPIIRVALASGEGPFIEIVRKAQLSSD 220
ES+T + + K D DL+ D ++PLL V+ G A+L
Sbjct: 174 ESNTGDTLWPQKVKGVYDNLIKDLKLDPKKTPLLAGEMVSKEEGGACASMNTIIAKLPEV 233
Query: 221 LPNVRCVDAMGLPLEPDGLHLT 242
LPN V + G D LH T
Sbjct: 234 LPNAYVVSSEGCTAVNDHLHFT 255
>gi|116221995|ref|YP_794050.1| hypothetical protein Stx2-86_gp03 [Stx2-converting phage 86]
gi|115500805|dbj|BAF34035.1| hypothetical protein [Stx2-converting phage 86]
Length = 631
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 47/109 (43%), Gaps = 20/109 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--------------- 128
AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 151 ADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGTEGTFSESTGASQD 210
Query: 129 ---WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAK 172
W G LY+ ++ R + AL+ + A+ W QGE D N A+
Sbjct: 211 SARWGVGKPLYQDLLFRTKAALQKNPKNVLLAICWMQGEFDMKNASYAQ 259
>gi|317022264|gb|ADU86933.1| putative acetyl xylan esterase [uncultured bacterium]
Length = 270
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 58/258 (22%), Positives = 95/258 (36%), Gaps = 71/258 (27%)
Query: 26 LIILAGQSNMAGRG------------------GVTNDTRTNKL-TWDGIVPPQCQPNPSI 66
+ + GQSNM G V + R K+ W VPP C+PN
Sbjct: 14 IFLCFGQSNMEGAAKPEAQDLVSPGPRFLLMPAVDDAERGRKMGEWCEAVPPLCRPN--- 70
Query: 67 LRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI 126
G+ P F ++ +P IG++ AIGG I
Sbjct: 71 -------------------------TGLTPADWFGRTMVASLPENIKIGVIHVAIGGIKI 105
Query: 127 S----------------QWRKG------SSLYEQMIQRAQVALRGGGTIRAVLWYQGESD 164
W KG + YE+++ A+ A + G ++ +L +QGES+
Sbjct: 106 EGFMKDKIGDYVKTEAPDWMKGMLKSYDDNPYERLVMLAKKAQKEG-VVKGILMHQGESN 164
Query: 165 TVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRK-AQLSSDLPN 223
T + E AK ++ + DL+ ++ L + A G+G I ++ +L +P
Sbjct: 165 TGDPEWAKKVQQVYNALCKDLKLKPKNVPLFAGNIVQAGGQGVCIGCKKQIDELPLTIPT 224
Query: 224 VRCVDAMGLPLEPDGLHL 241
+ + G PD LH
Sbjct: 225 AHIISSDGCSNGPDRLHF 242
>gi|189467819|ref|ZP_03016604.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
17393]
gi|189436083|gb|EDV05068.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
17393]
Length = 1061
Score = 44.7 bits (104), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 96/243 (39%), Gaps = 38/243 (15%)
Query: 25 QLIILAGQSNMAGRGGVTN-DTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + GQSNM G V DT + + C P++ R K W A PL
Sbjct: 11 HIYLCLGQSNMEGNAKVEEQDTVAVDSRFQVLAAVDC---PNLGR--TKGNWYKAVPPL- 64
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------ 131
G+ PG F A++ +P+ +G++ A+GG I + K
Sbjct: 65 ----ARCYTGLTPGDYFGRAMVANLPSNVQVGIINVAVGGCKIELFDKDNYQSYVETSPD 120
Query: 132 ---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G + Y ++++ A++A + G I+ +L +QGES+T + + K D
Sbjct: 121 WLKNMVKEYGGNPYARLVEMAKLAQK-DGVIKGILLHQGESNTNDKDWPSKVKGVYDNLL 179
Query: 183 TDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSS---DLPNVRCVDAMGLPLEPDGL 239
DL L + +P++ + + I + S +P + + G P D L
Sbjct: 180 KDL--GLSAADVPLLAGEVVHADQNGICASMNTIIDSLPQVIPTAHVISSAGCPAAFDNL 237
Query: 240 HLT 242
H T
Sbjct: 238 HFT 240
>gi|81239397|gb|ABB60215.1| hypothetical protein [Escherichia coli]
gi|81239404|gb|ABB60221.1| hypothetical protein [Phage 258-320]
gi|81239411|gb|ABB60227.1| hypothetical protein [Phage 258-320]
gi|81239418|gb|ABB60233.1| hypothetical protein [Phage 258-320]
Length = 631
Score = 44.7 bits (104), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +PN I LVPC GG+
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTTGAD 184
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 185 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 237
>gi|419869826|ref|ZP_14392000.1| hypothetical protein ECO9450_11276, partial [Escherichia coli
O103:H2 str. CVM9450]
gi|388341270|gb|EIL07401.1| hypothetical protein ECO9450_11276, partial [Escherichia coli
O103:H2 str. CVM9450]
Length = 234
Score = 44.7 bits (104), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 69/170 (40%), Gaps = 35/170 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQG 161
W G LY+ +I R + AL+ + AV W QG
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQG 234
>gi|255033937|ref|YP_003084558.1| hypothetical protein Dfer_0122 [Dyadobacter fermentans DSM 18053]
gi|254946693|gb|ACT91393.1| protein of unknown function DUF303 acetylesterase putative
[Dyadobacter fermentans DSM 18053]
Length = 618
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 96/258 (37%), Gaps = 49/258 (18%)
Query: 3 AWLLCLILVSEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQP 62
W LC ILV+ ++ + I+AGQSN G ++ KL +P
Sbjct: 110 GWYLCEILVNGIVYTANKFGVGDVFIIAGQSNAQGI-----KDQSYKLPSGAGIPEWVVG 164
Query: 63 NPSILRLTAKLKWVLAH-EPLHADIDVNKTNGVGP--GLPFANAVLTKV---PNFGV-IG 115
T KL + PL+ D+ K +GP +A VL K+ N G+ +
Sbjct: 165 ASEDKTCTRKLPESFTNLFPLNTADDMKKHGPLGPTGNSVWAYGVLGKLISDANGGMPVA 224
Query: 116 LVPCAIGGTNISQWRKGSS--------------------------LYEQMIQRAQVALRG 149
A G+++++W++G+ Y Q + AL
Sbjct: 225 FFNAATAGSSVTEWKQGADGVEAKHPYTGAQVCLGYMGGSVIPKDYYGQPYTALKTALNY 284
Query: 150 GGT---IRAVLWYQGESDT-------VNLEDAKLYKERSDMFFTDLRSDLQSPLLP-IIR 198
G+ +RAVLW+QGE+D A Y+ + RSD +P L I
Sbjct: 285 YGSLYGVRAVLWHQGEADADPNVNAIYKASSAADYQSKLQAVIAKSRSDFAAPNLTWYIC 344
Query: 199 VALASGEGPFIEIVRKAQ 216
A S GP +R Q
Sbjct: 345 KATISKFGPVNATIRTGQ 362
>gi|417662134|ref|ZP_12311715.1| hypothetical protein ECAA86_01707 [Escherichia coli AA86]
gi|330911352|gb|EGH39862.1| hypothetical protein ECAA86_01707 [Escherichia coli AA86]
Length = 609
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 54/124 (43%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ +
Sbjct: 105 ADLAKGQYGTVGQGLHIAKKLLPYIPQNAGILLVPCCRGGSAFTTGDDGSFSEASGASAD 164
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R + AL + AV+W QGE+D + + + L+ F
Sbjct: 165 SSRWGAGKPLYQDLVSRTRAALAKNPKNKLLAVVWMQGEADLASGSQQHNGLFTTMVQQF 224
Query: 182 FTDL 185
TDL
Sbjct: 225 RTDL 228
>gi|115345639|ref|YP_771820.1| hypothetical protein RD1_B0003 [Roseobacter denitrificans OCh 114]
gi|115292960|gb|ABI93412.1| conserved domain protein [Roseobacter denitrificans OCh 114]
Length = 617
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 74/205 (36%), Gaps = 34/205 (16%)
Query: 17 VKCQYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTW--------DGIVPPQCQP--NPSI 66
Q ++ + L GQSNM GR + T DG + P P P+
Sbjct: 58 AAAQPRETHVFALMGQSNMIGRAAFDGGAKWPDGTLQIGRGGDEDGAIIPARNPADGPAT 117
Query: 67 LRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI 126
R A L + L + FA L+ P+ ++ +PCA G T
Sbjct: 118 SRPLAHTGARLGNMGLD--------------IQFAIDYLSDKPDVTLL-FIPCAQGATGF 162
Query: 127 SQ--WRKGSSLYEQMIQRAQVALRGGGTI--RAVLWYQGESDTVNLEDAKLYKERSDMFF 182
S W G LY + R A+ + LW+QGE+DT Y D
Sbjct: 163 SNGAWNPGDWLYNRETARINAAMNANPEFLFQGFLWHQGETDT---GIPGTYGGLLDNLI 219
Query: 183 TDLRSDLQ--SPLLPIIRVALASGE 205
LR D+ +P P I LA+G
Sbjct: 220 AGLRRDVTAATPTTPFILGGLAAGN 244
>gi|425150007|ref|ZP_18549697.1| hypothetical protein EC880221_2323, partial [Escherichia coli
88.0221]
gi|408599011|gb|EKK72941.1| hypothetical protein EC880221_2323, partial [Escherichia coli
88.0221]
Length = 224
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 69/170 (40%), Gaps = 35/170 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 56 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 114
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 115 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 174
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQG 161
W G LY+ +I R + AL+ + AV W QG
Sbjct: 175 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCWMQG 224
>gi|425379189|ref|ZP_18763332.1| hypothetical protein ECEC1865_2284, partial [Escherichia coli
EC1865]
gi|408299149|gb|EKJ16980.1| hypothetical protein ECEC1865_2284, partial [Escherichia coli
EC1865]
Length = 417
Score = 44.3 bits (103), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 57/137 (41%), Gaps = 29/137 (21%)
Query: 76 VLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ------- 128
VL H +AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 140 VLNHP--NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFS 197
Query: 129 -----------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYK 175
W G LY+ +I R + AL+ + AV W QGE D A Y
Sbjct: 198 ESTGASQDSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYS 253
Query: 176 ERSDMF---FTDLRSDL 189
++ +F R+D+
Sbjct: 254 QQPPLFAAMLKQFRADI 270
>gi|408671715|ref|YP_006875523.1| protein of unknown function DUF303 acetylesterase [Emticicia
oligotrophica DSM 17448]
gi|387857564|gb|AFK05659.1| protein of unknown function DUF303 acetylesterase [Emticicia
oligotrophica DSM 17448]
Length = 278
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 100/251 (39%), Gaps = 49/251 (19%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKL-KWVLAHEPLH 83
+ + GQSNM G + N + I+ P +L K+ +W LA PL
Sbjct: 25 HIYLCIGQSNMEGAARIEEQDTINIDSRFKILEALDCP-----QLGRKMGQWYLAKPPL- 78
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------ 131
+ P F +L + +GLV A+ G+ I + K
Sbjct: 79 ----CRCNTRLSPADYFGRTLLQNMSPKQSLGLVHVAVAGSKIEIFDKIKYKTYLDSSAK 134
Query: 132 ------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
G + YE++I+ A++A + G I+ +L +QGES+T + K + +
Sbjct: 135 EKPWMINMANSYGGNPYERLIEMAKIAQKSG-VIKGILLHQGESNTGD----KTWPAQVK 189
Query: 180 MFFTDLRSDL-QSP-LLPIIRVALASGE-----GPFIEIVRKAQLSSDLPNVRCVDAMGL 232
+ D+ +DL +P +P+I L S E I+ A L +P V + GL
Sbjct: 190 KIYDDILADLGMAPNSIPLIAGELVSAEQGGKCASHNTII--ATLPQAIPKAIVVSSNGL 247
Query: 233 PLEPDGLHLTT 243
DGLH +
Sbjct: 248 TAAKDGLHFDS 258
>gi|366086963|ref|ZP_09453448.1| hypothetical protein LzeaK3_07061 [Lactobacillus zeae KCTC 3804]
Length = 269
Score = 44.3 bits (103), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 68/156 (43%), Gaps = 23/156 (14%)
Query: 95 GPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS------------------LY 136
G L + +L + P G+ ++ C+ GGT+I G +
Sbjct: 68 GFDLVTYHNILQRTPYQGLY-VIKCSEGGTSIDPTGDGDRHWTTHFDELASPDDSLLLAF 126
Query: 137 EQMIQRAQVALRGGGTIRAVLWYQGESD--TVNLEDAKLYKERSDMFFTDLRSDLQSPLL 194
+I++ A R I+A+LW+QGE+D + + A Y + FT R + + L
Sbjct: 127 THLIKQCLAASRQHLDIKAMLWHQGEADRGSYSQAAADHYYDNLKAVFTYCRQLVDNATL 186
Query: 195 PIIRVALASGEGPFIEIVRKA--QLSSDLPNVRCVD 228
PII ++ + V K+ QL+S+ PN+ +D
Sbjct: 187 PIICGTVSHHSEQYDPQVEKSMIQLASEDPNIHMID 222
>gi|296123387|ref|YP_003631165.1| hypothetical protein Plim_3151 [Planctomyces limnophilus DSM 3776]
gi|296015727|gb|ADG68966.1| protein of unknown function DUF303 acetylesterase putative
[Planctomyces limnophilus DSM 3776]
Length = 1077
Score = 44.3 bits (103), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 97/242 (40%), Gaps = 50/242 (20%)
Query: 12 SEAWPVKCQYQQQQLIILAGQSNMAGRGGVTNDTRTN------KLTWDGIVPPQCQPNPS 65
SEA P+K + ILAGQSNM G G V+ D + + L W Q
Sbjct: 786 SEAKPLK-------VFILAGQSNMEGHGVVSMDGKRDYNGGKGNLVWS---MKHSQSAEK 835
Query: 66 ILRL-TAKLKWVLAHE---PLHADIDVNK------------TNGVGPGLPFANAVLTKVP 109
+ RL K +WV+ + D V K ++ +GP L F V+
Sbjct: 836 LKRLKNEKGEWVIRDDVQISFKVDDKVRKGGLTIGYTGYGGSSHIGPELGFG-FVMGDYL 894
Query: 110 NFGVIGLVPCAIGGTNI-SQWRKGSS------LYEQMIQRAQVALRGGG----TIRAVLW 158
+ V+ L+ A GG ++ +R SS Y +M++ + AL G I +W
Sbjct: 895 DEPVL-LIKTAWGGKSLFVDFRPPSSGGQVGPYYTKMVEEVRAALAELGDQKYEIAGFVW 953
Query: 159 YQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASG----EGPFIEIVRK 214
QG +D Y + DLR + SP LP++ L +G G E RK
Sbjct: 954 QQGWNDMCEKPAIAEYAQNLVNLVKDLRKEFDSPNLPVVVGELGNGGPVTSGDMFEF-RK 1012
Query: 215 AQ 216
AQ
Sbjct: 1013 AQ 1014
>gi|336415342|ref|ZP_08595682.1| hypothetical protein HMPREF1017_02790 [Bacteroides ovatus
3_8_47FAA]
gi|335940938|gb|EGN02800.1| hypothetical protein HMPREF1017_02790 [Bacteroides ovatus
3_8_47FAA]
Length = 643
Score = 44.3 bits (103), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 58/135 (42%), Gaps = 25/135 (18%)
Query: 117 VPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKE 176
+P IG N + + LY MI LR G IR ++WYQGESDT E +K Y+
Sbjct: 396 IPSTIGFQN-----EPTGLYNSMIH----PLRNYG-IRGIIWYQGESDT-GPEGSKHYER 444
Query: 177 RSDMFFTDLRSDLQSPLLPIIRVALA-----------SGEGPFIEIVRKAQLSSDLPNVR 225
D R+ + LP + V LA SG E RKA L L NV
Sbjct: 445 HLIDLVNDWRTQWNNKNLPFVIVQLANYQQRSKVPVESGNAQVREAQRKASLQ--LKNVG 502
Query: 226 CVDAMGLPLEPDGLH 240
A+ L E + +H
Sbjct: 503 LATAIDLG-ESNDIH 516
>gi|260855873|ref|YP_003229764.1| hypothetical protein ECO26_2785 [Escherichia coli O26:H11 str.
11368]
gi|415792095|ref|ZP_11495738.1| hypothetical protein ECEPECA14_5385 [Escherichia coli EPECa14]
gi|417298020|ref|ZP_12085262.1| PF03629 domain protein [Escherichia coli 900105 (10e)]
gi|419267505|ref|ZP_13809862.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
gi|419272924|ref|ZP_13815225.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10D]
gi|419905780|ref|ZP_14424730.1| hypothetical protein ECO10026_13793 [Escherichia coli O26:H11 str.
CVM10026]
gi|420114099|ref|ZP_14623787.1| hypothetical protein ECO10021_24171 [Escherichia coli O26:H11 str.
CVM10021]
gi|420123759|ref|ZP_14632640.1| hypothetical protein ECO10030_13494 [Escherichia coli O26:H11 str.
CVM10030]
gi|420124979|ref|ZP_14633816.1| hypothetical protein ECO10224_20610 [Escherichia coli O26:H11 str.
CVM10224]
gi|420134644|ref|ZP_14642748.1| hypothetical protein ECO9952_00999 [Escherichia coli O26:H11 str.
CVM9952]
gi|424753342|ref|ZP_18181299.1| hypothetical protein CFSAN001629_23431 [Escherichia coli O26:H11
str. CFSAN001629]
gi|257754522|dbj|BAI26024.1| hypothetical protein ECO26_2785 [Escherichia coli O26:H11 str.
11368]
gi|323152778|gb|EFZ39050.1| hypothetical protein ECEPECA14_5385 [Escherichia coli EPECa14]
gi|378112277|gb|EHW73857.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
gi|378117641|gb|EHW79155.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10D]
gi|386258288|gb|EIJ13767.1| PF03629 domain protein [Escherichia coli 900105 (10e)]
gi|388380633|gb|EIL43227.1| hypothetical protein ECO10026_13793 [Escherichia coli O26:H11 str.
CVM10026]
gi|394396330|gb|EJE72704.1| hypothetical protein ECO10224_20610 [Escherichia coli O26:H11 str.
CVM10224]
gi|394410299|gb|EJE84709.1| hypothetical protein ECO10021_24171 [Escherichia coli O26:H11 str.
CVM10021]
gi|394416414|gb|EJE90210.1| hypothetical protein ECO10030_13494 [Escherichia coli O26:H11 str.
CVM10030]
gi|394421226|gb|EJE94707.1| hypothetical protein ECO9952_00999 [Escherichia coli O26:H11 str.
CVM9952]
gi|421935564|gb|EKT93252.1| hypothetical protein CFSAN001629_23431 [Escherichia coli O26:H11
str. CFSAN001629]
Length = 625
Score = 44.3 bits (103), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 57/137 (41%), Gaps = 29/137 (21%)
Query: 76 VLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ------- 128
VL H +AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 140 VLNHP--NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFS 197
Query: 129 -----------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYK 175
W G LY+ +I R + AL+ + AV W QGE D A Y
Sbjct: 198 ESTGASQDSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYS 253
Query: 176 ERSDMF---FTDLRSDL 189
++ +F R+D+
Sbjct: 254 QQPPLFAAMLKQFRADI 270
>gi|160885450|ref|ZP_02066453.1| hypothetical protein BACOVA_03450 [Bacteroides ovatus ATCC 8483]
gi|423290377|ref|ZP_17269226.1| hypothetical protein HMPREF1069_04269 [Bacteroides ovatus
CL02T12C04]
gi|423294320|ref|ZP_17272447.1| hypothetical protein HMPREF1070_01112 [Bacteroides ovatus
CL03T12C18]
gi|156109072|gb|EDO10817.1| glycosyl hydrolase family 2, sugar binding domain protein
[Bacteroides ovatus ATCC 8483]
gi|392665764|gb|EIY59287.1| hypothetical protein HMPREF1069_04269 [Bacteroides ovatus
CL02T12C04]
gi|392675511|gb|EIY68952.1| hypothetical protein HMPREF1070_01112 [Bacteroides ovatus
CL03T12C18]
Length = 643
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 58/135 (42%), Gaps = 25/135 (18%)
Query: 117 VPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKE 176
+P IG N + + LY MI LR G IR ++WYQGESDT E +K Y+
Sbjct: 396 IPSTIGFQN-----EPTGLYNSMIH----PLRNYG-IRGIIWYQGESDT-GPEGSKHYER 444
Query: 177 RSDMFFTDLRSDLQSPLLPIIRVALA-----------SGEGPFIEIVRKAQLSSDLPNVR 225
D R+ + LP + V LA SG E RKA L L NV
Sbjct: 445 HLIDLVNDWRTQWNNKNLPFVIVQLANYQQRSKVPVESGNAQVREAQRKASLQ--LKNVG 502
Query: 226 CVDAMGLPLEPDGLH 240
A+ L E + +H
Sbjct: 503 LATAIDLG-ESNDIH 516
>gi|419883598|ref|ZP_14404688.1| hypothetical protein ECO9545_28688, partial [Escherichia coli
O111:H11 str. CVM9545]
gi|420105307|ref|ZP_14615843.1| hypothetical protein ECO9455_08219, partial [Escherichia coli
O111:H11 str. CVM9455]
gi|388357965|gb|EIL22460.1| hypothetical protein ECO9545_28688, partial [Escherichia coli
O111:H11 str. CVM9545]
gi|394398929|gb|EJE75045.1| hypothetical protein ECO9455_08219, partial [Escherichia coli
O111:H11 str. CVM9455]
Length = 393
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFA 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|224538524|ref|ZP_03679063.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519862|gb|EEF88967.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1061
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 100/253 (39%), Gaps = 58/253 (22%)
Query: 25 QLIILAGQSNMAGRGGVTN-DTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + GQSNM G V DT + + C P++ R K W A PL
Sbjct: 11 HIYLCLGQSNMEGNAKVEEQDTVAVDSRFQVLAAVDC---PNLGR--TKGNWYKAVPPL- 64
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------ 131
G+ PG F A++ +P+ +G++ A+GG I + K
Sbjct: 65 ----ARCYTGLTPGDYFGRAMVANLPSNVRVGIINVAVGGCRIELFDKDNYQSYVETSPD 120
Query: 132 ---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G + Y ++++ A++A + G I+ +L +QGES+T N +D L + +
Sbjct: 121 WLKNMVKEYGGNPYARLVEMAKLAQK-DGVIKGILLHQGESNT-NDKDWPL---KVKGVY 175
Query: 183 TDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQ-------------LSSDLPNVRCVDA 229
+L +DL L V L +G E+V Q L +P + +
Sbjct: 176 DNLLNDLG---LSAANVPLLAG-----EVVHADQNGVCASMNTIIDSLPQVIPTAHVISS 227
Query: 230 MGLPLEPDGLHLT 242
G P D LH T
Sbjct: 228 AGCPAAFDNLHFT 240
>gi|162457597|ref|YP_001619964.1| hypothetical protein sce9311 [Sorangium cellulosum So ce56]
gi|161168179|emb|CAN99484.1| hypothetical protein predicted by Glimmer/Critica [Sorangium
cellulosum So ce56]
Length = 346
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 64/152 (42%), Gaps = 28/152 (18%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTA-------KLKWVL 77
+ +L GQSNMAG + Q S RL +W L
Sbjct: 118 HIFMLMGQSNMAG-----------------VAAKQASDQNSDQRLKVLGGCNQPAGQWNL 160
Query: 78 AHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSS 134
A+ PL + +N + V PG+ F +L K+ IGL+ A G +I+ + G S
Sbjct: 161 ANPPLSDCPGESRINLSTSVDPGIWFGKTLLGKLREGDTIGLIGTAESGESINTFISGGS 220
Query: 135 LYEQMIQR-AQVALRGGGTIRAVLWYQGESDT 165
++ ++ + A+ ++++QGE+DT
Sbjct: 221 HHQTILNKIAKAKTAENARFAGIIFHQGETDT 252
>gi|432449289|ref|ZP_19691570.1| hypothetical protein A13W_00242 [Escherichia coli KTE193]
gi|433032604|ref|ZP_20220373.1| hypothetical protein WIC_01210 [Escherichia coli KTE112]
gi|430982421|gb|ELC99111.1| hypothetical protein A13W_00242 [Escherichia coli KTE193]
gi|431558108|gb|ELI31787.1| hypothetical protein WIC_01210 [Escherichia coli KTE112]
Length = 693
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 80/192 (41%), Gaps = 39/192 (20%)
Query: 26 LIILAGQSNMA--GRGGVTNDT------RTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
++++AGQSN + G G DT R +L V P +C N I+ L
Sbjct: 119 VVVIAGQSNASSFGEGLPLPDTYDRPDPRIKQLARRNTVTPGGVECAYN-DIIPADHCLH 177
Query: 75 WVLA---HEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI----- 126
VL H AD+ + VG GL A +L +P I LVPCA GG+
Sbjct: 178 DVLDMSNHNHPKADLSKGQYGCVGQGLHIAKKLLPFIPEEAGILLVPCARGGSAFTDGAD 237
Query: 127 -------------SQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
S+W L+ ++ R + AL + +V+W QGE+D L+
Sbjct: 238 GEFTEASGATSASSRWGVNKPLFSDLVNRTKAALSSNPRNILLSVVWMQGEND---LKTG 294
Query: 172 KLYKERSDMFFT 183
K + E S +F T
Sbjct: 295 K-HAEHSGLFVT 305
>gi|293433588|ref|ZP_06662016.1| transposase [Escherichia coli B088]
gi|291324407|gb|EFE63829.1| transposase [Escherichia coli B088]
Length = 646
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 22/126 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + +
Sbjct: 154 ADLSKGQYGCVGQGLHIAKKLLPYIPQNAGILLVPCCRGASAFTTGDDGSFSEVSGASAD 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R + AL + AV+W QGE+D + + + L+ F
Sbjct: 214 SSRWGAGKPLYQDLLSRTRAALAKNPKNKLLAVVWMQGEADLASGSQQHNGLFTAMVQQF 273
Query: 182 FTDLRS 187
TDL S
Sbjct: 274 RTDLSS 279
>gi|420107835|ref|ZP_14618154.1| hypothetical protein ECO9553_16783 [Escherichia coli O111:H11 str.
CVM9553]
gi|424760725|ref|ZP_18188330.1| hypothetical protein CFSAN001630_14097 [Escherichia coli O111:H11
str. CFSAN001630]
gi|394411814|gb|EJE86005.1| hypothetical protein ECO9553_16783 [Escherichia coli O111:H11 str.
CVM9553]
gi|421945396|gb|EKU02612.1| hypothetical protein CFSAN001630_14097 [Escherichia coli O111:H11
str. CFSAN001630]
Length = 625
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFA 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|419209857|ref|ZP_13752944.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8C]
gi|378055088|gb|EHW17356.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8C]
Length = 625
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFA 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|419298471|ref|ZP_13840491.1| hypothetical protein ECDEC11C_0340 [Escherichia coli DEC11C]
gi|378157339|gb|EHX18377.1| hypothetical protein ECDEC11C_0340 [Escherichia coli DEC11C]
Length = 616
Score = 43.9 bits (102), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 81/201 (40%), Gaps = 42/201 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDA 171
W G LY+ +I R + AL+ + AV QGE D A
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQKNPKNVLLAVCRMQGEFDM----SA 240
Query: 172 KLYKERSDMF---FTDLRSDL 189
+ ++ +F T R+DL
Sbjct: 241 ATHAQQPALFTAMLTQFRADL 261
>gi|427384532|ref|ZP_18881037.1| hypothetical protein HMPREF9447_02070 [Bacteroides oleiciplenus YIT
12058]
gi|425727793|gb|EKU90652.1| hypothetical protein HMPREF9447_02070 [Bacteroides oleiciplenus YIT
12058]
Length = 643
Score = 43.9 bits (102), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 4/73 (5%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
IR LWYQGE ++ E LYK+ TD R + LP + V L + G +
Sbjct: 433 IRGFLWYQGEGNSGQPE---LYKQLQPTMITDWRIRFEQGYLPFLFVQLPNISGGSCQYF 489
Query: 213 RKAQLSS-DLPNV 224
R+AQ S +LPNV
Sbjct: 490 REAQAESLELPNV 502
>gi|423000428|ref|ZP_16991182.1| hypothetical protein EUEG_02845 [Escherichia coli O104:H4 str.
09-7901]
gi|423004097|ref|ZP_16994843.1| hypothetical protein EUDG_01581 [Escherichia coli O104:H4 str.
04-8351]
gi|354869544|gb|EHF29954.1| hypothetical protein EUDG_01581 [Escherichia coli O104:H4 str.
04-8351]
gi|354873399|gb|EHF33776.1| hypothetical protein EUEG_02845 [Escherichia coli O104:H4 str.
09-7901]
Length = 625
Score = 43.9 bits (102), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFA 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|432615868|ref|ZP_19851993.1| hypothetical protein A1UM_01299 [Escherichia coli KTE75]
gi|431156286|gb|ELE57022.1| hypothetical protein A1UM_01299 [Escherichia coli KTE75]
Length = 655
Score = 43.5 bits (101), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 52/125 (41%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + + L+ +
Sbjct: 211 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDVAVGTHAQHSGLFSAMVNQ 270
Query: 181 FFTDL 185
F TDL
Sbjct: 271 FRTDL 275
>gi|419148325|ref|ZP_13693001.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC6B]
gi|377995696|gb|EHV58811.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC6B]
Length = 465
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFT 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|365852288|ref|ZP_09392678.1| hypothetical protein HMPREF9103_01459 [Lactobacillus parafarraginis
F0439]
gi|363715094|gb|EHL98565.1| hypothetical protein HMPREF9103_01459 [Lactobacillus parafarraginis
F0439]
Length = 276
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 56/132 (42%), Gaps = 15/132 (11%)
Query: 112 GVIGLVPCAIGGTNISQWRK--------GSSL---YEQMIQRAQVALRGGGTIRAVLWYQ 160
G I + P GG + S W SL ++Q+I+ Q A I+A+LW+Q
Sbjct: 98 GGISIAPSGEGGVDDSHWSTHIDQLKDPSHSLLLQFKQLIESCQAAQNNQLVIKAMLWHQ 157
Query: 161 GESDTVNLED--AKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKA--Q 216
GE D + A Y + F R + +P LPI ++ F V +
Sbjct: 158 GEGDRADFSSSAAANYYDNLKAVFAYCRRVVGNPQLPIFCGTVSHHSDQFDSQVEAGVIR 217
Query: 217 LSSDLPNVRCVD 228
L+++ P++ VD
Sbjct: 218 LATEDPHIYLVD 229
>gi|189468355|ref|ZP_03017140.1| hypothetical protein BACINT_04752 [Bacteroides intestinalis DSM
17393]
gi|189436619|gb|EDV05604.1| hypothetical protein BACINT_04752 [Bacteroides intestinalis DSM
17393]
Length = 484
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 62/148 (41%), Gaps = 32/148 (21%)
Query: 114 IGLVPCAIGGTNISQW--RKGSSLYEQM--------------------IQRAQVALRGGG 151
+G++ +GG+ + W R+ S ++ + + A++A
Sbjct: 206 VGIIISTLGGSKVEAWMSREAISPFKSIDLSILDNDEKIKNLTNTPCVLYNAKIAPFLNF 265
Query: 152 TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA-----SGEG 206
I+ LWYQGES N ++A LYK+ F DLRS P V +A +G
Sbjct: 266 AIKGFLWYQGES---NRDNADLYKDLMPAFVKDLRSKWNRGEFPFYFVEIAPFNYEGADG 322
Query: 207 PFIEIVRKAQLSS--DLPNVRCVDAMGL 232
+R+ QL + D+PN V + +
Sbjct: 323 TSAARMREVQLQNMKDIPNSGMVTTLDI 350
>gi|283788278|ref|YP_003368143.1| hypothetical protein ROD_47411 [Citrobacter rodentium ICC168]
gi|282951732|emb|CBG91434.1| hypothetical prophage protein [Citrobacter rodentium ICC168]
Length = 683
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 53/124 (42%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + +
Sbjct: 188 ADLSKGQYGCVGQGLHIAKKLLPYIPQNAGILLVPCCRGASAFTTGDDGSFSEVSGASAD 247
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESD--TVNLEDAKLYKERSDMF 181
S+W G LY+ ++ R + AL R AV+W QGE+D + + + L+ F
Sbjct: 248 SSRWGAGKPLYQDLLSRTRAALEKNPKNRLLAVVWMQGEADLASGSQQHNGLFTAMVQQF 307
Query: 182 FTDL 185
TDL
Sbjct: 308 RTDL 311
>gi|417172492|ref|ZP_12002525.1| PF03629 domain protein [Escherichia coli 3.2608]
gi|432557892|ref|ZP_19794580.1| hypothetical protein A1S7_01542 [Escherichia coli KTE49]
gi|386180190|gb|EIH57664.1| PF03629 domain protein [Escherichia coli 3.2608]
gi|431093398|gb|ELD99063.1| hypothetical protein A1S7_01542 [Escherichia coli KTE49]
Length = 625
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 54/130 (41%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
+AD+ + VG GL A +L +P I LVPC GG+ +Q
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTQGAEGTFSESTGASQ 204
Query: 129 ----WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFT 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|423227456|ref|ZP_17213917.1| hypothetical protein HMPREF1062_06103 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392623086|gb|EIY17192.1| hypothetical protein HMPREF1062_06103 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 491
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 72/176 (40%), Gaps = 38/176 (21%)
Query: 88 VNKTNGVGPGLPFANAV--LTKVPNFGVIGLVPCAIGGTNISQW--RKGSSLYEQM---- 139
VN N FA + + +VP +G++ +GG+ + W R+ S ++ +
Sbjct: 189 VNVANTSAAAYYFARYIQEVLEVP----VGIIVSTLGGSKVEAWMSREAISPFKSINLSI 244
Query: 140 ----------------IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
+ A+VA I+ LWYQGES N ++A LY+ F
Sbjct: 245 LDNDEQIKNITATPCVLYNAKVAPFTNFAIKGFLWYQGES---NRDNADLYQSLMPAFVK 301
Query: 184 DLRSDLQSPLLPIIRVALA-----SGEGPFIEIVRKAQLSS--DLPNVRCVDAMGL 232
DLR+ LP V +A +G +R+ QL + D+PN V + +
Sbjct: 302 DLRNKWNRGELPFYFVEIAPFNYEGADGTSAARMREVQLQNMKDIPNSGMVSTLDI 357
>gi|224535245|ref|ZP_03675784.1| hypothetical protein BACCELL_00106 [Bacteroides cellulosilyticus
DSM 14838]
gi|224523143|gb|EEF92248.1| hypothetical protein BACCELL_00106 [Bacteroides cellulosilyticus
DSM 14838]
Length = 491
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 72/176 (40%), Gaps = 38/176 (21%)
Query: 88 VNKTNGVGPGLPFANAV--LTKVPNFGVIGLVPCAIGGTNISQW--RKGSSLYEQM---- 139
VN N FA + + +VP +G++ +GG+ + W R+ S ++ +
Sbjct: 189 VNVANTSAAAYYFARYIQEVLEVP----VGIIVSTLGGSKVEAWMSREAISPFKSINLSI 244
Query: 140 ----------------IQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFT 183
+ A+VA I+ LWYQGES N ++A LY+ F
Sbjct: 245 LDNDEQIKNITATPCVLYNAKVAPFTNFAIKGFLWYQGES---NRDNADLYQSLMPAFVK 301
Query: 184 DLRSDLQSPLLPIIRVALA-----SGEGPFIEIVRKAQLSS--DLPNVRCVDAMGL 232
DLR+ LP V +A +G +R+ QL + D+PN V + +
Sbjct: 302 DLRNKWNRGELPFYFVEIAPFNYEGADGTSAARMREVQLQNMKDIPNSGMVSTLDI 357
>gi|338209453|ref|YP_004646424.1| acetylcholinesterase [Runella slithyformis DSM 19594]
gi|336308916|gb|AEI52017.1| Acetylcholinesterase [Runella slithyformis DSM 19594]
Length = 786
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 99/258 (38%), Gaps = 48/258 (18%)
Query: 17 VKCQYQQQQLIILAGQSNMAGRGGVT--NDTRTNKLTWDGIVPPQCQPNPSILRLTAKLK 74
K Q + + GQSNM G + + T N+L V C P + R K
Sbjct: 18 AKAQDPNFHIYLCIGQSNMEGPARIEPQDTTVDNRLRLLASV--DC---PELGR--TKGN 70
Query: 75 WVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK--- 131
W A PL + P F ++ +P +G + A+ G+ I + K
Sbjct: 71 WYTAKPPL-----CRCNTRLSPADYFGRTLVANLPPNVKLGFLHVAVAGSKIEIFDKKDY 125
Query: 132 ---------------------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED 170
G + YE++++ A++A + G I+ +L +QGES+T +
Sbjct: 126 KMYLDTSAKERPWMINMANQYGGNPYERLVEMARLAQKAG-VIKGILLHQGESNTGD--- 181
Query: 171 AKLYKERSDMFFTDLRSDLQ-----SPLLPIIRVALASGEGPFIEIVRKAQLSSDLPNVR 225
K + + + DL +DLQ PLL V G A L +P
Sbjct: 182 -KAWPMKVKKIYDDLLADLQLAPNSIPLLAGELVNADQGGKCASMNTIIATLPQVIPQAI 240
Query: 226 CVDAMGLPLEPDGLHLTT 243
V + GLP PD LH ++
Sbjct: 241 IVSSKGLPAVPDKLHFSS 258
>gi|427387357|ref|ZP_18883413.1| hypothetical protein HMPREF9447_04446 [Bacteroides oleiciplenus YIT
12058]
gi|425725518|gb|EKU88389.1| hypothetical protein HMPREF9447_04446 [Bacteroides oleiciplenus YIT
12058]
Length = 465
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 93/233 (39%), Gaps = 47/233 (20%)
Query: 9 ILVSEAWPVKCQYQQQQLIILAGQSNMAGR-GGVTNDTRTNKLTWDGIVPPQCQPNPSIL 67
+L+ E W + +GQSNM R GG +D L D IV NP I
Sbjct: 105 VLIGEVW------------LCSGQSNMDMRVGGRYSDPVIGSL--DAIV---TSGNPDIR 147
Query: 68 RLTAKLKWVLAHEPLHADID---VNKTNGVGPGLPFANAVLTKVPN--FGV-IGLVPCAI 121
T K + EPL D + ++ PG A + N G+ +G++ +
Sbjct: 148 MFTVGSK--MTSEPL-TDCEGEWQEASSETVPGFSAAGYFFARKLNQVLGIPVGIIHASY 204
Query: 122 GGTNISQW--RKGSSLYEQM--IQRAQVALRG------GGTIRAVLWYQGESDTVNLEDA 171
GG+ + W ++G + Y+ + + A + G G IR LWYQGE+ N++
Sbjct: 205 GGSRVEAWMSKEGVAPYKDLPDVHNASILYNGMLSPVIGYGIRGCLWYQGEA---NVDAP 261
Query: 172 KLYKERSDMFFTDLRSDLQSPLLPIIRVALA-------SGEGPFIEIVRKAQL 217
LY + +D R P +A G+G +R+AQ+
Sbjct: 262 DLYTQLFPSLVSDWRQQWGIGEFPFYYAQIAPFNYNKGEGKGKNSAYLREAQV 314
>gi|150006324|ref|YP_001301068.1| sialic acid-specific 9-O-acetylesterase [Bacteroides vulgatus ATCC
8482]
gi|149934748|gb|ABR41446.1| sialic acid-specific 9-O-acetylesterase [Bacteroides vulgatus ATCC
8482]
Length = 638
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 56/132 (42%), Gaps = 13/132 (9%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
++ V+WYQG+S NLE + Y + D R Q P +P V L P E
Sbjct: 424 LQGVIWYQGKS---NLESSDEYADLFMSLIADWRDKWQKPQMPFYFVQL-----PNHEKK 475
Query: 213 RKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT---PAQGSTLNSWSNEALRVNLSLLVFRI 269
+AQ SD +R A L L G+ +TT + +T S LR LS L +
Sbjct: 476 EEAQDDSDWAAMREAQAQALHLNHTGMVITTDIGKEKSNTFQSTLETGLR--LSQLALKQ 533
Query: 270 LEGSCRISKQAV 281
G ++ + V
Sbjct: 534 TYGKRKMPQYPV 545
>gi|373458721|ref|ZP_09550488.1| protein of unknown function DUF303 acetylesterase [Caldithrix
abyssi DSM 13497]
gi|371720385|gb|EHO42156.1| protein of unknown function DUF303 acetylesterase [Caldithrix
abyssi DSM 13497]
Length = 577
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 21/143 (14%)
Query: 133 SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSP 192
S+ Y +++ RA A ++A+ W+QGESD+ N DA Y R D + R D Q P
Sbjct: 223 STTYGRLLYRATKA-HVQNAVKAIFWHQGESDS-NTPDADYYAARFDTLYNAWRQDYQ-P 279
Query: 193 LLPIIRVALASGE--GPFIEIVRKAQ--LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGS 248
L + L G G VR+ Q NV + GL + DG H
Sbjct: 280 LTKVYVFQLHPGTCGGDRQSDVREIQRNFKKTYGNVHVMATCGL-VGHDGCHY------- 331
Query: 249 TLNSWSNEALRVNLSLLVFRILE 271
N+ + ++ +FR++E
Sbjct: 332 ------NDDGYLQMAEWIFRLVE 348
>gi|218663496|ref|ZP_03519426.1| hypothetical protein RetlI_31510 [Rhizobium etli IE4771]
Length = 312
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 8/126 (6%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
AN ++ N VI L P A GG+ +++W G ++ + G + +V W
Sbjct: 133 LANKLIASGQNDNVI-LAPLAYGGSEVARWAAGGDFNPLLVDTVKQLHDSGYRVTSVHWV 191
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPL-LPIIRVALASGEGPFIE-----IV 212
QGE+D V A+ Y+ER LR +++P+ + I L G F E ++
Sbjct: 192 QGEADLVFGTTAEAYQERFLSMVGTLRQHGVEAPVYISIASKCLEPSNGGFKEHIPDNVI 251
Query: 213 RKAQLS 218
+AQL+
Sbjct: 252 VQAQLA 257
>gi|219116657|ref|XP_002179123.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409014|gb|EEC48946.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 396
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 41/93 (44%), Gaps = 5/93 (5%)
Query: 118 PCAIGGTNISQWRKGSSLYEQMIQRAQV----ALRGGGTIRAVLWYQGESDTVNLEDAKL 173
P A G T +R + + Q A + I ++W+ G +D N +A
Sbjct: 150 PSATGETGFQWYRMQTGIANTFAQIANILGEEYKHADIDIGGIVWWHGYTDLWNQANAAE 209
Query: 174 YKERSDMFFTDLRSDLQSPLLPIIRVALASGEG 206
Y+ + F DLRS L PLLPI+ +A G G
Sbjct: 210 YESNLEHFVRDLRSTLHRPLLPIV-IAELGGSG 241
>gi|345514966|ref|ZP_08794472.1| polysaccharide deacetylase [Bacteroides dorei 5_1_36/D4]
gi|345455823|gb|EEO44678.2| polysaccharide deacetylase [Bacteroides dorei 5_1_36/D4]
Length = 503
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 7/112 (6%)
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDA----KLYKERSDMFFTDLRSDLQSPLLP 195
I + L+ G I A LW+QGESD +D K M T+ ++ LP
Sbjct: 169 IDKTLSRLKDGYQIDAFLWHQGESDYAKSKDYYRNLKTMVAYVRMHLTE-KTGKDYSRLP 227
Query: 196 IIRVALASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
I +A F V A QL+++ PN+ +D G L D LH T +
Sbjct: 228 FIFGTVARSNKYFSREVENAMKQLAAEDPNMHLIDMSGAELLNDRLHFTAHS 279
>gi|345521349|ref|ZP_08800678.1| sialic acid-specific 9-O-acetylesterase [Bacteroides sp. 4_3_47FAA]
gi|345456583|gb|EET18251.2| sialic acid-specific 9-O-acetylesterase [Bacteroides sp. 4_3_47FAA]
Length = 634
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 56/132 (42%), Gaps = 13/132 (9%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
++ V+WYQG+S NLE + Y + D R Q P +P V L P E
Sbjct: 420 LQGVIWYQGKS---NLESSDEYADLFMSLIADWRDKWQKPQMPFYFVQL-----PNHEKK 471
Query: 213 RKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT---PAQGSTLNSWSNEALRVNLSLLVFRI 269
+AQ SD +R A L L G+ +TT + +T S LR LS L +
Sbjct: 472 EEAQDDSDWAAMREAQAQALHLNHTGMVVTTDIGKEKSNTFQSTLETGLR--LSQLALKQ 529
Query: 270 LEGSCRISKQAV 281
G ++ + V
Sbjct: 530 TYGKRKMPQYPV 541
>gi|212694116|ref|ZP_03302244.1| hypothetical protein BACDOR_03642 [Bacteroides dorei DSM 17855]
gi|423228401|ref|ZP_17214807.1| hypothetical protein HMPREF1063_00627 [Bacteroides dorei
CL02T00C15]
gi|423239506|ref|ZP_17220622.1| hypothetical protein HMPREF1065_01245 [Bacteroides dorei
CL03T12C01]
gi|423243664|ref|ZP_17224740.1| hypothetical protein HMPREF1064_00946 [Bacteroides dorei
CL02T12C06]
gi|212663336|gb|EEB23910.1| GDSL-like protein [Bacteroides dorei DSM 17855]
gi|392636147|gb|EIY30031.1| hypothetical protein HMPREF1063_00627 [Bacteroides dorei
CL02T00C15]
gi|392644554|gb|EIY38292.1| hypothetical protein HMPREF1064_00946 [Bacteroides dorei
CL02T12C06]
gi|392646240|gb|EIY39957.1| hypothetical protein HMPREF1065_01245 [Bacteroides dorei
CL03T12C01]
Length = 503
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 7/112 (6%)
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDA----KLYKERSDMFFTDLRSDLQSPLLP 195
I + L+ G I A LW+QGESD +D K M T+ ++ LP
Sbjct: 169 IDKTLSRLKDGYQIDAFLWHQGESDYAKSKDYYRNLKTMVAYVRMHLTE-KTGKDYSRLP 227
Query: 196 IIRVALASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
I +A F V A QL+++ PN+ +D G L D LH T +
Sbjct: 228 FIFGTVARSNKYFSREVENAMKQLAAEDPNMHLIDMSGAELLNDRLHFTAHS 279
>gi|294778434|ref|ZP_06743857.1| GDSL-like protein [Bacteroides vulgatus PC510]
gi|319642040|ref|ZP_07996706.1| hypothetical protein HMPREF9011_02306 [Bacteroides sp. 3_1_40A]
gi|345521204|ref|ZP_08800535.1| polysaccharide deacetylase [Bacteroides sp. 4_3_47FAA]
gi|423312199|ref|ZP_17290136.1| hypothetical protein HMPREF1058_00748 [Bacteroides vulgatus
CL09T03C04]
gi|254835413|gb|EET15722.1| polysaccharide deacetylase [Bacteroides sp. 4_3_47FAA]
gi|294447696|gb|EFG16273.1| GDSL-like protein [Bacteroides vulgatus PC510]
gi|317386306|gb|EFV67219.1| hypothetical protein HMPREF9011_02306 [Bacteroides sp. 3_1_40A]
gi|392688683|gb|EIY81967.1| hypothetical protein HMPREF1058_00748 [Bacteroides vulgatus
CL09T03C04]
Length = 503
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 7/112 (6%)
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDA----KLYKERSDMFFTDLRSDLQSPLLP 195
I + L+ G I A LW+QGESD +D K M T+ ++ LP
Sbjct: 169 IDKTLSRLKDGYQIDAFLWHQGESDYAKSKDYYRNLKTMVAYVRMHLTE-KTGKDYSRLP 227
Query: 196 IIRVALASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
I +A F V A QL+++ PN+ +D G L D LH T +
Sbjct: 228 FIFGTVARSNKYFSREVENAMKQLAAEDPNMHLIDMSGAELLNDRLHFTAHS 279
>gi|150004869|ref|YP_001299613.1| hypothetical protein BVU_2332 [Bacteroides vulgatus ATCC 8482]
gi|149933293|gb|ABR39991.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 500
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 7/112 (6%)
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDA----KLYKERSDMFFTDLRSDLQSPLLP 195
I + L+ G I A LW+QGESD +D K M T+ ++ LP
Sbjct: 166 IDKTLSRLKDGYQIDAFLWHQGESDYAKSKDYYRNLKTMVAYVRMHLTE-KTGKDYSRLP 224
Query: 196 IIRVALASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
I +A F V A QL+++ PN+ +D G L D LH T +
Sbjct: 225 FIFGTVARSNKYFSREVENAMKQLAAEDPNMHLIDMSGAELLNDRLHFTAHS 276
>gi|319641218|ref|ZP_07995918.1| sialic acid-specific 9-O-acetylesterase [Bacteroides sp. 3_1_40A]
gi|317387151|gb|EFV68030.1| sialic acid-specific 9-O-acetylesterase [Bacteroides sp. 3_1_40A]
Length = 426
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 56/132 (42%), Gaps = 13/132 (9%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
++ V+WYQG+S NLE + Y + D R Q P +P V L P E
Sbjct: 212 LQGVIWYQGKS---NLESSDEYADLFMSLIADWRDKWQKPQMPFYFVQL-----PNHEKK 263
Query: 213 RKAQLSSDLPNVRCVDAMGLPLEPDGLHLTT---PAQGSTLNSWSNEALRVNLSLLVFRI 269
+AQ SD +R A L L G+ +TT + +T S LR LS L +
Sbjct: 264 EEAQDDSDWAAMREAQAQALHLNHTGMVVTTDIGKEKSNTFQSTLETGLR--LSQLALKQ 321
Query: 270 LEGSCRISKQAV 281
G ++ + V
Sbjct: 322 TYGKRKMPQYPV 333
>gi|456357048|dbj|BAM91493.1| hypothetical protein S58_55160 [Agromonas oligotrophica S58]
Length = 342
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 70/180 (38%), Gaps = 31/180 (17%)
Query: 10 LVSEAWPVKCQYQQQ---QLIILAGQSNMA--GRGGVTNDTRTNKLTWDGIVPPQCQPNP 64
L ++A V C+ Q +I++ GQSN G G N+ + + G QC
Sbjct: 93 LFAKAMKVDCRTFAQPRSAVILILGQSNAGNYGEGRSPNNHGADVANYFG---QQC---- 145
Query: 65 SILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT 124
+A EPL + NG P + AN L + F + LVP +GGT
Sbjct: 146 -----------AVAAEPLMG----SDGNGGSPWMALANTTL-EAKVFDRVLLVPLTLGGT 189
Query: 125 NISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTD 184
+++W G LY + R G V W QGE++ D Y+ + D
Sbjct: 190 GMTRWNAGGDLYMLAESTLRRLARSGIPPTHVFWVQGEAERF---DGSRYRRNGGADYFD 246
>gi|359769306|ref|ZP_09273068.1| hypothetical protein GOPIP_088_00110 [Gordonia polyisoprenivorans
NBRC 16320]
gi|359313212|dbj|GAB25901.1| hypothetical protein GOPIP_088_00110 [Gordonia polyisoprenivorans
NBRC 16320]
Length = 296
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 76/176 (43%), Gaps = 27/176 (15%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
Q++ + GQSN G G + + + + P+ P R ++ +LA +PL
Sbjct: 51 QVVAVLGQSNAHGAGRLLDPSAAP------VTDPRVHQWPGCGRRRGQI--LLAEDPL-- 100
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI----------SQWRKGSS 134
+ GVG G F + + G + LVP A G T+ Q +
Sbjct: 101 -LHGTPGAGVGFGTTFGRLLAEDID--GSVLLVPAARGDTSFHPKNGFSWDPDQRSVRVN 157
Query: 135 LYEQMIQRAQVALRGGG---TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRS 187
L+++ + + ALR G + AVLW+QGESD V L + Y++R D LR
Sbjct: 158 LFDRAVAQIAGALRAAGPESELVAVLWHQGESD-VPLTAPETYRDRLDTLIRRLRD 212
>gi|343926022|ref|ZP_08765537.1| hypothetical protein GOALK_050_03180 [Gordonia alkanivorans NBRC
16433]
gi|343764373|dbj|GAA12463.1| hypothetical protein GOALK_050_03180 [Gordonia alkanivorans NBRC
16433]
Length = 298
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 73/176 (41%), Gaps = 25/176 (14%)
Query: 116 LVPCAIGGTNISQ-----WRKGS-----SLYE---QMIQRAQVALRGGGTIRAVLWYQGE 162
LVP A G T+ Q W + +LY+ + I A A G + A+LW+QGE
Sbjct: 130 LVPSARGDTSFHQKNGYSWDPANRTARVNLYDLAVRQIGNALAAASTGSRLAAILWHQGE 189
Query: 163 SDTVNLEDAKLYKERSDMFFTDLRSDL-QSPLL--PIIRVALASGEGPFIEIVRKAQLSS 219
SD V L +Y++R D T LR + + P + ++ +A+G + I +
Sbjct: 190 SD-VPLTPPDVYRDRLDALITGLRDNFGEVPFILGQMVPEEIATGHPKYPGIAAVHATTP 248
Query: 220 DLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNEALRVNLSLLVFRILEGSCR 275
D + C G PDG+H P + NS + +R L G R
Sbjct: 249 DRHSA-CAHVSG----PDGMH--NPGETIHYNSAGQREFGRAM-FEAYRDLAGPSR 296
>gi|237710246|ref|ZP_04540727.1| polysaccharide deacetylase [Bacteroides sp. 9_1_42FAA]
gi|265751054|ref|ZP_06087117.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|229455708|gb|EEO61429.1| polysaccharide deacetylase [Bacteroides sp. 9_1_42FAA]
gi|263237950|gb|EEZ23400.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 483
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 48/112 (42%), Gaps = 7/112 (6%)
Query: 140 IQRAQVALRGGGTIRAVLWYQGESDTVNLEDA----KLYKERSDMFFTDLRSDLQSPLLP 195
I + L+ G I A LW+QGESD +D K M T+ ++ LP
Sbjct: 149 IDKTLSRLKDGYQIDAFLWHQGESDYAKSKDYYRNLKTMVAYVRMHLTE-KTGKDYSRLP 207
Query: 196 IIRVALASGEGPFIEIVRKA--QLSSDLPNVRCVDAMGLPLEPDGLHLTTPA 245
I +A F V A QL+++ PN+ +D G L D LH T +
Sbjct: 208 FIFGTVARSNKYFSREVENAMKQLAAEDPNMHLIDMSGAELLNDRLHFTAHS 259
>gi|432617736|ref|ZP_19853847.1| hypothetical protein A1UM_03177 [Escherichia coli KTE75]
gi|431152874|gb|ELE53794.1| hypothetical protein A1UM_03177 [Escherichia coli KTE75]
Length = 658
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 80/209 (38%), Gaps = 62/209 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
G + N +W G LY+ ++ R + AL R AV+W
Sbjct: 190 CRGASAFTTGADGTYSESAGASENSLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWM 249
Query: 160 QGESDT---VNLEDAKLYKERSDMFFTDL 185
QGE D + + L+ + F TDL
Sbjct: 250 QGEGDAAVGTHAQHPGLFSAMVNQFRTDL 278
>gi|417150832|ref|ZP_11990571.1| PF08410 domain protein [Escherichia coli 1.2264]
gi|386160326|gb|EIH22137.1| PF08410 domain protein [Escherichia coli 1.2264]
Length = 630
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + L+ +
Sbjct: 214 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDAAVGTHAQHPGLFSAMVNQ 273
Query: 181 FFTDL 185
F TDL
Sbjct: 274 FRTDL 278
>gi|424087464|ref|ZP_17823804.1| yjhS, partial [Escherichia coli FRIK1996]
gi|390653204|gb|EIN31364.1| yjhS, partial [Escherichia coli FRIK1996]
Length = 277
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|445021210|ref|ZP_21337147.1| hypothetical protein EC71982_5146, partial [Escherichia coli
7.1982]
gi|444649452|gb|ELW22339.1| hypothetical protein EC71982_5146, partial [Escherichia coli
7.1982]
Length = 579
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|425424612|ref|ZP_18805760.1| hypothetical protein EC01288_3966 [Escherichia coli 0.1288]
gi|408340737|gb|EKJ55217.1| hypothetical protein EC01288_3966 [Escherichia coli 0.1288]
Length = 615
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKVCQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSMGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|15834247|ref|NP_313020.1| hypothetical protein ECs4993 [Escherichia coli O157:H7 str. Sakai]
gi|13364469|dbj|BAB38416.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
Length = 679
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 116 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 162
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 163 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 222
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 223 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 282
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 283 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 310
>gi|423223545|ref|ZP_17210014.1| hypothetical protein HMPREF1062_02200 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638302|gb|EIY32146.1| hypothetical protein HMPREF1062_02200 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 644
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
IR LWYQGE ++ E LYK+ TD R + LP + V L + G +
Sbjct: 433 IRGFLWYQGEGNSGQPE---LYKQLQPTMITDWRIRFEQGYLPFLLVQLPNISGGSCQYF 489
Query: 213 RKAQLSS-DLPNV 224
R+AQ S LPNV
Sbjct: 490 REAQAESLQLPNV 502
>gi|419255186|ref|ZP_13797707.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
gi|378100939|gb|EHW62629.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
Length = 458
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|347732490|ref|ZP_08865570.1| hypothetical protein DA2_1861 [Desulfovibrio sp. A2]
gi|347518773|gb|EGY25938.1| hypothetical protein DA2_1861 [Desulfovibrio sp. A2]
Length = 296
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 63/145 (43%), Gaps = 9/145 (6%)
Query: 120 AIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSD 179
A GG+ +S W G + ++ R + + V W+ GESD +N LYK
Sbjct: 148 AEGGSPLSYWLPGGPVRPKLEDRLRAIQQLPIRPDYVFWFHGESDALNSLPRLLYKHDFL 207
Query: 180 MFFTDLRS-DLQSPLLPIIRVALASGEGPFIEIVRKAQ--LSSDLPNVRC---VDAMGLP 233
LR+ + +P+L + + +L G E VR+AQ L+ +PNV D +GLP
Sbjct: 208 DLVGTLRTFGIDNPVL-VSQTSLCRRLG--TESVRQAQQELARQVPNVTLGPDTDEVGLP 264
Query: 234 LEPDGLHLTTPAQGSTLNSWSNEAL 258
DG H T W + L
Sbjct: 265 FRRDGCHFTDEGGDIVAGLWMDAML 289
>gi|325860096|ref|ZP_08173222.1| GDSL-like protein [Prevotella denticola CRIS 18C-A]
gi|325482381|gb|EGC85388.1| GDSL-like protein [Prevotella denticola CRIS 18C-A]
Length = 717
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 6/88 (6%)
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
M + A + L G G IR V+WYQGES+ N+E L++ + RS P LP +
Sbjct: 486 MFETALLPLEGYG-IRGVVWYQGESNAHNME---LHERLFPLLLKSWRSFFHHPDLPFLF 541
Query: 199 VALASGEGPFIEIVRKAQ--LSSDLPNV 224
L+S P R +Q ++S L N+
Sbjct: 542 AQLSSLNRPSWPRFRDSQCRMASALHNI 569
>gi|427383711|ref|ZP_18880431.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
12058]
gi|425728416|gb|EKU91274.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
12058]
Length = 1074
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 99/253 (39%), Gaps = 58/253 (22%)
Query: 25 QLIILAGQSNMAGRGGVTN-DTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
+ + GQSNM G V DT + + C P++ R K W A PL
Sbjct: 24 HIYLCLGQSNMEGNAKVEEQDTVAIDSRFQVLAAVDC---PNLGR--TKGNWYKAVPPL- 77
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------ 131
G+ PG F A++ +P+ +G++ A+GG I + K
Sbjct: 78 ----ARCYTGLTPGDYFGRAMVANLPSNVRVGIINVAVGGCRIELFDKDNYQSYVETSPD 133
Query: 132 ---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G + Y ++++ A++A + G I+ +L +QGES+T N +D L + +
Sbjct: 134 WLKNMVKEYGGNPYARLVELAKLAQK-DGVIKGILLHQGESNT-NDKDWPL---KVKGVY 188
Query: 183 TDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQ-------------LSSDLPNVRCVDA 229
+L DL L V L +G E+V Q L +P + +
Sbjct: 189 DNLLKDLG---LSAANVPLLAG-----EVVHADQNGICASMNTIIDSLPQVIPTAHVISS 240
Query: 230 MGLPLEPDGLHLT 242
G P D LH T
Sbjct: 241 AGCPAAFDKLHFT 253
>gi|419899999|ref|ZP_14419472.1| hypothetical protein ECO9942_01687 [Escherichia coli O26:H11 str.
CVM9942]
gi|419906153|ref|ZP_14425079.1| hypothetical protein ECO10026_01034 [Escherichia coli O26:H11 str.
CVM10026]
gi|425126688|ref|ZP_18527884.1| hypothetical protein EC80586_3463 [Escherichia coli 8.0586]
gi|428969139|ref|ZP_19039792.1| hypothetical protein EC900039_0282 [Escherichia coli 90.0039]
gi|388378863|gb|EIL41570.1| hypothetical protein ECO9942_01687 [Escherichia coli O26:H11 str.
CVM9942]
gi|388379782|gb|EIL42424.1| hypothetical protein ECO10026_01034 [Escherichia coli O26:H11 str.
CVM10026]
gi|408570213|gb|EKK46193.1| hypothetical protein EC80586_3463 [Escherichia coli 8.0586]
gi|427234934|gb|EKW02601.1| hypothetical protein EC900039_0282 [Escherichia coli 90.0039]
Length = 614
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|224536591|ref|ZP_03677130.1| hypothetical protein BACCELL_01466 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521847|gb|EEF90952.1| hypothetical protein BACCELL_01466 [Bacteroides cellulosilyticus
DSM 14838]
Length = 614
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
IR LWYQGE ++ E LYK+ TD R + LP + V L + G +
Sbjct: 403 IRGFLWYQGEGNSGQPE---LYKQLQPTMITDWRIRFEQGYLPFLLVQLPNISGGSCQYF 459
Query: 213 RKAQLSS-DLPNV 224
R+AQ S LPNV
Sbjct: 460 REAQAESLQLPNV 472
>gi|421829656|ref|ZP_16264978.1| hypothetical protein ECPA7_1814 [Escherichia coli PA7]
gi|408070515|gb|EKH04872.1| hypothetical protein ECPA7_1814 [Escherichia coli PA7]
Length = 614
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|417246271|ref|ZP_12039611.1| PF03629 domain protein [Escherichia coli 9.0111]
gi|425414354|ref|ZP_18796057.1| hypothetical protein ECFRIK523_5934 [Escherichia coli FRIK523]
gi|429823498|ref|ZP_19355056.1| hypothetical protein EC960109_6001 [Escherichia coli 96.0109]
gi|386209893|gb|EII20378.1| PF03629 domain protein [Escherichia coli 9.0111]
gi|408351707|gb|EKJ65428.1| hypothetical protein ECFRIK523_5934 [Escherichia coli FRIK523]
gi|429260899|gb|EKY44427.1| hypothetical protein EC960109_6001 [Escherichia coli 96.0109]
Length = 614
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|444968946|ref|ZP_21286370.1| hypothetical protein EC991793_1890 [Escherichia coli 99.1793]
gi|445044798|ref|ZP_21360098.1| hypothetical protein EC34880_1761 [Escherichia coli 3.4880]
gi|444583009|gb|ELV58765.1| hypothetical protein EC991793_1890 [Escherichia coli 99.1793]
gi|444663755|gb|ELW35964.1| hypothetical protein EC34880_1761 [Escherichia coli 3.4880]
Length = 614
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|405950672|gb|EKC18644.1| Sialate O-acetylesterase [Crassostrea gigas]
Length = 465
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 56/142 (39%), Gaps = 22/142 (15%)
Query: 114 IGLVPCAIGGTNISQW--------------RKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
IGLV GGT I W RK + YE + A + TI+ +WY
Sbjct: 163 IGLVETNWGGTRIEAWSSPDALKRCAGFSGRKRNQYYESHLYNAMINPLLRNTIKGAIWY 222
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV--RKAQL 217
QGES+ + K + +M F D R+ + L + G F+++ R+ +
Sbjct: 223 QGESNAAHA--YKYTCQFQEMIF-DWRTKFSTASLGTTSSSFPFG---FVQLAPWREGES 276
Query: 218 SSDLPNVRCVDAMGLPLEPDGL 239
+ P VR + P+ L
Sbjct: 277 NLGFPQVRWAQTSNVGYVPNSL 298
>gi|372210212|ref|ZP_09498014.1| carbohydrate esterase [Flavobacteriaceae bacterium S85]
Length = 265
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 99/264 (37%), Gaps = 61/264 (23%)
Query: 29 LAGQSNMAGRGGVT--NDTRTNKLTWDGI-------VPPQCQPNPSILRLTAKLKWVLAH 79
+AGQSNMAG G ++ +++ GI P + +P P L W
Sbjct: 1 MAGQSNMAGHGNFDALDEKALDRVKKAGIRVKLATREPQKKEPVP--------LTWYNGG 52
Query: 80 EPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNIS-----QW----- 129
++ N GP L F+ VL++ L+ A+GGT++ W
Sbjct: 53 ----SNKKYNFKKHFGPEL-FSGVVLSETYPEDDFLLIKTAVGGTSLYGAWNPNWTQEKA 107
Query: 130 --------RKGSSLYEQMIQRAQ---VALRGGG---TIRAVLWYQGESDTVNLEDAKLYK 175
R+ LY++ I+ + L G I VLW QGE+DT N A Y+
Sbjct: 108 KIAERGAARQSMQLYQKHIKNIKSNLAVLESKGIPYKIVGVLWMQGEADTNNELKATAYQ 167
Query: 176 ERSDMFFTDLRSDLQSPLLPIIRVAL-----ASGEGPFIEIVRKA--QLSSDLPNVRCVD 228
+ + R + LP + + +GP +VRKA Q+ +D NV V
Sbjct: 168 QNLENLIAAYRKEFGIEKLPFVIGQINIPPRKFKQGP--TLVRKAMEQVVADNKNVALVK 225
Query: 229 A------MGLPLEPDGLHLTTPAQ 246
P D H T Q
Sbjct: 226 TSTDVSWTDYPKHSDDTHYNTEGQ 249
>gi|284037147|ref|YP_003387077.1| hypothetical protein Slin_2257 [Spirosoma linguale DSM 74]
gi|283816440|gb|ADB38278.1| conserved repeat domain protein [Spirosoma linguale DSM 74]
Length = 831
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 51/116 (43%), Gaps = 20/116 (17%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
IRAVL GE+D N ED+ YK + +R++ P L I VA++S + V
Sbjct: 278 IRAVLVQHGENDRRNPEDST-YKYYHKVI-EKVRTEFLMPKLGFI-VAISSFVDTRFDNV 334
Query: 213 RKAQ------------LSSDLPNVRCVDAMGLPLEPDGLHLTTPAQGSTLNSWSNE 256
R AQ + DL N+ D PDG+H +T Q SW+N
Sbjct: 335 RSAQFRIIGQPNFDTYIGPDLDNINSQDD-----RPDGIHFSTAGQVKAAESWANS 385
>gi|189404003|ref|ZP_02786506.2| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|420291386|ref|ZP_14793544.1| hypothetical protein ECTW11039_1527 [Escherichia coli TW11039]
gi|424103101|ref|ZP_17837978.1| hypothetical protein ECFRIK1990_2571 [Escherichia coli FRIK1990]
gi|424141599|ref|ZP_17873525.1| hypothetical protein ECPA14_3218 [Escherichia coli PA14]
gi|189368015|gb|EDU86431.1| YjhS [Escherichia coli O157:H7 str. EC4501]
gi|390666133|gb|EIN43329.1| hypothetical protein ECFRIK1990_2571 [Escherichia coli FRIK1990]
gi|390702464|gb|EIN76629.1| hypothetical protein ECPA14_3218 [Escherichia coli PA14]
gi|390800402|gb|EIO67493.1| hypothetical protein ECTW11039_1527 [Escherichia coli TW11039]
Length = 669
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 106 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 152
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 153 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 212
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 213 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 272
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 273 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 300
>gi|261258701|ref|ZP_05951234.1| hypothetical protein EscherichiacoliO157EcO_23241 [Escherichia coli
O157:H7 str. FRIK966]
gi|420092609|ref|ZP_14604311.1| hypothetical protein ECO9634_30263 [Escherichia coli O111:H8 str.
CVM9634]
gi|425205130|ref|ZP_18601174.1| hypothetical protein ECFRIK2001_2055 [Escherichia coli FRIK2001]
gi|394400627|gb|EJE76541.1| hypothetical protein ECO9634_30263 [Escherichia coli O111:H8 str.
CVM9634]
gi|408128690|gb|EKH58976.1| hypothetical protein ECFRIK2001_2055 [Escherichia coli FRIK2001]
Length = 640
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 77 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 123
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 124 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 183
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 184 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 243
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 244 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 271
>gi|198275802|ref|ZP_03208333.1| hypothetical protein BACPLE_01977 [Bacteroides plebeius DSM 17135]
gi|198271431|gb|EDY95701.1| hypothetical protein BACPLE_01977 [Bacteroides plebeius DSM 17135]
Length = 289
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 108/279 (38%), Gaps = 55/279 (19%)
Query: 1 MFAWLLCLILVSEAWPVKCQYQ--------QQQLIILAGQSNMAGRGGVTNDTRTNKLTW 52
MF L L+ +S PV Q Q + + + GQSNM G + N
Sbjct: 6 MFVTLTSLMALSLG-PVSAQAQTGTEKVNEKFHIYLCLGQSNMEGNAKIEACDTVNVTPR 64
Query: 53 DGIVPPQCQPNPSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFG 112
++ Q P + R K KW A PL G+ P F + +P
Sbjct: 65 FKVL--QAVDCPDLGR--EKGKWYTAVPPL-----ARCGTGLTPADYFGRTLADSLPADV 115
Query: 113 VIGLVPCAIGGTNISQWRKGSSL---------------------YEQMIQRAQVALRGGG 151
IG++ A+GG I + K + Y ++I+ A+ A R G
Sbjct: 116 EIGVINVAVGGCRIELFDKDNYASYVAGSPDWLKNMVAEYDGNPYARLIELAKQASRCG- 174
Query: 152 TIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALAS-GEG---- 206
I+ +L +QGES+T + + K+ D +DL LQ LP++ L S G+G
Sbjct: 175 VIKGILLHQGESNTGDSDWPMKVKKVYDNILSDL--GLQPNSLPLLVGELVSEGQGGACA 232
Query: 207 ---PFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLT 242
P I+ +L +P V V + G D LH +
Sbjct: 233 SMNPVIQ-----KLPETIPVVHVVSSEGCEAVSDRLHFS 266
>gi|217325788|ref|ZP_03441872.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|261226609|ref|ZP_05940890.1| hypothetical protein EscherichiacoliO157_18762 [Escherichia coli
O157:H7 str. FRIK2000]
gi|416307248|ref|ZP_11654492.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|417212563|ref|ZP_12022180.1| PF03629 domain protein [Escherichia coli JB1-95]
gi|419095132|ref|ZP_13640405.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
gi|419206184|ref|ZP_13749334.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8B]
gi|419889201|ref|ZP_14409620.1| hypothetical protein ECO9570_18841 [Escherichia coli O111:H8 str.
CVM9570]
gi|423724093|ref|ZP_17698241.1| hypothetical protein ECPA31_3030 [Escherichia coli PA31]
gi|424106963|ref|ZP_17841595.1| hypothetical protein EC93001_5809 [Escherichia coli 93-001]
gi|424148026|ref|ZP_17879438.1| hypothetical protein ECPA15_3350 [Escherichia coli PA15]
gi|424471847|ref|ZP_17921610.1| hypothetical protein ECPA41_5719 [Escherichia coli PA41]
gi|424472134|ref|ZP_17921853.1| hypothetical protein ECPA42_5836 [Escherichia coli PA42]
gi|424497495|ref|ZP_17944846.1| hypothetical protein ECTW09195_6116 [Escherichia coli TW09195]
gi|425177806|ref|ZP_18575881.1| hypothetical protein ECFRIK1999_0506 [Escherichia coli FRIK1999]
gi|425190089|ref|ZP_18587256.1| hypothetical protein ECFRIK1997_6230 [Escherichia coli FRIK1997]
gi|425217080|ref|ZP_18612260.1| hypothetical protein ECPA23_1720 [Escherichia coli PA23]
gi|425226822|ref|ZP_18621280.1| hypothetical protein ECPA49_4881 [Escherichia coli PA49]
gi|425230977|ref|ZP_18625105.1| hypothetical protein ECPA45_2883 [Escherichia coli PA45]
gi|428962500|ref|ZP_19033727.1| hypothetical protein EC900091_6024 [Escherichia coli 90.0091]
gi|428986992|ref|ZP_19056349.1| hypothetical protein EC930055_5755 [Escherichia coli 93.0055]
gi|428987128|ref|ZP_19056466.1| hypothetical protein EC930056_5604 [Escherichia coli 93.0056]
gi|429834622|ref|ZP_19364925.1| hypothetical protein EC970010_4288 [Escherichia coli 97.0010]
gi|444987395|ref|ZP_21304169.1| hypothetical protein ECPA11_4008 [Escherichia coli PA11]
gi|217322009|gb|EEC30433.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
TW14588]
gi|326348017|gb|EGD71727.1| YjhS [Escherichia coli O157:H7 str. 1044]
gi|377937676|gb|EHV01452.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
gi|378042815|gb|EHW05260.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8B]
gi|386194803|gb|EIH89046.1| PF03629 domain protein [Escherichia coli JB1-95]
gi|388358017|gb|EIL22505.1| hypothetical protein ECO9570_18841 [Escherichia coli O111:H8 str.
CVM9570]
gi|390671318|gb|EIN47765.1| hypothetical protein EC93001_5809 [Escherichia coli 93-001]
gi|390701701|gb|EIN75920.1| hypothetical protein ECPA15_3350 [Escherichia coli PA15]
gi|390743664|gb|EIO14620.1| hypothetical protein ECPA31_3030 [Escherichia coli PA31]
gi|390760319|gb|EIO29653.1| hypothetical protein ECPA41_5719 [Escherichia coli PA41]
gi|390782239|gb|EIO49891.1| hypothetical protein ECPA42_5836 [Escherichia coli PA42]
gi|390813863|gb|EIO80463.1| hypothetical protein ECTW09195_6116 [Escherichia coli TW09195]
gi|408098264|gb|EKH31067.1| hypothetical protein ECFRIK1997_6230 [Escherichia coli FRIK1997]
gi|408110490|gb|EKH42290.1| hypothetical protein ECFRIK1999_0506 [Escherichia coli FRIK1999]
gi|408137939|gb|EKH67631.1| hypothetical protein ECPA49_4881 [Escherichia coli PA49]
gi|408146706|gb|EKH75782.1| hypothetical protein ECPA23_1720 [Escherichia coli PA23]
gi|408147880|gb|EKH76789.1| hypothetical protein ECPA45_2883 [Escherichia coli PA45]
gi|427236399|gb|EKW03977.1| hypothetical protein EC930055_5755 [Escherichia coli 93.0055]
gi|427238727|gb|EKW06232.1| hypothetical protein EC900091_6024 [Escherichia coli 90.0091]
gi|427252964|gb|EKW19419.1| hypothetical protein EC930056_5604 [Escherichia coli 93.0056]
gi|429253514|gb|EKY37998.1| hypothetical protein EC970010_4288 [Escherichia coli 97.0010]
gi|444590860|gb|ELV66159.1| hypothetical protein ECPA11_4008 [Escherichia coli PA11]
Length = 614
Score = 42.4 bits (98), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|417240975|ref|ZP_12037088.1| PF03629 domain protein [Escherichia coli 9.0111]
gi|386212289|gb|EII22735.1| PF03629 domain protein [Escherichia coli 9.0111]
Length = 695
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAK---------LK 74
+I++AGQSN + G +G+ P +P+P I++L + K
Sbjct: 119 VIVIAGQSNASSYG-------------EGLPLPDSYDRPDPRIMQLARRNTQTPGGIPCK 165
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A VL +P I LVPC
Sbjct: 166 YNEIIPADHCLHDVQNMSLLNHPKADLKKGQYGCVGQGLHIAKKVLPVIPADAGILLVPC 225
Query: 120 AIGGTNIS------------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
A GG+ + +W + LY+ ++ R + AL + +V+W
Sbjct: 226 ARGGSAFTTGAVGSFDPASGAAEASLRWGVDTPLYQDLVSRTKAALEANPKNVLLSVVWI 285
Query: 160 QGESD 164
QGE D
Sbjct: 286 QGEGD 290
>gi|419085790|ref|ZP_13631173.1| hypothetical protein ECDEC4B_1714, partial [Escherichia coli DEC4B]
gi|377935118|gb|EHU98935.1| hypothetical protein ECDEC4B_1714, partial [Escherichia coli DEC4B]
Length = 234
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAHGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GIFSESTGASQDSARWGVGKPLYQDLIARTKAALQ 219
>gi|419396271|ref|ZP_13937049.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC15B]
gi|378247605|gb|EHY07521.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein,
partial [Escherichia coli DEC15B]
Length = 576
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------ 124
+AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTLGAEGTFSESTGASQ 204
Query: 125 NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
+ ++W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFT 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|421821525|ref|ZP_16256971.1| hypothetical protein ECFRIK920_6138 [Escherichia coli FRIK920]
gi|408077439|gb|EKH11644.1| hypothetical protein ECFRIK920_6138 [Escherichia coli FRIK920]
Length = 601
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|420110955|ref|ZP_14620844.1| hypothetical protein ECO9553_29323 [Escherichia coli O111:H11 str.
CVM9553]
gi|429076934|ref|ZP_19140154.1| hypothetical protein EC990713_0789 [Escherichia coli 99.0713]
gi|394400053|gb|EJE76009.1| hypothetical protein ECO9553_29323 [Escherichia coli O111:H11 str.
CVM9553]
gi|427334576|gb|EKW95645.1| hypothetical protein EC990713_0789 [Escherichia coli 99.0713]
Length = 643
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAK---------LK 74
+I++AGQSN + G +G+ P +P+P I++L + K
Sbjct: 77 VIVIAGQSNASSYG-------------EGLPLPDSYDRPDPRIMQLARRNTQTPGGIPCK 123
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A VL +P I LVPC
Sbjct: 124 YNEIIPADHCLHDVQNMSLLNHPKADLKKGQYGCVGQGLHIAKKVLPVIPADAGILLVPC 183
Query: 120 AIGGTNIS------------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
A GG+ + +W + LY+ ++ R + AL + +V+W
Sbjct: 184 ARGGSAFTTGAVGSFDPASGAAEASLRWGVDTPLYQDLVSRTKAALEANPKNVLLSVVWI 243
Query: 160 QGESD 164
QGE D
Sbjct: 244 QGEGD 248
>gi|431798015|ref|YP_007224919.1| hypothetical protein Echvi_2669 [Echinicola vietnamensis DSM 17526]
gi|430788780|gb|AGA78909.1| protein of unknown function (DUF303) [Echinicola vietnamensis DSM
17526]
Length = 278
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 59/266 (22%), Positives = 101/266 (37%), Gaps = 43/266 (16%)
Query: 6 LCLILVSEAWPVKCQYQQQQLIILA--GQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPN 63
+ L+LV V Q Q + I GQSNM G R + + C
Sbjct: 8 ISLVLVIMTLGVSAQAQDKNFYIFLAFGQSNMEGAAKFEEQDREVNPRFQVLQSIDC--- 64
Query: 64 PSILRLTAKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGG 123
P + R K +W A PL G+ P F ++ +P+ +G++ ++GG
Sbjct: 65 PDLGR--EKGQWYPAVPPL-----TRCHTGLTPADYFGRTLVKNLPDSIRVGVINVSVGG 117
Query: 124 TNISQWRKGS---------------------SLYEQMIQRAQVALRGGGTIRAVLWYQGE 162
I+ + K + Y +++ A+ A + G I+ +L +QGE
Sbjct: 118 CKIALFEKDTYSSYVDTAPDWMLNMIKVYDGDPYGHLVELARKA-QEDGVIKGILLHQGE 176
Query: 163 SDTVNLEDAKLYKERSDMFFTDLRSDL-----QSPLLPIIRVALASGEGPFIEIVRKAQL 217
S+T +++ + + + +L SDL + PLL V+ G A L
Sbjct: 177 SNTGDVQ----WPNKVKGVYENLLSDLGLVPEEVPLLAGEMVSAEQGGKCASMNAIIATL 232
Query: 218 SSDLPNVRCVDAMGLPLEPDGLHLTT 243
+PN + + DGLH +
Sbjct: 233 PEVIPNAHVISSQDCEAVSDGLHFSA 258
>gi|420131650|ref|ZP_14640075.1| hypothetical protein ECO9952_02169 [Escherichia coli O26:H11 str.
CVM9952]
gi|394431499|gb|EJF03699.1| hypothetical protein ECO9952_02169 [Escherichia coli O26:H11 str.
CVM9952]
Length = 643
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAK---------LK 74
+I++AGQSN + G +G+ P +P+P I++L + K
Sbjct: 77 VIVIAGQSNASSYG-------------EGLPLPDSYDRPDPRIMQLARRNTQTPGGIPCK 123
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A VL +P I LVPC
Sbjct: 124 YNEIIPADHCLHDVQNMSLLNHPKADLKKGQYGCVGQGLHIAKKVLPVIPADAGILLVPC 183
Query: 120 AIGGTNIS------------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
A GG+ + +W + LY+ ++ R + AL + +V+W
Sbjct: 184 ARGGSAFTTGAVGSFDPASGAAEASLRWGVDTPLYQDLVSRTKAALEANPKNVLLSVVWI 243
Query: 160 QGESD 164
QGE D
Sbjct: 244 QGEGD 248
>gi|419391929|ref|ZP_13932743.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15A]
gi|419402342|ref|ZP_13943066.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15C]
gi|419407455|ref|ZP_13948145.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15D]
gi|419413028|ref|ZP_13953683.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15E]
gi|378238050|gb|EHX98063.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15A]
gi|378246876|gb|EHY06795.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15C]
gi|378254866|gb|EHY14728.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15D]
gi|378259413|gb|EHY19226.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC15E]
Length = 625
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 27/130 (20%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------ 124
+AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 145 NADLSKGQYGCVGQGLHIAKRLLPYIPQNAGILLVPCCRGGSAFTLGAEGTFSESTGASQ 204
Query: 125 NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNLEDAKLYKERSDMF- 181
+ ++W G LY+ +I R + AL+ + AV W QGE D A Y ++ +F
Sbjct: 205 DSARWGVGKPLYQDLILRTKAALQKNPKNMLLAVCWMQGEFDM----SAATYSQQPPLFT 260
Query: 182 --FTDLRSDL 189
R+D+
Sbjct: 261 AMLKQFRADI 270
>gi|419255314|ref|ZP_13797835.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
gi|378101067|gb|EHW62757.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10A]
Length = 502
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 82/213 (38%), Gaps = 67/213 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAKL---------- 73
+++LAGQSN G +G+ P+ +P P I++L +
Sbjct: 51 VVVLAGQSNGMAYG-------------EGLPLPETYDRPEPRIMQLARRSTVTPGGKACQ 97
Query: 74 --KWVLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+LA LH AD+ + VG GL A +L +P I LVPC
Sbjct: 98 YNDIILADHCLHDVQDMSGKNHPKADVAKGQYGTVGQGLHIAKKLLPFIPADAGILLVPC 157
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
GG+ + ++W LY+ +I R + AL R AV+W
Sbjct: 158 CRGGSAFTAGADGTYSDSTGASEDSARWGVDKPLYKDLISRTKAALAKNPKNRLLAVVWM 217
Query: 160 QGESDTVNLEDAKLYKERSDMFFT---DLRSDL 189
QGE D DAK E S +F R+DL
Sbjct: 218 QGEFDI----DAKP-TEHSALFLAMVEKFRADL 245
>gi|419862493|ref|ZP_14385097.1| hypothetical protein ECO9340_23221, partial [Escherichia coli
O103:H25 str. CVM9340]
gi|388345087|gb|EIL10881.1| hypothetical protein ECO9340_23221, partial [Escherichia coli
O103:H25 str. CVM9340]
Length = 330
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|424562615|ref|ZP_18003735.1| hypothetical protein ECEC4437_2032, partial [Escherichia coli
EC4437]
gi|425242368|ref|ZP_18635831.1| hypothetical protein ECMA6_2175, partial [Escherichia coli MA6]
gi|390900647|gb|EIP59863.1| hypothetical protein ECEC4437_2032, partial [Escherichia coli
EC4437]
gi|408166047|gb|EKH93679.1| hypothetical protein ECMA6_2175, partial [Escherichia coli MA6]
Length = 126
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|294635253|ref|ZP_06713755.1| conserved hypothetical YjhS family protein encoded by [Edwardsiella
tarda ATCC 23685]
gi|451967014|ref|ZP_21920261.1| putative 9-O-acetyl-N-acetylneuraminic acid deacetylase
[Edwardsiella tarda NBRC 105688]
gi|291091370|gb|EFE23931.1| conserved hypothetical YjhS family protein encoded by [Edwardsiella
tarda ATCC 23685]
gi|451314167|dbj|GAC65623.1| putative 9-O-acetyl-N-acetylneuraminic acid deacetylase
[Edwardsiella tarda NBRC 105688]
Length = 348
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 49/126 (38%), Gaps = 23/126 (18%)
Query: 83 HADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ-------------- 128
HAD + V L +L +P I +VPCA GG+ +Q
Sbjct: 94 HADARRGEYGCVAQALHIGKTLLPYLPAEAGILIVPCARGGSAFTQGNLGAYHPARGATA 153
Query: 129 ----WRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDTVNLEDAK---LYKERSD 179
W + LY+ + R + ALR R AV+W QGE D E A L++
Sbjct: 154 DACRWGVATPLYQDLRDRTRAALRHNPDNRLLAVIWIQGEFDLTTAEYAHQPALFQAMVA 213
Query: 180 MFFTDL 185
F D+
Sbjct: 214 RFRADM 219
>gi|429004994|ref|ZP_19073031.1| hypothetical protein EC950183_5428 [Escherichia coli 95.0183]
gi|429910620|ref|ZP_19376577.1| hypothetical protein MO7_02861 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|427255384|gb|EKW21652.1| hypothetical protein EC950183_5428 [Escherichia coli 95.0183]
gi|429457013|gb|EKZ92855.1| hypothetical protein MO7_02861 [Escherichia coli O104:H4 str.
Ec11-9941]
Length = 685
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAK---------LK 74
+I++AGQSN + G +G+ P +P+P I++L + K
Sbjct: 119 VIVIAGQSNASSYG-------------EGLPLPDSYDRPDPRIMQLARRNTQTPGGIPCK 165
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A VL +P I LVPC
Sbjct: 166 YNEIIPADHCLHDVQNMSLLNHPKADLKKGQYGCVGQGLHIAKKVLPVIPADAGILLVPC 225
Query: 120 AIGGTNIS------------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
A GG+ + +W + LY+ ++ R + AL + +V+W
Sbjct: 226 ARGGSAFTTGAVGSFDPASGAAEASLRWGVDTPLYQDLVSRTKAALEANPKNVLLSVVWI 285
Query: 160 QGESD 164
QGE D
Sbjct: 286 QGEGD 290
>gi|424762104|ref|ZP_18189629.1| hypothetical protein CFSAN001630_18273, partial [Escherichia coli
O111:H11 str. CFSAN001630]
gi|421941573|gb|EKT98962.1| hypothetical protein CFSAN001630_18273, partial [Escherichia coli
O111:H11 str. CFSAN001630]
Length = 290
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 135 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 194
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 195 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 254
Query: 182 FTDL 185
DL
Sbjct: 255 RADL 258
>gi|420095907|ref|ZP_14607373.1| hypothetical protein ECO9634_25452, partial [Escherichia coli
O111:H8 str. CVM9634]
gi|394391194|gb|EJE68083.1| hypothetical protein ECO9634_25452, partial [Escherichia coli
O111:H8 str. CVM9634]
Length = 231
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 52 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 111
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 112 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 171
Query: 182 FTDL 185
DL
Sbjct: 172 RADL 175
>gi|260557965|ref|ZP_05830177.1| cellulosome enzyme [Acinetobacter baumannii ATCC 19606 = CIP 70.34]
gi|260408475|gb|EEX01781.1| cellulosome enzyme [Acinetobacter baumannii ATCC 19606 = CIP 70.34]
Length = 604
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 60/147 (40%), Gaps = 13/147 (8%)
Query: 109 PNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALR--GGGT--IRAVLWYQGESD 164
P VI GG I Q KG++ YE ++ A R G T ++A+ W QGE+D
Sbjct: 326 PKDHVIFCSAAGHGGYRIDQLEKGTTWYEFLLHHVSEAKRLNSGKTYKVQAIAWVQGEND 385
Query: 165 TV--NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLP 222
+ +LY+++ + D D++ + + S + + AQ L
Sbjct: 386 AITGTQTSYELYRQKLEKLQRDANDDIKEITGQVDDIKFISYQLSYAARTWSAQALVQLH 445
Query: 223 NVRCVDAMGL-------PLEPDGLHLT 242
+ D+ L P PD +HLT
Sbjct: 446 LAQESDSFALSTPMYHMPYAPDNIHLT 472
>gi|241258862|ref|YP_002978746.1| hypothetical protein Rleg_6243 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240863332|gb|ACS60995.1| protein of unknown function DUF303 acetylesterase putative
[Rhizobium leguminosarum bv. trifolii WSM1325]
Length = 312
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 3/115 (2%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
N ++ N VI L P A G+ +++W G ++ + G I VLW
Sbjct: 133 LGNNLIASGQNDNVI-LAPLAYSGSEVARWAAGGDFNPVLVDTVKQLQGSGYRITNVLWV 191
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPL-LPIIRVALASGEGPFIEIV 212
QGE+D V AK Y+ER LR +++P+ + I L G F E +
Sbjct: 192 QGEADLVMGTTAKAYQERFMSMVDTLRQHGVEAPVYISIASKCLEPSNGGFKEHI 246
>gi|444986496|ref|ZP_21303284.1| hypothetical protein ECPA11_3101, partial [Escherichia coli PA11]
gi|444593209|gb|ELV68437.1| hypothetical protein ECPA11_3101, partial [Escherichia coli PA11]
Length = 631
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417226653|ref|ZP_12029033.1| PF08410 domain protein [Escherichia coli 5.0959]
gi|386208869|gb|EII13368.1| PF08410 domain protein [Escherichia coli 5.0959]
Length = 359
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|380692977|ref|ZP_09857836.1| sialic acid-specific acetylesterase [Bacteroides faecis MAJ27]
Length = 479
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
++ A++A ++ LWYQGES N ++A LY+ F TDLR+ LP
Sbjct: 248 VLYNAKIAPLTHFAVKGFLWYQGES---NRDNAGLYQSLMPAFVTDLRAKWGRGELPFYF 304
Query: 199 VALA-----SGEGPFIEIVRKAQLSS--DLPNVRCVDAMGL 232
V +A +G +R+ QL + D+PN V M +
Sbjct: 305 VQIAPFNYEGADGTSAARLREVQLQNMKDIPNSGMVTTMDV 345
>gi|7649865|dbj|BAA94143.1| hypothetical protein [Enterobacteria phage VT2-Sakai]
Length = 492
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 9 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 68
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 69 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 128
Query: 182 FTDL 185
DL
Sbjct: 129 RADL 132
>gi|55859419|emb|CAE53950.1| hypothetical protein [Enterobacteria phage 2851]
gi|209407411|emb|CAQ82027.1| conserved hypothetical protein [Enterobacteria phage 2851]
Length = 280
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 80 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 138
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 139 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 198
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 199 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|15830461|ref|NP_309234.1| hypothetical protein ECs1207 [Escherichia coli O157:H7 str. Sakai]
gi|302393159|ref|YP_003828989.1| hypothetical protein Stx2II_gp76 [Stx2 converting phage II]
gi|13360667|dbj|BAB34630.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
gi|32128326|dbj|BAC78129.1| hypothetical protein [Stx2 converting phage II]
Length = 634
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|419269636|ref|ZP_13811976.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
gi|421812058|ref|ZP_16247818.1| hypothetical protein EC80416_1850 [Escherichia coli 8.0416]
gi|424134281|ref|ZP_17866828.1| hypothetical protein ECPA10_2624 [Escherichia coli PA10]
gi|378106329|gb|EHW67958.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
gi|390702047|gb|EIN76264.1| hypothetical protein ECPA10_2624 [Escherichia coli PA10]
gi|408603046|gb|EKK76715.1| hypothetical protein EC80416_1850 [Escherichia coli 8.0416]
Length = 672
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAK---------LK 74
+I++AGQSN + G +G+ P +P+P I++L + K
Sbjct: 106 VIVIAGQSNASSYG-------------EGLPLPDSYDRPDPRIMQLARRNTQTPGGIPCK 152
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A VL +P I LVPC
Sbjct: 153 YNEIIPADHCLHDVQNMSLLNHPKADLKKGQYGCVGQGLHIAKKVLPVIPADAGILLVPC 212
Query: 120 AIGGTNIS------------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
A GG+ + +W + LY+ ++ R + AL + +V+W
Sbjct: 213 ARGGSAFTTGAVGSFDPASGAAEASLRWGVDTPLYQDLVSRTKAALEANPKNVLLSVVWI 272
Query: 160 QGESD 164
QGE D
Sbjct: 273 QGEGD 277
>gi|373852183|ref|ZP_09594983.1| Sialate O-acetylesterase [Opitutaceae bacterium TAV5]
gi|372474412|gb|EHP34422.1| Sialate O-acetylesterase [Opitutaceae bacterium TAV5]
Length = 485
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 58/147 (39%), Gaps = 25/147 (17%)
Query: 115 GLVPCAIGGTNISQW-------RKGSSLYEQMIQRAQVALRG----------GGTIRAVL 157
GLV A GGT + W R S Q + A G G +R +L
Sbjct: 209 GLVTSAWGGTTVEAWISEEAFDRHAISAVVQSGSENRRAPSGAFNAMIHPIIGVGLRGIL 268
Query: 158 WYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVAL---ASGEGPFIEIVRK 214
WYQGE+ N + Y D R +SP LP + V L S EG +R+
Sbjct: 269 WYQGEA---NAREPDGYGALFRALIADWRQRWESPALPFLFVQLPNYGSTEGINWAQIRQ 325
Query: 215 AQLSS-DLPNVRCVDAMGLPLEPDGLH 240
Q S+ DLP + L EP G+H
Sbjct: 326 GQASALDLPATAMAVTIDL-GEPRGIH 351
>gi|417121429|ref|ZP_11970857.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386148281|gb|EIG94718.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 328
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 73/185 (39%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWM 246
Query: 160 QGESD 164
QGE D
Sbjct: 247 QGEFD 251
>gi|417298019|ref|ZP_12085261.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386258287|gb|EIJ13766.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 237
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQ 219
>gi|417298954|ref|ZP_12086190.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386257590|gb|EIJ13075.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 237
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQ 219
>gi|419326312|ref|ZP_13867980.1| hypothetical protein ECDEC12C_5465 [Escherichia coli DEC12C]
gi|378179945|gb|EHX40649.1| hypothetical protein ECDEC12C_5465 [Escherichia coli DEC12C]
Length = 237
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 68/169 (40%), Gaps = 35/169 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C N I+ L
Sbjct: 66 VVVLAGQSNGMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGESCTYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQ 160
W G LY+ +I R + AL+ + AV W Q
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQKNPKNVLLAVCWMQ 233
>gi|420134805|ref|ZP_14642905.1| hypothetical protein ECO9952_03481, partial [Escherichia coli
O26:H11 str. CVM9952]
gi|394420926|gb|EJE94424.1| hypothetical protein ECO9952_03481, partial [Escherichia coli
O26:H11 str. CVM9952]
Length = 357
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 85 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 131
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 132 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 191
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 192 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 251
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 252 QGEFDFGGTPVNHAAQFGALVDKFRADL 279
>gi|419210971|ref|ZP_13754044.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8C]
gi|419875272|ref|ZP_14397141.1| hypothetical protein ECO9534_05053 [Escherichia coli O111:H11 str.
CVM9534]
gi|419884487|ref|ZP_14405430.1| hypothetical protein ECO9545_02350 [Escherichia coli O111:H11 str.
CVM9545]
gi|420101878|ref|ZP_14612936.1| hypothetical protein ECO9455_02897 [Escherichia coli O111:H11 str.
CVM9455]
gi|424763385|ref|ZP_18190863.1| hypothetical protein CFSAN001630_20440 [Escherichia coli O111:H11
str. CFSAN001630]
gi|378051516|gb|EHW13832.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8C]
gi|388349314|gb|EIL14827.1| hypothetical protein ECO9534_05053 [Escherichia coli O111:H11 str.
CVM9534]
gi|388354386|gb|EIL19304.1| hypothetical protein ECO9545_02350 [Escherichia coli O111:H11 str.
CVM9545]
gi|394413787|gb|EJE87783.1| hypothetical protein ECO9455_02897 [Escherichia coli O111:H11 str.
CVM9455]
gi|421940114|gb|EKT97594.1| hypothetical protein CFSAN001630_20440 [Escherichia coli O111:H11
str. CFSAN001630]
Length = 617
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 74/185 (40%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLTAK---------LK 74
+I++AGQSN + G +G+ P +P+P I++L + K
Sbjct: 51 VIVIAGQSNASSYG-------------EGLPLPDSYDRPDPRIMQLARRNTQTPGGIPCK 97
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A VL +P I LVPC
Sbjct: 98 YNEIIPADHCLHDVQNMSLLNHPKADLKKGQYGCVGQGLHIAKKVLPVIPADAGILLVPC 157
Query: 120 AIGGTNIS------------------QWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
A GG+ + +W + LY+ ++ R + AL + +V+W
Sbjct: 158 ARGGSAFTTGAVGSFDPASGAAEASLRWGVDTPLYQDLVSRTKAALEANPKNVLLSVVWI 217
Query: 160 QGESD 164
QGE D
Sbjct: 218 QGEGD 222
>gi|260868597|ref|YP_003234999.1| hypothetical protein ECO111_2588 [Escherichia coli O111:H- str.
11128]
gi|257764953|dbj|BAI36448.1| hypothetical protein ECO111_2588 [Escherichia coli O111:H- str.
11128]
Length = 645
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|420114256|ref|ZP_14623937.1| hypothetical protein ECO10021_01858, partial [Escherichia coli
O26:H11 str. CVM10021]
gi|394409960|gb|EJE84399.1| hypothetical protein ECO10021_01858, partial [Escherichia coli
O26:H11 str. CVM10021]
Length = 425
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|423034210|ref|ZP_17024894.1| hypothetical protein EUJG_03269, partial [Escherichia coli O104:H4
str. 11-4623]
gi|354887537|gb|EHF47812.1| hypothetical protein EUJG_03269, partial [Escherichia coli O104:H4
str. 11-4623]
Length = 279
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|419222083|ref|ZP_13765007.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8E]
gi|378065643|gb|EHW27786.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8E]
Length = 402
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 214 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|444991253|ref|ZP_21307922.1| hypothetical protein ECPA19_2527, partial [Escherichia coli PA19]
gi|444608550|gb|ELV83066.1| hypothetical protein ECPA19_2527, partial [Escherichia coli PA19]
Length = 252
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|419862546|ref|ZP_14385143.1| hypothetical protein ECO9340_08723, partial [Escherichia coli
O103:H25 str. CVM9340]
gi|388344846|gb|EIL10660.1| hypothetical protein ECO9340_08723, partial [Escherichia coli
O103:H25 str. CVM9340]
Length = 320
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 73/185 (39%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 85 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 131
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 132 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 191
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 192 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 251
Query: 160 QGESD 164
QGE D
Sbjct: 252 QGEFD 256
>gi|419284694|ref|ZP_13826870.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10F]
gi|378131948|gb|EHW93301.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10F]
Length = 462
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|419262433|ref|ZP_13804844.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
gi|378104395|gb|EHW66053.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
Length = 439
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|419221846|ref|ZP_13764772.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
gi|378066112|gb|EHW28250.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8E]
Length = 645
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|419260327|ref|ZP_13802762.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
gi|378111013|gb|EHW72603.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
Length = 526
Score = 41.6 bits (96), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|402488641|ref|ZP_10835450.1| hypothetical protein RCCGE510_12995 [Rhizobium sp. CCGE 510]
gi|401812406|gb|EJT04759.1| hypothetical protein RCCGE510_12995 [Rhizobium sp. CCGE 510]
Length = 311
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 78/190 (41%), Gaps = 26/190 (13%)
Query: 16 PVKC--QYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKL 73
PV C Q + +++L GQSN A GG + + + +C
Sbjct: 66 PVPCPTQTDRTAVLLLLGQSNAANDGGQRHRSNYGARVVNAF-DKRC------------- 111
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+A PL D T G L N ++ N VI L P A G+ +++W G
Sbjct: 112 --FIAASPLLGSTD---TKGEYWTL-LGNELIASGQNDSVI-LAPLAYSGSEVARWAAGG 164
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPL 193
L +++ + G I +VLW QGE D V A+ Y+ D F + + + Q +
Sbjct: 165 DLNPVLVETMKQLQDSGYRITSVLWVQGEKDLVMGTTAEAYR---DYFLSMVDTLRQHGV 221
Query: 194 LPIIRVALAS 203
+ +++AS
Sbjct: 222 EAPVYISIAS 231
>gi|419104652|ref|ZP_13649781.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4E]
gi|377947135|gb|EHV10802.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4E]
Length = 645
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|424223507|ref|ZP_17889275.1| hypothetical protein ECPA25_1759, partial [Escherichia coli PA25]
gi|390729155|gb|EIO01379.1| hypothetical protein ECPA25_1759, partial [Escherichia coli PA25]
Length = 130
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|419091710|ref|ZP_13637018.1| hypothetical protein ECDEC4C_1625, partial [Escherichia coli DEC4C]
gi|377946932|gb|EHV10604.1| hypothetical protein ECDEC4C_1625, partial [Escherichia coli DEC4C]
Length = 220
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGHGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQ 219
>gi|419061544|ref|ZP_13608311.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3D]
gi|377915957|gb|EHU80055.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3D]
Length = 329
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 152 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 211
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 212 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 271
Query: 182 FTDL 185
DL
Sbjct: 272 RADL 275
>gi|218886687|ref|YP_002436008.1| hypothetical protein DvMF_1592 [Desulfovibrio vulgaris str.
'Miyazaki F']
gi|218757641|gb|ACL08540.1| hypothetical protein DvMF_1592 [Desulfovibrio vulgaris str.
'Miyazaki F']
Length = 296
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 64/145 (44%), Gaps = 9/145 (6%)
Query: 120 AIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYK-ERS 178
A GG+ +S W G + ++ R + + V W+ GESD +N LYK +
Sbjct: 148 AEGGSPLSYWLPGGPVRPKLEDRLRAIQQLPIRPDYVFWFHGESDALNSLPRLLYKYDFL 207
Query: 179 DMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQ--LSSDLPNVRC---VDAMGLP 233
D+ T + +P+L + + +L G E VR+AQ L+ +PNV D +GLP
Sbjct: 208 DLVGTLRTFGIDNPVL-VSQTSLCRRFGS--ESVRQAQQELARQVPNVTLGPDTDEVGLP 264
Query: 234 LEPDGLHLTTPAQGSTLNSWSNEAL 258
DG H T W + L
Sbjct: 265 FRRDGCHFTDEGGDIVAGLWMDAML 289
>gi|115524376|ref|YP_781287.1| hypothetical protein RPE_2367 [Rhodopseudomonas palustris BisA53]
gi|115518323|gb|ABJ06307.1| protein of unknown function DUF303, acetylesterase putative
[Rhodopseudomonas palustris BisA53]
Length = 399
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 63/164 (38%), Gaps = 23/164 (14%)
Query: 116 LVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGG-TIRAVLWYQGESDTVNL-----E 169
+ P AI GT + +WR Y +++ A LR G +LW+QGE + + E
Sbjct: 223 IAPIAISGTYLEEWRARGGKYFEVVLSALAGLREHGLEPTGILWHQGEFNALAFTANTAE 282
Query: 170 DAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFI------------EIVRKAQL 217
DA + M S +++ L I + A P EI+R AQ+
Sbjct: 283 DATQLTVTTPMREAARLSYIRNYLEIIAGLRAADANAPIFVATATRCGGAQDEIIRSAQM 342
Query: 218 SSDLPNVRC-----VDAMGLPLEPDGLHLTTPAQGSTLNSWSNE 256
S P + D +G + DG H+T W++
Sbjct: 343 SIPNPTLGIYAGPDTDLIGPSMRSDGCHMTHAGTDQHARMWADR 386
>gi|417226914|ref|ZP_12029108.1| PF08410 domain protein [Escherichia coli 5.0959]
gi|386208692|gb|EII13193.1| PF08410 domain protein [Escherichia coli 5.0959]
Length = 237
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GTFSADTGASQDSARWGVGKPLYQDLIARTKAALQ 219
>gi|218689480|ref|YP_002397692.1| hypothetical protein ECED1_1725 [Escherichia coli ED1a]
gi|218427044|emb|CAR07919.2| conserved hypothetical protein from phage origin [Escherichia coli
ED1a]
Length = 662
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 52/125 (41%), Gaps = 28/125 (22%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNI----------------- 126
AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFLEGDEGTFSESTGASET 210
Query: 127 -SQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--------TVNLEDAKLYK 175
++W LY+ ++ R Q AL+ + AV+W QGE D L D+ + K
Sbjct: 211 SARWGVDKPLYKDLLTRTQAALKANPKNILLAVVWMQGEFDLKQGAYATQPGLFDSMVEK 270
Query: 176 ERSDM 180
RSD+
Sbjct: 271 YRSDL 275
>gi|425199019|ref|ZP_18595448.1| hypothetical protein ECNE037_2275, partial [Escherichia coli NE037]
gi|408122161|gb|EKH53035.1| hypothetical protein ECNE037_2275, partial [Escherichia coli NE037]
Length = 191
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 56 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 115
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 116 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 156
>gi|417155150|ref|ZP_11993279.1| PF08410 domain protein [Escherichia coli 96.0497]
gi|386168239|gb|EIH34755.1| PF08410 domain protein [Escherichia coli 96.0497]
Length = 654
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESDTVNL--EDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKSPKNVLFAVVWMQGEFDFGGMPANHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|419200921|ref|ZP_13744165.1| hypothetical protein ECDEC8A_5990 [Escherichia coli DEC8A]
gi|378037181|gb|EHV99715.1| hypothetical protein ECDEC8A_5990 [Escherichia coli DEC8A]
Length = 293
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 214 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|420310207|ref|ZP_14812143.1| yjhS [Escherichia coli EC1738]
gi|390900346|gb|EIP59566.1| yjhS [Escherichia coli EC1738]
Length = 330
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 80 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 138
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 139 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 198
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 199 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|419056011|ref|ZP_13602857.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3C]
gi|377911714|gb|EHU75882.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3C]
Length = 249
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 72 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 131
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 132 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 191
Query: 182 FTDL 185
DL
Sbjct: 192 RADL 195
>gi|190891675|ref|YP_001978217.1| hypothetical protein RHECIAT_CH0002080 [Rhizobium etli CIAT 652]
gi|190696954|gb|ACE91039.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
Length = 311
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 53/115 (46%), Gaps = 3/115 (2%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
N ++ N VI L P A G+ +++W G L +I + G + +VLW
Sbjct: 132 LGNELIASGQNDSVI-LAPLAYSGSEVARWAAGGDLNAVLIDTLKKLRDTGYRVTSVLWV 190
Query: 160 QGESDTVNLEDAKLYKERS-DMFFTDLRSDLQSPL-LPIIRVALASGEGPFIEIV 212
QGE+D V A+ Y+ER M T + +++P+ + I L G F E +
Sbjct: 191 QGEADFVLGTTAEAYQERFLSMVDTLHQHGVEAPVYISIASKCLEPSNGGFKEHI 245
>gi|420287679|ref|ZP_14789868.1| hypothetical protein ECTW10246_3550, partial [Escherichia coli
TW10246]
gi|390789816|gb|EIO57256.1| hypothetical protein ECTW10246_3550, partial [Escherichia coli
TW10246]
Length = 314
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|452949308|gb|EME54776.1| hypothetical protein G347_12953 [Acinetobacter baumannii MSP4-16]
Length = 804
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 60/147 (40%), Gaps = 13/147 (8%)
Query: 109 PNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALR--GGGT--IRAVLWYQGESD 164
P VI GG I Q KG++ YE ++ A R G T ++A+ W QGE+D
Sbjct: 500 PKDHVIFCSAAGHGGYRIDQLEKGTTWYEFLLHHVSEAKRLNSGKTYKVQAIAWVQGEND 559
Query: 165 TV--NLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIVRKAQLSSDLP 222
+ +LY+++ + D D++ + + S + + AQ L
Sbjct: 560 AITGTQTSYELYRQKLEKLQRDANDDIKEITGQVDDIKFISYQLSYAARTWSAQALVQLH 619
Query: 223 NVRCVDAMGL-------PLEPDGLHLT 242
+ D+ L P PD +HLT
Sbjct: 620 LAQESDSFALSTPMYHMPYAPDNIHLT 646
>gi|86360871|ref|YP_472758.1| hypothetical protein RHE_PF00140 [Rhizobium etli CFN 42]
gi|86284973|gb|ABC94031.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 312
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 3/115 (2%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
AN ++ N VI L P A GG+ +++W G L ++ + G I +VLW
Sbjct: 133 LANKLIGSGQNDSVI-LAPLAYGGSEVARWAAGGDLNPVLVDTMKQLQDSGYRITSVLWV 191
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPL-LPIIRVALASGEGPFIEIV 212
QGE+D V ++ Y++ LR +++P+ + I L G F E +
Sbjct: 192 QGEADLVMGTTSEAYQKHFMSMVDTLRQHGVEAPVYISIASKCLEPSNGGFKEHI 246
>gi|417266280|ref|ZP_12053648.1| PF08410 domain protein [Escherichia coli 3.3884]
gi|386231090|gb|EII58438.1| PF08410 domain protein [Escherichia coli 3.3884]
Length = 654
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|419221155|ref|ZP_13764096.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8E]
gi|378068971|gb|EHW31067.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8E]
Length = 237
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GTFSADAGASQDSARWGVGKPLYQDLIARTKAALQ 219
>gi|372211265|ref|ZP_09499067.1| acetylxylan esterase [Flavobacteriaceae bacterium S85]
Length = 648
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 46/212 (21%), Positives = 83/212 (39%), Gaps = 33/212 (15%)
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG- 132
+W A PL + + G+ P F ++ ++P +GLVP A+GG +I + K
Sbjct: 442 RWYTAIPPL-----FHCSTGLSPADYFGRTLVEQLPEKIKVGLVPVAVGGCDIRIFDKDI 496
Query: 133 ---------------------SSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDA 171
+ Y +I A++A + G I+ +L +QGE++ +
Sbjct: 497 YQDYNATTKESWFVDKVRSYRGNPYGHLINLAKIAQK-SGVIKGILLHQGEANAGDKNWP 555
Query: 172 KLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPF---IEIVRKAQLSSDLPNVRCVD 228
K K DL D +S L V +G F +I+ L +P V
Sbjct: 556 KYVKSVYRNILKDLSLDAKSVPLIAGEVVHEDQKGMFGYMNQIIN--TLPQVIPTAHVVS 613
Query: 229 AMGLPLEPDGLHLTTPAQGSTLNSWSNEALRV 260
+ G ++ D LH + ++++ L +
Sbjct: 614 SKGCLVQEDNLHFNSEGVRKLGKRYADKILEI 645
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 99/247 (40%), Gaps = 44/247 (17%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKLKWVLAHEPLHA 84
+ + GQSNM G + +KL D + Q ++LR W A PL
Sbjct: 31 HIYLCFGQSNMEGSASIE---PKDKLVNDRFLAMQTTDCNNLLRTQGI--WYPAVPPLS- 84
Query: 85 DIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK------------- 131
G+ P F ++ +P+ +G++ AIGG++I + K
Sbjct: 85 ----QCYTGLSPADAFGKTMVKHLPDSIKVGVMNVAIGGSDIRLFDKEIYQNYLNTYPES 140
Query: 132 ---------GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
G + Y+++I+ A+ A + G I+ +L +QGE++T + + K+ +
Sbjct: 141 WFQDKINGYGGNPYQRLIELAKKAQKNG-VIKGILLHQGETNTGDKKWPLYVKKIYESML 199
Query: 183 TDLRSDL-QSPLLPIIRVALASGE-----GPFIEIVRKAQLSSDLPNVRCVDAMGLPLEP 236
+DL + + PLL V G P I+ L + +P + + G +
Sbjct: 200 SDLSLNADEVPLLAGEVVGADQGGKCAAMNPIIQT-----LPNVIPTAHVISSKGCTVRD 254
Query: 237 DGLHLTT 243
D +H +
Sbjct: 255 DQVHFNS 261
>gi|291282464|ref|YP_003499282.1| YjhS [Escherichia coli O55:H7 str. CB9615]
gi|290762337|gb|ADD56298.1| YjhS [Escherichia coli O55:H7 str. CB9615]
Length = 648
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPGIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 190 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWM 249
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 250 QGEFDFGGTPVNHAAQFGALVDKFRADL 277
>gi|417270153|ref|ZP_12057513.1| PF08410 domain protein [Escherichia coli 3.3884]
gi|386228958|gb|EII56314.1| PF08410 domain protein [Escherichia coli 3.3884]
Length = 654
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|424756789|ref|ZP_18184584.1| hypothetical protein CFSAN001630_04393, partial [Escherichia coli
O111:H11 str. CFSAN001630]
gi|421949524|gb|EKU06466.1| hypothetical protein CFSAN001630_04393, partial [Escherichia coli
O111:H11 str. CFSAN001630]
Length = 306
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 80 VVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 138
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 139 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 198
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 199 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|86358257|ref|YP_470149.1| hypothetical protein RHE_CH02651 [Rhizobium etli CFN 42]
gi|86282359|gb|ABC91422.1| hypothetical protein RHE_CH02651 [Rhizobium etli CFN 42]
Length = 312
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 3/115 (2%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
AN ++ N VI L P A GG+ +++W G L ++ + G I +VLW
Sbjct: 133 LANKLIGSGQNDSVI-LAPLAYGGSEVARWAAGGDLNPVLVDTMKQLQDSGYRITSVLWV 191
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPL-LPIIRVALASGEGPFIEIV 212
QGE+D V ++ Y++ LR +++P+ + I L G F E +
Sbjct: 192 QGEADLVMGTTSEAYQKHFMSMVDTLRQHGVEAPVYISIASKCLEPSNGGFKEHI 246
>gi|419260343|ref|ZP_13802777.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
gi|378110918|gb|EHW72511.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10B]
Length = 438
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 62 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 121
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 122 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 181
Query: 182 FTDL 185
DL
Sbjct: 182 RADL 185
>gi|365121222|ref|ZP_09338213.1| hypothetical protein HMPREF1033_01559 [Tannerella sp.
6_1_58FAA_CT1]
gi|363645845|gb|EHL85098.1| hypothetical protein HMPREF1033_01559 [Tannerella sp.
6_1_58FAA_CT1]
Length = 468
Score = 41.6 bits (96), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 45/97 (46%), Gaps = 12/97 (12%)
Query: 130 RKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL 189
R SSLY M VA G IR LWYQGES N+ D LY+ D R
Sbjct: 234 RAPSSLYNGM-----VAPIAGFGIRGFLWYQGES---NVGDPDLYRRLLPEMVKDWRKSW 285
Query: 190 QSPLLPIIRVALASGEGPFIE--IVRKAQLSS--DLP 222
+ LP V +A + P ++R+AQL + D+P
Sbjct: 286 NNDTLPFYYVQVAPYDYPNGNGALLREAQLKAYKDIP 322
>gi|417225108|ref|ZP_12028399.1| PF08410 domain protein [Escherichia coli 96.154]
gi|386200156|gb|EIH99147.1| PF08410 domain protein [Escherichia coli 96.154]
Length = 654
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|255532202|ref|YP_003092574.1| sialate O-acetylesterase [Pedobacter heparinus DSM 2366]
gi|255345186|gb|ACU04512.1| Sialate O-acetylesterase [Pedobacter heparinus DSM 2366]
Length = 485
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 8/90 (8%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEIV 212
I+ V+WYQGES N A Y+E + + R+ P LP + V LAS + +
Sbjct: 261 IKGVIWYQGES---NASRAYQYRELFPLMINNWRAKFNRPQLPFLFVQLAS-----FQAI 312
Query: 213 RKAQLSSDLPNVRCVDAMGLPLEPDGLHLT 242
+ +R AM L L+ G+ +T
Sbjct: 313 NPQPADAAWAELREAQAMALNLKNTGMAVT 342
>gi|417128402|ref|ZP_11975393.1| PF08410 domain protein [Escherichia coli 97.0246]
gi|386143863|gb|EIG90336.1| PF08410 domain protein [Escherichia coli 97.0246]
Length = 648
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 190 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKHLIGRTKAALKKNPKNVLLAVVWM 249
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 250 QGEFDFGGTPVNHAAQFGALVDKFRADL 277
>gi|419304373|ref|ZP_13846295.1| hypothetical protein ECDEC11D_5437 [Escherichia coli DEC11D]
gi|419314530|ref|ZP_13856377.1| hypothetical protein ECDEC11E_5132 [Escherichia coli DEC11E]
gi|378152717|gb|EHX13809.1| hypothetical protein ECDEC11E_5132 [Escherichia coli DEC11E]
gi|378154866|gb|EHX15931.1| hypothetical protein ECDEC11D_5437 [Escherichia coli DEC11D]
Length = 645
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 80 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 126
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 127 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 186
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 187 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 246
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 247 QGEFDFGGTPVNHAAQFGALVDKFRADL 274
>gi|417164781|ref|ZP_11999200.1| PF08410 domain protein [Escherichia coli 99.0741]
gi|386172517|gb|EIH44544.1| PF08410 domain protein [Escherichia coli 99.0741]
Length = 646
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 152 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 211
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + + A + D F
Sbjct: 212 STRWGVDKPLYKDLIGRTKAALKKSPKNVLFAVVWMQGEFDFGGMPVNHAAQFGALVDKF 271
Query: 182 FTDL 185
DL
Sbjct: 272 RADL 275
>gi|417143811|ref|ZP_11985773.1| PF08410 domain protein [Escherichia coli 1.2264]
gi|386164871|gb|EIH26656.1| PF08410 domain protein [Escherichia coli 1.2264]
Length = 539
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|218516445|ref|ZP_03513285.1| hypothetical protein Retl8_23721 [Rhizobium etli 8C-3]
Length = 312
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 86/216 (39%), Gaps = 38/216 (17%)
Query: 16 PVKC--QYQQQQLIILAGQSNMAGRGGVTNDTRTNKL---TWDGIVPPQCQPNPSILRLT 70
PV C Q + +++L GQSN A GG + + +DG
Sbjct: 67 PVACPAQTDRTAVLLLLGQSNAANDGGQRHRSEYGARVVNAFDG---------------- 110
Query: 71 AKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWR 130
+ +A PL D T G L N ++ N VI L P A G+ +++W
Sbjct: 111 ---RCFIAASPLLGSTD---TKGEYWTL-LGNELIASGQNDSVI-LAPLAYSGSEVARWA 162
Query: 131 KGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSD-L 189
G L +++ + G +VLW QGE D V A+ Y+E LR +
Sbjct: 163 AGGDLNAVLVETMKQLQASGYRATSVLWVQGEKDLVIGTTAEAYREYFLSMVDTLRQHGI 222
Query: 190 QSPL-LPIIRVALASGEGPFIE------IVRKAQLS 218
++P+ + I L G F E IVR AQLS
Sbjct: 223 EAPVYISIASKCLEPSNGGFKEHIPDNPIVR-AQLS 257
>gi|420309138|ref|ZP_14811091.1| yjhS [Escherichia coli EC1738]
gi|390902069|gb|EIP61206.1| yjhS [Escherichia coli EC1738]
Length = 331
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 83 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 141
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 142 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 201
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 202 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 254
>gi|424461558|ref|ZP_17912237.1| hypothetical protein ECPA39_1973, partial [Escherichia coli PA39]
gi|390773841|gb|EIO42161.1| hypothetical protein ECPA39_1973, partial [Escherichia coli PA39]
Length = 107
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 6 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 65
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 66 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 106
>gi|420276084|ref|ZP_14778373.1| yjhS [Escherichia coli PA40]
gi|390758437|gb|EIO27890.1| yjhS [Escherichia coli PA40]
Length = 331
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 83 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 141
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 142 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 201
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 202 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 254
>gi|417266124|ref|ZP_12053493.1| PF08410 domain protein [Escherichia coli 3.3884]
gi|386232117|gb|EII59464.1| PF08410 domain protein [Escherichia coli 3.3884]
Length = 646
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 152 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 211
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + + A + D F
Sbjct: 212 STRWGVDKPLYKDLIGRTKAALKKSPKNVLFAVVWMQGEFDFGGMPVNHAAQFGALVDKF 271
Query: 182 FTDL 185
DL
Sbjct: 272 RADL 275
>gi|417173198|ref|ZP_12003099.1| PF08410 domain protein, partial [Escherichia coli 3.2608]
gi|386179708|gb|EIH57186.1| PF08410 domain protein, partial [Escherichia coli 3.2608]
Length = 279
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|419204405|ref|ZP_13747586.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8B]
gi|378047840|gb|EHW10198.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC8B]
Length = 267
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|425098916|ref|ZP_18501664.1| hypothetical protein EC34870_3456, partial [Escherichia coli
3.4870]
gi|408550307|gb|EKK27638.1| hypothetical protein EC34870_3456, partial [Escherichia coli
3.4870]
Length = 403
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|332668360|ref|YP_004451148.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332337174|gb|AEE54275.1| protein of unknown function DUF303 acetylesterase
[Haliscomenobacter hydrossis DSM 1100]
Length = 647
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 78/180 (43%), Gaps = 38/180 (21%)
Query: 94 VGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK---------------------- 131
+ P F ++ +P +GLV A+ G+ I + K
Sbjct: 90 LSPADYFGRTMIQYLPEKISVGLVHVAVAGSKIEIFDKELYKTYLDTSAASRPWMIRMSD 149
Query: 132 --GSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDL 189
G + Y++++ A++A + G I+ +L +QGES+T + K + + + DL +DL
Sbjct: 150 AYGGNPYQRLVDMARIAQQNG-VIKGILLHQGESNTGD----KAWPAKVKKIYDDLLADL 204
Query: 190 Q-SP-LLPIIRVALASGE-----GPFIEIVRKAQLSSDLPNVRCVDAMGLPLEPDGLHLT 242
+ +P +P++ L + + EI+ A L LP + + GL PD LH +
Sbjct: 205 KLAPNSIPLLAGELVNADQGGKCASMNEII--ATLPQTLPRAMVIPSFGLEAVPDKLHFS 262
>gi|419080029|ref|ZP_13625498.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4A]
gi|419087023|ref|ZP_13632385.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4B]
gi|377930719|gb|EHU94598.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4A]
gi|377930847|gb|EHU94718.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC4B]
Length = 333
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 83 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 141
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 142 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 201
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 202 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 254
>gi|417616179|ref|ZP_12266621.1| hypothetical protein ECSTECEH250_5309 [Escherichia coli STEC_EH250]
gi|345356038|gb|EGW88246.1| hypothetical protein ECSTECEH250_5309 [Escherichia coli STEC_EH250]
Length = 658
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 80/209 (38%), Gaps = 62/209 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWY 159
G + N +W G LY+ ++ R + AL R AV+W
Sbjct: 190 CRGASAFTTGADGTYSESAGASENSLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWM 249
Query: 160 QGESDT---VNLEDAKLYKERSDMFFTDL 185
QGE D + + L+ + F T+L
Sbjct: 250 QGEGDAAVGTHAQHPGLFSAMVNQFRTEL 278
>gi|425294132|ref|ZP_18684499.1| hypothetical protein ECPA38_1941, partial [Escherichia coli PA38]
gi|408222804|gb|EKI46623.1| hypothetical protein ECPA38_1941, partial [Escherichia coli PA38]
Length = 206
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 71 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 130
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 131 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 171
>gi|424545353|ref|ZP_17987789.1| yjhS, partial [Escherichia coli EC4402]
gi|390870731|gb|EIP32218.1| yjhS, partial [Escherichia coli EC4402]
Length = 314
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 62 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 121
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 122 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 181
Query: 182 FTDL 185
DL
Sbjct: 182 RADL 185
>gi|424116730|ref|ZP_17850588.1| yjhS, partial [Escherichia coli PA3]
gi|425336820|ref|ZP_18724219.1| yjhS, partial [Escherichia coli EC1847]
gi|425367459|ref|ZP_18752646.1| yjhS, partial [Escherichia coli EC1862]
gi|390677491|gb|EIN53522.1| yjhS, partial [Escherichia coli PA3]
gi|408256087|gb|EKI77486.1| yjhS, partial [Escherichia coli EC1847]
gi|408286401|gb|EKJ05326.1| yjhS, partial [Escherichia coli EC1862]
Length = 404
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|260867673|ref|YP_003234075.1| hypothetical protein ECO111_1598 [Escherichia coli O111:H- str.
11128]
gi|415819851|ref|ZP_11509148.1| hypothetical protein ECOK1180_1879 [Escherichia coli OK1180]
gi|417591182|ref|ZP_12241891.1| hypothetical protein EC253486_1784 [Escherichia coli 2534-86]
gi|257764029|dbj|BAI35524.1| hypothetical protein ECO111_1598 [Escherichia coli O111:H- str.
11128]
gi|323179215|gb|EFZ64785.1| hypothetical protein ECOK1180_1879 [Escherichia coli OK1180]
gi|345343417|gb|EGW75805.1| hypothetical protein EC253486_1784 [Escherichia coli 2534-86]
Length = 648
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 214 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|209546000|ref|YP_002277890.1| hypothetical protein Rleg2_5615 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209538857|gb|ACI58790.1| protein of unknown function DUF303 acetylesterase putative
[Rhizobium leguminosarum bv. trifolii WSM2304]
Length = 312
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 67/173 (38%), Gaps = 23/173 (13%)
Query: 16 PVKC--QYQQQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPSILRLTAKL 73
PV C Q + ++++ GQSN A GG + + + QC
Sbjct: 67 PVTCPTQTDRTAVLLILGQSNAANDGGQRHRSNYGARVINAF-GKQC------------- 112
Query: 74 KWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGS 133
+A PL D T G L N ++ N VI L P A G+ +++W G
Sbjct: 113 --FIAASPLLGSTD---TKGEYWTL-LGNKLIASGQNDSVI-LAPLAFSGSEVARWAAGG 165
Query: 134 SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLR 186
L ++ + G I +VLW QGE D V A+ Y +R LR
Sbjct: 166 DLNPVLVDTMKQLQASGYRITSVLWVQGEKDLVIGNTAEAYGQRFMSMVDTLR 218
>gi|196234110|ref|ZP_03132944.1| protein of unknown function DUF303 acetylesterase putative
[Chthoniobacter flavus Ellin428]
gi|196221859|gb|EDY16395.1| protein of unknown function DUF303 acetylesterase putative
[Chthoniobacter flavus Ellin428]
Length = 384
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 76/198 (38%), Gaps = 61/198 (30%)
Query: 25 QLIILAGQSNMAGRGGVTNDTRTNKLT-WDGIVPPQCQPNPSILRLTAKLKWVLAHEPLH 83
++ ++AGQSN A G T+T ++T DG W LA++P
Sbjct: 125 EVFVVAGQSNSANYGEEKQTTQTGRVTALDG------------------RGWQLANDP-- 164
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGV-IGLVPCAIGGTNISQ-------------- 128
G +P L + F V IG V C +GGT++ +
Sbjct: 165 ---QPGAAGSRGSFMPPLGDALEE--RFHVPIGFVACGVGGTSVREWLPQGVVFPNPPTV 219
Query: 129 -----------WRKGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED------A 171
W LY +++ A + G RAVLW+QGESD N +D
Sbjct: 220 ESRVVRLAGGTWESKGQLYAKLL--ASMKAVGPHGFRAVLWHQGESD-ANQQDTSRTLPG 276
Query: 172 KLYKERSDMFFTDLRSDL 189
KLY+E + + R ++
Sbjct: 277 KLYREYLEKIIRESRREV 294
>gi|424874226|ref|ZP_18297888.1| protein of unknown function (DUF303) [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393169927|gb|EJC69974.1| protein of unknown function (DUF303) [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 312
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 56/126 (44%), Gaps = 8/126 (6%)
Query: 100 FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKGSSLYEQMIQRAQVALRGGGTIRAVLWY 159
AN ++ N VI L P A G+ +++W G ++ + G I VLW
Sbjct: 133 LANNLIASGQNDNVI-LAPLAYSGSEVARWAAGGDFNPVLVDTVKQLQDSGYRITNVLWV 191
Query: 160 QGESDTVNLEDAKLYKERSDMFFTDLRSD-LQSPL-LPIIRVALASGEGPFIE-----IV 212
QGE+D V A+ Y+ER LR +++P+ + I L G F E V
Sbjct: 192 QGEADLVIGTPAETYQERFMSMVDTLRQHGVEAPVYISIASKCLEPSNGGFKEHIPDNAV 251
Query: 213 RKAQLS 218
+AQL+
Sbjct: 252 VRAQLA 257
>gi|417189803|ref|ZP_12012941.1| PF08410 domain protein [Escherichia coli 4.0522]
gi|386192356|gb|EIH81085.1| PF08410 domain protein [Escherichia coli 4.0522]
Length = 331
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 83 VVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 141
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 142 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 201
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 202 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 254
>gi|327314642|ref|YP_004330079.1| GDSL-like protein [Prevotella denticola F0289]
gi|326944104|gb|AEA19989.1| GDSL-like protein [Prevotella denticola F0289]
Length = 717
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 6/88 (6%)
Query: 139 MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIR 198
M + A + L G G IR V+WYQGES+ N+E L++ + RS P LP +
Sbjct: 486 MFETALLPLEGYG-IRGVVWYQGESNAHNME---LHERLFPLLLKSWRSFFHHPDLPFLF 541
Query: 199 VALASGEGPFIEIVRKAQ--LSSDLPNV 224
L+S P R +Q ++S L N
Sbjct: 542 AQLSSLNRPSWPRFRDSQCRMASALHNT 569
>gi|424450629|ref|ZP_17902346.1| hypothetical protein ECPA32_3417, partial [Escherichia coli PA32]
gi|390742552|gb|EIO13553.1| hypothetical protein ECPA32_3417, partial [Escherichia coli PA32]
Length = 580
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417212826|ref|ZP_12022290.1| PF08410 domain protein [Escherichia coli JB1-95]
gi|386194728|gb|EIH88973.1| PF08410 domain protein [Escherichia coli JB1-95]
Length = 544
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 77/199 (38%), Gaps = 41/199 (20%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPKADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALRGGGTIRAVLW----YQGESD---TV 166
W G LY+ +I R + AL+ A W +QGE D
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQ-KNPKNAFCWPSAGWQGEFDMSAAT 243
Query: 167 NLEDAKLYKERSDMFFTDL 185
+ + L+ F DL
Sbjct: 244 HAQQPALFTAMLKQFHADL 262
>gi|419092855|ref|ZP_13638146.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
gi|377943405|gb|EHV07123.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4C]
Length = 648
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 190 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 249
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 250 QGEFDFGGTPVNHAAQFGALVDKFRADL 277
>gi|419070191|ref|ZP_13615816.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3E]
gi|377912582|gb|EHU76737.1| putative 9-O-acetyl-N-acetylneuraminate esterase [Escherichia coli
DEC3E]
Length = 331
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 83 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 141
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 142 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 201
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 202 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 254
>gi|424570257|ref|ZP_18010835.1| yjhS, partial [Escherichia coli EC4448]
gi|425330659|ref|ZP_18718540.1| yjhS, partial [Escherichia coli EC1846]
gi|390895856|gb|EIP55271.1| yjhS, partial [Escherichia coli EC4448]
gi|408246818|gb|EKI69058.1| yjhS, partial [Escherichia coli EC1846]
Length = 405
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|425361283|ref|ZP_18746951.1| yjhS, partial [Escherichia coli EC1856]
gi|408277068|gb|EKI96896.1| yjhS, partial [Escherichia coli EC1856]
Length = 408
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417203909|ref|ZP_12018601.1| PF08410 domain protein [Escherichia coli JB1-95]
gi|386198490|gb|EIH92664.1| PF08410 domain protein [Escherichia coli JB1-95]
Length = 648
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D A + + D F
Sbjct: 214 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPANHAAQFGAQVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|417601767|ref|ZP_12252341.1| hypothetical protein ECSTEC94C_1558 [Escherichia coli STEC_94C]
gi|345351527|gb|EGW83786.1| hypothetical protein ECSTEC94C_1558 [Escherichia coli STEC_94C]
Length = 654
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDAGGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 SARWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|416300326|ref|ZP_11652684.1| YjhS [Shigella flexneri CDC 796-83]
gi|420324820|ref|ZP_14826594.1| hypothetical protein SFCCH060_1148 [Shigella flexneri CCH060]
gi|320184647|gb|EFW59443.1| YjhS [Shigella flexneri CDC 796-83]
gi|391254933|gb|EIQ14088.1| hypothetical protein SFCCH060_1148 [Shigella flexneri CCH060]
Length = 359
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 50/173 (28%), Positives = 69/173 (39%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSNMAGRG------GVTN--DTRTNKLTWDGIVPP---QCQPNPSILR---LTA 71
+++LAGQSN G G + D R +L V P C+ N IL L
Sbjct: 65 VVVLAGQSNGMSYGEGLPLPGTYDRPDPRIKQLARRSTVTPGGAACKYNDIILADHCLHD 124
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
+ P AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 125 VQDMSRLNHP-KADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 183
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL + AV+W QGE D
Sbjct: 184 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALEKNPKNVLFAVVWMQGEFD 236
>gi|420120740|ref|ZP_14629923.1| hypothetical protein ECO10030_06642, partial [Escherichia coli
O26:H11 str. CVM10030]
gi|394428518|gb|EJF01066.1| hypothetical protein ECO10030_06642, partial [Escherichia coli
O26:H11 str. CVM10030]
Length = 413
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|424525626|ref|ZP_17969478.1| hypothetical protein ECEC4421_1945, partial [Escherichia coli
EC4421]
gi|424531803|ref|ZP_17975270.1| hypothetical protein ECEC4422_2077, partial [Escherichia coli
EC4422]
gi|425167502|ref|ZP_18566138.1| hypothetical protein ECFDA507_2013, partial [Escherichia coli
FDA507]
gi|428995168|ref|ZP_19063919.1| hypothetical protein EC940618_1872, partial [Escherichia coli
94.0618]
gi|390854132|gb|EIP17059.1| hypothetical protein ECEC4421_1945, partial [Escherichia coli
EC4421]
gi|390866591|gb|EIP28543.1| hypothetical protein ECEC4422_2077, partial [Escherichia coli
EC4422]
gi|408087030|gb|EKH20514.1| hypothetical protein ECFDA507_2013, partial [Escherichia coli
FDA507]
gi|427249357|gb|EKW16197.1| hypothetical protein EC940618_1872, partial [Escherichia coli
94.0618]
Length = 161
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 26 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 85
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 86 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 126
>gi|417581370|ref|ZP_12232172.1| hypothetical protein ECSTECB2F1_2023 [Escherichia coli STEC_B2F1]
gi|345337141|gb|EGW69573.1| hypothetical protein ECSTECB2F1_2023 [Escherichia coli STEC_B2F1]
Length = 657
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 214 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|22001100|gb|AAM88304.1|AF479828_3 unknown [Escherichia coli]
Length = 318
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 62 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 121
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 122 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 181
Query: 182 FTDL 185
DL
Sbjct: 182 RADL 185
>gi|429061953|ref|ZP_19125984.1| hypothetical protein EC970007_2798, partial [Escherichia coli
97.0007]
gi|427315383|gb|EKW77386.1| hypothetical protein EC970007_2798, partial [Escherichia coli
97.0007]
Length = 422
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|419069945|ref|ZP_13615575.1| hypothetical protein ECDEC3E_3023, partial [Escherichia coli DEC3E]
gi|429073619|ref|ZP_19136897.1| hypothetical protein EC990678_2719, partial [Escherichia coli
99.0678]
gi|377913307|gb|EHU77449.1| hypothetical protein ECDEC3E_3023, partial [Escherichia coli DEC3E]
gi|427329590|gb|EKW90912.1| hypothetical protein EC990678_2719, partial [Escherichia coli
99.0678]
Length = 303
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|24415903|gb|AAN59927.1| hypothetical protein [Enterobacteria phage SC370]
Length = 390
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|420103699|ref|ZP_14614523.1| hypothetical protein ECO9455_00376, partial [Escherichia coli
O111:H11 str. CVM9455]
gi|394406744|gb|EJE81695.1| hypothetical protein ECO9455_00376, partial [Escherichia coli
O111:H11 str. CVM9455]
Length = 372
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 98 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 157
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 158 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 217
Query: 182 FTDL 185
DL
Sbjct: 218 RADL 221
>gi|423725987|ref|ZP_17700089.1| yjhS, partial [Escherichia coli PA31]
gi|424501386|ref|ZP_17948304.1| yjhS, partial [Escherichia coli EC4203]
gi|390742365|gb|EIO13373.1| yjhS, partial [Escherichia coli PA31]
gi|390825952|gb|EIO91833.1| yjhS, partial [Escherichia coli EC4203]
Length = 406
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417189303|ref|ZP_12012767.1| PF08410 domain protein, partial [Escherichia coli 4.0522]
gi|386192464|gb|EIH81189.1| PF08410 domain protein, partial [Escherichia coli 4.0522]
Length = 259
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|417607245|ref|ZP_12257763.1| hypothetical protein ECSTECDG1313_1639 [Escherichia coli
STEC_DG131-3]
gi|345363078|gb|EGW95222.1| hypothetical protein ECSTECDG1313_1639 [Escherichia coli
STEC_DG131-3]
Length = 654
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417155349|ref|ZP_11993478.1| PF08410 domain protein [Escherichia coli 96.0497]
gi|386168438|gb|EIH34954.1| PF08410 domain protein [Escherichia coli 96.0497]
Length = 657
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 214 SARWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|425343226|ref|ZP_18730138.1| yjhS, partial [Escherichia coli EC1848]
gi|425355324|ref|ZP_18741409.1| yjhS, partial [Escherichia coli EC1850]
gi|429015412|ref|ZP_19082328.1| hypothetical protein EC950943_3417, partial [Escherichia coli
95.0943]
gi|408258989|gb|EKI80195.1| yjhS, partial [Escherichia coli EC1848]
gi|408274532|gb|EKI94531.1| yjhS, partial [Escherichia coli EC1850]
gi|427261613|gb|EKW27531.1| hypothetical protein EC950943_3417, partial [Escherichia coli
95.0943]
Length = 407
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|420108673|ref|ZP_14618902.1| hypothetical protein ECO9553_02799, partial [Escherichia coli
O111:H11 str. CVM9553]
gi|394409188|gb|EJE83751.1| hypothetical protein ECO9553_02799, partial [Escherichia coli
O111:H11 str. CVM9553]
Length = 620
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 149 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 208
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 209 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 268
Query: 182 FTDL 185
DL
Sbjct: 269 RADL 272
>gi|425131229|ref|ZP_18532179.1| hypothetical protein EC82524_1931A, partial [Escherichia coli
8.2524]
gi|408584507|gb|EKK59509.1| hypothetical protein EC82524_1931A, partial [Escherichia coli
8.2524]
Length = 204
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|424576422|ref|ZP_18016518.1| yjhS, partial [Escherichia coli EC1845]
gi|390920238|gb|EIP78528.1| yjhS, partial [Escherichia coli EC1845]
Length = 449
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|424148261|ref|ZP_17879656.1| hypothetical protein ECPA15_3572, partial [Escherichia coli PA15]
gi|390700958|gb|EIN75226.1| hypothetical protein ECPA15_3572, partial [Escherichia coli PA15]
Length = 256
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|167763609|ref|ZP_02435736.1| hypothetical protein BACSTE_01984 [Bacteroides stercoris ATCC
43183]
gi|167698903|gb|EDS15482.1| cyclically-permuted mutarotase family protein [Bacteroides
stercoris ATCC 43183]
Length = 903
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 153 IRAVLWYQGESDTVNLE-DAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEI 211
++ V+WYQGES+T N E KL+K + RS+ + P LP V L+S + P
Sbjct: 339 LKGVIWYQGESNTHNKEAHEKLFK----LLIDSWRSNWEQPNLPFYYVQLSSIDRPSWTW 394
Query: 212 VRKAQ--LSSDLPNVRCV 227
R +Q L +PN V
Sbjct: 395 FRDSQRRLMKSIPNTGMV 412
>gi|429001747|ref|ZP_19069998.1| hypothetical protein EC950183_2389, partial [Escherichia coli
95.0183]
gi|427264759|gb|EKW30414.1| hypothetical protein EC950183_2389, partial [Escherichia coli
95.0183]
Length = 225
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 26 LIILAGQSNMAGRG-GV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+I+LAGQSN G G+ D R +L V P C+ N I+ L
Sbjct: 66 VIVLAGQSNAMAYGEGLPLPDSYDAPDPRIKQLARRSTVTPGGAACRYN-DIIPADHCLH 124
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQ--- 128
V L+ AD+ + VG GL A +L +PN I LVPC GG+ +Q
Sbjct: 125 DVQDMSTLNHPRADLSKGQYGCVGQGLHIAKKLLPYIPNNAGILLVPCCRGGSAFTQGAE 184
Query: 129 ---------------WRKGSSLYEQMIQRAQVALR 148
W G LY+ +I R + AL+
Sbjct: 185 GTFSESTGASQDSARWGVGKPLYQDLISRTKAALQ 219
>gi|420298910|ref|ZP_14800960.1| yjhS [Escherichia coli TW09109]
gi|390807227|gb|EIO74125.1| yjhS [Escherichia coli TW09109]
Length = 239
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 62 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 121
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 122 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 162
>gi|419865890|ref|ZP_14388265.1| YjhS, partial [Escherichia coli O103:H25 str. CVM9340]
gi|388336672|gb|EIL03206.1| YjhS, partial [Escherichia coli O103:H25 str. CVM9340]
Length = 318
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 73/185 (39%), Gaps = 59/185 (31%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 190 CRGGSAFTTGADGTYSDAGGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWM 249
Query: 160 QGESD 164
QGE D
Sbjct: 250 QGEFD 254
>gi|417607638|ref|ZP_12258149.1| hypothetical protein ECSTECDG1313_2030 [Escherichia coli
STEC_DG131-3]
gi|345361006|gb|EGW93169.1| hypothetical protein ECSTECDG1313_2030 [Escherichia coli
STEC_DG131-3]
Length = 655
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + L+ +
Sbjct: 211 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDAAVGTHAQHPGLFSAMVNQ 270
Query: 181 FFTDL 185
F T+L
Sbjct: 271 FRTEL 275
>gi|416826277|ref|ZP_11897118.1| YjhS [Escherichia coli O55:H7 str. USDA 5905]
gi|425248414|ref|ZP_18641465.1| hypothetical protein EC5905_2098 [Escherichia coli 5905]
gi|320659100|gb|EFX26703.1| YjhS [Escherichia coli O55:H7 str. USDA 5905]
gi|408167812|gb|EKH95293.1| hypothetical protein EC5905_2098 [Escherichia coli 5905]
Length = 648
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 80/208 (38%), Gaps = 61/208 (29%)
Query: 26 LIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQC--QPNPSILRLT---------AKLK 74
+++LAGQSN G +G+ P+ +P+P I +L A K
Sbjct: 83 VVVLAGQSNSMAYG-------------EGLPLPETYDRPDPRIKQLARRSTVTPGGAACK 129
Query: 75 W---VLAHEPLH------------ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPC 119
+ + A LH AD+ + VG GL A +L +P I LVPC
Sbjct: 130 YNDIIPADHCLHDVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPC 189
Query: 120 AIGGT------------------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWY 159
GG+ N ++W LY+ +I R + AL+ + AV+W
Sbjct: 190 CRGGSAFTTGADGTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWM 249
Query: 160 QGESD--TVNLEDAKLYKERSDMFFTDL 185
QGE D + A + D F DL
Sbjct: 250 QGEFDFGGTPVNHAAQFGALVDKFRADL 277
>gi|420094847|ref|ZP_14606406.1| hypothetical protein ECO9634_17177, partial [Escherichia coli
O111:H8 str. CVM9634]
gi|394395038|gb|EJE71547.1| hypothetical protein ECO9634_17177, partial [Escherichia coli
O111:H8 str. CVM9634]
Length = 380
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417244610|ref|ZP_12038553.1| PF03629 domain protein [Escherichia coli 9.0111]
gi|386210825|gb|EII21296.1| PF03629 domain protein [Escherichia coli 9.0111]
Length = 538
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 34 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 93
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + L+ +
Sbjct: 94 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDAAVGTHAQHPGLFSAMVNQ 153
Query: 181 FFTDL 185
F T+L
Sbjct: 154 FRTEL 158
>gi|298385784|ref|ZP_06995341.1| sialic acid-specific 9-O-acetylesterase [Bacteroides sp. 1_1_14]
gi|298261012|gb|EFI03879.1| sialic acid-specific 9-O-acetylesterase [Bacteroides sp. 1_1_14]
Length = 464
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA-----SGEGP 207
+R LWYQGES N ++A LY+ F DLR+ LP V +A +G
Sbjct: 247 VRGFLWYQGES---NRDNADLYQSLMPAFVADLRAKWGRGELPFYFVQIAPFDYEGADGT 303
Query: 208 FIEIVRKAQLSS--DLPNVRCVDAMGL 232
+R+ QL + D+PN V M +
Sbjct: 304 SAARLREVQLQNMKDIPNSGMVTTMDV 330
>gi|445017560|ref|ZP_21333572.1| hypothetical protein ECPA8_1714, partial [Escherichia coli PA8]
gi|444633604|gb|ELW07115.1| hypothetical protein ECPA8_1714, partial [Escherichia coli PA8]
Length = 292
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 72/173 (41%), Gaps = 35/173 (20%)
Query: 26 LIILAGQSN-MAGRGGV-------TNDTRTNKLTWDGIVPP---QCQPNPSILRLTAKLK 74
+++LAGQSN MA G+ D R +L V P C+ N I+ L
Sbjct: 83 IVVLAGQSNSMAYGEGLPLPETYDRPDPRIKQLARRSTVTPGGVACKYN-DIIPADHCLH 141
Query: 75 WVLAHEPLH---ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------- 124
V L+ AD+ + VG GL A +L +P I LVPC GG+
Sbjct: 142 DVQDMSRLNHPKADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGAD 201
Query: 125 -----------NISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
N ++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 202 GTYSDASGASENSTRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 254
>gi|419893849|ref|ZP_14413803.1| hypothetical protein ECO9574_21263, partial [Escherichia coli
O111:H8 str. CVM9574]
gi|388365883|gb|EIL29653.1| hypothetical protein ECO9574_21263, partial [Escherichia coli
O111:H8 str. CVM9574]
Length = 380
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417253823|ref|ZP_12045579.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|386215750|gb|EII32242.1| PF08410 domain protein [Escherichia coli 4.0967]
Length = 533
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417132230|ref|ZP_11977015.1| PF08410 domain protein [Escherichia coli 5.0588]
gi|386150084|gb|EIH01373.1| PF08410 domain protein [Escherichia coli 5.0588]
Length = 640
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|416342779|ref|ZP_11676888.1| hypothetical protein ECoL_01824 [Escherichia coli EC4100B]
gi|320200915|gb|EFW75500.1| hypothetical protein ECoL_01824 [Escherichia coli EC4100B]
Length = 645
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|425398939|ref|ZP_18781697.1| hypothetical protein ECEC1869_3041, partial [Escherichia coli
EC1869]
gi|408321094|gb|EKJ37141.1| hypothetical protein ECEC1869_3041, partial [Escherichia coli
EC1869]
Length = 407
Score = 41.2 bits (95), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|420106841|ref|ZP_14617227.1| hypothetical protein ECO9553_01864, partial [Escherichia coli
O111:H11 str. CVM9553]
gi|394414840|gb|EJE88750.1| hypothetical protein ECO9553_01864, partial [Escherichia coli
O111:H11 str. CVM9553]
Length = 315
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|424133639|ref|ZP_17866261.1| hypothetical protein ECPA10_2027, partial [Escherichia coli PA10]
gi|390704309|gb|EIN78251.1| hypothetical protein ECPA10_2027, partial [Escherichia coli PA10]
Length = 331
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 77 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 136
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 137 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 177
>gi|420124486|ref|ZP_14633340.1| hypothetical protein ECO10030_15699, partial [Escherichia coli
O26:H11 str. CVM10030]
gi|394414858|gb|EJE88767.1| hypothetical protein ECO10030_15699, partial [Escherichia coli
O26:H11 str. CVM10030]
Length = 413
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|424550025|ref|ZP_17992040.1| hypothetical protein ECEC4439_1916, partial [Escherichia coli
EC4439]
gi|390882472|gb|EIP42994.1| hypothetical protein ECEC4439_1916, partial [Escherichia coli
EC4439]
Length = 406
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|419226043|ref|ZP_13768916.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9A]
gi|419230470|ref|ZP_13773275.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
gi|419237156|ref|ZP_13779894.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9C]
gi|419241400|ref|ZP_13784059.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9D]
gi|419248286|ref|ZP_13790885.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9E]
gi|378078323|gb|EHW40311.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9A]
gi|378084425|gb|EHW46336.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9B]
gi|378087114|gb|EHW48981.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9C]
gi|378096828|gb|EHW58597.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9D]
gi|378098626|gb|EHW60359.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC9E]
Length = 645
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|333381845|ref|ZP_08473524.1| hypothetical protein HMPREF9455_01690 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829774|gb|EGK02420.1| hypothetical protein HMPREF9455_01690 [Dysgonomonas gadei ATCC
BAA-286]
Length = 286
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 31/196 (15%)
Query: 72 KLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWRK 131
K KW A PL G+ P F ++ +P IG++ ++GG I + K
Sbjct: 79 KCKWRTAVPPL-----TRCRTGLSPADYFGRTMVANLPENIKIGIINVSVGGCRIELFDK 133
Query: 132 GS---------------------SLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLED 170
+ + Y ++++ A A + GG I+ +L +QGES++ + +
Sbjct: 134 DNYESYVETSPDWLKNMVKEYDGNPYRRLVELANQAQQNGGIIKGILLHQGESNSGDQDW 193
Query: 171 AKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGP---FIEIVRKAQLSSDLPNVRCV 227
+ K D D+ + S L + + A G EI+R L +PN +
Sbjct: 194 PQKVKGVYDNLLRDINLEANSIPLLVGELVNADQNGACSGMNEIIR--MLPDVIPNAYII 251
Query: 228 DAMGLPLEPDGLHLTT 243
+ G D LH +
Sbjct: 252 PSDGCEGVADRLHFSA 267
>gi|266620007|ref|ZP_06112942.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
gi|288868366|gb|EFD00665.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
Length = 254
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 75/185 (40%), Gaps = 41/185 (22%)
Query: 23 QQQLIILAGQSNMAGRGGVTNDTRTNKLTWDGIVPPQCQPNPS--ILRLTAKLKWVLAHE 80
+ +++ GQSNMAGRG D + P+ P + +T V E
Sbjct: 2 EADILLFMGQSNMAGRG-------------DYRLAPEVLPGAAYEYRAVTEPDTLVPLTE 48
Query: 81 PLHADIDVNKTNGV-GPGLP-------FANAVLTKVPNFGVIGLVPCAIGGTNISQWRKG 132
P ++ N+ GV PG+ F NA K I V C+ GG+ I +W+
Sbjct: 49 PF--GVNENREGGVFEPGMKTGSMAAAFVNACYRKTGR--PIIAVSCSKGGSRIQEWQPE 104
Query: 133 SSLYEQ----------MIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFF 182
+ ++ +Q Q+A+ G + W QG ++ + YKE++ FF
Sbjct: 105 TPYFKDAAARYQACLSFVQSRQIAVHSTGMV----WCQGCTNADDGMAKAEYKEKTKAFF 160
Query: 183 TDLRS 187
++S
Sbjct: 161 QAVKS 165
>gi|420270359|ref|ZP_14772717.1| hypothetical protein ECPA22_3274 [Escherichia coli PA22]
gi|390713871|gb|EIN86785.1| hypothetical protein ECPA22_3274 [Escherichia coli PA22]
Length = 526
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|419303481|ref|ZP_13845465.1| hypothetical protein ECDEC11C_5462 [Escherichia coli DEC11C]
gi|378144251|gb|EHX05425.1| hypothetical protein ECDEC11C_5462 [Escherichia coli DEC11C]
Length = 645
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|425422628|ref|ZP_18803805.1| hypothetical protein EC01288_1980 [Escherichia coli 0.1288]
gi|408344526|gb|EKJ58887.1| hypothetical protein EC01288_1980 [Escherichia coli 0.1288]
Length = 654
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKRNPKNVLFAVVWMQGEFD 251
>gi|419044285|ref|ZP_13591252.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3A]
gi|377899003|gb|EHU63359.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC3A]
Length = 566
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 72 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 131
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 132 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 191
Query: 182 FTDL 185
DL
Sbjct: 192 RADL 195
>gi|24415898|gb|AAN59923.1| unknown [Enterobacteria phage LC159]
Length = 376
Score = 40.8 bits (94), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 158 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 217
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 218 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 258
>gi|427385775|ref|ZP_18882082.1| hypothetical protein HMPREF9447_03115 [Bacteroides oleiciplenus YIT
12058]
gi|425726814|gb|EKU89677.1| hypothetical protein HMPREF9447_03115 [Bacteroides oleiciplenus YIT
12058]
Length = 459
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 56/141 (39%), Gaps = 30/141 (21%)
Query: 114 IGLVPCAIGGTNISQW---------------RKGSSLYEQMIQRAQVALRGGGTIRAVLW 158
+GLV A GG+ I W S LY MI + TI+ LW
Sbjct: 193 VGLVVSAFGGSKIESWLSYKAVDDIPGALAHHSPSQLYNAMIHPFK-----NYTIKGFLW 247
Query: 159 YQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALASGEGPFIEI-----VR 213
YQGE++ V D +LY D R + LP V +A G +E +R
Sbjct: 248 YQGENNWV---DPELYARLFPELPKDFRRAWNAGELPFYYVQIAPGPYDGVEKTTSARIR 304
Query: 214 KAQLSSD--LPNVRCVDAMGL 232
+ Q+ ++ +PN V + L
Sbjct: 305 EVQMLNEKTIPNAGMVVTLDL 325
>gi|410483302|ref|YP_006770848.1| hypothetical protein O3M_16100 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|429949987|ref|ZP_19415835.1| hypothetical protein S7Y_01401 [Escherichia coli O104:H4 str.
Ec12-0465]
gi|406778464|gb|AFS57888.1| hypothetical protein O3M_16100 [Escherichia coli O104:H4 str.
2009EL-2050]
gi|429438260|gb|EKZ74254.1| hypothetical protein S7Y_01401 [Escherichia coli O104:H4 str.
Ec12-0465]
Length = 645
Score = 40.8 bits (94), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|445007601|ref|ZP_21323868.1| hypothetical protein ECPA47_2524, partial [Escherichia coli PA47]
gi|444625362|gb|ELV99215.1| hypothetical protein ECPA47_2524, partial [Escherichia coli PA47]
Length = 418
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|425336214|ref|ZP_18723675.1| hypothetical protein ECEC1847_2862, partial [Escherichia coli
EC1847]
gi|408258789|gb|EKI80020.1| hypothetical protein ECEC1847_2862, partial [Escherichia coli
EC1847]
Length = 404
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|425217370|ref|ZP_18612504.1| hypothetical protein ECPA23_1972, partial [Escherichia coli PA23]
gi|408144905|gb|EKH74118.1| hypothetical protein ECPA23_1972, partial [Escherichia coli PA23]
Length = 410
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 214 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
>gi|419203852|ref|ZP_13747043.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8B]
gi|378049776|gb|EHW12113.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8B]
Length = 645
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|260844966|ref|YP_003222744.1| hypothetical protein ECO103_2843 [Escherichia coli O103:H2 str.
12009]
gi|257760113|dbj|BAI31610.1| hypothetical protein ECO103_2843 [Escherichia coli O103:H2 str.
12009]
Length = 645
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASDN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|383124724|ref|ZP_09945386.1| hypothetical protein BSIG_1527 [Bacteroides sp. 1_1_6]
gi|251841121|gb|EES69202.1| hypothetical protein BSIG_1527 [Bacteroides sp. 1_1_6]
Length = 479
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA-----SGEGP 207
+R LWYQGES N ++A LY+ F DLR+ LP V +A +G
Sbjct: 262 VRGFLWYQGES---NRDNADLYQSLMPAFVADLRAKWGRGELPFYFVQIAPFDYEGADGT 318
Query: 208 FIEIVRKAQLSS--DLPNVRCVDAMGL 232
+R+ QL + D+PN V M +
Sbjct: 319 SAARLREVQLQNMKDIPNSGMVTTMDV 345
>gi|424762012|ref|ZP_18189539.1| hypothetical protein CFSAN001630_17988, partial [Escherichia coli
O111:H11 str. CFSAN001630]
gi|421941624|gb|EKT99010.1| hypothetical protein CFSAN001630_17988, partial [Escherichia coli
O111:H11 str. CFSAN001630]
Length = 352
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|415822735|ref|ZP_11511254.1| hypothetical protein ECOK1180_4055 [Escherichia coli OK1180]
gi|323176690|gb|EFZ62280.1| hypothetical protein ECOK1180_4055 [Escherichia coli OK1180]
Length = 645
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|260844473|ref|YP_003222251.1| hypothetical protein ECO103_2330 [Escherichia coli O103:H2 str.
12009]
gi|419215355|ref|ZP_13758369.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8D]
gi|419265876|ref|ZP_13808254.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
gi|257759620|dbj|BAI31117.1| hypothetical protein ECO103_2330 [Escherichia coli O103:H2 str.
12009]
gi|378064869|gb|EHW27020.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC8D]
gi|378116561|gb|EHW78084.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC10C]
Length = 645
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|29348533|ref|NP_812036.1| sialic acid-specific acetylesterase [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340438|gb|AAO78230.1| putative sialic acid-specific acetylesterase [Bacteroides
thetaiotaomicron VPI-5482]
Length = 479
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 41/87 (47%), Gaps = 10/87 (11%)
Query: 153 IRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSDLQSPLLPIIRVALA-----SGEGP 207
+R LWYQGES N ++A LY+ F DLR+ LP V +A +G
Sbjct: 262 VRGFLWYQGES---NRDNADLYQSLMPAFVADLRAKWGRGELPFYFVQIAPFDYEGADGT 318
Query: 208 FIEIVRKAQLSS--DLPNVRCVDAMGL 232
+R+ QL + D+PN V M +
Sbjct: 319 SAARLREVQLQNMKDIPNSGMVTTMDV 345
>gi|10799913|emb|CAC12889.1| hypothetical protein [Shigella phage 7888]
Length = 645
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|425103649|ref|ZP_18506095.1| hypothetical protein EC52239_2122, partial [Escherichia coli
5.2239]
gi|408554028|gb|EKK31035.1| hypothetical protein EC52239_2122, partial [Escherichia coli
5.2239]
Length = 169
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 34 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 93
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 94 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 134
>gi|407468520|ref|YP_006785038.1| hypothetical protein O3O_09175 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|407482750|ref|YP_006779899.1| hypothetical protein O3K_16125 [Escherichia coli O104:H4 str.
2011C-3493]
gi|410491642|ref|YP_006906864.1| hypothetical protein [Escherichia phage P13374]
gi|417865181|ref|ZP_12510226.1| hypothetical protein C22711_2113 [Escherichia coli O104:H4 str.
C227-11]
gi|422991775|ref|ZP_16982546.1| hypothetical protein EUAG_01368 [Escherichia coli O104:H4 str.
C227-11]
gi|422993718|ref|ZP_16984482.1| hypothetical protein EUBG_01369 [Escherichia coli O104:H4 str.
C236-11]
gi|423007445|ref|ZP_16998188.1| hypothetical protein EUDG_04444 [Escherichia coli O104:H4 str.
04-8351]
gi|423009036|ref|ZP_16999774.1| hypothetical protein EUFG_01373 [Escherichia coli O104:H4 str.
11-3677]
gi|423028376|ref|ZP_17019069.1| hypothetical protein EUIG_01380 [Escherichia coli O104:H4 str.
11-4522]
gi|423037076|ref|ZP_17027750.1| hypothetical protein EUKG_01353 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|423042196|ref|ZP_17032863.1| hypothetical protein EULG_01371 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|423048885|ref|ZP_17039542.1| hypothetical protein EUMG_01373 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|423052467|ref|ZP_17041275.1| hypothetical protein EUNG_00873 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|423059434|ref|ZP_17048230.1| hypothetical protein EUOG_01374 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|429775442|ref|ZP_19307439.1| hypothetical protein C212_05023 [Escherichia coli O104:H4 str.
11-02030]
gi|429780764|ref|ZP_19312710.1| hypothetical protein C213_05024 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429784681|ref|ZP_19316590.1| hypothetical protein C214_05012 [Escherichia coli O104:H4 str.
11-02092]
gi|429790018|ref|ZP_19321890.1| hypothetical protein C215_04993 [Escherichia coli O104:H4 str.
11-02093]
gi|429796248|ref|ZP_19328071.1| hypothetical protein C216_05028 [Escherichia coli O104:H4 str.
11-02281]
gi|429802173|ref|ZP_19333948.1| hypothetical protein C217_05020 [Escherichia coli O104:H4 str.
11-02318]
gi|429805805|ref|ZP_19337549.1| hypothetical protein C218_05026 [Escherichia coli O104:H4 str.
11-02913]
gi|429811401|ref|ZP_19343100.1| hypothetical protein C219_05029 [Escherichia coli O104:H4 str.
11-03439]
gi|429816752|ref|ZP_19348408.1| hypothetical protein C220_05019 [Escherichia coli O104:H4 str.
11-04080]
gi|429821962|ref|ZP_19353573.1| hypothetical protein C221_05020 [Escherichia coli O104:H4 str.
11-03943]
gi|429907629|ref|ZP_19373597.1| hypothetical protein MO5_02812 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429911831|ref|ZP_19377787.1| hypothetical protein MO7_02267 [Escherichia coli O104:H4 str.
Ec11-9941]
gi|429922705|ref|ZP_19388626.1| hypothetical protein O7E_04642 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429923555|ref|ZP_19389471.1| hypothetical protein O7G_00409 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429939713|ref|ZP_19405587.1| hypothetical protein O7M_01408 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429958264|ref|ZP_19424093.1| hypothetical protein S91_04731 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|341918470|gb|EGT68084.1| hypothetical protein C22711_2113 [Escherichia coli O104:H4 str.
C227-11]
gi|354856833|gb|EHF17291.1| hypothetical protein EUDG_04444 [Escherichia coli O104:H4 str.
04-8351]
gi|354858024|gb|EHF18477.1| hypothetical protein EUAG_01368 [Escherichia coli O104:H4 str.
C227-11]
gi|354864793|gb|EHF25222.1| hypothetical protein EUBG_01369 [Escherichia coli O104:H4 str.
C236-11]
gi|354882858|gb|EHF43180.1| hypothetical protein EUFG_01373 [Escherichia coli O104:H4 str.
11-3677]
gi|354884480|gb|EHF44793.1| hypothetical protein EUIG_01380 [Escherichia coli O104:H4 str.
11-4522]
gi|354900732|gb|EHF60866.1| hypothetical protein EUKG_01353 [Escherichia coli O104:H4 str.
11-4632 C1]
gi|354903120|gb|EHF63229.1| hypothetical protein EULG_01371 [Escherichia coli O104:H4 str.
11-4632 C2]
gi|354906240|gb|EHF66322.1| hypothetical protein EUMG_01373 [Escherichia coli O104:H4 str.
11-4632 C3]
gi|354916054|gb|EHF76028.1| hypothetical protein EUOG_01374 [Escherichia coli O104:H4 str.
11-4632 C5]
gi|354921218|gb|EHF81143.1| hypothetical protein EUNG_00873 [Escherichia coli O104:H4 str.
11-4632 C4]
gi|405109717|emb|CCG06191.1| hypothetical protein [Escherichia phage P13374]
gi|407055047|gb|AFS75098.1| hypothetical protein O3K_16125 [Escherichia coli O104:H4 str.
2011C-3493]
gi|407064555|gb|AFS85602.1| hypothetical protein O3O_09175 [Escherichia coli O104:H4 str.
2009EL-2071]
gi|429349598|gb|EKY86335.1| hypothetical protein C212_05023 [Escherichia coli O104:H4 str.
11-02030]
gi|429350176|gb|EKY86910.1| hypothetical protein C213_05024 [Escherichia coli O104:H4 str.
11-02033-1]
gi|429351266|gb|EKY87987.1| hypothetical protein C214_05012 [Escherichia coli O104:H4 str.
11-02092]
gi|429365544|gb|EKZ02157.1| hypothetical protein C215_04993 [Escherichia coli O104:H4 str.
11-02093]
gi|429366495|gb|EKZ03098.1| hypothetical protein C216_05028 [Escherichia coli O104:H4 str.
11-02281]
gi|429369058|gb|EKZ05641.1| hypothetical protein C217_05020 [Escherichia coli O104:H4 str.
11-02318]
gi|429381465|gb|EKZ17952.1| hypothetical protein C218_05026 [Escherichia coli O104:H4 str.
11-02913]
gi|429382433|gb|EKZ18898.1| hypothetical protein C219_05029 [Escherichia coli O104:H4 str.
11-03439]
gi|429383481|gb|EKZ19941.1| hypothetical protein C221_05020 [Escherichia coli O104:H4 str.
11-03943]
gi|429395699|gb|EKZ32065.1| hypothetical protein C220_05019 [Escherichia coli O104:H4 str.
11-04080]
gi|429397791|gb|EKZ34137.1| hypothetical protein MO5_02812 [Escherichia coli O104:H4 str.
Ec11-9990]
gi|429423387|gb|EKZ59495.1| hypothetical protein O7G_00409 [Escherichia coli O104:H4 str.
Ec11-4986]
gi|429425458|gb|EKZ61547.1| hypothetical protein O7M_01408 [Escherichia coli O104:H4 str.
Ec11-5603]
gi|429432941|gb|EKZ68976.1| hypothetical protein O7E_04642 [Escherichia coli O104:H4 str.
Ec11-5604]
gi|429449063|gb|EKZ84966.1| hypothetical protein S91_04731 [Escherichia coli O104:H4 str.
Ec12-0466]
gi|429455293|gb|EKZ91150.1| hypothetical protein MO7_02267 [Escherichia coli O104:H4 str.
Ec11-9941]
Length = 645
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417105828|ref|ZP_11961969.1| hypothetical protein RHECNPAF_4310062 [Rhizobium etli CNPAF512]
gi|327190339|gb|EGE57437.1| hypothetical protein RHECNPAF_4310062 [Rhizobium etli CNPAF512]
Length = 312
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 81/215 (37%), Gaps = 36/215 (16%)
Query: 16 PVKC--QYQQQQLIILAGQSNMAGRGGVTNDTRTNKL---TWDGIVPPQCQPNPSILRLT 70
PV C Q + +++L GQSN A GG + + +DG
Sbjct: 67 PVACPAQTDRTAVLLLLGQSNAANDGGQRHRSEYGARVVNAFDG---------------- 110
Query: 71 AKLKWVLAHEPLHADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGTNISQWR 130
+ +A PL D+ G L + + L P A G+ +++W
Sbjct: 111 ---RCFIAASPLLGSTDIK-----GEYWTLLGNELIASGQYDSVILAPLAYSGSEVARWA 162
Query: 131 KGSSLYEQMIQRAQVALRGGGTIRAVLWYQGESDTVNLEDAKLYKERSDMFFTDLRSD-L 189
G L +++ + G +VLW QGE D V A+ Y+E LR +
Sbjct: 163 AGGDLNAVLVETMKKLQASGYRATSVLWVQGEKDLVIGTTAEAYREYFLSMVDTLRQHGI 222
Query: 190 QSPL-LPIIRVALASGEGPFIEI-----VRKAQLS 218
++P+ + I L G F E V +AQLS
Sbjct: 223 EAPVYISIASKCLEPSNGGFKEHIPDNPVVRAQLS 257
>gi|9632508|ref|NP_049502.1| hypothetical protein 933Wp42 [Enterobacteria phage 933W]
gi|15800962|ref|NP_286978.1| hypothetical protein Z1466 [Escherichia coli O157:H7 str. EDL933]
gi|20065943|ref|NP_613026.1| hypothetical protein Stx2Ip148 [Stx2 converting phage I]
gi|168748245|ref|ZP_02773267.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|168768021|ref|ZP_02793028.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|168772877|ref|ZP_02797884.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|168780252|ref|ZP_02805259.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|170783652|ref|YP_001648934.1| hypothetical protein [Enterobacteria phage Min27]
gi|208808653|ref|ZP_03250990.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208812998|ref|ZP_03254327.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208821147|ref|ZP_03261467.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209398813|ref|YP_002271795.1| hypothetical protein ECH74115_3531 [Escherichia coli O157:H7 str.
EC4115]
gi|254794269|ref|YP_003079106.1| hypothetical protein ECSP_3251 [Escherichia coli O157:H7 str.
TW14359]
gi|387881725|ref|YP_006312027.1| hypothetical protein CDCO157_1156 [Escherichia coli Xuzhou21]
gi|417254373|ref|ZP_12046127.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|419087373|ref|ZP_13632730.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|420293411|ref|ZP_14795531.1| hypothetical protein ECTW11039_3541 [Escherichia coli TW11039]
gi|4585419|gb|AAD25447.1|AF125520_42 hypothetical protein [Enterobacteria phage 933W]
gi|12514318|gb|AAG55589.1|AE005296_11 unknown protein encoded by bacteriophage BP-933W [Escherichia coli
O157:H7 str. EDL933]
gi|19911735|dbj|BAB87995.1| hypothetical protein [Stx2 converting phage I]
gi|163955746|gb|ABY49896.1| hypothetical protein [Enterobacteria phage Min27]
gi|187771067|gb|EDU34911.1| YjhS [Escherichia coli O157:H7 str. EC4196]
gi|188017257|gb|EDU55379.1| YjhS [Escherichia coli O157:H7 str. EC4113]
gi|189001919|gb|EDU70905.1| YjhS [Escherichia coli O157:H7 str. EC4076]
gi|189362899|gb|EDU81318.1| YjhS [Escherichia coli O157:H7 str. EC4486]
gi|208728454|gb|EDZ78055.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4206]
gi|208734275|gb|EDZ82962.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4045]
gi|208741270|gb|EDZ88952.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4042]
gi|209160213|gb|ACI37646.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4115]
gi|254593669|gb|ACT73030.1| hypothetical protein ECSP_3251 [Escherichia coli O157:H7 str.
TW14359]
gi|307604125|gb|ADN68435.1| hypothetical protein vb_24B_21 [Stx2 converting phage vB_EcoP_24B]
gi|377930563|gb|EHU94446.1| putative 9-O-acetyl-N-acetylneuraminate esterase domain protein
[Escherichia coli DEC4B]
gi|386215317|gb|EII31811.1| PF08410 domain protein [Escherichia coli 4.0967]
gi|386795183|gb|AFJ28217.1| hypothetical protein CDCO157_1156 [Escherichia coli Xuzhou21]
gi|390796659|gb|EIO63928.1| hypothetical protein ECTW11039_3541 [Escherichia coli TW11039]
Length = 645
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|9955822|emb|CAC05625.1| hypothetical protein [Shigella dysenteriae]
Length = 536
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|421831265|ref|ZP_16266560.1| hypothetical protein ECPA7_3408 [Escherichia coli PA7]
gi|408066483|gb|EKH00939.1| hypothetical protein ECPA7_3408 [Escherichia coli PA7]
Length = 645
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|260854366|ref|YP_003228257.1| hypothetical protein ECO26_1205 [Escherichia coli O26:H11 str.
11368]
gi|257753015|dbj|BAI24517.1| hypothetical protein ECO26_1205 [Escherichia coli O26:H11 str.
11368]
Length = 645
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|429825770|ref|ZP_19357016.1| hypothetical protein EC960109_2058A, partial [Escherichia coli
96.0109]
gi|429256754|gb|EKY40888.1| hypothetical protein EC960109_2058A, partial [Escherichia coli
96.0109]
Length = 380
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|425341588|ref|ZP_18728640.1| hypothetical protein ECEC1848_2077, partial [Escherichia coli
EC1848]
gi|425353684|ref|ZP_18739899.1| hypothetical protein ECEC1850_2056, partial [Escherichia coli
EC1850]
gi|408265152|gb|EKI85905.1| hypothetical protein ECEC1848_2077, partial [Escherichia coli
EC1848]
gi|408280263|gb|EKI99827.1| hypothetical protein ECEC1850_2056, partial [Escherichia coli
EC1850]
Length = 281
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|418487095|ref|YP_007001458.1| hypothetical protein [Escherichia phage TL-2011c]
gi|363498337|gb|AEW24650.1| hypothetical protein [Escherichia phage TL-2011c]
Length = 645
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|420113039|ref|ZP_14622806.1| hypothetical protein ECO10021_11052, partial [Escherichia coli
O26:H11 str. CVM10021]
gi|394413106|gb|EJE87183.1| hypothetical protein ECO10021_11052, partial [Escherichia coli
O26:H11 str. CVM10021]
Length = 425
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|417160133|ref|ZP_11997052.1| PF03629 domain protein [Escherichia coli 99.0741]
gi|386174624|gb|EIH46617.1| PF03629 domain protein [Escherichia coli 99.0741]
Length = 538
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 34 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 93
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + L+ +
Sbjct: 94 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDAAVGTHAQHPGLFSAMVNQ 153
Query: 181 FFTDL 185
F T+L
Sbjct: 154 FRTEL 158
>gi|424499751|ref|ZP_17946833.1| hypothetical protein ECEC4203_1952, partial [Escherichia coli
EC4203]
gi|390832666|gb|EIO97892.1| hypothetical protein ECEC4203_1952, partial [Escherichia coli
EC4203]
Length = 280
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|363498233|gb|AEW24548.1| hypothetical protein, partial [Escherichia phage TL-2011a]
Length = 547
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|425149604|ref|ZP_18549338.1| hypothetical protein EC880221_1943, partial [Escherichia coli
88.0221]
gi|425185858|ref|ZP_18583290.1| hypothetical protein ECFRIK1997_2178, partial [Escherichia coli
FRIK1997]
gi|429025856|ref|ZP_19092052.1| hypothetical protein EC960427_1964, partial [Escherichia coli
96.0427]
gi|408109803|gb|EKH41664.1| hypothetical protein ECFRIK1997_2178, partial [Escherichia coli
FRIK1997]
gi|408601556|gb|EKK75357.1| hypothetical protein EC880221_1943, partial [Escherichia coli
88.0221]
gi|427285526|gb|EKW49487.1| hypothetical protein EC960427_1964, partial [Escherichia coli
96.0427]
Length = 111
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 6 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 65
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 66 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 106
>gi|425359667|ref|ZP_18745471.1| hypothetical protein ECEC1856_1890, partial [Escherichia coli
EC1856]
gi|408281811|gb|EKJ01183.1| hypothetical protein ECEC1856_1890, partial [Escherichia coli
EC1856]
Length = 282
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|420305316|ref|ZP_14807310.1| hypothetical protein ECTW10119_3816, partial [Escherichia coli
TW10119]
gi|390815621|gb|EIO82149.1| hypothetical protein ECTW10119_3816, partial [Escherichia coli
TW10119]
Length = 641
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417294593|ref|ZP_12081862.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
gi|386261992|gb|EIJ17444.1| PF08410 domain protein [Escherichia coli 900105 (10e)]
Length = 645
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|417144127|ref|ZP_11985933.1| PF08410 domain protein [Escherichia coli 1.2264]
gi|386164010|gb|EIH25796.1| PF08410 domain protein [Escherichia coli 1.2264]
Length = 658
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + L+ +
Sbjct: 214 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDAAVGTHAQHPGLFSAMVNQ 273
Query: 181 FFTDL 185
F T+L
Sbjct: 274 FRTEL 278
>gi|424749074|ref|ZP_18177193.1| hypothetical protein CFSAN001629_10603, partial [Escherichia coli
O26:H11 str. CFSAN001629]
gi|421943009|gb|EKU00314.1| hypothetical protein CFSAN001629_10603, partial [Escherichia coli
O26:H11 str. CFSAN001629]
Length = 627
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 133 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 192
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 193 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 233
>gi|424568666|ref|ZP_18009394.1| hypothetical protein ECEC4448_1923, partial [Escherichia coli
EC4448]
gi|390903797|gb|EIP62824.1| hypothetical protein ECEC4448_1923, partial [Escherichia coli
EC4448]
Length = 279
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 25 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 84
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 85 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 125
>gi|424488021|ref|ZP_17936601.1| yjhS, partial [Escherichia coli TW09098]
gi|390805865|gb|EIO72800.1| yjhS, partial [Escherichia coli TW09098]
Length = 403
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|416318681|ref|ZP_11661325.1| hypothetical protein ECoD_01536 [Escherichia coli O157:H7 str.
EC1212]
gi|420287213|ref|ZP_14789406.1| hypothetical protein ECTW10246_3078 [Escherichia coli TW10246]
gi|420298742|ref|ZP_14800793.1| hypothetical protein ECTW09109_3205 [Escherichia coli TW09109]
gi|320191860|gb|EFW66508.1| hypothetical protein ECoD_01536 [Escherichia coli O157:H7 str.
EC1212]
gi|390790603|gb|EIO58020.1| hypothetical protein ECTW10246_3078 [Escherichia coli TW10246]
gi|390807313|gb|EIO74201.1| hypothetical protein ECTW09109_3205 [Escherichia coli TW09109]
Length = 645
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD 164
++W LY+ +I R + AL+ + AV+W QGE D
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFD 251
>gi|195935767|ref|ZP_03081149.1| hypothetical protein EscherichcoliO157_04772 [Escherichia coli
O157:H7 str. EC4024]
Length = 645
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|417612583|ref|ZP_12263049.1| hypothetical protein ECSTECEH250_1639 [Escherichia coli STEC_EH250]
gi|345364163|gb|EGW96293.1| hypothetical protein ECSTECEH250_1639 [Escherichia coli STEC_EH250]
Length = 516
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 23/125 (18%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC G + N
Sbjct: 12 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGASAFTTGADGTYSESAGASEN 71
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGGGTIR--AVLWYQGESDT---VNLEDAKLYKERSDM 180
+W G LY+ ++ R + AL R AV+W QGE D + + L+ +
Sbjct: 72 SLRWGVGKPLYQDLVSRTKAALAKNPKNRLLAVVWMQGEGDAAVGTHAQHPGLFSAMVNQ 131
Query: 181 FFTDL 185
F T+L
Sbjct: 132 FRTEL 136
>gi|260867247|ref|YP_003233649.1| hypothetical protein ECO111_1150 [Escherichia coli O111:H- str.
11128]
gi|257763603|dbj|BAI35098.1| hypothetical protein ECO111_1150 [Escherichia coli O111:H- str.
11128]
Length = 645
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 151 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 210
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 211 STRWGVDKPLYKDLIGRTKAALKKNPKNVLLAVVWMQGEFDFGGTPVNHAAQFGALVDKF 270
Query: 182 FTDL 185
DL
Sbjct: 271 RADL 274
>gi|22001111|gb|AAM88314.1|AF479829_3 unknown [Escherichia coli]
Length = 410
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 22/124 (17%)
Query: 84 ADIDVNKTNGVGPGLPFANAVLTKVPNFGVIGLVPCAIGGT------------------N 125
AD+ + VG GL A +L +P I LVPC GG+ N
Sbjct: 154 ADLSKGQYGTVGQGLHIAKKLLPFIPANAGILLVPCCRGGSAFTTGADGTYSDASGASEN 213
Query: 126 ISQWRKGSSLYEQMIQRAQVALRGG--GTIRAVLWYQGESD--TVNLEDAKLYKERSDMF 181
++W LY+ +I R + AL+ + AV+W QGE D + A + D F
Sbjct: 214 SARWGVDKPLYKDLIGRTKAALKKNPKNVLFAVVWMQGEFDFGGTPVNHAAQFGALVDKF 273
Query: 182 FTDL 185
DL
Sbjct: 274 RADL 277
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,808,279,398
Number of Sequences: 23463169
Number of extensions: 207065478
Number of successful extensions: 482718
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 161
Number of HSP's successfully gapped in prelim test: 1189
Number of HSP's that attempted gapping in prelim test: 480776
Number of HSP's gapped (non-prelim): 1397
length of query: 290
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 149
effective length of database: 9,050,888,538
effective search space: 1348582392162
effective search space used: 1348582392162
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)