BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020796
(321 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255581890|ref|XP_002531744.1| conserved hypothetical protein [Ricinus communis]
gi|223528614|gb|EEF30633.1| conserved hypothetical protein [Ricinus communis]
Length = 321
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 267/321 (83%), Positives = 291/321 (90%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
M+KDC I+G + EEFTFP SAPF +CHASTIVEVDK HFLVAYFGG+ EGAPDVKIWLQ
Sbjct: 1 MRKDCYIRGPLKEEFTFPEKSAPFDNCHASTIVEVDKDHFLVAYFGGTLEGAPDVKIWLQ 60
Query: 61 TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
T+KDG WQSP+I DE+P VPMWNPVLFKL S LLLFYKIGQEVQKWSGCMKRSY+KGVT
Sbjct: 61 TYKDGFWQSPVIVDEQPEVPMWNPVLFKLSSEELLLFYKIGQEVQKWSGCMKRSYDKGVT 120
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
W+ REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG+WMEVT DAGRSW+KYGPIY
Sbjct: 121 WTAREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGAWMEVTADAGRSWKKYGPIY 180
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
I N SLSVIQPV F TA TLRVL+RSF+GIG++CMSES DGGL W YAK T+L NPNSG
Sbjct: 181 IVNTSLSVIQPVLFQTAKGTLRVLLRSFDGIGKICMSESLDGGLNWGYAKLTELPNPNSG 240
Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
IDGVKLKDGRLLLAYNT+SRGVLKVA+S+DDGDSW+D +TLEENL MEFSYPAVIQASDG
Sbjct: 241 IDGVKLKDGRLLLAYNTISRGVLKVAISEDDGDSWYDVITLEENLEMEFSYPAVIQASDG 300
Query: 301 SVHITYTYNRTQIKHVVLLPN 321
SVHI+YTYNRTQIK+VVL PN
Sbjct: 301 SVHISYTYNRTQIKYVVLQPN 321
>gi|224142297|ref|XP_002324495.1| predicted protein [Populus trichocarpa]
gi|222865929|gb|EEF03060.1| predicted protein [Populus trichocarpa]
Length = 321
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 268/321 (83%), Positives = 291/321 (90%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
M KDC LV EEFTFPANS+PF SCHASTIVEVDK HFLVAYFGG+ EGAPDVKIW+Q
Sbjct: 1 MSKDCHNWQLVKEEFTFPANSSPFNSCHASTIVEVDKNHFLVAYFGGTKEGAPDVKIWIQ 60
Query: 61 TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
T+KDG WQSPII D++ +VPMWNPVLFKL S LLLFYK+GQEVQKWSGCMKRSY+KGVT
Sbjct: 61 TYKDGYWQSPIIVDDQLDVPMWNPVLFKLASEELLLFYKVGQEVQKWSGCMKRSYDKGVT 120
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
WS+REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG+WMEVT DAGRSWRK+GPIY
Sbjct: 121 WSKREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGAWMEVTADAGRSWRKHGPIY 180
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ N SLSVIQPVPF TA+ T RVL+RSF+ IGRVCMSES DGG+TW YAKPT+L NPNSG
Sbjct: 181 VENTSLSVIQPVPFQTADGTFRVLLRSFDDIGRVCMSESYDGGVTWGYAKPTELPNPNSG 240
Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
IDGVKLKDG +LLAYNT+SRGVLKVA+SKDDGDSWHD +TLEE L MEFSYPAVIQASDG
Sbjct: 241 IDGVKLKDGPVLLAYNTISRGVLKVAVSKDDGDSWHDVVTLEEKLDMEFSYPAVIQASDG 300
Query: 301 SVHITYTYNRTQIKHVVLLPN 321
SVHITYTYNRTQIKHVVL P+
Sbjct: 301 SVHITYTYNRTQIKHVVLQPS 321
>gi|363808210|ref|NP_001242743.1| uncharacterized protein LOC100809430 [Glycine max]
gi|255636459|gb|ACU18568.1| unknown [Glycine max]
Length = 354
Score = 536 bits (1382), Expect = e-150, Method: Compositional matrix adjust.
Identities = 257/321 (80%), Positives = 282/321 (87%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
M +C I+G + EEFT+ A SAPFKSCHASTIVEVDK HFLVAYFGG+ EGAPDVKIW+Q
Sbjct: 34 MASNCSIEGPLVEEFTYQAGSAPFKSCHASTIVEVDKDHFLVAYFGGTFEGAPDVKIWVQ 93
Query: 61 TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
T+K+GRWQ+P+IADEEPNVPMWNP LFKLPS+ LLLFYKIGQEVQKWSG MKRSY+KG T
Sbjct: 94 TYKNGRWQAPVIADEEPNVPMWNPALFKLPSDELLLFYKIGQEVQKWSGFMKRSYDKGKT 153
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
W+ REQLPPGILGP KNKPILLENG LLCGSSVESWNSWG+W+EVT D GRSW KYGPIY
Sbjct: 154 WTGREQLPPGILGPIKNKPILLENGNLLCGSSVESWNSWGAWVEVTTDFGRSWSKYGPIY 213
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
I N+ LSVIQPVP+ TA+ LRVL+RSF+GIGRVCMSES DGG TW YAKPTQL NPNSG
Sbjct: 214 IENKPLSVIQPVPYQTADGKLRVLLRSFDGIGRVCMSESSDGGKTWGYAKPTQLPNPNSG 273
Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
IDGVKL+DGRLLLAYNTVSR V KVALSKDDGDSW + LTLE+ MEFSYPAVIQ SDG
Sbjct: 274 IDGVKLRDGRLLLAYNTVSRSVPKVALSKDDGDSWCEVLTLEDTSGMEFSYPAVIQDSDG 333
Query: 301 SVHITYTYNRTQIKHVVLLPN 321
+HITYTYNRTQIK VVL P+
Sbjct: 334 RIHITYTYNRTQIKRVVLRPS 354
>gi|356547785|ref|XP_003542289.1| PREDICTED: uncharacterized protein LOC100779946 [Glycine max]
Length = 366
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 255/319 (79%), Positives = 278/319 (87%)
Query: 2 KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
K +KG V EEFTFP S PF SCHASTIVEV KGHFLVAYFGGS EGAPDVKIWLQT
Sbjct: 40 KNSSYVKGPVVEEFTFPERSTPFNSCHASTIVEVGKGHFLVAYFGGSSEGAPDVKIWLQT 99
Query: 62 FKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
+K GRW+SPIIADEEPNVPMWNPVLFKLPSN LLLFYKIG +VQKWSG MKRSY+KG+TW
Sbjct: 100 YKSGRWKSPIIADEEPNVPMWNPVLFKLPSNELLLFYKIGLDVQKWSGFMKRSYDKGITW 159
Query: 122 SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
+EREQLP GILGP KNKP+LLENG LLCGSSVESWNSWG+W EVT D G +WRKYGPIYI
Sbjct: 160 TEREQLPSGILGPIKNKPLLLENGDLLCGSSVESWNSWGAWAEVTTDFGGAWRKYGPIYI 219
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
N+ VIQPVP+ TA TLRVL+RSF G+GR+ MSES DGG +W YAKPTQL NPNSGI
Sbjct: 220 ENKPHGVIQPVPYQTAKGTLRVLLRSFTGLGRIYMSESFDGGKSWGYAKPTQLPNPNSGI 279
Query: 242 DGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
DGVKLKDGRLLLAYNT+SRGVLK+ALS+DDGDSWH+ALTLE+ L MEFSYPAVIQASDG
Sbjct: 280 DGVKLKDGRLLLAYNTISRGVLKLALSEDDGDSWHEALTLEDTLGMEFSYPAVIQASDGR 339
Query: 302 VHITYTYNRTQIKHVVLLP 320
VH+TYTY RTQIKHVV+ P
Sbjct: 340 VHVTYTYKRTQIKHVVVRP 358
>gi|357479849|ref|XP_003610210.1| hypothetical protein MTR_4g129200 [Medicago truncatula]
gi|355511265|gb|AES92407.1| hypothetical protein MTR_4g129200 [Medicago truncatula]
Length = 467
Score = 532 bits (1370), Expect = e-149, Method: Compositional matrix adjust.
Identities = 253/316 (80%), Positives = 279/316 (88%), Gaps = 1/316 (0%)
Query: 7 IKGLVAEEFTFPANSA-PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
I+GLV EEFTFPA S+ PF SCHASTIVEV KGHFLVAYFGG+ EGAPDVKIWLQT+K+G
Sbjct: 38 IQGLVVEEFTFPAKSSVPFNSCHASTIVEVGKGHFLVAYFGGTSEGAPDVKIWLQTYKNG 97
Query: 66 RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
WQ+P+I DEEPNVPMWNPVLFKLPSN LLLFYKIGQ+ QKWSG +KRSY+KG+TW+ERE
Sbjct: 98 IWQAPVIGDEEPNVPMWNPVLFKLPSNVLLLFYKIGQDFQKWSGFIKRSYDKGITWTERE 157
Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
QLPPGILGP KNKPILLENG LLCGSSVESWNSWGSW EVT D G++WRKYGPIYI N
Sbjct: 158 QLPPGILGPIKNKPILLENGDLLCGSSVESWNSWGSWAEVTTDFGKTWRKYGPIYIKNNP 217
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
L VIQPVP+ TA TLRVL+RSF GIGRV MSES DGG TW YA+PTQL NPNSGIDG+K
Sbjct: 218 LGVIQPVPYQTAKGTLRVLLRSFTGIGRVYMSESFDGGKTWGYAEPTQLHNPNSGIDGIK 277
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
LKDGRLL+AYNT+SRGVLK+ALS+DDGDSWH ALTLE+ + MEFSYPAVIQASDG VH+T
Sbjct: 278 LKDGRLLIAYNTISRGVLKLALSEDDGDSWHQALTLEDTVGMEFSYPAVIQASDGRVHVT 337
Query: 306 YTYNRTQIKHVVLLPN 321
YTYNRTQIK V+ N
Sbjct: 338 YTYNRTQIKAKVMQTN 353
>gi|357479845|ref|XP_003610208.1| hypothetical protein MTR_4g129180 [Medicago truncatula]
gi|355511263|gb|AES92405.1| hypothetical protein MTR_4g129180 [Medicago truncatula]
Length = 354
Score = 523 bits (1347), Expect = e-146, Method: Compositional matrix adjust.
Identities = 250/321 (77%), Positives = 280/321 (87%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
M+ C I G + EE TF SAPF +CHASTIVE DK HFLVAYFGG+ EGAPDVKIWLQ
Sbjct: 34 MENRCPIYGPLTEELTFLEGSAPFNNCHASTIVEFDKDHFLVAYFGGTSEGAPDVKIWLQ 93
Query: 61 TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
T K+GRW+SP+IADE+PNVPMWNPVLFKLPS+ LLLFYKIGQEVQKWSG +KRSY+KG T
Sbjct: 94 TCKNGRWESPVIADEQPNVPMWNPVLFKLPSDELLLFYKIGQEVQKWSGFVKRSYDKGNT 153
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
W+EREQLPPGILGP KNKPILLENG L+CGSSVESWNSWG+W+EVT D GRSW K+GPIY
Sbjct: 154 WTEREQLPPGILGPIKNKPILLENGHLICGSSVESWNSWGAWVEVTSDFGRSWSKHGPIY 213
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
I NE LSVIQPVP+ TA+ LRVL+RSF+GIGRVC+SES DGG TW +AKPTQL NPNSG
Sbjct: 214 IENEPLSVIQPVPYQTADGKLRVLLRSFDGIGRVCISESSDGGQTWGHAKPTQLPNPNSG 273
Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
IDGVKL+DGR+LL YNT SRG+LKVALS DDGDSW++ALTLE++ MEFSYPAVIQASDG
Sbjct: 274 IDGVKLRDGRILLVYNTTSRGILKVALSNDDGDSWYEALTLEDSSGMEFSYPAVIQASDG 333
Query: 301 SVHITYTYNRTQIKHVVLLPN 321
+HITYTY RTQIKHV L PN
Sbjct: 334 LIHITYTYKRTQIKHVTLRPN 354
>gi|449464334|ref|XP_004149884.1| PREDICTED: uncharacterized protein LOC101219119 [Cucumis sativus]
gi|449507997|ref|XP_004163188.1| PREDICTED: uncharacterized protein LOC101230872 [Cucumis sativus]
Length = 398
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 245/318 (77%), Positives = 273/318 (85%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
M K + G + EEFTFPANS PF +CHASTIVEVDK H+LVAYFGG+ EGAPDVKIWLQ
Sbjct: 73 MGKGSYVDGPLQEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTLEGAPDVKIWLQ 132
Query: 61 TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
FK+G W SPI+ADEEP++PMWNPVLFKLPS+ LLLFYK+GQEVQKWSGCMKRSY+KG+T
Sbjct: 133 AFKNGSWHSPIVADEEPDIPMWNPVLFKLPSDELLLFYKVGQEVQKWSGCMKRSYDKGIT 192
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
W+ REQLPPGILGP KNKPILLENG+LLCGSSVESWNSWG+WMEVT D+GRSWRK+GPIY
Sbjct: 193 WTAREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEVTSDSGRSWRKFGPIY 252
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ N SLSVIQPVP+ TAN LRVL+RSF GI +CMSES DGG WS+A PT L NPNSG
Sbjct: 253 MKNRSLSVIQPVPYQTANGNLRVLLRSFTGIDSICMSESNDGGHNWSFAIPTNLPNPNSG 312
Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
IDGVKL+DGRLLL YNTVSRGVLKV LS DDGDSW D +TLE+ EFSYPAVI+ASD
Sbjct: 313 IDGVKLRDGRLLLVYNTVSRGVLKVGLSLDDGDSWQDVMTLEDEPENEFSYPAVIEASDD 372
Query: 301 SVHITYTYNRTQIKHVVL 318
S+HITYTY R QIKHVV
Sbjct: 373 SIHITYTYRRKQIKHVVF 390
>gi|297793281|ref|XP_002864525.1| hypothetical protein ARALYDRAFT_495867 [Arabidopsis lyrata subsp.
lyrata]
gi|297310360|gb|EFH40784.1| hypothetical protein ARALYDRAFT_495867 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 515 bits (1327), Expect = e-144, Method: Compositional matrix adjust.
Identities = 247/309 (79%), Positives = 270/309 (87%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV + HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8 VLLETFTFPADSAPFKSCHASTIVEVVEDHFLTAYFGGTTEGAPDVKIWLQHFKDGQWDS 67
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPMWNPVLFKLPS LLLFYKIGQEVQKWSGCMKRSY+KG TW+EREQLPP
Sbjct: 68 PVIVDEEPGVPMWNPVLFKLPSQELLLFYKIGQEVQKWSGCMKRSYDKGRTWTEREQLPP 127
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGRSWRK GPIYI +SLSVI
Sbjct: 128 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRSWRKQGPIYIQGKSLSVI 187
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LRVL+RSF GI ++C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGKLRVLLRSFTGIDKICISESSDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLKV +S DDGDSW D LTLEE+ MEFSYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKVGVSLDDGDSWTDVLTLEESPGMEFSYPAVIQAGDGNVHVTYTYN 307
Query: 310 RTQIKHVVL 318
RTQIKHVVL
Sbjct: 308 RTQIKHVVL 316
>gi|30696915|ref|NP_851204.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
gi|119360001|gb|ABL66729.1| At5g57700 [Arabidopsis thaliana]
gi|332009554|gb|AED96937.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
Length = 352
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/309 (78%), Positives = 272/309 (88%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 13 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 72
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 73 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 132
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI +SLSVI
Sbjct: 133 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 192
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LR+L+RSF GI R+C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 193 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 252
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 253 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 312
Query: 310 RTQIKHVVL 318
RTQIKHVVL
Sbjct: 313 RTQIKHVVL 321
>gi|145334835|ref|NP_001078763.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
gi|332009556|gb|AED96939.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
Length = 358
Score = 513 bits (1320), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/309 (78%), Positives = 272/309 (88%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 19 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 78
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 79 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 138
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI +SLSVI
Sbjct: 139 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 198
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LR+L+RSF GI R+C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 199 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 258
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 259 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 318
Query: 310 RTQIKHVVL 318
RTQIKHVVL
Sbjct: 319 RTQIKHVVL 327
>gi|238481582|ref|NP_001154784.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
gi|332009557|gb|AED96940.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
Length = 348
Score = 512 bits (1319), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/309 (78%), Positives = 272/309 (88%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 9 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 68
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 69 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 128
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI +SLSVI
Sbjct: 129 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 188
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LR+L+RSF GI R+C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 189 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 248
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 249 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 308
Query: 310 RTQIKHVVL 318
RTQIKHVVL
Sbjct: 309 RTQIKHVVL 317
>gi|30696919|ref|NP_568864.2| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
gi|124301162|gb|ABN04833.1| At5g57700 [Arabidopsis thaliana]
gi|332009555|gb|AED96938.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
Length = 347
Score = 512 bits (1319), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/309 (78%), Positives = 272/309 (88%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 67
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 68 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 127
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI +SLSVI
Sbjct: 128 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 187
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LR+L+RSF GI R+C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 307
Query: 310 RTQIKHVVL 318
RTQIKHVVL
Sbjct: 308 RTQIKHVVL 316
>gi|21536870|gb|AAM61202.1| unknown [Arabidopsis thaliana]
Length = 347
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 243/309 (78%), Positives = 272/309 (88%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 67
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 68 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 127
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI +SLSVI
Sbjct: 128 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 187
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LR+L+RSF GI ++C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGNLRILLRSFTGIDKICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 307
Query: 310 RTQIKHVVL 318
RTQIKHVVL
Sbjct: 308 RTQIKHVVL 316
>gi|9759267|dbj|BAB09588.1| unnamed protein product [Arabidopsis thaliana]
Length = 371
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 244/328 (74%), Positives = 272/328 (82%), Gaps = 19/328 (5%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 13 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 72
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 73 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 132
Query: 130 GILGPSKNK-------------------PILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
GILGP KNK PILLE+G LLCGSSVESWNSWG+WMEVT DAG
Sbjct: 133 GILGPIKNKVLVALRRLDYSIKTKPFVLPILLEDGTLLCGSSVESWNSWGAWMEVTSDAG 192
Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
R+WRK GPIYI +SLSVIQPVP+ TA LR+L+RSF GI R+C+SES DGG WS+A
Sbjct: 193 RTWRKKGPIYIQGKSLSVIQPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAV 252
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
PT L NPNSGIDGVKLKDGRL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+S
Sbjct: 253 PTVLPNPNSGIDGVKLKDGRLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYS 312
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YPAVIQA DG+VH+TYTYNRTQIKHVVL
Sbjct: 313 YPAVIQAGDGNVHVTYTYNRTQIKHVVL 340
>gi|326520790|dbj|BAJ92758.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 418
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 230/312 (73%), Positives = 268/312 (85%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EE TF A SAPF SCHASTIVE+ K +FLVAYFGGS EGAPDV+IWLQ + DG W S
Sbjct: 102 IVKEELTFSAGSAPFSSCHASTIVEIGKDNFLVAYFGGSKEGAPDVRIWLQRYSDGHWHS 161
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P +ADE+ VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G++WSEREQLPP
Sbjct: 162 PEVADEQDGVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGISWSEREQLPP 221
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI E+L VI
Sbjct: 222 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 281
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TAN T+R+L+RSF IGRVCM++S DGG+TWSYA T+L NPNSGIDGVK+KDG
Sbjct: 282 QPVPYQTANGTIRMLLRSFETIGRVCMADSADGGVTWSYAHKTELPNPNSGIDGVKMKDG 341
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R+L+AYNT SRG+LK+A+S+DDGDSW++ +TLE+ MEFSYPAVI+ D VHITYTYN
Sbjct: 342 RILVAYNTFSRGILKIAVSRDDGDSWNEVMTLEDTKGMEFSYPAVIKTMDELVHITYTYN 401
Query: 310 RTQIKHVVLLPN 321
RTQIKHVVL P+
Sbjct: 402 RTQIKHVVLRPS 413
>gi|326514456|dbj|BAJ96215.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 391
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 230/312 (73%), Positives = 268/312 (85%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EE TF A SAPF SCHASTIVE+ K +FLVAYFGGS EGAPDV+IWLQ + DG W S
Sbjct: 75 IVKEELTFSAGSAPFSSCHASTIVEIGKDNFLVAYFGGSKEGAPDVRIWLQRYSDGHWHS 134
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P +ADE+ VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G++WSEREQLPP
Sbjct: 135 PEVADEQDGVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGISWSEREQLPP 194
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI E+L VI
Sbjct: 195 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 254
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TAN T+R+L+RSF IGRVCM++S DGG+TWSYA T+L NPNSGIDGVK+KDG
Sbjct: 255 QPVPYQTANGTIRMLLRSFETIGRVCMADSADGGVTWSYAHKTELPNPNSGIDGVKMKDG 314
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R+L+AYNT SRG+LK+A+S+DDGDSW++ +TLE+ MEFSYPAVI+ D VHITYTYN
Sbjct: 315 RILVAYNTFSRGILKIAVSRDDGDSWNEVMTLEDTKGMEFSYPAVIKTMDELVHITYTYN 374
Query: 310 RTQIKHVVLLPN 321
RTQIKHVVL P+
Sbjct: 375 RTQIKHVVLRPS 386
>gi|26452755|dbj|BAC43458.1| unknown protein [Arabidopsis thaliana]
Length = 310
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 237/302 (78%), Positives = 264/302 (87%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8 VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 67
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 68 PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 127
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLE G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI +SLSVI
Sbjct: 128 GILGPIKNKPILLEGGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 187
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TA LR+L+RSF GI R+C+SES DGG WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+ ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 307
Query: 310 RT 311
RT
Sbjct: 308 RT 309
>gi|115472361|ref|NP_001059779.1| Os07g0516000 [Oryza sativa Japonica Group]
gi|34393670|dbj|BAC83370.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
gi|50508733|dbj|BAD31309.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
gi|113611315|dbj|BAF21693.1| Os07g0516000 [Oryza sativa Japonica Group]
Length = 424
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 234/312 (75%), Positives = 263/312 (84%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EEFTFPA SAPF SCHASTIVE +K FLVAYFGGS EGAPDVKIWLQ + DG W +
Sbjct: 108 IVKEEFTFPAGSAPFNSCHASTIVETEKDSFLVAYFGGSKEGAPDVKIWLQRYSDGCWHT 167
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P +ADE+ VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G TWS REQLPP
Sbjct: 168 PQVADEQDEVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGKTWSAREQLPP 227
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP L+E+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI E+L VI
Sbjct: 228 GILGPIKNKPFLVEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 287
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TAN T+RVL+RSF IGRVCM++S DGG+TWSY T L NPNSGIDGVK+KDG
Sbjct: 288 QPVPYMTANGTIRVLLRSFETIGRVCMADSADGGVTWSYVHETDLPNPNSGIDGVKMKDG 347
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R+LLAYNT SRG LKVA+S DDGDSW+D +TLE+ MEFSYPAVIQ D +HITYTYN
Sbjct: 348 RVLLAYNTFSRGTLKVAVSMDDGDSWNDVMTLEDTEGMEFSYPAVIQTMDDLIHITYTYN 407
Query: 310 RTQIKHVVLLPN 321
RTQIKHVVL P+
Sbjct: 408 RTQIKHVVLQPS 419
>gi|357122645|ref|XP_003563025.1| PREDICTED: uncharacterized protein LOC100823135 [Brachypodium
distachyon]
Length = 404
Score = 493 bits (1268), Expect = e-137, Method: Compositional matrix adjust.
Identities = 230/312 (73%), Positives = 263/312 (84%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EE TFPA +APF SCHASTIVEV+K +FLVAYFGGS EGAPDVKIWLQ + DG W
Sbjct: 88 ILKEELTFPAGTAPFNSCHASTIVEVEKDNFLVAYFGGSKEGAPDVKIWLQRYGDGYWHP 147
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P +ADE+ VPMWNPVLFKLPS LLLFYKIGQEVQKWSG MKRS + G++WS+REQLP
Sbjct: 148 PKVADEQDEVPMWNPVLFKLPSRELLLFYKIGQEVQKWSGAMKRSLDGGISWSQREQLPA 207
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIY+ E+L VI
Sbjct: 208 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYVEGETLGVI 267
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TAN T+RVL+RSF IGRVCM++S DGG+TWSY T+L NPNSGIDGVK+KDG
Sbjct: 268 QPVPYQTANGTIRVLLRSFETIGRVCMADSIDGGVTWSYVHKTELPNPNSGIDGVKMKDG 327
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R+LLAYNT SRG LK+A+S DDGDSW + +TLE+ MEFSYPAVIQ D VHITYTYN
Sbjct: 328 RVLLAYNTFSRGTLKIAISTDDGDSWKEVMTLEDTKGMEFSYPAVIQTMDELVHITYTYN 387
Query: 310 RTQIKHVVLLPN 321
RTQIKHVVL P+
Sbjct: 388 RTQIKHVVLQPS 399
>gi|222637136|gb|EEE67268.1| hypothetical protein OsJ_24447 [Oryza sativa Japonica Group]
Length = 1184
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/305 (75%), Positives = 257/305 (84%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EEFTFPA SAPF SCHASTIVE +K FLVAYFGGS EGAPDVKIWLQ + DG W +
Sbjct: 608 IVKEEFTFPAGSAPFNSCHASTIVETEKDSFLVAYFGGSKEGAPDVKIWLQRYSDGCWHT 667
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P +ADE+ VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G TWS REQLPP
Sbjct: 668 PQVADEQDEVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGKTWSAREQLPP 727
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP L+E+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI E+L VI
Sbjct: 728 GILGPIKNKPFLVEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 787
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TAN T+RVL+RSF IGRVCM++S DGG+TWSY T L NPNSGIDGVK+KDG
Sbjct: 788 QPVPYMTANGTIRVLLRSFETIGRVCMADSADGGVTWSYVHETDLPNPNSGIDGVKMKDG 847
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R+LLAYNT SRG LKVA+S DDGDSW+D +TLE+ MEFSYPAVIQ D +HITYTYN
Sbjct: 848 RVLLAYNTFSRGTLKVAVSMDDGDSWNDVMTLEDTEGMEFSYPAVIQTMDDLIHITYTYN 907
Query: 310 RTQIK 314
RTQIK
Sbjct: 908 RTQIK 912
Score = 360 bits (924), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 175/308 (56%), Positives = 219/308 (71%), Gaps = 26/308 (8%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIA 73
T+ N K + +V K FLVAYFGG+ EGA DVKIWLQ + +G W SP +
Sbjct: 901 HITYTYNRTQIKFDLPTLSSQVQKNMFLVAYFGGTQEGADDVKIWLQRYYNGSWHSPEVV 960
Query: 74 DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILG 133
DE PNVP+WNPVLF+LPS +LLFYK+G+ V+ WSGCMK S + GV WS+REQLPPGILG
Sbjct: 961 DEVPNVPLWNPVLFQLPSGEILLFYKVGKTVESWSGCMKCSSDGGVIWSKREQLPPGILG 1020
Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
P KNK VT D G++W+KYGPIY+ +++ VIQPV
Sbjct: 1021 PIKNK--------------------------VTKDHGKTWKKYGPIYVRGKTMGVIQPVL 1054
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ T++ T+R+L+R + +GR+C++ES D G+ WSYA+PT+L NPNSGIDGVKLKDGR++L
Sbjct: 1055 YQTSSGTIRMLLRPSDEVGRICVAESKDSGVNWSYAQPTELPNPNSGIDGVKLKDGRVVL 1114
Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
YN+ SRGVLKVA+S+DDGD W D LTLEE +EFSYPAVIQ SDG VH+TYTY RTQI
Sbjct: 1115 VYNSTSRGVLKVAVSQDDGDKWEDVLTLEETHGVEFSYPAVIQTSDGLVHVTYTYKRTQI 1174
Query: 314 KHVVLLPN 321
KHVVL P+
Sbjct: 1175 KHVVLQPS 1182
>gi|218199713|gb|EEC82140.1| hypothetical protein OsI_26191 [Oryza sativa Indica Group]
Length = 684
Score = 483 bits (1242), Expect = e-134, Method: Compositional matrix adjust.
Identities = 228/305 (74%), Positives = 256/305 (83%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EEFTFPA SAPF SCHASTIVE +K FLVAYFGGS EGAPDVKIWLQ + DG W +
Sbjct: 108 IVKEEFTFPAGSAPFNSCHASTIVETEKDSFLVAYFGGSKEGAPDVKIWLQRYSDGCWHT 167
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P +ADE+ VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G TWS REQLPP
Sbjct: 168 PQVADEQDEVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGKTWSAREQLPP 227
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP L+E+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI E+L VI
Sbjct: 228 GILGPIKNKPFLVEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 287
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ TAN T+RVL+RSF IGRVCM++S DGG+TWSY T L NPNSGIDGVK+KDG
Sbjct: 288 QPVPYMTANGTIRVLLRSFETIGRVCMADSADGGVTWSYVHETDLPNPNSGIDGVKMKDG 347
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R+LLAYNT SRG LKVA+S DDGDSW+D + LE+ MEFSYPAVIQ D +HITYTYN
Sbjct: 348 RVLLAYNTFSRGTLKVAVSMDDGDSWNDVMALEDTEGMEFSYPAVIQTMDDLIHITYTYN 407
Query: 310 RTQIK 314
RTQIK
Sbjct: 408 RTQIK 412
Score = 360 bits (923), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 175/308 (56%), Positives = 219/308 (71%), Gaps = 26/308 (8%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIA 73
T+ N K + +V K FLVAYFGG+ EGA DVKIWLQ + +G W SP +
Sbjct: 401 HITYTYNRTQIKFDLPTLSSQVQKNMFLVAYFGGTQEGADDVKIWLQRYYNGSWHSPEVV 460
Query: 74 DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILG 133
DE PNVP+WNPVLF+LPS +LLFYK+G+ V+ WSGCMK S + GV WS+REQLPPGILG
Sbjct: 461 DEVPNVPLWNPVLFQLPSGEILLFYKVGKTVESWSGCMKCSSDGGVIWSKREQLPPGILG 520
Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
P KNK VT D G++W+KYGPIY+ +++ VIQPV
Sbjct: 521 PIKNK--------------------------VTKDHGKTWKKYGPIYVRGKTMGVIQPVL 554
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ T++ T+R+L+R + +GR+C++ES D G+ WSYA+PT+L NPNSGIDGVKLKDGR++L
Sbjct: 555 YQTSSGTIRMLLRPSDEVGRICVAESKDSGVNWSYAQPTELPNPNSGIDGVKLKDGRVVL 614
Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
YN+ SRGVLKVA+S+DDGD W D LTLEE +EFSYPAVIQ SDG VH+TYTY RTQI
Sbjct: 615 VYNSTSRGVLKVAVSQDDGDKWEDVLTLEETHGVEFSYPAVIQTSDGLVHVTYTYKRTQI 674
Query: 314 KHVVLLPN 321
KHVVL P+
Sbjct: 675 KHVVLQPS 682
>gi|242045796|ref|XP_002460769.1| hypothetical protein SORBIDRAFT_02g034675 [Sorghum bicolor]
gi|241924146|gb|EER97290.1| hypothetical protein SORBIDRAFT_02g034675 [Sorghum bicolor]
Length = 411
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 221/316 (69%), Positives = 262/316 (82%), Gaps = 5/316 (1%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V +EFTFPA APF SCHASTIV++D+G+FLVAYFGGS EGAPDVKIWLQ + G W
Sbjct: 91 IVKKEFTFPARRAPFNSCHASTIVQIDEGNFLVAYFGGSTEGAPDVKIWLQRYSGGHWHP 150
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P++ADE+ VPMWNPVLF+LPS LLLFYKIGQEVQKWSG MKRS + G+TW REQLPP
Sbjct: 151 PVVADEQLGVPMWNPVLFQLPSRELLLFYKIGQEVQKWSGAMKRSLDGGLTWLPREQLPP 210
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT +AGR+WRKYGPIY+ + L VI
Sbjct: 211 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTQNAGRTWRKYGPIYVEGQPLGVI 270
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS-----GIDGV 244
QPVP+ TAN +RVL+RSF IGRVCM++S DGG+TWS+ + T+L NPNS GIDGV
Sbjct: 271 QPVPYRTANGIIRVLLRSFQTIGRVCMADSFDGGVTWSFVRGTELPNPNSDLKAEGIDGV 330
Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
K+KDGR++LAYNT SRG LK+A+S +DGDSW++ LTLE+ MEFSYPAVIQ D +H+
Sbjct: 331 KMKDGRVVLAYNTFSRGTLKLAVSLNDGDSWNEVLTLEDTSGMEFSYPAVIQTMDQLIHV 390
Query: 305 TYTYNRTQIKHVVLLP 320
TYTYNRTQIKHVV+ P
Sbjct: 391 TYTYNRTQIKHVVIKP 406
>gi|23617011|dbj|BAC20707.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
gi|50508734|dbj|BAD31310.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
Length = 371
Score = 442 bits (1137), Expect = e-122, Method: Compositional matrix adjust.
Identities = 205/311 (65%), Positives = 255/311 (81%), Gaps = 2/311 (0%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EEFTF + PFK+CHASTIVEV K FLVAYFGG+ EGA DVKIWLQ + +G W S
Sbjct: 7 VLREEFTFSEGNVPFKTCHASTIVEVQKNMFLVAYFGGTQEGADDVKIWLQRYYNGSWHS 66
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P + DE PNVP+WNPVLF+LPS +LLFYK+G+ V+ WSGCMK S + GV WS+REQLPP
Sbjct: 67 PEVVDEVPNVPLWNPVLFQLPSGEILLFYKVGKTVESWSGCMKCSSDGGVIWSKREQLPP 126
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LL++G LLCG+SVESWNSWG+W+EVT D G++W+KYGPIY+ +++ VI
Sbjct: 127 GILGPIKNKPFLLKDGHLLCGTSVESWNSWGAWLEVTKDHGKTWKKYGPIYVRGKTMGVI 186
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPV + T++ T+R+L+R + +GR+C++ES D G+ WSYA+PT+L NPNSGIDGVKLKDG
Sbjct: 187 QPVLYQTSSGTIRMLLRPSDEVGRICVAESKDSGVNWSYAQPTELPNPNSGIDGVKLKDG 246
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
R++L YN+ SRGVLKVA+S+DDGD W D LTLEE +EFSYPAVIQ SDG VH+TYTY
Sbjct: 247 RVVLVYNSTSRGVLKVAVSQDDGDKWEDVLTLEETHGVEFSYPAVIQTSDGLVHVTYTYK 306
Query: 310 RTQIKHVVLLP 320
RTQIK V+LP
Sbjct: 307 RTQIK--VVLP 315
>gi|302805374|ref|XP_002984438.1| hypothetical protein SELMODRAFT_268887 [Selaginella moellendorffii]
gi|300147826|gb|EFJ14488.1| hypothetical protein SELMODRAFT_268887 [Selaginella moellendorffii]
Length = 329
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 203/313 (64%), Positives = 242/313 (77%)
Query: 6 GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
G G+V EEF F APF SCHASTIVE++ HF+VAYFGG+ EG DV IW Q FK G
Sbjct: 7 GRIGIVKEEFLFEKGLAPFNSCHASTIVELEPEHFMVAYFGGTYEGDSDVVIWTQRFKHG 66
Query: 66 RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
W P +AD E VPMWNPVLFK+P +LLFYKIG EVQKWSG MKRS++ GVTWS R+
Sbjct: 67 VWSDPQVADSELGVPMWNPVLFKMPKGEVLLFYKIGPEVQKWSGFMKRSFDNGVTWSARQ 126
Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
LPPGILGP KNKP+LL +G LLCGSSVESW++WG+WMEVT D+G +WRK+GPIY+ +
Sbjct: 127 ALPPGILGPIKNKPLLLRDGRLLCGSSVESWDAWGAWMEVTEDSGYTWRKHGPIYVQHTP 186
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ VIQPVP+ T N T+RVL+R+ IG++CM+ S DGG +W+YA PTQL NPN G DGVK
Sbjct: 187 MGVIQPVPYLTDNGTIRVLLRATQEIGKICMASSSDGGRSWTYATPTQLPNPNCGFDGVK 246
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
L+DG LLL YNT SRG+LKV +S DDGD+W + +TLEE EFSY AVIQ+ G VH+T
Sbjct: 247 LRDGTLLLVYNTDSRGILKVGVSGDDGDTWTEEITLEEQEGGEFSYAAVIQSFTGLVHVT 306
Query: 306 YTYNRTQIKHVVL 318
YTYNR QIKHVVL
Sbjct: 307 YTYNRVQIKHVVL 319
>gi|168030356|ref|XP_001767689.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681009|gb|EDQ67440.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 358
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 199/313 (63%), Positives = 242/313 (77%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
++ EEF F APF SCHAS+IVE+ G FLVAYFGG+ EG DV IW F+ G W+
Sbjct: 37 AIIKEEFLFEDGKAPFPSCHASSIVELGSGTFLVAYFGGTHEGFSDVAIWTSRFEVGLWK 96
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
P + D EP+VP WNPVLFK+P LL+FYKIG+EVQKWSG +KRS + GVTWS+REQLP
Sbjct: 97 PPALVDREPDVPAWNPVLFKMPDGELLIFYKIGEEVQKWSGFIKRSSDNGVTWSDREQLP 156
Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
PGILGP+KNKP+L+++G LLCGSS ESWN+WG+WMEVT DAGR+W K+GPI++ L +
Sbjct: 157 PGILGPTKNKPLLIKDGRLLCGSSTESWNAWGAWMEVTADAGRTWAKHGPIHLSGTPLGI 216
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQPVPF TA+ T+RVL+R + +GR+CM+ S D GLTWS+A PT+L+N N G DGVKL D
Sbjct: 217 IQPVPFVTAHDTVRVLLRPSSMVGRICMATSQDAGLTWSFATPTELINCNCGFDGVKLHD 276
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
GRLL+ YNT SRG+LKVA+S DDG SW D LTLE+ EFSY AVI ASD +H TYTY
Sbjct: 277 GRLLILYNTTSRGILKVAISADDGLSWRDYLTLEDTEGCEFSYAAVILASDKHIHATYTY 336
Query: 309 NRTQIKHVVLLPN 321
R QIKHVVL P+
Sbjct: 337 KRRQIKHVVLQPS 349
>gi|302764718|ref|XP_002965780.1| hypothetical protein SELMODRAFT_84263 [Selaginella moellendorffii]
gi|300166594|gb|EFJ33200.1| hypothetical protein SELMODRAFT_84263 [Selaginella moellendorffii]
Length = 387
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 199/309 (64%), Positives = 238/309 (77%)
Query: 6 GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
G G+V EEF F APF SCHASTIVE++ HF+VAYFGG+ EG DV IW Q FK G
Sbjct: 7 GRIGIVKEEFLFEKGLAPFNSCHASTIVELEPEHFMVAYFGGTYEGDSDVVIWTQRFKHG 66
Query: 66 RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
W P +AD E VPMWNPVLFK+P +LLFYKIG EVQKWSG MKRS++ GVTWS R+
Sbjct: 67 VWSDPQVADSELGVPMWNPVLFKMPKGEVLLFYKIGPEVQKWSGFMKRSFDNGVTWSARQ 126
Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
LPPGILGP KNKP+LL +G LLCGSSVESW++WG+WMEVT D+G +WRK+GPIY+ +
Sbjct: 127 ALPPGILGPIKNKPLLLRDGRLLCGSSVESWDAWGAWMEVTEDSGYTWRKHGPIYVQHTP 186
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ VIQPVP+ T N T+RVL+R+ IG++CM+ S DGG +W+YA PTQL NPN G DGVK
Sbjct: 187 MGVIQPVPYLTDNGTVRVLLRATQEIGKICMASSSDGGRSWTYATPTQLPNPNCGFDGVK 246
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
L+DG LLL YNT SRG+LKV +S DDGD+W + +TLEE EFSY AVIQ+ G VH+T
Sbjct: 247 LRDGTLLLVYNTDSRGILKVGVSGDDGDTWTEEITLEEQEGGEFSYAAVIQSFTGLVHVT 306
Query: 306 YTYNRTQIK 314
YTYNR QIK
Sbjct: 307 YTYNRVQIK 315
>gi|259489888|ref|NP_001159249.1| uncharacterized protein LOC100304339 [Zea mays]
gi|223942995|gb|ACN25581.1| unknown [Zea mays]
gi|414590413|tpg|DAA40984.1| TPA: hypothetical protein ZEAMMB73_438581 [Zea mays]
Length = 360
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/274 (70%), Positives = 226/274 (82%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EE PA APF SCHASTIV++++ +FLVAYFGGS EGAPDVKIWLQ + DG W
Sbjct: 81 IVKEELILPARGAPFNSCHASTIVQIEEDNFLVAYFGGSREGAPDVKIWLQRYSDGHWHP 140
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P++ADE+ VPMWNPVLF+LPS LLLFYKIGQEVQKWSG MKRS + GVTWS REQLPP
Sbjct: 141 PVVADEQDGVPMWNPVLFQLPSRELLLFYKIGQEVQKWSGAMKRSLDGGVTWSPREQLPP 200
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAG +WRKYGPI I + L VI
Sbjct: 201 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGWTWRKYGPICIKGQPLGVI 260
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ T + T+RVL+RSF IG VCM++S D G+TWSY + T+L NPNSGIDGVK+KDG
Sbjct: 261 QPVPYRTDDGTIRVLLRSFETIGHVCMADSIDEGVTWSYVRETELPNPNSGIDGVKMKDG 320
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEE 283
R+LLAYNT SRG LK+A+S +DGDSW + +TLE+
Sbjct: 321 RVLLAYNTFSRGTLKLAVSLNDGDSWDEVMTLED 354
>gi|195643224|gb|ACG41080.1| hypothetical protein [Zea mays]
Length = 305
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 192/288 (66%), Positives = 229/288 (79%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EEFTF A+ PF +CHASTIVEV+K FLV+YFGGS EGAPDVKIW Q + DG W
Sbjct: 14 IMKEEFTFQAHDRPFNNCHASTIVEVEKEIFLVSYFGGSIEGAPDVKIWTQRYSDGYWHP 73
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P++ADEE MWNPVLF+LPS+ LLLFYKIG+ Q WSG MKRS N G++W EREQLPP
Sbjct: 74 PVVADEENATAMWNPVLFQLPSHELLLFYKIGEHPQNWSGAMKRSLNGGMSWLEREQLPP 133
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LL++G LLCG+SVESWNSWG+W+EVT DAGR+W KYGPI+I E L VI
Sbjct: 134 GILGPIKNKPFLLDDGRLLCGTSVESWNSWGAWLEVTEDAGRTWSKYGPIFIQGEKLGVI 193
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ T N ++R+L+RS IG VCM++S DGGL WS+A+ + L NPNSGIDG+K+KDG
Sbjct: 194 QPVPYQTTNGSIRMLLRSHQTIGSVCMADSYDGGLRWSFARKSVLPNPNSGIDGIKMKDG 253
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
R+ LAYNTVSRG LKVA+S DDG SW + +TLE EFSYPAVIQ
Sbjct: 254 RVALAYNTVSRGTLKVAVSTDDGLSWQEVVTLENAEGWEFSYPAVIQT 301
>gi|212722798|ref|NP_001131252.1| hypothetical protein [Zea mays]
gi|194690998|gb|ACF79583.1| unknown [Zea mays]
gi|414590410|tpg|DAA40981.1| TPA: hypothetical protein ZEAMMB73_041549 [Zea mays]
Length = 308
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 192/288 (66%), Positives = 229/288 (79%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EEFTF A+ PF +CHASTIVEV+K FLV+YFGGS EGAPDVKIW Q + DG W
Sbjct: 17 IMKEEFTFQAHDRPFNNCHASTIVEVEKEIFLVSYFGGSIEGAPDVKIWTQRYSDGYWHP 76
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P++ADEE MWNPVLF+LPS+ LLLFYKIG+ Q WSG MKRS N G++W EREQLPP
Sbjct: 77 PVVADEENATAMWNPVLFQLPSHELLLFYKIGEHPQNWSGAMKRSLNGGMSWLEREQLPP 136
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP LL++G LLCG+SVESWNSWG+W+EVT DAGR+W KYGPI+I E L VI
Sbjct: 137 GILGPIKNKPFLLDDGRLLCGTSVESWNSWGAWLEVTEDAGRTWSKYGPIFIQGEKLGVI 196
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QPVP+ T N ++R+L+RS IG VCM++S DGGL WS+A+ + L NPNSGIDG+K+KDG
Sbjct: 197 QPVPYQTTNGSIRMLLRSHQTIGSVCMADSYDGGLRWSFARKSVLPNPNSGIDGIKMKDG 256
Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
R+ LAYNTVSRG LKVA+S DDG SW + +TLE EFSYPAVIQ
Sbjct: 257 RVALAYNTVSRGTLKVAVSTDDGLSWQEVVTLENAEGWEFSYPAVIQT 304
>gi|242045798|ref|XP_002460770.1| hypothetical protein SORBIDRAFT_02g034680 [Sorghum bicolor]
gi|241924147|gb|EER97291.1| hypothetical protein SORBIDRAFT_02g034680 [Sorghum bicolor]
Length = 216
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 151/211 (71%), Positives = 177/211 (83%)
Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
MKRS + GVTWS+REQLPPGILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAG
Sbjct: 1 MKRSLDGGVTWSQREQLPPGILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAG 60
Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
R+WRKYGPI I + L VIQPVP+ T N T+RVL+RSF IGRVCM++S D G+TWSY
Sbjct: 61 RTWRKYGPICIEGQPLGVIQPVPYRTDNGTIRVLLRSFETIGRVCMADSVDEGVTWSYVH 120
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
T+L NPNSGIDGVK+KDGR+LLAYNT SRG LK+A+S +DGDSW + +TLE+ MEFS
Sbjct: 121 ETELPNPNSGIDGVKMKDGRVLLAYNTFSRGTLKIAVSSNDGDSWDEVMTLEDTKGMEFS 180
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
YPAVIQ+ D +HITYTYNRTQ+KHVVL P+
Sbjct: 181 YPAVIQSLDELIHITYTYNRTQVKHVVLQPS 211
>gi|297736658|emb|CBI25675.3| unnamed protein product [Vitis vinifera]
Length = 951
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 159/231 (68%), Positives = 183/231 (79%), Gaps = 18/231 (7%)
Query: 4 DCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK 63
D I+G V EEFTFP+NSAPF SCHASTIVEV K HFLVAYFGG+ EGAPDVKIWLQT++
Sbjct: 2 DHWIEGPVLEEFTFPSNSAPFNSCHASTIVEVGKLHFLVAYFGGTAEGAPDVKIWLQTYR 61
Query: 64 DGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+ N+ + ++F G+L K+ + Q WSGCMKRS++ GVTW+E
Sbjct: 62 E-------------NISLEKCIMF-----GILAHNKVSFKCQGWSGCMKRSFDGGVTWTE 103
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
REQLPPGILGP KNKPILLENGLLLCGSSVESWNSWG+WMEVT D+GRSWRKYGPI+I N
Sbjct: 104 REQLPPGILGPIKNKPILLENGLLLCGSSVESWNSWGAWMEVTEDSGRSWRKYGPIFIKN 163
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQL 234
E+LSVIQPVP+ TAN TLRVL+RSF+GI RVCMSES DGG +W YAKPT L
Sbjct: 164 ETLSVIQPVPYQTANGTLRVLLRSFDGIDRVCMSESHDGGQSWDYAKPTAL 214
>gi|406959033|gb|EKD86495.1| hypothetical protein ACD_37C00268G0002 [uncultured bacterium]
Length = 336
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 205/313 (65%), Gaps = 6/313 (1%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EEF F PF SCHAST+ + G L A+F GS EGA DV IW T ++ +W
Sbjct: 25 IIKEEFIFI--DPPFASCHASTLTQTHSGTILCAWFAGSEEGANDVAIWFSTCEESKWDL 82
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P E VP WNP LF +PS +LLFYK G+ Q+WSG +KRS+++GV WSE E LP
Sbjct: 83 PKKIAETEGVPCWNPALFTMPSKEILLFYKTGRNPQQWSGLLKRSWDEGVHWSEEECLPA 142
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GI+GP+KNKP+LL NG LLCGSS+ESW WG W+++T D G +W K PI + + +I
Sbjct: 143 GIIGPAKNKPLLLPNGTLLCGSSIESWRRWGCWIDITADVGHTWHKSSPINVDAQLFGII 202
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP F N +L++L RS + IG +C +ES D G TWS A+P L NPNS ID V + DG
Sbjct: 203 QPTLFFGKNDSLKLLARS-HQIGSICTAESYDQGETWSKAQPISLPNPNSAIDAVNIMDG 261
Query: 250 RLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307
R++L YN R L +A+SKD G +W+ + LEE E+SYP+VIQ+ D +HITYT
Sbjct: 262 RIVLVYNHSKEERYPLNIAVSKDGGGTWNTEVVLEEEPG-EYSYPSVIQSLDKEIHITYT 320
Query: 308 YNRTQIKHVVLLP 320
+NR IKHV+L P
Sbjct: 321 WNRKYIKHVILDP 333
>gi|359495950|ref|XP_002263578.2| PREDICTED: uncharacterized protein LOC100263718, partial [Vitis
vinifera]
Length = 216
Score = 303 bits (776), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 153/223 (68%), Positives = 173/223 (77%), Gaps = 26/223 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EEFTFP+NSAPF SCHASTIVEV K HFLVAYFGG+ EGAPDVKIWLQT+
Sbjct: 19 IMLEEFTFPSNSAPFNSCHASTIVEVGKLHFLVAYFGGTAEGAPDVKIWLQTY------- 71
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
NP++ L + +K G WSGCMKRS++ GVTW+EREQLPP
Sbjct: 72 -------------NPLIVHLDRR-QIKGWKTG-----WSGCMKRSFDGGVTWTEREQLPP 112
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKPILLENGLLLCGSSVESWNSWG+WMEVT D+GRSWRKYGPI+I NE+LSVI
Sbjct: 113 GILGPIKNKPILLENGLLLCGSSVESWNSWGAWMEVTEDSGRSWRKYGPIFIKNETLSVI 172
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
QPVP+ TAN TLRVL+RSF+GI RVCMSES DGG +W YAKPT
Sbjct: 173 QPVPYQTANGTLRVLLRSFDGIDRVCMSESHDGGQSWDYAKPT 215
>gi|116623694|ref|YP_825850.1| hypothetical protein Acid_4606 [Candidatus Solibacter usitatus
Ellin6076]
gi|116226856|gb|ABJ85565.1| hypothetical protein Acid_4606 [Candidatus Solibacter usitatus
Ellin6076]
Length = 321
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 195/308 (63%), Gaps = 5/308 (1%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
L A+ F A+ P SCHASTIVE+ G + A+FGG+ EG PDV IW G+WQ
Sbjct: 13 LAAQTSEFVADPMPTPSCHASTIVELASGDLMTAWFGGAGEGRPDVAIWGARRHAGKWQP 72
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+ EPN+ WNPVLF L L+YK G Q+W+ S + GVTWS+ E LP
Sbjct: 73 PVELAREPNIATWNPVLFHSGDGLLWLYYKFGPSPQQWTAARMWSRDDGVTWSKAEHLPA 132
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
G+ G + KP++L GL++ GSSVES+NSW +W+E + D G++W ++GPI +P E +I
Sbjct: 133 GLYGAIRAKPLVLPKGLIVSGSSVESYNSWAAWVERSTDFGKTWTRHGPI-VPPEG-GII 190
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP LR RS + R+C+S+S D G+TW+ A+ T+L NPNSGID V+LKDG
Sbjct: 191 QPSVVQVEGDHLRFYARS-DKAARICVSDSYDAGITWTPARVTELPNPNSGIDVVRLKDG 249
Query: 250 RLLLAYNTVSRGVLKVALS-KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
R L YN + RG + L+ +DGD W TLE + E+SYPA+IQASDG +HITYT+
Sbjct: 250 RYLAVYNHIPRGRTPLNLAISNDGDHWTPLKTLESDPG-EYSYPAMIQASDGLIHITYTW 308
Query: 309 NRTQIKHV 316
R +IKHV
Sbjct: 309 RRQKIKHV 316
>gi|338734050|ref|YP_004672523.1| alpha-rhamnosidase-like protein [Simkania negevensis Z]
gi|336483433|emb|CCB90032.1| alpha-rhamnosidase-like protein [Simkania negevensis Z]
Length = 348
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 204/332 (61%), Gaps = 23/332 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
+ L+ +EF F ++APF+SCHAST+ E ++G +VAYF GS EG DV I+L D +W
Sbjct: 20 QTLLVDEFLF--SNAPFESCHASTLTETEEG-LIVAYFAGSKEGNSDVSIYLSRQCDNKW 76
Query: 68 QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
Q+P+ E+ P WNPVLF +PS +LLFYK G + +WSG + S + G TWS L
Sbjct: 77 QAPVKVIEDWGAPTWNPVLFTMPSGKILLFYKAGYDPTRWSGFLTSSIDAGQTWSRPFLL 136
Query: 128 PPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI-------- 179
P GILGP KNKP+LL++G LLCGSS++S+ +W E T D G +W + PI
Sbjct: 137 PGGILGPVKNKPLLLQDGRLLCGSSIQSYLNWACSFEWTRDEGLTWERSNPIPYFEERRA 196
Query: 180 -YIPNES-------LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
+ P++ + VIQP + + + +L RS IG +C + S DGG TW+ A P
Sbjct: 197 PFFPDKKSASKDRPIGVIQPTFWTEDGQHITMLCRS-RRIGWICKATSSDGGRTWTRAYP 255
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
T+L NP+SG D V++ DGR+ L YN R L +ALS D G++W D L LE+ F
Sbjct: 256 TELPNPDSGFDAVRMFDGRIALVYNHSKTKRTPLNLALSIDGGETWKDVLVLEDGPG-SF 314
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
+YPA+IQ DG +HITYT+NR IKH+ L P
Sbjct: 315 AYPAIIQTQDGLLHITYTWNRKHIKHIALDPT 346
>gi|390956309|ref|YP_006420066.1| putative neuraminidase (sialidase) [Terriglobus roseus DSM 18391]
gi|390411227|gb|AFL86731.1| putative neuraminidase (sialidase) [Terriglobus roseus DSM 18391]
Length = 361
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 194/319 (60%), Gaps = 20/319 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK-DGRWQSPIIADE 75
F +APF S HASTIVEV KG FL A+FGG+ EGAPDV IW+ DG W +PI
Sbjct: 35 FIYQNAPFPSAHASTIVEVGKGEFLSAWFGGTKEGAPDVAIWMSRRSVDGAWSAPIEMAR 94
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS 135
EP P WNPVLF + L L++K G +W+ + S + G TWS+ E LP G+ GP
Sbjct: 95 EPGTPTWNPVLFHAKTGRLWLYFKYGPHPAEWTAARRYSDDDGKTWSQVEHLPAGVYGPI 154
Query: 136 KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI------------PN 183
+ KP + +G ++ GSSVES++SW W+E + D G +W + GPI + P+
Sbjct: 155 RAKPYVAPDGTIVSGSSVESYHSWSVWVERSTDNGMTWTRTGPITLPELLHAPASALDPD 214
Query: 184 ESLSVIQP--VPFHTAN--RTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
+++ +IQP +P + LR+ MRS I ++C+++S DGG TWS A+ + NPNS
Sbjct: 215 KAIGLIQPTVIPMNDGKPGMHLRLYMRSSIQINKICVADSTDGGKTWSDARKLNVPNPNS 274
Query: 240 GIDGVKLKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
GID V+LKD R +L YN R L +A+SK DG+ + + +E EFSYPA+IQ
Sbjct: 275 GIDIVRLKDRRFVLLYNDTPTGRTPLNLAVSK-DGEHFTNFAIVENENPGEFSYPALIQG 333
Query: 298 SDGSVHITYTYNRTQIKHV 316
+DG++H+TYT+ R I++
Sbjct: 334 TDGNLHMTYTWQRKTIRYA 352
>gi|326799722|ref|YP_004317541.1| exported exo-alpha-sialidase [Sphingobacterium sp. 21]
gi|326550486|gb|ADZ78871.1| exported exo-alpha-sialidase [Sphingobacterium sp. 21]
Length = 377
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/327 (44%), Positives = 196/327 (59%), Gaps = 22/327 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
+G++ EEF + A F SCH++TI E G + A+FGG+ E PDV+I++ F++G W
Sbjct: 27 QGILKEEFLY--EKAAFPSCHSATIAETPTG-LVAAFFGGTKERHPDVEIYVCRFENGTW 83
Query: 68 QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+P+ + E +P WNPVL+++P LLLFYKIG + W G +KRS + G TWS
Sbjct: 84 TAPVSAADGVQAEGKRLPTWNPVLYQIPGGDLLLFYKIGPKPSAWWGMLKRSKDGGKTWS 143
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
E+LP G +GP KNKP+LL+NG L C SS E N W EVT D ++WRK GP+
Sbjct: 144 TAEKLPNGHIGPVKNKPVLLDNGQLFCPSSTEG-NGWKIHFEVTPDFAKTWRKIGPLAA- 201
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+S+ IQP NRTL++L RS N + S S D G TWS + T L N NSG D
Sbjct: 202 KDSIDAIQPSILIHGNRTLQLLARSRNRA--LVQSWSTDNGETWSPLEKTSLPNNNSGTD 259
Query: 243 GVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
V + DGR +L YN V R L V++SK DG SW AL LE++ ++SYPA
Sbjct: 260 AVTMADGRHVLVYNHVLPPGNLAKGPRTPLNVSVSK-DGKSWSAALILEDSPISQYSYPA 318
Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
VIQ DG +H YT+ R +IKHVV+ P
Sbjct: 319 VIQTKDGLLHFIYTWRRQKIKHVVVDP 345
>gi|325103077|ref|YP_004272731.1| exported exo-alpha-sialidase [Pedobacter saltans DSM 12145]
gi|324971925|gb|ADY50909.1| exported exo-alpha-sialidase [Pedobacter saltans DSM 12145]
Length = 383
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 140/327 (42%), Positives = 203/327 (62%), Gaps = 21/327 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
KG+V EF + + APF SCH++TI E G + A+FGG+ E PDV+I++ ++G+W
Sbjct: 32 KGIVKNEFIY--DKAPFPSCHSATIAETPTG-LVYAFFGGTHERHPDVEIYVSRQENGKW 88
Query: 68 QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+P+ I ++ +P WNPVL+++P+ L+LFYK+G + +W G +KRS + G TWS
Sbjct: 89 LTPVSAANGIQEDGKRLPTWNPVLYQIPNGDLMLFYKVGPKPSEWWGMLKRSKDGGKTWS 148
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
E+LP G LGP KNKP+LL NG L+C +S E N W E+T D G++WRK GP+
Sbjct: 149 AAEKLPEGYLGPVKNKPVLLSNGNLICPTSTEG-NGWNIHFEITSDFGKTWRKVGPVGRG 207
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++++ IQP N L++L R+ N + S S D G TW+ K T L N NSG D
Sbjct: 208 EDNINAIQPSVLDHGNGKLQILARTRNWA--IAESWSYDNGETWTPLKKTTLPNNNSGTD 265
Query: 243 GVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
V +K+G+ +L YN V +R L VA+SK DG +W+ L LE++ ++SYPA
Sbjct: 266 AVTMKNGKHVLVYNHVLPPKDNPKGARTPLNVAVSK-DGVNWNAVLILEDSPISQYSYPA 324
Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
VIQ SDG +H YT+ R ++KHVV+ P
Sbjct: 325 VIQTSDGLLHFAYTWRREKMKHVVVDP 351
>gi|409100320|ref|ZP_11220344.1| exported exo-alpha-sialidase [Pedobacter agri PB92]
Length = 368
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 23/327 (7%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G+V +EF + + APF SCH++TI E G + ++FGG+ E PDV+I++ F DG+W
Sbjct: 30 GVVKQEFLY--DKAPFPSCHSATIAETPTG-LVASFFGGTKERNPDVEIYISRFIDGKWL 86
Query: 69 SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+P+ + + +P WNPVL+++P LLLFYKIG + +W G MK S + G+TWSE
Sbjct: 87 APVSVANGVMPDGKRLPTWNPVLYQVPGGDLLLFYKIGPKPSEWWGLMKTSKDGGITWSE 146
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
+LP G +GP KNKP+LL NG L SS E N W EVT D G++WR GP +P
Sbjct: 147 ATKLPDGYIGPVKNKPVLLSNGNLFAPSSKEG-NGWKIHFEVTKDNGKTWRTVGP--LPE 203
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+ IQP N L++L R+ N + S S D G TWS T L N NSG D
Sbjct: 204 NGIKAIQPSILQHGNGKLQILARTANRA--IVESWSEDNGETWSPLTKTSLPNNNSGTDA 261
Query: 244 VKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
V +KDGR +L YN V R L V++S+ DG +W AL LE++ ++SYPAV
Sbjct: 262 VTMKDGRHVLVYNHVLPPGDLAKGPRTPLNVSVSR-DGKTWSAALILEDSPISQYSYPAV 320
Query: 295 IQASDGSVHITYTYNRTQIKHVVLLPN 321
IQ SDG +H YT+ R +IKHVV+ P+
Sbjct: 321 IQTSDGMLHFIYTWRREKIKHVVVDPS 347
>gi|431795896|ref|YP_007222800.1| neuraminidase [Echinicola vietnamensis DSM 17526]
gi|430786661|gb|AGA76790.1| putative neuraminidase (sialidase) [Echinicola vietnamensis DSM
17526]
Length = 366
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 195/323 (60%), Gaps = 22/323 (6%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EEF F ++APF+ CHAST+V + G + A+FGG E P+V IW+ W
Sbjct: 46 VITEEFVF--DNAPFRECHASTLVGLKDGSIMAAWFGGEYERHPNVGIWIARKTGTGWSQ 103
Query: 70 PI-IADEEPN----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P +AD + N P WNPVLF + + L+LFYK G Q+W G K S ++G TWS
Sbjct: 104 PQKVADGQVNDTLSYPTWNPVLFNMTPDSLVLFYKEGPSPQEWWGMYKVSVDEGKTWSSA 163
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
++LP GILGP KNKPI L NG +L SSVE + W + +E+T D G++W+K + I +E
Sbjct: 164 QKLPEGILGPIKNKPITLRNGAVLAPSSVEDEDGWRAHVEITEDHGKTWKK---VAIDHE 220
Query: 185 -SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
VIQP A+ L+VL RS R+ + S D G +W T++LNPNSG D
Sbjct: 221 GEYDVIQPSILQHADGRLQVLCRSQQ--NRIITAWSEDQGRSWGQLSATEVLNPNSGTDA 278
Query: 244 VKLKDGRLLLAYN--------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
V L DGR LL YN + R L++A+S DG SW+D +TLE + EFSYPA+I
Sbjct: 279 VTLADGRFLLVYNPEVSGKDWSDGRNELRLAVST-DGVSWNDVMTLEHEPSGEFSYPAII 337
Query: 296 QASDGSVHITYTYNRTQIKHVVL 318
Q +DG VHI+YT+ R +IK+VVL
Sbjct: 338 QTTDGKVHISYTWKRKKIKYVVL 360
>gi|149176178|ref|ZP_01854794.1| hypothetical protein PM8797T_29628 [Planctomyces maris DSM 8797]
gi|148845045|gb|EDL59392.1| hypothetical protein PM8797T_29628 [Planctomyces maris DSM 8797]
Length = 350
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 190/319 (59%), Gaps = 17/319 (5%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
GLV +EF + + APF SCHASTI E +G + A+FGG+ E PDV+IWL W
Sbjct: 33 GLVMQEFIY--DQAPFPSCHASTIAETPEG-LVCAFFGGTAEKNPDVEIWLSRNTGDGWS 89
Query: 69 SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+P+ + D P WNPVLF+ LLLFYK+G +W G +K S++ G TW +
Sbjct: 90 APVSVADGVKDATKRWPCWNPVLFQTKPGTLLLFYKVGPNPSEWWGMLKISHDNGKTWGK 149
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
+LP G +GP KNKP LL +G LLC +S E +SW ME T D G++W + GP+
Sbjct: 150 ARRLPDGFVGPVKNKPFLLSDGTLLCPASTEH-DSWQLQMEWTPDLGKTWHRTGPLNDGR 208
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
E + IQP ++ L++L RS G++ + S D G +WS PT L NPNSG D
Sbjct: 209 E-IGAIQPSVLQYGDK-LQILCRSRQ--GKIVEAWSEDNGRSWSKVTPTSLPNPNSGTDA 264
Query: 244 VKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
V LKDGR LL YN +G L VA+S +DG W L LE+ E+SYPAVIQ +DG
Sbjct: 265 VTLKDGRALLVYNPTKKGRSPLHVAIS-EDGKHWKTGLVLEDKKG-EYSYPAVIQTADGK 322
Query: 302 VHITYTYNRTQIKHVVLLP 320
VHITYT+ R +KHVV+ P
Sbjct: 323 VHITYTWRRELVKHVVIDP 341
>gi|332666300|ref|YP_004449088.1| BNR/Asp-box repeat domain-containing protein [Haliscomenobacter
hydrossis DSM 1100]
gi|332335114|gb|AEE52215.1| BNR/Asp-box repeat domain protein [Haliscomenobacter hydrossis DSM
1100]
Length = 338
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 188/328 (57%), Gaps = 24/328 (7%)
Query: 9 GLVAEEFTFPAN----SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
GL A+ F + APF SCHAST+ E +G + A+FGG+ E DV IW +
Sbjct: 13 GLTAQSINFTSEFIYEQAPFPSCHASTLAETPQG-LIAAWFGGTHERHKDVGIWSSRLVN 71
Query: 65 GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
G+W SP+ + + P WNPVLF++P LLLFYK+G W G + RS + G+
Sbjct: 72 GKWTSPVELANGVQNASLRHPCWNPVLFQMPGAELLLFYKVGPSPADWWGMLMRSTDGGL 131
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
+WS E+LP GILGP KNKP+LL++G LLC SS E + W E T DAGR+W+K I
Sbjct: 132 SWSTPEKLPEGILGPIKNKPVLLKDGTLLCPSSSEH-DGWRVHFESTKDAGRTWQKTAAI 190
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
E S IQP A+ L++L RS NG V + S D G TWS K T + NPNS
Sbjct: 191 NDGKE-FSAIQPSVLFHADGRLQILCRSKNGF--VLEAFSKDNGQTWSPLKKTSIPNPNS 247
Query: 240 GIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
G D V L+DGR LL YN V+ R L V+LS DG W LE+ EFS
Sbjct: 248 GTDAVTLRDGRQLLVYNHVTKRSRQWGGDRSPLNVSLST-DGKHWKSMTVLEQEPKQEFS 306
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YPAVIQ DG VHITYT+ R +IK+V +
Sbjct: 307 YPAVIQTQDGRVHITYTWKRQRIKYVSM 334
>gi|255532668|ref|YP_003093040.1| exported exo-alpha-sialidase [Pedobacter heparinus DSM 2366]
gi|255345652|gb|ACU04978.1| exported exo-alpha-sialidase [Pedobacter heparinus DSM 2366]
Length = 378
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 23/327 (7%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G++ +EF + A F SCH++TI E G + AYFGG+ E PDV+I++ +G W
Sbjct: 31 GIITDEFLY--EKAAFPSCHSATIAETPTG-LVAAYFGGTHERHPDVEIYVSRQVNGTWL 87
Query: 69 SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+P+ I +++ +P WNPVL+++P LLLFYKIG + +W G M+ S + G+TWSE
Sbjct: 88 APVSVANGIQNDKVRLPTWNPVLYQVPGGELLLFYKIGPKPSEWWGMMRSSKDGGITWSE 147
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
++LP G +GP KNKP+LL NG L C SS E W EVT D G++WR GP +
Sbjct: 148 AQKLPEGQIGPVKNKPVLLSNGNLFCPSSTEG-KGWKVHFEVTKDNGKTWRLIGP--LEG 204
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
++ IQP N L++L RS N + S S D G TWS T L N NSG D
Sbjct: 205 GEINAIQPSILDHGNGKLQILARSRNRA--IVESWSQDNGETWSALAKTSLPNNNSGTDA 262
Query: 244 VKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
V +KDGR +L YN V +R L VA+SK DG +W AL LE++ ++SYPAV
Sbjct: 263 VTMKDGRHVLVYNHVLPPGDLAKGARTPLNVAISK-DGKNWSAALILEDSPTSQYSYPAV 321
Query: 295 IQASDGSVHITYTYNRTQIKHVVLLPN 321
IQ SDG +H YT+ R +IKHVV+ P+
Sbjct: 322 IQTSDGLLHFIYTWRREKIKHVVVDPS 348
>gi|338209632|ref|YP_004653679.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336303445|gb|AEI46547.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 806
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 186/324 (57%), Gaps = 23/324 (7%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV EEF F ++ P SCHAS+IV++ H + +FGG+ EG PDV IWL ++ G+W
Sbjct: 27 LVKEEFIF--DTPPVPSCHASSIVQLGDQHLMATWFGGTAEGKPDVTIWLAEYRKGKWGQ 84
Query: 70 PI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P I D P WNPVLFK L+L+YK+G ++W G ++ S + G WS
Sbjct: 85 PKQMADGIMDAGKRYPCWNPVLFKTREGKLMLYYKVGPNPREWWGMVRYSTDNGKNWSAP 144
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKYGPIYIP 182
E+LP GI+GP KNKPI L NG +L +S ES N W +E T G++W+K I I
Sbjct: 145 EKLPEGIMGPIKNKPIQLPNGDILHPTSTESVTGNIWHVHLEKTDKNGKNWQK---ISID 201
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
N IQP + +++L RS + V + S DGG TWS L NPNSG D
Sbjct: 202 NGDFGAIQPSILTYPDGKMQMLCRSRQNV--VVQTWSSDGGKTWSPVSALSLPNPNSGTD 259
Query: 243 GVKLKDGRLLLAYNTVSRGV--------LKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
V LK+G +L YN + RG L+VA SK DG +W D TLE+ EFSYPAV
Sbjct: 260 AVTLKNGTQILIYNPLIRGSDWAKGRNKLRVAASK-DGTNWQDIYTLEDEGKGEFSYPAV 318
Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
IQ +DG VHITYT+ R +KHVVL
Sbjct: 319 IQTTDGLVHITYTHERKNVKHVVL 342
>gi|86142035|ref|ZP_01060559.1| putative neuraminidase [Leeuwenhoekiella blandensis MED217]
gi|85831598|gb|EAQ50054.1| putative neuraminidase [Leeuwenhoekiella blandensis MED217]
Length = 351
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 185/326 (56%), Gaps = 25/326 (7%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V E F + APF CHAST+VE G + A+FGG E P+V I+ D W +
Sbjct: 31 IVQEGFIY--EEAPFPQCHASTLVEAQDGSIIAAWFGGEYERHPEVSIYQSKLTDTVWST 88
Query: 70 PI-IAD-----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
P IAD + + P WNPVLFK N L+L+YK G W G +K S + G +WS
Sbjct: 89 PQKIADGKVENDTLSYPTWNPVLFKNAENTLMLYYKEGPSPSSWWGKLKTSEDNGASWSA 148
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYI 181
EQLP GILGP KNKPI L NG ++ SSVES + W S +E++ D G +W + + I
Sbjct: 149 SEQLPDGILGPIKNKPIQLANGSIVSPSSVESEDGAVWKSHIELSSDNGYTWER---VAI 205
Query: 182 PN-ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
P+ +++ VIQP N L+ L+RS + SES D G TWS A +LNPNSG
Sbjct: 206 PSVDTVKVIQPTLIQLQNGNLKALLRSDQNY--ILESESSDQGKTWSEATKGTMLNPNSG 263
Query: 241 IDGVKLKDGRLLLAYNTVSRGV--------LKVALSKDDGDSWHDALTLEENLAMEFSYP 292
ID V LK G LL YN G L +A S DG W D L LE EFSYP
Sbjct: 264 IDAVTLKSGNFLLVYNPTEAGADWSDGRNKLNLAYST-DGSHWEDVLQLENEAEGEFSYP 322
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
A+IQ S+G VHITYT+NR++IK++ L
Sbjct: 323 AIIQDSEGLVHITYTHNRSKIKYLQL 348
>gi|326801668|ref|YP_004319487.1| neuraminidase [Sphingobacterium sp. 21]
gi|326552432|gb|ADZ80817.1| putative neuraminidase [Sphingobacterium sp. 21]
Length = 343
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 188/323 (58%), Gaps = 21/323 (6%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ +E F + PFK CHAST++EV G L A FGGS EG DV IWL + +
Sbjct: 27 IIKQELVF--TNPPFKECHASTLLEVKPGILLAACFGGSQEGKEDVSIWLCKINSEKIST 84
Query: 70 P-IIADEEPN----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
IIAD N P WNPVLFK + LFYK+G + W G +K S + G+ WS
Sbjct: 85 AQIIADGVINDTLRYPTWNPVLFKSQAGKTFLFYKVGPNPRAWWGMVKTSTDDGLNWSSP 144
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
++L GILGP KNKPI L NG +L SSVE + N W + +E++ D G SW+ Y PI +
Sbjct: 145 QKLTDGILGPIKNKPIQLANGYILSPSSVEVTENKWRAHIELSKDDGESWQ-YIPIDTAS 203
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
E +VIQP N+ L+VL RS G + S S DGGL W T LLNPNSG D
Sbjct: 204 E-YNVIQPSILTYKNQRLQVLCRSKE--GSIIQSWSEDGGLHWGSLNKTALLNPNSGTDA 260
Query: 244 VKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
V L+DGR L+ YN +RG L++A S DG W+D LE+ E+SYPA+I
Sbjct: 261 VTLRDGRQLIVYNPDVPGKEWFNNRGKLRIACSM-DGQKWNDVAVLEDGTNEEYSYPAII 319
Query: 296 QASDGSVHITYTYNRTQIKHVVL 318
Q SDG +H+ YTY+R IKHVVL
Sbjct: 320 QTSDGLIHVLYTYDRKNIKHVVL 342
>gi|283778151|ref|YP_003368906.1| hypothetical protein Psta_0356 [Pirellula staleyi DSM 6068]
gi|283436604|gb|ADB15046.1| conserved hypothetical protein [Pirellula staleyi DSM 6068]
Length = 357
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 193/321 (60%), Gaps = 18/321 (5%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G++ EF N P CHA+TI E G + A+FGG+ EG DV IW+ ++G+WQ
Sbjct: 40 GVLVSEFI--DNPTPTPQCHATTIAETPAG-LVAAWFGGTREGNKDVVIWVARHREGKWQ 96
Query: 69 SPI------IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
P+ ++D E + P WNPVLF++P LLLFYK+G W G + S + G TW
Sbjct: 97 PPVEVASGKVSDTERH-PTWNPVLFQMPDGPLLLFYKVGPSPSTWWGMLTSSSDNGDTWE 155
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
+LP I GP KNKPILL+N LLC SS E N W ME+T DAG++W + +
Sbjct: 156 VPTKLPADIAGPIKNKPILLDNKTLLCPSSTED-NGWRVHMEMTTDAGKTWTRTEAL-CD 213
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+S+ IQP ++ L++L R+ G++ S S DGG TWS + L+NPNSGID
Sbjct: 214 GKSVQAIQPTVLRHGDK-LQMLCRT-RLPGKIVESWSEDGGKTWSKLESIALVNPNSGID 271
Query: 243 GVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
G+ LK G LL YN ++R L +ALS DG +W+D L LE + E+SYPA+IQ SDG
Sbjct: 272 GLTLKSGEHLLVYNHTLIARSPLNLALSS-DGKTWNDILDLETSKG-EYSYPAIIQTSDG 329
Query: 301 SVHITYTYNRTQIKHVVLLPN 321
VHITYT+ R + KHVV+ P+
Sbjct: 330 LVHITYTWKRLRAKHVVIDPS 350
>gi|373958303|ref|ZP_09618263.1| exported exo-alpha-sialidase [Mucilaginibacter paludis DSM 18603]
gi|373894903|gb|EHQ30800.1| exported exo-alpha-sialidase [Mucilaginibacter paludis DSM 18603]
Length = 411
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 146/329 (44%), Positives = 192/329 (58%), Gaps = 23/329 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR 66
KG++ +EF + APF CHA+TI E D+G + A+FGG+ E PDV IW+ + K+G+
Sbjct: 61 KGILTDEFIY--EKAPFPECHAATIAETDEG-LVAAWFGGTKERNPDVCIWISRQKKNGK 117
Query: 67 WQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
W +P I + WNPVL+++P L+LFYKIG + W G K S + G TW
Sbjct: 118 WSTPENVANGIQPDTTRYACWNPVLYQVPHGQLILFYKIGPKPATWKGFYKTSDDGGKTW 177
Query: 122 SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
SE++ LP ILGP KNKP+LL NG LL SS E W E++ D G++W+ GP+
Sbjct: 178 SEQQALPNSILGPIKNKPVLLSNGQLLSPSSTEE-KGWDVHFEMSPDNGKTWKLIGPV-A 235
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
E+ IQP N +L+ L RS + + + S D GLTWS T L N NSG
Sbjct: 236 DGETQGGIQPSILFHKNGSLQALCRSKDRA--IVETWSTDNGLTWSPLAKTSLPNNNSGT 293
Query: 242 DGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V LKDGR LL YN V +R L VALSK DG W +L LE++ ++SYP
Sbjct: 294 DAVTLKDGRQLLVYNHVLPPGKLAKGARTPLNVALSK-DGKKWFASLILEDSPISQYSYP 352
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVLLPN 321
AVIQ SDG VH YT+ R +IKHVV+ P+
Sbjct: 353 AVIQTSDGLVHFVYTWRRQRIKHVVVDPS 381
>gi|255035915|ref|YP_003086536.1| BNR repeat-containing glycosyl hydrolase [Dyadobacter fermentans
DSM 18053]
gi|254948671|gb|ACT93371.1| BNR repeat-containing glycosyl hydrolase [Dyadobacter fermentans
DSM 18053]
Length = 361
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 140/328 (42%), Positives = 191/328 (58%), Gaps = 21/328 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
+G++ +E F +APF HASTI E G + A+FGG+ E PDV IW+ ++ +W
Sbjct: 27 EGILVDENIF--ETAPFPESHASTIAETPAG-LVAAWFGGTKERNPDVGIWVSRLENDKW 83
Query: 68 QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+P+ + +E+ WNPVL+++P L+LFYK+G V W G + S + G TWS
Sbjct: 84 TAPVEVANGVVNEKLRYACWNPVLYQIPKGELMLFYKVGPNVAGWKGFLITSKDNGKTWS 143
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
+ L G LGP KNKP+LL NG L C SS E W EVT D G++WRK GP+
Sbjct: 144 KPTALQEGYLGPVKNKPVLLANGELWCPSSTEGEGGWRLHFEVTPDFGKTWRKVGPLN-D 202
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+++ IQP AN +++L RS + + S S DGG TWS P+ L N NSG D
Sbjct: 203 GKTIKAIQPSLLTYANGDMQILCRSQS--RSIVESWSKDGGKTWSEVGPSALPNNNSGTD 260
Query: 243 GVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
V LKDGR LL YN V R L VA+SK DG +W L LE++ ++SYP+
Sbjct: 261 AVTLKDGRQLLVYNHVKPPAGQPKGPRTPLNVAVSK-DGKNWEAVLILEDSPISQYSYPS 319
Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLPN 321
VIQ++DG VHI YT+ R +IKHV + P+
Sbjct: 320 VIQSTDGMVHIVYTWRRQRIKHVKIDPS 347
>gi|171913078|ref|ZP_02928548.1| hypothetical protein VspiD_17900 [Verrucomicrobium spinosum DSM
4136]
Length = 1006
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 181/329 (55%), Gaps = 17/329 (5%)
Query: 2 KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
K GL + EF + + APF SCHASTI + G L ++FGG+ EG PDV IW+
Sbjct: 670 KTQAAAAGLTSSEFIY--DKAPFPSCHASTIAQAADGTLLTSWFGGTAEGKPDVGIWVSR 727
Query: 62 FKDGRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
G+W P+ + + P WNPVLF+ L LFYK+G Q W G ++ S +
Sbjct: 728 QVGGKWSEPVEVANGLQADGTRHPCWNPVLFQPKQGPLWLFYKVGPSPQTWWGMLRTSTD 787
Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW---NSWGSWMEVTVDAGRSW 173
G TW E +LP GILGP KNKP+ L NG +LC +S ES + W E T D G++W
Sbjct: 788 GGKTWGEAVRLPDGILGPIKNKPVQLANGDILCPTSTESPTKPSKWAVHFERTSDMGKTW 847
Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
P E++ IQP L L RS G +V ES D G TW +
Sbjct: 848 TAT-PALHDGEAIRAIQPSVLLAGGDKLIALGRSAQG--KVFEIESNDLGKTWGTIRLGS 904
Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSY 291
L NPNSG D V LKDGR LL YN S+G L +A+SKD G +W AL LE E+SY
Sbjct: 905 LPNPNSGTDAVTLKDGRHLLIYNHTSKGRSPLNLAISKD-GVTWQAALVLESEKG-EYSY 962
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLP 320
PA+IQ DG VH+TYT+ R ++KH V+ P
Sbjct: 963 PAIIQTPDGKVHVTYTWKREKVKHAVIDP 991
>gi|149280002|ref|ZP_01886127.1| hypothetical protein PBAL39_23968 [Pedobacter sp. BAL39]
gi|149229199|gb|EDM34593.1| hypothetical protein PBAL39_23968 [Pedobacter sp. BAL39]
Length = 333
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 185/324 (57%), Gaps = 25/324 (7%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
V EE F APF+ CHASTI E+ + A+FGG EG+ DV IW K+G+W P
Sbjct: 15 VTEEMIF--QDAPFQQCHASTIAEISGARIMAAWFGGKHEGSNDVCIWSSVLKNGKWSEP 72
Query: 71 II-----ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
+DEE P+WNPVLF S L L+YK+G+ ++W G S + G TW+ +
Sbjct: 73 RKLADGKSDEETAYPLWNPVLFMNKSGKLFLYYKMGRSPREWWGMEMSSLDGGKTWTAPK 132
Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE- 184
+LP GILGP KNKP+ L G++L SS E+ W + +E + D G +W + IP +
Sbjct: 133 RLPDGILGPIKNKPVELVGGIILSPSSTETTKRWKAHIERSTDGGETWTR-----IPVDE 187
Query: 185 --SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+ VIQP + +++L RS +G+ V S S D GL+WS T LLNPNSG D
Sbjct: 188 KGAYDVIQPSVMVYPDGKIQLLCRSKHGV--VMESWSSDQGLSWSPLVKTSLLNPNSGTD 245
Query: 243 GVKLKDGRLLLAYNTV------SRGVLKVALSKD-DGDSWHDALTLEENLAM-EFSYPAV 294
GV L G+ L YN G K+AL++ DG W D L LE+ EFSYPA+
Sbjct: 246 GVSLASGKQLFVYNPQIPGNDWFNGRYKLALTESTDGKDWLDQLMLEDGKETDEFSYPAI 305
Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
IQ+ DG +HITYTYNR IK+VVL
Sbjct: 306 IQSQDGKIHITYTYNRKNIKYVVL 329
>gi|325110761|ref|YP_004271829.1| FG-GAP repeat-containing protein [Planctomyces brasiliensis DSM
5305]
gi|324971029|gb|ADY61807.1| FG-GAP repeat protein [Planctomyces brasiliensis DSM 5305]
Length = 1006
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 198/337 (58%), Gaps = 27/337 (8%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
K+ GLV+E F + +APFK CHASTI E +G + ++FGG+ EG DV IW F
Sbjct: 659 KEPRFPGLVSETFIY--ENAPFKECHASTICETTRG-LVASWFGGTKEGNKDVGIWTSYF 715
Query: 63 KDGRW-QSPIIAD----EEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYN 116
RW ++ + AD E P WNPVLF+ P + LLF+K+G ++W G M SY+
Sbjct: 716 DGQRWSRTSLAADGVQHEGLRYPCWNPVLFQPPGDAPTLLFFKVGPNPREWWGEMMVSYD 775
Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVT--VDA--GR 171
+G T+ +R +LP GI GP + KPILL++G LLCGSS E ++ W E VD
Sbjct: 776 RGRTFRDRVRLPEGIDGPVRCKPILLDDGSTLLCGSSTE-YDGWRVHFESVQLVDGIPNG 834
Query: 172 SWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
+W++ GPI E + IQP A+ L+VL R+ + + S S DGG TWS K
Sbjct: 835 TWKRVGPINDGKE-FNAIQPTFLQHADGRLQVLCRTKEAV--ITSSHSTDGGETWSTMKA 891
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEE 283
L NPNSGI+ V LKDGR L+ YN + RG+L +A+S DDG++W +E
Sbjct: 892 IDLPNPNSGIEAVTLKDGRHLMIYNHLGSGASGWGRRGLLNLAIS-DDGENWRKVAVVER 950
Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
EFSYPA+IQ DG VHITYT+ R ++KH+V+ P
Sbjct: 951 EQRGEFSYPAIIQTEDGKVHITYTWKRQRVKHLVVDP 987
>gi|256420851|ref|YP_003121504.1| alpha-L-rhamnosidase [Chitinophaga pinensis DSM 2588]
gi|256035759|gb|ACU59303.1| alpha-L-rhamnosidase [Chitinophaga pinensis DSM 2588]
Length = 1251
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 141/327 (43%), Positives = 185/327 (56%), Gaps = 22/327 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
KG+V +E+ F APF HA+TI E G + A+FGG+ E PDV IW+ + W
Sbjct: 919 KGIVEDEYIF--MDAPFPESHAATIAETSDG-LVAAWFGGTKERNPDVGIWVSRKEGNTW 975
Query: 68 QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+P+ I + V WNPVL+++P L L+YK G +V W G M+ S + G TWS
Sbjct: 976 TAPVEVANGILSDTLRVACWNPVLYQVPGGELHLYYKTGTKVATWVGWMRTSNDGGRTWS 1035
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
+ LP G LGP KNKP+LL+NG LLC SS E + W E T D G++W GPI
Sbjct: 1036 AAKALPEGFLGPVKNKPVLLDNGELLCPSSTEG-SGWKVHFECTPDNGKTWTMRGPIN-D 1093
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++ + IQP L++L RS G V S S D G TWS T+L N NSG D
Sbjct: 1094 GKTFNTIQPSILTYGKGKLQILCRSKE--GSVVQSWSTDNGRTWSPMSATELPNNNSGTD 1151
Query: 243 GVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
V LKDGR L+ YN V +R L VA+S +DG W AL LE++ ++SYP+
Sbjct: 1152 AVTLKDGRQLIVYNHVKTPKGKSKGARTPLNVAIS-EDGIHWSAALVLEDSPVSQYSYPS 1210
Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
VIQ +DG VHI YT+ R +IKHV + P
Sbjct: 1211 VIQTADGYVHIVYTWRRQRIKHVKIDP 1237
>gi|431797036|ref|YP_007223940.1| hypothetical protein Echvi_1665 [Echinicola vietnamensis DSM 17526]
gi|430787801|gb|AGA77930.1| hypothetical protein Echvi_1665 [Echinicola vietnamensis DSM 17526]
Length = 379
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 192/334 (57%), Gaps = 21/334 (6%)
Query: 2 KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
+K+ G+V +EF + AP+ SCHA+TI E +G + A+FGG+ E PDV IW+
Sbjct: 25 QKETWRDGIVVDEFIY--EEAPYPSCHAATIAETPEG-LVAAWFGGTHERNPDVGIWVSH 81
Query: 62 FKDGRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
KDG+W + + ++ P WNPVL+++P L L+YK+G +W G + S +
Sbjct: 82 RKDGKWTESVEVANGVINDTLRYPTWNPVLYQVPDGDLQLYYKVGPHPSQWWGMLITSAD 141
Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKY 176
G TWS + LP +GP KNKP+LL+NG L+ SS E N W EVT D G +W
Sbjct: 142 GGHTWSAPKALPDSAIGPVKNKPVLLDNGALIAPSSTEG-NGWNIHFEVTPDFGETWENV 200
Query: 177 GPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN 236
GPI + ++ IQP N L++L R+ N + S S D G TWS + + L N
Sbjct: 201 GPIARGEDDINAIQPSVLDHGNGKLQILARTRNRA--IGTSWSEDYGKTWSPMEKSNLPN 258
Query: 237 PNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAM 287
NSG D V L+DGR LL YN V R L V+LS DDG W+ +L LE++
Sbjct: 259 NNSGTDAVTLQDGRHLLVYNHVLPPGEKAKGPRTPLNVSLS-DDGIHWNASLILEDSEIS 317
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
++SYP+VIQ++DG VHI YT+ R +IK+V + P+
Sbjct: 318 QYSYPSVIQSADGMVHIVYTWRRERIKYVKIDPS 351
>gi|375146864|ref|YP_005009305.1| alpha-L-rhamnosidase [Niastella koreensis GR20-10]
gi|361060910|gb|AEV99901.1| alpha-L-rhamnosidase [Niastella koreensis GR20-10]
Length = 1278
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 138/326 (42%), Positives = 188/326 (57%), Gaps = 22/326 (6%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G+V +EF F S F HA+TI E KG + A+FGG+ E PDV IW+ + +W
Sbjct: 939 GIVTDEFIFDRTS--FPESHAATIAETPKG-LVTAWFGGTKERNPDVCIWVSRQVNNKWT 995
Query: 69 SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
P+ I ++ WNPVL++ P+ LLLFYK+G V W G + RS + GV+W +
Sbjct: 996 EPVNVANGIINDSLRYACWNPVLYQAPNGELLLFYKVGPNVAGWKGWLIRSKDSGVSWGK 1055
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
E LP G LGP KNKP+LL NG L+C SS E + W EVT D G++WR GPI P+
Sbjct: 1056 PEALPEGFLGPIKNKPVLLSNGDLICPSSREG-SGWTVLFEVTSDFGKTWRMVGPIN-PD 1113
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+ ++ IQP + +++L RS V S S D G TW+ + L N NSG D
Sbjct: 1114 KKINAIQPSILTYKDGKMQILCRS--KTSSVVESWSTDNGKTWTPLAESTLPNNNSGTDA 1171
Query: 244 VKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
V LKDGR L+ YN V +R L VA+S DG W AL LE++ ++SYP+V
Sbjct: 1172 VTLKDGRQLIVYNHVATPKGAPKGARTPLNVAVS-SDGKLWQAALVLEDSPVSQYSYPSV 1230
Query: 295 IQASDGSVHITYTYNRTQIKHVVLLP 320
IQ++DG VHI YT+ R ++K+V + P
Sbjct: 1231 IQSADGMVHIVYTWRRQRVKYVKVDP 1256
>gi|430746327|ref|YP_007205456.1| neuraminidase [Singulisphaera acidiphila DSM 18658]
gi|430018047|gb|AGA29761.1| putative neuraminidase (sialidase) [Singulisphaera acidiphila DSM
18658]
Length = 358
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 176/318 (55%), Gaps = 15/318 (4%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G V E + SAPF CHASTI E G + ++FGG+ EG DV IW+ + W
Sbjct: 38 GFVKSELIY--ESAPFPKCHASTIAEA-GGRLVASWFGGTDEGNSDVGIWVASHDGKAWT 94
Query: 69 SPIIA----DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
+P+ D + P WNPVLF+ LLLFYK+G W G + S + G TWS
Sbjct: 95 TPVEVANGIDGDKRYPCWNPVLFQPKKGPLLLFYKVGPTPANWWGMVTTSDDAGKTWSTP 154
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+LP LGP KNKPI L G LLC SS E + W E T D G W GPI
Sbjct: 155 RRLPDPFLGPIKNKPIQLPEGDLLCPSSSED-HGWRVHFERTADLGYHWELIGPIN-DGV 212
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+ IQP + L+ L R+ +V S S D G TWS T L NPNSGIDGV
Sbjct: 213 TFGAIQPTLLTHPDGRLQALSRTKQ--AKVVESWSSDRGKTWSELAATTLPNPNSGIDGV 270
Query: 245 KLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L DGR LL YN RG L +A+S +DG +W AL LE+ EFSYPAVIQ +DG V
Sbjct: 271 TLADGRQLLVYNHTRRGRSPLNIAVS-EDGHAWRSALILEDQPG-EFSYPAVIQGADGLV 328
Query: 303 HITYTYNRTQIKHVVLLP 320
H+TYT+NRT+I+HVV+ P
Sbjct: 329 HVTYTWNRTRIRHVVVDP 346
>gi|333377826|ref|ZP_08469559.1| hypothetical protein HMPREF9456_01154 [Dysgonomonas mossii DSM
22836]
gi|332883846|gb|EGK04126.1| hypothetical protein HMPREF9456_01154 [Dysgonomonas mossii DSM
22836]
Length = 365
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 190/329 (57%), Gaps = 25/329 (7%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
+G++ ++F + +SA F S HA+TI E G + A+FGG EG P+V I+ W
Sbjct: 35 EGILVDQFIY--DSAEFPSVHAATIAETPSG-LITAFFGGKYEGHPEVNIYTSRHTATGW 91
Query: 68 QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
P+ I + +NPVLF+ P LLLFYKIG+ VQ W+G + RSYN G TW+
Sbjct: 92 TPPVKVADGIVSDTLRKACYNPVLFQYPDGELLLFYKIGKNVQDWTGYLIRSYNDGQTWT 151
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
+ E LP G LGP KNKPIL+ N L+C SS E+ + W ME T D G++WRK P I
Sbjct: 152 KPEALPEGFLGPIKNKPILIGNK-LICASSTEN-DGWQVHMEFTEDRGKTWRKTSP--IN 207
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++ ++IQP N L+++ RS N + + S D G TWS L N NSG D
Sbjct: 208 SKPWNIIQPSILTLNNNVLQIVCRSQN--EHIISAFSKDEGETWSDPIALYLPNNNSGTD 265
Query: 243 GVKLKDGRLLLAYNTV----------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
V LKDGR L+ YN V +R L VALS DDG +W +L LE++ E+SYP
Sbjct: 266 AVTLKDGRQLMVYNHVAASESAYKDKARTPLNVALS-DDGITWKASLILEDSPIKEYSYP 324
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVLLPN 321
+VIQ DG +HI YT+ R +IK+V + P+
Sbjct: 325 SVIQGHDGMIHIVYTWRREKIKYVKVDPS 353
>gi|168701252|ref|ZP_02733529.1| hypothetical protein GobsU_17136 [Gemmata obscuriglobus UQM 2246]
Length = 284
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 121/283 (42%), Positives = 170/283 (60%), Gaps = 5/283 (1%)
Query: 41 LVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKI 100
+ +FGG EGA DV+IW TF +W P + EP P WNPVLFK L L+YK
Sbjct: 1 MATWFGGKAEGAKDVEIWASTFDGKKWSEPKVFGTEPGQPCWNPVLFKSAKGTLFLWYKA 60
Query: 101 GQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG 160
G + W+G ++ S + G TW++ E +P +GP + KPI L NG +L G+S ES+ +W
Sbjct: 61 GPKPDNWTGYVRTSADNGKTWTKPEMMPSTFMGPVRAKPIQLANGTILAGTSWESYRNWV 120
Query: 161 SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
+++ + D G++W++ P +P E + IQP F + + VLMRS N + VC SES
Sbjct: 121 PFVDRSTDEGKTWKRSNPFPVP-EKFNQIQPALFEAKDGKIVVLMRSRNPL-TVCRSESK 178
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDA 278
DGG T+S A+ T L NP+SGID V+ K+G + L YN S R + +A S DDG +W
Sbjct: 179 DGGETFSPAEETSLANPSSGIDCVRTKEGDVFLIYNPTSVLRTPISLARSTDDGKTWKKV 238
Query: 279 LTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE EFSYPA+I++S G++ ITYT+ RT IKH + P
Sbjct: 239 ADLETEPG-EFSYPAIIESSAGTLEITYTWKRTHIKHQSVDPK 280
>gi|284036722|ref|YP_003386652.1| BNR/Asp-box repeat domain-containing protein [Spirosoma linguale
DSM 74]
gi|283816015|gb|ADB37853.1| BNR/Asp-box repeat domain protein [Spirosoma linguale DSM 74]
Length = 348
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 192/330 (58%), Gaps = 28/330 (8%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
++ EF + APF CHASTI E +G + A+FGG+ E PDV IW+ + W
Sbjct: 27 AVLKSEFIY--EQAPFPECHASTIAETPQG-LMTAWFGGTRERHPDVGIWVSSATKSGWT 83
Query: 69 SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+P+ + + +P WNPVLF++P LLLFYK+G W G +KRS + G +WS
Sbjct: 84 TPVEVATGVQADGKRLPCWNPVLFQVPKGELLLFYKVGPSPSTWWGMLKRSTDGGKSWSA 143
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
E+LP GI GP KNKP+LL +G+LLC SS E N W E T D G++W++ I N
Sbjct: 144 AERLPDGIAGPIKNKPVLLPSGVLLCPSSSEDHN-WRVHFEQTSDWGKTWQRTEAI---N 199
Query: 184 ESLS--VIQP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ ++ IQP + FH + L+ L RS G + + S DGG +WS + T L NPNSG
Sbjct: 200 DGVNDGAIQPSILFHPGGQ-LQALCRS-QRTGFIAETWSTDGGKSWSPLQKTTLPNPNSG 257
Query: 241 IDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
D V L DGR +L YN VS R L+VA+S DG +W L + EN E+SY
Sbjct: 258 TDAVTLADGRQVLVYNPVSPTPGKSGGPRTPLEVAIS-SDGKTWK-TLAVLENTPGEYSY 315
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
PAVIQ +DG + ITYT+ R +I+HVV+ P
Sbjct: 316 PAVIQTADGLLQITYTWKRQRIRHVVVDPK 345
>gi|408675471|ref|YP_006875219.1| neuraminidase [Emticicia oligotrophica DSM 17448]
gi|387857095|gb|AFK05192.1| neuraminidase [Emticicia oligotrophica DSM 17448]
Length = 341
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 179/309 (57%), Gaps = 21/309 (6%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR-WQSPI-IAD---EEPNV 79
KSCHASTIVE+ KG + A+F G E P V IW+ T + G+ W I IAD E
Sbjct: 36 KSCHASTIVELSKGKLMAAWFAGEHENHPQVCIWVATQERGKTWSKAIKIADGIYEGKQY 95
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
WNPVLFK S L L+YK+G ++W G +K S N+G TWS +LP ILGP +NK
Sbjct: 96 ACWNPVLFKNSSGKLFLYYKVGINPREWWGMVKTSDNEGKTWSSPSRLPNDILGPIRNKS 155
Query: 140 ILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTA 197
I L+NG +L SS ES W S ME++ G++W+K I I VIQP +
Sbjct: 156 IQLKNGSVLHPSSTESIRGDIWHSHMEISDANGKNWKK---IDIDCGEYGVIQPTILNHK 212
Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
+ +++LMRS ++ S S D G TWS P L NPNSGID V LK+G LL YN
Sbjct: 213 DGKIQILMRSRQ--NKIIQSWSEDNGQTWSKLSPINLPNPNSGIDAVTLKNGSFLLVYNP 270
Query: 258 V--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
+ R VLKVA S DG++W D TLE EFSYPA+IQ DG VHITYTY+
Sbjct: 271 LPSGKEWSNGRNVLKVARSI-DGNNWEDIYTLENQEKGEFSYPAIIQTGDGLVHITYTYD 329
Query: 310 RTQIKHVVL 318
R IK VVL
Sbjct: 330 RKLIKTVVL 338
>gi|196232131|ref|ZP_03130986.1| Glycosyl hydrolase family 32 domain protein [Chthoniobacter flavus
Ellin428]
gi|196223853|gb|EDY18368.1| Glycosyl hydrolase family 32 domain protein [Chthoniobacter flavus
Ellin428]
Length = 373
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 142/325 (43%), Positives = 181/325 (55%), Gaps = 22/325 (6%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G V EF F SAPF SCHASTI + G + A+FGG+ EGAPDV IWL G+W
Sbjct: 29 GYVHSEFIF--ESAPFPSCHASTIAQTASG-LVSAWFGGTREGAPDVGIWLSRHIGGQWS 85
Query: 69 SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+P+ + + +NPVLF+ LLLFYK G WSG +K S + G WS
Sbjct: 86 APVEVANGLRSDGRRFACYNPVLFQPAKGPLLLFYKAGGNPNGWSGFLKTSLDDGADWSA 145
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSW---MEVTVDAGRSWRKYGPIY 180
+ LP G +GP KNKP+ L NG+LLCG+S+E+ W +E T D G++W K
Sbjct: 146 AKPLPDGFVGPVKNKPVQLANGMLLCGASIETPEKPSRWRVQIERTPDLGKTWEK---TE 202
Query: 181 IPNESL--SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
N+ L S IQP L + R+ + + S D G TW+ T L NPN
Sbjct: 203 FLNDGLEISAIQPSILFLGGDKLLAIGRTRQ--AHLFRTTSADDGQTWNELSLTDLPNPN 260
Query: 239 SGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
SG D V L+D R +L YN G L VA+S+ DG +W AL L EN E+SYPAVIQ
Sbjct: 261 SGTDAVTLRDHRHVLVYNHTDHGRSPLNVAVSR-DGRTWQAALVL-ENEPGEYSYPAVIQ 318
Query: 297 ASDGSVHITYTYNRTQIKHVVLLPN 321
+DG VHITYT+ R +IKHVV+ P
Sbjct: 319 TADGLVHITYTWKRHRIKHVVVDPT 343
>gi|311748214|ref|ZP_07721999.1| putative neuraminidase [Algoriphagus sp. PR1]
gi|126576704|gb|EAZ80952.1| putative neuraminidase [Algoriphagus sp. PR1]
Length = 336
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 185/315 (58%), Gaps = 14/315 (4%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EF + +APF SCHAST+VE G + A+FGG+ E PDV I+ DG W +
Sbjct: 27 ILKSEFIY--ENAPFPSCHASTLVETPDG-IMAAWFGGTYERHPDVSIYTALLSDGSWST 83
Query: 70 P-IIADEEPNV----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P ++AD N P WNPVL++ P+ L+LFYK G ++W G K S + G TWS+
Sbjct: 84 PKMVADGVENKDFRNPTWNPVLYRNPNGQLVLFYKEGPNPREWWGLYKTSDDGGKTWSKA 143
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
Q+PPG+LGP KNK + L +G LL SS E+ W +E T + W+K I I N
Sbjct: 144 IQIPPGMLGPVKNKSVTLADGTLLHPSSFETNGVWSMHVETTTSDIQDWKK---IAIDNG 200
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+ IQP N L++L R+ + + + S DGG TWS T L++ NSGID V
Sbjct: 201 AFHAIQPTVLTYPNGKLQMLARTQEHV--IGTTWSTDGGKTWSPVSSTGLVHNNSGIDAV 258
Query: 245 KLKDGRLLLAYNTVSRGVLKVAL-SKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
LK+G LL N + G K++L +DG +W + +E+ EFSYPA+IQA DG+VH
Sbjct: 259 TLKNGVQLLLCNPIKEGRNKLSLMMSEDGVNWEEIHVMEDQPEGEFSYPAIIQAEDGTVH 318
Query: 304 ITYTYNRTQIKHVVL 318
+TYTYNR +IK+V L
Sbjct: 319 MTYTYNREKIKYVSL 333
>gi|374376460|ref|ZP_09634118.1| BNR repeat-containing glycosyl hydrolase [Niabella soli DSM 19437]
gi|373233300|gb|EHP53095.1| BNR repeat-containing glycosyl hydrolase [Niabella soli DSM 19437]
Length = 374
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 144/330 (43%), Positives = 188/330 (56%), Gaps = 25/330 (7%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
+G++ +EF + ++A F HASTI E G + A+FGG+ EG DV IW + +W
Sbjct: 29 QGILVDEFIY--DTASFPQAHASTIAETPDG-LIAAWFGGTKEGNKDVCIWTSRLVNNKW 85
Query: 68 QSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+P ++AD + +NPVL+ P+ LLLFYKIG V W+G MKRS + G TWS
Sbjct: 86 TAPQLVADGILNDSTRYACYNPVLYYAPTGELLLFYKIGPNVAGWTGWMKRSKDNGKTWS 145
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
RE LP G LGP KNKP L+ NG+L+C SS E W E T D G +W K PI
Sbjct: 146 PREALPEGYLGPIKNKPELI-NGVLVCPSSTEV-GGWKVHFEYTKDWGHTWTKSAPIN-D 202
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++ S IQP N L+VL R+ N + + S S D G TWS L N NSG D
Sbjct: 203 GKTFSAIQPSILKYKNGALQVLGRTRNTV--IYESWSKDHGKTWSPMTALDLPNNNSGTD 260
Query: 243 GVKLKDGRLLLAYNTV-----------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
V LKDGR LL YN V R L VA+SK DG +W+ AL LE++ ++SY
Sbjct: 261 AVTLKDGRQLLVYNHVLPDSTWVKGKGPRTPLNVAVSK-DGKNWYAALVLEDSPISQYSY 319
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
P+VIQ DG VHI YT+ R +IK+V + P+
Sbjct: 320 PSVIQGKDGMVHIVYTWRRQRIKYVKIDPS 349
>gi|393721557|ref|ZP_10341484.1| hypothetical protein SechA1_17504 [Sphingomonas echinoides ATCC
14820]
Length = 365
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 181/331 (54%), Gaps = 26/331 (7%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ EF PA P+ HASTIVE+ G A+FGG+ E PDV IW W
Sbjct: 36 ILLSEFIDPA--PPYPQAHASTIVELPDGTLAAAWFGGTGESRPDVTIWFARRGAHGWDR 93
Query: 70 PI-IAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P+ +AD + P WNPVLF+ P L LFYK+G + W G +K S + G +WS
Sbjct: 94 PVAVADGQSADRTRYPTWNPVLFQPPGAPLTLFYKVGPNPRGWWGMVKISSDGGRSWSAA 153
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVE----SWNSWGSWMEVTVDAGRSWRKYGPIY 180
++LP G+LGP KNKP++L +G L SS E N W ME + D GRSW PI
Sbjct: 154 QRLPEGVLGPIKNKPVVLADGSWLSPSSREEGTPEHNRWCIRMERSTDQGRSWTVEPPIA 213
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
P + IQP H A+ L V+ R+ G + MS S D G +WS L NPN+G
Sbjct: 214 SP-MGIEAIQPSVLHFADGRLAVVARTRQGA--LAMSWSRDTGKSWSPLAAIDLPNPNAG 270
Query: 241 IDGVKLKDGRLLLAYNTVS----------RGVLKVALSKDDGDSWHDALTLEEN-LAMEF 289
D V L DGR L+ YN + R L + LS+D GD W + LTLE+ L +
Sbjct: 271 TDAVTLADGRQLIVYNHSAHWPDHPGEGPRWPLNIGLSRDGGD-WRNVLTLEDKPLPDGY 329
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
+YPAVIQ DG VHI+YTYNRT I+HVV+ P
Sbjct: 330 AYPAVIQTRDGLVHISYTYNRTHIRHVVVDP 360
>gi|406831122|ref|ZP_11090716.1| BNR/Asp-box repeat domain-containing protein [Schlesneria
paludicola DSM 18645]
Length = 355
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 183/320 (57%), Gaps = 20/320 (6%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
L+ EF + +APF SCHASTI E +G + A+FGG+ E PDV IWL G W S
Sbjct: 32 LLKREFVY--ETAPFPSCHASTIAETKEG-LVAAWFGGTKESDPDVGIWLSRHVAGAWTS 88
Query: 70 PI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P+ + P WNPVLF+ + L+LFYK+G KW G + S + G +WSE
Sbjct: 89 PVEVANGVQSPTKREPCWNPVLFQPQTGPLMLFYKVGPNPDKWWGMLMVSDDSGKSWSEP 148
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+LP GI+GP KNKP+ L NG +L SS E + W E + D G++W + N+
Sbjct: 149 RRLPDGIVGPIKNKPVQLANGDILSPSSSE-YAKWRVHFERSADLGKTWTATAAV---ND 204
Query: 185 SLSV--IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+++ IQP + L+ + R+ G ++ S + G TWS K L NPNSG D
Sbjct: 205 GIAIAAIQPSILTYRDGHLQAIGRTRQG--QIFDLTSSNQGATWSEMKLISLPNPNSGTD 262
Query: 243 GVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
V L DGR LL YN +SRG L VA+S DDG +W AL LE EFSYPAVIQ +DG
Sbjct: 263 AVTLADGRQLLIYNHISRGRSPLNVAVS-DDGYNWFAALVLESEPG-EFSYPAVIQTADG 320
Query: 301 SVHITYTYNRTQIKHVVLLP 320
VH YT+NR +I+HVV+ P
Sbjct: 321 LVHFAYTWNRKRIRHVVIDP 340
>gi|146301814|ref|YP_001196405.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium johnsoniae
UW101]
gi|146156232|gb|ABQ07086.1| hypothetical lipoprotein [Flavobacterium johnsoniae UW101]
Length = 381
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 199/338 (58%), Gaps = 24/338 (7%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
++K +G++ +EF + + AP+ SCHA TIVE G + ++FGG+ E PDV I++
Sbjct: 23 LEKKTWKEGILVDEFIY--DKAPYPSCHAVTIVEATNGDLVASWFGGTHERHPDVCIYVA 80
Query: 61 TFKDG--RWQSPI-IAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR 113
G +W + +AD E P +P WNPVL+++P L+LFYKIG + +W G ++
Sbjct: 81 IKPKGSDKWGEGVKVADGVMKEGPRLPTWNPVLYQIPGGDLMLFYKIGPKPSEWWGVIRT 140
Query: 114 SYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
S + G TWSE +++P G LGP KNKP+LL NG LLC SS+E + W ME T D G++W
Sbjct: 141 SSDGGKTWSEAQKMPDGFLGPIKNKPVLLSNGTLLCPSSIEG-DGWRLRMESTPDFGKTW 199
Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-SCDGGLTWSYAKPT 232
+ + ++ IQP + +++ + R+ N R S S D G TWS +
Sbjct: 200 VMGDTLPRGKQKINAIQPSILFHKDGSIQAIGRTRN---RAIFSTFSKDNGKTWSDVELI 256
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEE 283
L N NSG D V L+DGR LL YN V R L V++SK DG W+ AL LE+
Sbjct: 257 GLPNNNSGTDAVTLRDGRHLLVYNHVLPPGKEAKGPRTPLNVSVSK-DGIHWNAALVLED 315
Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
+ ++SYP++IQ+SDG VHI YT+ R ++K+V + P+
Sbjct: 316 SKISQYSYPSMIQSSDGMVHIVYTWRREKLKYVKVDPS 353
>gi|399031119|ref|ZP_10731258.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
gi|398070588|gb|EJL61880.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
Length = 381
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 195/338 (57%), Gaps = 24/338 (7%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
++K +G++ +EF + + AP+ SCHA TIVE G + A+FGG+ E PDV I++
Sbjct: 23 LEKKTWKEGILVDEFIY--DKAPYPSCHAVTIVEATNGDLVSAWFGGTAERNPDVCIYVS 80
Query: 61 TFKDG--RWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR 113
G +W + I + P +P WNPVL+++P L+LFYKIG + +W G +
Sbjct: 81 VKPKGSDKWGEGVQVADGIMKDGPRLPTWNPVLYQIPGGDLMLFYKIGPKPSEWWGVTRT 140
Query: 114 SYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
S + G TWSE +++P G LGP KNKP+LL NG LLC SS+E + W ME T D G++W
Sbjct: 141 SSDGGKTWSEAKKMPEGFLGPIKNKPVLLSNGTLLCPSSIEG-DGWRLRMESTPDFGKTW 199
Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC-DGGLTWSYAKPT 232
+ + ++ IQP + T++ + R+ N R S S D G TWS +
Sbjct: 200 VMGDTLPRGKQKINAIQPSILFHKDGTIQAIGRTRN---RAIFSTSSKDNGKTWSNVELI 256
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEE 283
L N NSG D V LK+G+ LL YN V R L +++S DDG +W L LE+
Sbjct: 257 GLPNNNSGTDAVTLKNGKHLLVYNHVLPPGKEAKGPRTPLNLSIS-DDGINWKAVLILED 315
Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
+ ++SYP++IQ+SDG VHI YT+ R ++K+V + P+
Sbjct: 316 SKISQYSYPSIIQSSDGMVHIVYTWRREKLKYVKIDPS 353
>gi|395803588|ref|ZP_10482832.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
gi|395434142|gb|EJG00092.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
Length = 381
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 135/338 (39%), Positives = 198/338 (58%), Gaps = 24/338 (7%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
++K +G++ ++F + + AP+ SCHA TIVE G + ++FGG+ E PDV I++
Sbjct: 23 LEKKTWKEGILVDQFVY--DKAPYPSCHAITIVEATNGDLVASWFGGTHERNPDVCIYVA 80
Query: 61 TFKDGR--WQSPI-IAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR 113
G W I +AD + P +P WNPVL+++P L+LFYKIG + +W G ++
Sbjct: 81 IKPKGSDTWNEGIKVADGVMKDGPRLPTWNPVLYQIPGGDLMLFYKIGPKPSEWWGVIRT 140
Query: 114 SYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
S + G TWSE +++P G LGP KNKP+LL NG LLC SS+E + W ME T D G++W
Sbjct: 141 SSDGGKTWSEAKKMPDGFLGPIKNKPVLLSNGTLLCPSSIEG-DGWRLRMESTPDFGKTW 199
Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-SCDGGLTWSYAKPT 232
+ + ++ IQP + +++ + R+ N R S S D G TWS +
Sbjct: 200 VMGDTLPRGKQKINAIQPSILFHKDGSIQAIGRTRN---RAIFSTFSKDNGKTWSNVELI 256
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEE 283
L N NSG D V LKDGR LL YN V R L V++SK DG W+ AL LE+
Sbjct: 257 GLPNNNSGTDAVTLKDGRHLLVYNHVLPPGKEAKGPRTPLNVSVSK-DGIHWNAALVLED 315
Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
+ ++SYP++IQ+SDG VHI YT+ R ++K+V + P+
Sbjct: 316 SKISQYSYPSIIQSSDGMVHIVYTWRREKLKYVKVDPS 353
>gi|289664888|ref|ZP_06486469.1| BNR/Asp-box repeat domain protein [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 367
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 180/322 (55%), Gaps = 24/322 (7%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E GH L A+FGG+ EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGHLLAAWFGGAHEGAADVGIWLSQRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++LP
Sbjct: 100 ACAGIDGAAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRIVSTDDGAHWSPTQRLPD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ N++ +
Sbjct: 160 GILGPIKNKPVQLPNGRILSPSSSED-RGWRAHLEWSDDDGAHWQRGMPL---NDASVID 215
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
IQP + L+ L RS ++ + S DGGL W + L NPNSG D V L+
Sbjct: 216 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSSDGGLHWQPMQLLDLQNPNSGTDAVMLR 273
Query: 248 DGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
DGR LL YN RG L VA+S +DG W LTLE + EFSYPAVIQ D
Sbjct: 274 DGRALLVYNPGIAGKDWWEGRGTLAVAVS-NDGTHWQRVLTLENSEKDEFSYPAVIQTRD 332
Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
G VHI+YT+ R IKHVVL P
Sbjct: 333 GLVHISYTWKRRHIKHVVLDPT 354
>gi|87307124|ref|ZP_01089270.1| hypothetical protein DSM3645_01685 [Blastopirellula marina DSM
3645]
gi|87290497|gb|EAQ82385.1| hypothetical protein DSM3645_01685 [Blastopirellula marina DSM
3645]
Length = 372
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 187/330 (56%), Gaps = 26/330 (7%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
G+V+ E+ + AP SCHASTI G F+ A+FGG+ EGA DV I+ DG+W
Sbjct: 36 GVVSVEYL--TDGAPHDSCHASTIAHTPAG-FVAAWFGGTREGAKDVGIYFNRQVDGKWI 92
Query: 69 SPI-IAD---------EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKG 118
+P+ +A+ +E P WNPVL + L+LFYK G W G + S + G
Sbjct: 93 TPVEVANGVQYVSPTGKEHRHPCWNPVLHQAQEGPLMLFYKCGPTPSTWWGMLMTSDDSG 152
Query: 119 VTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGP 178
VTWSE ++LP GI GP KNK I +G L+C SS E+ + W +E+T D GR+W + GP
Sbjct: 153 VTWSEPQRLPEGIDGPVKNKAI-EHDGRLVCPSSSEN-HGWRLHLEITSDQGRAWTRVGP 210
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ + IQP N ++L R N G V + S D G+TW+ T L NPN
Sbjct: 211 LN-DGTKIGAIQPSVLVYPNGRWQILARDRNNNGNVWSTWSDDQGITWTPLVSTGLPNPN 269
Query: 239 SGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
SG D V L DGR LL YN R +L VA+S+ DG +W AL LE++ A EFS
Sbjct: 270 SGTDAVMLADGRALLVYNHTHRSGPFPSGRNMLNVAVSQ-DGRTWEAALVLEKS-AGEFS 327
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
YPAVIQ DG VHITYT+ R +I+HV + P
Sbjct: 328 YPAVIQTPDGLVHITYTWRRKKIRHVTVDP 357
>gi|189460070|ref|ZP_03008855.1| hypothetical protein BACCOP_00706 [Bacteroides coprocola DSM 17136]
gi|189433231|gb|EDV02216.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides coprocola
DSM 17136]
Length = 1267
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 190/358 (53%), Gaps = 54/358 (15%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGR 66
++ EF + APF CHASTIVE+D G + A+FGG+ E PDV IW+ ++
Sbjct: 911 AVIQNEFLY--EKAPFPECHASTIVELDNGDLVTAFFGGTKERNPDVCIWVCRKSHDSNT 968
Query: 67 WQSPIIADE------EPNVPM--------------WNPVLFKLPSNGLLLFYKIGQEVQK 106
W +PI A + +P+ + WNPVLF++P LLLF+KIG V
Sbjct: 969 WTAPIKAADGVFDLDDPDATIAGVTADIKDHRKACWNPVLFQVPGGDLLLFFKIGLNVPD 1028
Query: 107 WSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVT 166
W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E W ME +
Sbjct: 1029 WTGWLVRSKDGGKTWSKREPLPKGFLGPIKNKPEFI-NGRIICPSSTEGNAGWRIHMEYS 1087
Query: 167 VDAGRSWRKYGPIYIPNESLS----------------VIQPVPFHTANRTLRVLMRSFNG 210
D G++W+ GPI E LS VIQP + TL+V+ R+ N
Sbjct: 1088 DDMGKTWKTTGPIDAEQEFLSQHQGIGADDSLKRPIQVIQPSILKHKDGTLQVICRTRN- 1146
Query: 211 IGRVCMSESCDGGLTWSYAK-PTQLLNPNSGIDGVKLKDGRLLLAYNT---------VSR 260
+ S S D G TWS +L N NSG D V LKDGR +L YN R
Sbjct: 1147 -SHLATSWSKDNGTTWSKVTLIDELPNNNSGTDAVTLKDGRHVLVYNNSKPQLRAKKAVR 1205
Query: 261 GVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L +ALS +DG W LTLE++ E+SYPA+IQ DG +HIT+T+ R IK++ +
Sbjct: 1206 TPLNLALS-EDGIHWKPVLTLEDSPIREYSYPAIIQGKDGKLHITFTWRRELIKYMKI 1262
>gi|261406844|ref|YP_003243085.1| hypothetical protein GYMC10_3012 [Paenibacillus sp. Y412MC10]
gi|261283307|gb|ACX65278.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
Length = 327
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 191/326 (58%), Gaps = 26/326 (7%)
Query: 12 AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI 71
A+EF F + PF SCHAST+V + G L A+FGGS EGAPDV IW DG W +P+
Sbjct: 7 AKEFIFEEDR-PFASCHASTLVVLPDGDVLAAWFGGSREGAPDVAIWTARRTDGGWSAPV 65
Query: 72 IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+E +P WNPVL+ P LLLFYK+G V +W + S + G +WSE ++L PG
Sbjct: 66 KVADEEGLPHWNPVLYLRPDGKLLLFYKVGPRVAEWHTRIIHSDDNGFSWSEPKELVPGD 125
Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
+ GP KNKPI+L NG LL +S+E +W ++++++ D G +W + + + + L
Sbjct: 126 IGGRGPVKNKPIMLRNGTLLAPNSLEP--AWDAFIDISSDHGDTWTQMAIVPLDHGKLKL 183
Query: 188 --VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+IQP + + + ++ +L RS G + S+S DGG TW A PT + N NSG D +
Sbjct: 184 KGIIQPTLWESEDGSVHLLARSTEGA--MYRSDSQDGGFTWCEAYPTDMPNNNSGFDLAR 241
Query: 246 LKDGRLLLAYNTV-----------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
L DG L +AYN R L + LS+DDG +W + L L+ ++ ++SYPAV
Sbjct: 242 LSDGTLAMAYNPTVPREDDPKGKGPRTPLVLRLSRDDGATWGEELPLDSGIS-QYSYPAV 300
Query: 295 IQASDGSVHITYTYNRTQIK--HVVL 318
+ A +++I+YT+ R +I H+VL
Sbjct: 301 V-AHGNNIYISYTWRRERIAFYHIVL 325
>gi|325914142|ref|ZP_08176495.1| putative neuraminidase (sialidase) [Xanthomonas vesicatoria ATCC
35937]
gi|325539645|gb|EGD11288.1| putative neuraminidase (sialidase) [Xanthomonas vesicatoria ATCC
35937]
Length = 381
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ +P HAST++E G L A+FGG+ EGA DV IWL RW+ P IAD
Sbjct: 36 FIADPSPTPQAHASTLLETRNGRLLAAWFGGAHEGAADVGIWLTERDAQRWRPPRRIADG 95
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS +LP
Sbjct: 96 TRATTDGATVPAWNPVLFQSATGSVQLYYKLGPNPRQWWGLRITSADDGAHWSVPRRLPD 155
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L +G +L SS E W + +E + D G W++ P+ P+ + I
Sbjct: 156 GILGPIKNKPVQLPDGRILAPSSSED-RGWRAHLEWSDDDGEHWQRGMPLNDPS-VIGAI 213
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W + + NPNSG D V L+DG
Sbjct: 214 QPSLLLHGDGRLQALGRSQQ--NKLFSTFSYDGGLHWQPMQLIDVENPNSGTDAVMLRDG 271
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VALS +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 272 RALLVYNPAIAGKDWWDGRGTLAVALS-NDGVHWQRVLTLEDSADDEFSYPAVIQTRDGL 330
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 331 VHVSYTWKRRRIKHVVLDPT 350
>gi|188990640|ref|YP_001902650.1| exo-alpha-sialidase [Xanthomonas campestris pv. campestris str.
B100]
gi|167732400|emb|CAP50594.1| exported exo-alpha-sialidase [Xanthomonas campestris pv.
campestris]
Length = 355
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 181/322 (56%), Gaps = 26/322 (8%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG+ EGA DV IWL WQ+P IAD
Sbjct: 36 FIADQAPTPQAHASTLLETRDGTLLAAWFGGAHEGAADVGIWLAQRGPQHWQTPRRIADG 95
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ +P WNPVLF+ + + L+YK+G + W G S + GV WS ++LP
Sbjct: 96 AHAGVDGAALPAWNPVLFQSATGPVQLYYKLGPNPRDWWGLRITSTDDGVHWSPPQRLPN 155
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ ++ I
Sbjct: 156 GILGPIKNKPVQLPNGRILAPSSSED-RGWRTHLEWSDDDGAHWQRGLPLN-DAATIGAI 213
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGVKL 246
QP A+ L+ L RS ++ + S DGGL W +P QLL NPNSG D V L
Sbjct: 214 QPSLLLHADGRLQALGRSQQ--NKLFSAFSDDGGLHW---QPMQLLDVENPNSGTDAVML 268
Query: 247 KDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
DGR LL YN RG L VA+S DDG W LTLE++ EFSYPAVIQ
Sbjct: 269 HDGRALLVYNPGIAGKQWWDGRGTLAVAVS-DDGVQWRRVLTLEDSPKDEFSYPAVIQTR 327
Query: 299 DGSVHITYTYNRTQIKHVVLLP 320
DG VHI+YT+ R +IKHVV+ P
Sbjct: 328 DGLVHISYTWKRQRIKHVVIDP 349
>gi|390989306|ref|ZP_10259605.1| BNR/Asp-box repeat family protein [Xanthomonas axonopodis pv.
punicae str. LMG 859]
gi|372556064|emb|CCF66580.1| BNR/Asp-box repeat family protein [Xanthomonas axonopodis pv.
punicae str. LMG 859]
Length = 386
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 179/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP-IIAD- 74
F A+ AP HAST++E GH L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGHMLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRPIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++L
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLHDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W ALTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRALTLEDSAGDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|384428900|ref|YP_005638260.1| BNR-Asp-box repeat domain protein [Xanthomonas campestris pv.
raphani 756C]
gi|341938003|gb|AEL08142.1| BNR-Asp-box repeat domain protein [Xanthomonas campestris pv.
raphani 756C]
Length = 355
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 183/324 (56%), Gaps = 30/324 (9%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F AN AP HAST++E G L A+FGG+ EGA DV IWL WQ+P IAD
Sbjct: 36 FIANPAPTAQAHASTLLETRDGTLLAAWFGGTHEGAADVGIWLAQRGPQHWQTPRRIADG 95
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ +P WNPVLF+ + L+YK+G + W G S + G WS ++LP
Sbjct: 96 AHTGLDGAALPAWNPVLFQPRYGPVQLYYKLGPSPRDWWGLRITSTDDGAQWSPPQRLPN 155
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV- 188
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ N++ ++
Sbjct: 156 GILGPIKNKPLQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGLPL---NDAATIG 211
Query: 189 -IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGV 244
IQP A+ L+ L RS ++ + S DGGL W +P QLL NPNSG D V
Sbjct: 212 AIQPSLLLHADGRLQALGRSQQ--NKLFSAFSDDGGLHW---QPMQLLDVENPNSGTDAV 266
Query: 245 KLKDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
L+DGR LL YN RG L VA+S DDG W LTLE++ EFSYPAVIQ
Sbjct: 267 MLRDGRALLVYNPGIAGKQWWDGRGTLAVAVS-DDGVQWRRVLTLEDSPKDEFSYPAVIQ 325
Query: 297 ASDGSVHITYTYNRTQIKHVVLLP 320
DG VHI+YT+ R +IKHVV+ P
Sbjct: 326 TRDGLVHISYTWKRQRIKHVVIDP 349
>gi|412988193|emb|CCO17529.1| exported exo-alpha-sialidase [Bathycoccus prasinos]
Length = 436
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 184/325 (56%), Gaps = 26/325 (8%)
Query: 20 NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNV 79
+ P S HAS+I E+ G FLV++F G E PDV I+ +++G+W +P + V
Sbjct: 81 DHKPKPSHHASSICEISSGQFLVSWFAGQFESKPDVGIYTARYQNGKWSAPELVVHPKGV 140
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
P WNPVLFK + L LFYK G+ W G +S + G TWS +P GI+GP+KNKP
Sbjct: 141 PTWNPVLFKRSNGDLFLFYKHGRTPANWKGYYVKSRDGGNTWSSSVAMPNGIVGPAKNKP 200
Query: 140 ILLENGLLLCGSSVESWNSWGSWMEVTVDA---GRSWRKYGPIYIPNE------SLSVIQ 190
+ L++G +L SS E + + G E+ V+A ++ Y + + VIQ
Sbjct: 201 LELDDGTILLPSSRE-FGARGKDWEIVVEAMTMEDEFKTYAKFAVSEKGGGASVQQRVIQ 259
Query: 191 PVPFHT------ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
PV + A +R++ RS + + +S G W+ A +Q+ PN+G D V
Sbjct: 260 PVFWKQSMLAGDAANGIRMMARSRSSYAVAAVGDST--GRIWTDAVFSQIPCPNTGFDVV 317
Query: 245 KLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLA--MEFSYPAVIQ 296
KL+DGRLL+ YN RGVL +A+S DDGD+W + LE++ +EFSYPA+IQ
Sbjct: 318 KLRDGRLLIVYNHSFKHAIAGRGVLALAVSHDDGDTWERVMRLEDSGGRLLEFSYPAIIQ 377
Query: 297 ASDGSVHITYTYNRTQIKHVVLLPN 321
ASDG VH+TYT+ R IKHV++ P+
Sbjct: 378 ASDGMVHVTYTWRRANIKHVIIDPS 402
>gi|188577373|ref|YP_001914302.1| BNR/Asp-box repeat domain-containing protein [Xanthomonas oryzae
pv. oryzae PXO99A]
gi|188521825|gb|ACD59770.1| BNR/Asp-box repeat domain protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 350
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 179/321 (55%), Gaps = 24/321 (7%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG+ EGA DV IWL +WQ+P +AD
Sbjct: 23 FIADPAPTPQAHASTLLETRDGRLLAAWFGGAHEGAADVGIWLAQRGPQQWQTPRHLADG 82
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + L L+YK+G ++W G S + G WS ++LP
Sbjct: 83 AHAGVDGAAVPAWNPVLFQSATGPLQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 142
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ N++ +
Sbjct: 143 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPL---NDASVIG 198
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
IQP + L+ L RS ++ + S DGGL W + NPNSG D V L+
Sbjct: 199 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSNDGGLHWQPMHLLDVENPNSGTDAVMLR 256
Query: 248 DGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
DGR LL YN RG L VA+S +DG W LTLE + EFSYPAVIQ D
Sbjct: 257 DGRALLVYNPAIAGKDWWEGRGTLAVAVS-NDGTHWQRMLTLENSEKDEFSYPAVIQTRD 315
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G VHI+YT+ R +IKHVVL P
Sbjct: 316 GLVHISYTWKRQRIKHVVLDP 336
>gi|58581386|ref|YP_200402.1| hypothetical protein XOO1763 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84623320|ref|YP_450692.1| hypothetical protein XOO_1663 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|58425980|gb|AAW75017.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|84367260|dbj|BAE68418.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 367
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 179/322 (55%), Gaps = 24/322 (7%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG+ EGA DV IWL +WQ+P +AD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGRLLAAWFGGAHEGAADVGIWLAQRGPQQWQTPRHLADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + L L+YK+G ++W G S + G WS ++LP
Sbjct: 100 AHAGVDGAAVPAWNPVLFQSATGPLQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ N++ +
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPL---NDASVIG 215
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
IQP + L+ L RS ++ + S DGGL W + NPNSG D V L+
Sbjct: 216 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSNDGGLHWQPMHLLDVENPNSGTDAVMLR 273
Query: 248 DGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
DGR LL YN RG L VA+S +DG W LTLE + EFSYPAVIQ D
Sbjct: 274 DGRALLVYNPAIAGKDWWEGRGTLAVAVS-NDGTHWQRMLTLENSEKDEFSYPAVIQTRD 332
Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
G VHI+YT+ R +IKHVVL P
Sbjct: 333 GLVHISYTWKRQRIKHVVLDPT 354
>gi|399031783|ref|ZP_10731638.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
gi|398069838|gb|EJL61168.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
Length = 351
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 176/317 (55%), Gaps = 22/317 (6%)
Query: 16 TFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI---- 71
++ +N + HA+T+VE+ G + A+FGG EGA DV I+L T+K+ W SP
Sbjct: 35 SYISNEPITNNSHAATLVEIKPGQLMAAWFGGKHEGAKDVGIYLSTYKEKTWSSPQNIIK 94
Query: 72 -IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
+ + +P WNPVLFK S L LFYK+G+ ++W G M S + G TWS + LP G
Sbjct: 95 PLIKQGDTLPCWNPVLFKSKSKKLYLFYKVGKNPREWFGAMITSNDDGTTWSNPKYLPEG 154
Query: 131 ILGPSKNKPILLENGLLLCGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIP-NESLS 187
ILGP KNKPI G +LCGSS ES N W S +E +A W K I I N++
Sbjct: 155 ILGPIKNKPIETTQGTILCGSSTESIENNQWRSHIETYTEATDKWTK---INIENNKNFD 211
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+IQP +++ +++L RS + ++ S S D G W++ ++N NSG+D + L
Sbjct: 212 IIQPTFLVHSDKIIQMLCRSKH--NKLITSWSKDNGQNWNHTDTLNVVNSNSGVDALTLS 269
Query: 248 DGRLLLAYNTVSRG--------VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
LL N + +G +L V SK DG W LE EFSYPA+IQASD
Sbjct: 270 KKSFLLVNNPLKQGKDWFNGRNILDVEYSK-DGLYWTKLFDLENQKEGEFSYPAIIQASD 328
Query: 300 GSVHITYTYNRTQIKHV 316
G +H+ YTYNR IKHV
Sbjct: 329 GKIHVLYTYNRKYIKHV 345
>gi|418519411|ref|ZP_13085463.1| hypothetical protein WS7_00040 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410704855|gb|EKQ63334.1| hypothetical protein WS7_00040 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 386
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E GH L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++L
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLHDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAGDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|294665214|ref|ZP_06730512.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292605002|gb|EFF48355.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 386
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 179/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E GH L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLARRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++L
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W ++ NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLRDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSARDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +I+HVVL P
Sbjct: 335 VHVSYTWKRRRIRHVVLDPT 354
>gi|78048775|ref|YP_364950.1| neuraminidase [Xanthomonas campestris pv. vesicatoria str. 85-10]
gi|78037205|emb|CAJ24950.1| putative neuraminidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 386
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADRAPTPQAHASTLLETRDGRLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++LP
Sbjct: 100 AHAGVDGVAVPAWNPVLFQSTTGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W + NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLDVENPNSGTDAVMLRDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAHDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|346725888|ref|YP_004852557.1| neuraminidase [Xanthomonas axonopodis pv. citrumelo F1]
gi|346650635|gb|AEO43259.1| neuraminidase (sialidase) [Xanthomonas axonopodis pv. citrumelo F1]
Length = 386
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGRLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++LP
Sbjct: 100 AHAGVDGVAVPAWNPVLFQSTTGPVQLYYKLGPTPRQWWGLRLLSTDDGAHWSPPQRLPD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W + NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLDVENPNSGTDAVMLRDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAHDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|380512551|ref|ZP_09855958.1| sialidase/neuraminidase [Xanthomonas sacchari NCPPB 4393]
Length = 362
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 171/313 (54%), Gaps = 19/313 (6%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD-EEPN- 78
AP CHAST+VE G L A+FGG EGA DV IW+ WQ + +AD +P
Sbjct: 44 APTAQCHASTLVETRDG-LLAAWFGGRHEGADDVAIWVARRDAHGWQPAQRVADGAQPQG 102
Query: 79 --VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSK 136
+P WNPVLF+ L LFYK+G + ++W G S + G WS E+LP GILGP K
Sbjct: 103 KPLPTWNPVLFQPAQGPLRLFYKVGPDPKRWWGMQTTSSDAGHHWSTPERLPDGILGPIK 162
Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT 196
NKP+ L +G +L SS E W + ME + D G W + GP + IQP
Sbjct: 163 NKPVQLPSGRILSPSSSED-AGWVAHMEWSDDNGAHWTR-GPAMNDPARIGAIQPSVLLH 220
Query: 197 ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN 256
+ L+ + RS V + S D G TW L NPNSG D V L DGR LL YN
Sbjct: 221 KDGRLQAVGRSQQ--NHVFSTWSRDNGRTWEPMTLLDLANPNSGTDAVVLADGRSLLVYN 278
Query: 257 TVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
RGVL VALS DDG W LTLE++ EFSYPAVIQ DG VHI+YT+
Sbjct: 279 PTEAGKDWWDGRGVLAVALS-DDGQHWTRVLTLEDSPKDEFSYPAVIQTRDGKVHISYTW 337
Query: 309 NRTQIKHVVLLPN 321
RT+IKHVVL P
Sbjct: 338 KRTRIKHVVLDPK 350
>gi|21243816|ref|NP_643398.1| hypothetical protein XAC3089 [Xanthomonas axonopodis pv. citri str.
306]
gi|21109411|gb|AAM37934.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 386
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E GH L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++L
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMYLLEVENPNSGTDAVVLHDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAGDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|294627334|ref|ZP_06705920.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598416|gb|EFF42567.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 387
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 179/319 (56%), Gaps = 20/319 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E GH L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLARRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++L
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W ++ NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLRDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSARDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLP 320
VH++YT+ R +I+HVVL P
Sbjct: 335 VHVSYTWKRRRIRHVVLDP 353
>gi|381169762|ref|ZP_09878925.1| BNR/Asp-box repeat family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689780|emb|CCG35412.1| BNR/Asp-box repeat family protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 386
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 177/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A AP HAST++E GH L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIAAPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++L
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLHDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAGDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|295135992|ref|YP_003586668.1| neuraminidase [Zunongwangia profunda SM-A87]
gi|294984007|gb|ADF54472.1| putative neuraminidase [Zunongwangia profunda SM-A87]
Length = 345
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 174/317 (54%), Gaps = 21/317 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI----- 71
F A F S HASTI E++ G + A+FGG+ E PDV I+ W SP
Sbjct: 31 FIYTKASFPSAHASTIEELENGDIIAAWFGGTHERHPDVSIYTSIKTVSGWSSPKKVATG 90
Query: 72 IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+ + P WNPVLFK +N L LFYKIG W G K S + G TWSE++ LP I
Sbjct: 91 YQSDTLSYPTWNPVLFKT-NNKLFLFYKIGPSPSTWWGAYKISTDHGKTWSEKKMLPEDI 149
Query: 132 LGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
LGP KNKPI L+NG ++ SS E + W + ME++ D G++WRK P+ N + VI
Sbjct: 150 LGPIKNKPIQLKNGRIVSPSSEELQDGEIWKAHMEISDDYGKTWRKV-PVN-ENSAYKVI 207
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP N L+ RS V S S D G TWS ++L NPNSGID V L +G
Sbjct: 208 QPSIVQLQNGNLKAFFRS--DQDAVLESTSIDNGETWSDFSKSELANPNSGIDAVSLING 265
Query: 250 RLLLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
L+ YN ++ R L +A SK DG+ W D L LE E+SYPA+IQ +G
Sbjct: 266 GFLMVYNPMASGDNWWEGRSKLYLAYSK-DGEHWQDILKLENQEKGEYSYPAIIQTENGE 324
Query: 302 VHITYTYNRTQIKHVVL 318
++ITYTY+R +IK+ L
Sbjct: 325 IYITYTYDREKIKYFRL 341
>gi|325925747|ref|ZP_08187120.1| hypothetical protein XPE_1074 [Xanthomonas perforans 91-118]
gi|325543804|gb|EGD15214.1| hypothetical protein XPE_1074 [Xanthomonas perforans 91-118]
Length = 386
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG EGA DV IWL +WQ+P IAD
Sbjct: 40 FIADPAPTPQAHASTLLETRGGRLLAAWFGGEHEGAADVGIWLAQRGVQQWQTPRRIADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++LP
Sbjct: 100 AHAGVDGVAVPAWNPVLFQSTTGPVQLYYKLGPTPRQWWGLRLLSTDDGAHWSPPQRLPD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ +++ I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + L+ L RS ++ + S DGGL W + NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLDVENPNSGTDAVMLRDG 275
Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
R LL YN RG L VA+S +DG W LTLE++ EFSYPAVIQ DG
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAHDEFSYPAVIQTRDGL 334
Query: 302 VHITYTYNRTQIKHVVLLPN 321
VH++YT+ R +IKHVVL P
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354
>gi|343083099|ref|YP_004772394.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351633|gb|AEL24163.1| hypothetical protein Cycma_0384 [Cyclobacterium marinum DSM 745]
Length = 382
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 188/343 (54%), Gaps = 28/343 (8%)
Query: 2 KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
K+ G LV E +P + P HAST+VE G + A+F G+ E PDV IW+
Sbjct: 43 KQKVGEGALVHAELIYPLDDRPTPQVHASTLVETPSG-IVAAFFAGTHEKNPDVGIWVSH 101
Query: 62 FKDGRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
DG+W P+ + +++ P WNPVLF+ L+LFYK+G + + W G + S +
Sbjct: 102 LGDGQWTRPVEVANGVQNKDLRYPTWNPVLFQPKEGPLMLFYKVGPDPRGWWGMLMTSED 161
Query: 117 KGVTWSEREQL--PPGI---LGPSKNKPILLENGLLLCGSSVESWNS----WGSWMEVTV 167
G TWSE +L P + LGP KNK + LE+G ++ +S+E + W E++
Sbjct: 162 DGKTWSEPRKLGEDPAVGPLLGPVKNKAVQLEDGTIIAPTSIERKHDDDVFWMVHFEMSK 221
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W GPI E IQP + +L++L R+ G+ + S S D G TWS
Sbjct: 222 DNGQTWEVVGPINDGVE-FDAIQPSILFHKDGSLQILCRTRQGV--ISESWSTDNGQTWS 278
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--------RGVLKVALSKDDGDSWHDAL 279
K T L NPNSG D V L DGR LL YN + R +L +A+S DDG W +
Sbjct: 279 PMKATSLPNPNSGTDAVTLADGRQLLIYNHSTKEGEEPKGRNILNLAIS-DDGKDWKPFM 337
Query: 280 TLE-ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
TLE E +SYPA+IQ+ DG VH+TYTY+R +K+VV+ P+
Sbjct: 338 TLENEPNESGYSYPAIIQSKDGMVHMTYTYDRKTVKYVVVDPS 380
>gi|384420180|ref|YP_005629540.1| BNR-Asp-box repeat domain protein [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353463093|gb|AEQ97372.1| BNR-Asp-box repeat domain protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 367
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 178/322 (55%), Gaps = 24/322 (7%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F + AP HAST++E G L A+FGG+ EGA DV IWL +WQ+P +AD
Sbjct: 40 FITDPAPTPQAHASTLLETRDGRLLAAWFGGAHEGAADVGIWLAQRGPQQWQTPRRLADG 99
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ VP WNPVLF+ + + L+YK+G ++W G S + G WS ++LP
Sbjct: 100 AHAGVDGAAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 159
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ N++ +
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RRWRAHLEWSDDDGAHWQRGMPL---NDASVIG 215
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
IQP + L+ L RS ++ + S DGGL W + NPNSG D V L+
Sbjct: 216 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSNDGGLHWQPMHLLDVENPNSGTDAVMLR 273
Query: 248 DGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
DGR LL YN RG L VA+S +DG W LTLE + EFSYPAVIQ D
Sbjct: 274 DGRALLVYNPAIAGKDWWEGRGTLAVAVS-NDGTHWQRMLTLENSEKDEFSYPAVIQTRD 332
Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
G VHI+YT+ R +IKHVVL P
Sbjct: 333 GLVHISYTWKRQRIKHVVLDPT 354
>gi|21232340|ref|NP_638257.1| hypothetical protein XCC2909 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66767527|ref|YP_242289.1| hypothetical protein XC_1200 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21114112|gb|AAM42181.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66572859|gb|AAY48269.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 355
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 139/322 (43%), Positives = 180/322 (55%), Gaps = 26/322 (8%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
F A+ AP HAST++E G L A+FGG+ EGA DV IWL WQ+P IAD
Sbjct: 36 FIADQAPTPQAHASTLLETRDGTLLAAWFGGAHEGAADVGIWLAQRGPQHWQTPRRIADG 95
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ +P WNPVLF+ + + L+YK+G + W G S + G WS ++LP
Sbjct: 96 AHAGVDGAALPAWNPVLFQSATGPVQLYYKLGPNPRDWWGLRITSSDNGAHWSPPQRLPN 155
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L +G +L SS E W + +E + D G W++ P+ ++ I
Sbjct: 156 GILGPIKNKPVQLPSGRILAPSSGED-RGWRAHLEWSDDDGAHWQRGLPLN-DAATIGAI 213
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGVKL 246
QP A+ L+ L RS ++ + S DGGL W +P QLL NPNSG D V L
Sbjct: 214 QPSLLLHADGRLQALGRSQQ--NKLFSAFSDDGGLHW---QPMQLLDVENPNSGTDAVML 268
Query: 247 KDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
DGR LL YN RG L VA+S DDG W LTLE++ EFSYPAVIQ
Sbjct: 269 HDGRALLVYNPGIAGKQWWDGRGTLAVAVS-DDGVQWRRVLTLEDSPKDEFSYPAVIQTR 327
Query: 299 DGSVHITYTYNRTQIKHVVLLP 320
DG VHI+YT+ R +IKHVV+ P
Sbjct: 328 DGLVHISYTWKRQRIKHVVIDP 349
>gi|372222469|ref|ZP_09500890.1| alpha-L-rhamnosidase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 406
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/328 (39%), Positives = 188/328 (57%), Gaps = 21/328 (6%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR 66
K + EEF F +P + CHAS+++E+D G L +FGGS EGAPDVKIWL + K G+
Sbjct: 48 KAIDLEEFVF--TQSPTRDCHASSLLELDNGDLLCTWFGGSKEGAPDVKIWLARKPKGGQ 105
Query: 67 WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ 126
WQ P+ + ++NPVL KL + ++Y + + G + S + G TWS ++
Sbjct: 106 WQPPVRIAGDDGQTVFNPVLVKLRGGDIQIYY-VSPGID--DGHVITSRDNGYTWSSPKK 162
Query: 127 LPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP-NES 185
LP G +GP NKP+ LE+G ++ G+S++ +E++ D +W K GPI P N
Sbjct: 163 LPEGFVGPIVNKPVYLEDGTIIAGASLQGNPGKRIHVEISKDDAETWFKVGPISDPVNTK 222
Query: 186 LSVIQPVPFHTANRTLRVLMRS--FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+IQP + + L++L RS + ++ S S DGG+TWS L N NS ID
Sbjct: 223 YQIIQPTILVHSQKRLQILARSNEKDENTKIAQSWSEDGGITWSLVTDITLPNNNSAIDA 282
Query: 244 VKLKDGRLLLAYNTVS----------RGVLKVALSKDDGDSWHDALTLEENLA-MEFSYP 292
V L DGR LL YN + RG+L VA+SK DG +W A LE +++SYP
Sbjct: 283 VTLDDGRHLLVYNHSTREDSIGGRKGRGILTVAVSK-DGINWDAAAVLEYRTGTVQYSYP 341
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVLLP 320
AVIQ +DG VH+TY+++R IKHVV+ P
Sbjct: 342 AVIQTNDGLVHVTYSWHRKLIKHVVIDP 369
>gi|325919351|ref|ZP_08181384.1| hypothetical protein XGA_0321 [Xanthomonas gardneri ATCC 19865]
gi|325550164|gb|EGD20985.1| hypothetical protein XGA_0321 [Xanthomonas gardneri ATCC 19865]
Length = 465
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 180/324 (55%), Gaps = 28/324 (8%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP------ 70
F A+ AP HAST++E G L A+FGG+ EGA DV IWL WQ+P
Sbjct: 36 FIADPAPTPQAHASTLLETRDGTLLAAWFGGAHEGAADVGIWLAQRGPQHWQTPRRIADG 95
Query: 71 -IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
+ +P WNPVLF+ + + L+YK+G + W G S + G WS ++LP
Sbjct: 96 AHAGADGAALPAWNPVLFQSTTGPVQLYYKLGPNPRDWWGLRITSTDDGAHWSPPQRLPD 155
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
GILGP KNKP+ L NG +L SS E W + +E + D G W++ P+ P+ + I
Sbjct: 156 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLNEPS-VIGAI 213
Query: 190 QP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGVK 245
QP + HT R ++ L RS ++ + S DGGL W +P QLL NPNSG D V
Sbjct: 214 QPSLLLHTDGR-VQALGRSQQ--NKLFSTFSTDGGLHW---QPMQLLDVENPNSGTDAVM 267
Query: 246 LKDGRLLLAYNT--------VSRGVLKVALS-KDDGDSWHDALTLEENLAMEFSYPAVIQ 296
L+DGR LL YN RG L VA++ DDG W LTLE + EFSYPAVIQ
Sbjct: 268 LRDGRALLVYNPGIAGKDWWDGRGTLAVAVALSDDGVHWQRVLTLENSAKDEFSYPAVIQ 327
Query: 297 ASDGSVHITYTYNRTQIKHVVLLP 320
DG VHI+YT+ R +IKHVV+ P
Sbjct: 328 TRDGLVHISYTWKRQRIKHVVIDP 351
>gi|300770074|ref|ZP_07079953.1| BNR repeat-containing glycosyl hydrolase [Sphingobacterium
spiritivorum ATCC 33861]
gi|300762550|gb|EFK59367.1| BNR repeat-containing glycosyl hydrolase [Sphingobacterium
spiritivorum ATCC 33861]
Length = 342
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 134/324 (41%), Positives = 176/324 (54%), Gaps = 22/324 (6%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V F F N F CHAST+ E G ++FGG+ EG DV IW F +W +
Sbjct: 24 VVKTGFLFEENRY-FAQCHASTLAETASGEIRASWFGGTHEGNKDVVIWSSAFDGQKWSA 82
Query: 70 PI-----IADEEPNVPMWNPVLFKLP-SNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
P+ I+ + WNPVLFK N L L+YK+G ++W G +K S + G TWS
Sbjct: 83 PVSVADGISHDTVRYACWNPVLFKAKEDNTLYLYYKVGPNPREWWGAVKTSADNGKTWSS 142
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
LP GILGP KNKP+ L NG++L SSVE + + W +E + D +SW Y PI
Sbjct: 143 ARLLPKGILGPVKNKPLELSNGIILSPSSVEVTEDRWIGNIERSDDQQKSWTSY-PI-DH 200
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
N ++IQP A+ L+VL RS G V S S D G WS T ++NPNS D
Sbjct: 201 NSPFNIIQPSILQHADGRLQVLCRSKEGF--VMTSWSSDEGKNWSALSKTNMINPNSATD 258
Query: 243 GVKLKDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
+++K+G L+ YN R L++A SK DG W D L LE+ EFSYP +
Sbjct: 259 AIRVKNG-FLIVYNPDIPGKDWWEGRARLRLAYSK-DGIIWKDILVLEDQDKGEFSYPTI 316
Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
IQ + VHITYTYNR IKH++L
Sbjct: 317 IQDTKDLVHITYTYNRKNIKHIIL 340
>gi|302539930|ref|ZP_07292272.1| putative cytoplasmic protein [Streptomyces hygroscopicus ATCC
53653]
gi|302457548|gb|EFL20641.1| putative cytoplasmic protein [Streptomyces himastatinicus ATCC
53653]
Length = 356
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 127/330 (38%), Positives = 181/330 (54%), Gaps = 38/330 (11%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWN 83
F CHAST++ + G L A+F GS EGA DV IWL G W +P+ +EP +P WN
Sbjct: 15 FGHCHASTLLPLPDGDVLAAWFAGSREGADDVAIWLARRTGGGWDAPVKVADEPGLPHWN 74
Query: 84 PVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI-----LGPSKNK 138
P LF +LLFYK G + W + RS + G++WS +L PG GP KNK
Sbjct: 75 PALFATGHGEVLLFYKTGHRIPDWRTRVLRSRDGGLSWSAPAELVPGADGAGGRGPVKNK 134
Query: 139 PILLENGLLLCGSSVESWNS----WGSWMEVTVDAGRSWRKYGPIYIPNESL---SVIQP 191
P+ L +G L +SVE S W ++ + + D G SWR+ P+ + + VI+P
Sbjct: 135 PLRLADGGWLAPASVEEPQSAGGRWDAFTDRSDDGGASWRRSAPVPLDRPAFPGAGVIEP 194
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
+ +A +R+L+RS GRVC S S DGG+ WS A+P + N NSG+D V+L DGR+
Sbjct: 195 ALWESAPGEVRMLLRS--SCGRVCRSASHDGGVIWSTARPLDVPNNNSGLDAVRLADGRV 252
Query: 252 LLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLA------------------ME 288
+LA+N V +R L +++S+DDG +W A+ LE+ A E
Sbjct: 253 VLAHNPVAAEWGARTPLVLSVSEDDGITWRRAVVLEDRPASGPGAIVPDETGVRTDGHSE 312
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
FSYPA++ +DG V +TYT+ R I V +
Sbjct: 313 FSYPAIVPWADG-VAVTYTWQRRGIAFVTV 341
>gi|372221459|ref|ZP_09499880.1| neuraminidase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 346
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 177/320 (55%), Gaps = 22/320 (6%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-- 71
EF F PFK CHA T+V ++ G + ++F G+ E DV I+ K+G W P
Sbjct: 30 EFVF--EKGPFKECHAPTLVVLEDGSIMASWFAGTHERHEDVAIYTSIKKEGSWSYPKLV 87
Query: 72 ---IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+A P WNPVLFK P+ L L+YK+G W G K S + G TWSE ++L
Sbjct: 88 ANGVAANGKKYPTWNPVLFKNPNGLLYLYYKVGPSPSTWWGVYKTSKDNGKTWSEAKKLS 147
Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESL 186
GILGP KNKP+LL+NG +L SSVE N W + ME ++D+G++W+K L
Sbjct: 148 DGILGPIKNKPLLLKNGNILSPSSVELENGEIWLAHMEHSIDSGKTWQKIKIPQKKKHKL 207
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
IQP ++++ L RS +V S S + G+TWS AK L NPNSGID + +
Sbjct: 208 --IQPTLLTLKDQSILALFRS--NQDKVVYSTSKNNGITWSKAKAMDLENPNSGIDALTM 263
Query: 247 KDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
K+G LL YN R L + S+ DG W ++LE + E+SYPA++Q
Sbjct: 264 KNGNHLLVYNPALSGDQWWDGRSKLNLGYSQ-DGMQWKSIVSLENHQKGEYSYPAIVQDQ 322
Query: 299 DGSVHITYTYNRTQIKHVVL 318
DG +H+ YTY+R +IK+V L
Sbjct: 323 DGYIHLVYTYDRKKIKYVKL 342
>gi|440733196|ref|ZP_20912962.1| sialidase/neuraminidase [Xanthomonas translucens DAR61454]
gi|440363986|gb|ELQ01133.1| sialidase/neuraminidase [Xanthomonas translucens DAR61454]
Length = 364
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 172/317 (54%), Gaps = 19/317 (5%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD- 74
F AP CHAST++E G L A+FGG EGA DV IW+ WQ + +AD
Sbjct: 41 FVNADAPTAQCHASTLLETRDG-LLAAWFGGRHEGADDVGIWVARRSAQGWQPAQRVADG 99
Query: 75 ---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+ ++P WNPVLF+ L LFYK+G + ++W G S + GV WS E+LP GI
Sbjct: 100 MQPQGASLPAWNPVLFQPAKGPLRLFYKVGPDPKRWWGMQTTSRDGGVHWSAPERLPDGI 159
Query: 132 LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
LGP KNKP+ L +G +L SS E W + ME + D G W + GP + IQP
Sbjct: 160 LGPIKNKPVQLADGRILSPSSSED-TGWVAHMEWSDDNGAHWTR-GPALNDPARIGAIQP 217
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
+ +++ + RS V + S D G +W +L NPNSG D L DGR
Sbjct: 218 SALVHTDGSVQAIGRSQQ--NHVFSAWSRDHGRSWEPMTLLELANPNSGTDAAVLADGRS 275
Query: 252 LLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
LL YN RG L VALS DG+ W LTLE++ EFSYPAVIQ DG VH
Sbjct: 276 LLVYNPTEAGKNWWDGRGTLAVALSA-DGNHWTRVLTLEDSAKDEFSYPAVIQTRDGLVH 334
Query: 304 ITYTYNRTQIKHVVLLP 320
I+YT+ R +IKHVVL P
Sbjct: 335 ISYTWKRIRIKHVVLDP 351
>gi|433679939|ref|ZP_20511604.1| hypothetical protein BN444_03961 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430814943|emb|CCP42233.1| hypothetical protein BN444_03961 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 364
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 173/317 (54%), Gaps = 19/317 (5%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD- 74
F AP CHAST++E G L A+FGG EGA DV IW+ WQ + +AD
Sbjct: 41 FVNADAPTAQCHASTLLETRDG-LLAAWFGGRHEGADDVGIWVARRSAQGWQPAQRVADG 99
Query: 75 EEPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
++P +P WNPVLF+ L LFYK+G + ++W G S + GV WS E+LP GI
Sbjct: 100 KQPQGAPLPAWNPVLFQPSKGPLRLFYKVGPDPKRWWGMQTTSRDGGVHWSAPERLPDGI 159
Query: 132 LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
LGP KNKP+ L +G +L SS E W + ME + D G W + GP + IQP
Sbjct: 160 LGPIKNKPVQLADGRILSPSSSED-TGWVAHMEWSDDNGAHWTR-GPALNDPARIGAIQP 217
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
+ +++ + RS V + S D G +W +L NPNSG D L DGR
Sbjct: 218 SALVHTDGSVQAIGRSQQ--NHVFSAWSRDHGRSWEPMTLLELANPNSGTDAAVLADGRS 275
Query: 252 LLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
LL YN RG L VALS DG+ W LTLE++ EFSYPAVIQ DG VH
Sbjct: 276 LLVYNPTEAGKNWWDGRGTLAVALSA-DGNHWTRVLTLEDSAKDEFSYPAVIQTRDGLVH 334
Query: 304 ITYTYNRTQIKHVVLLP 320
I+YT+ R +IKHVVL P
Sbjct: 335 ISYTWKRIRIKHVVLDP 351
>gi|393781506|ref|ZP_10369701.1| hypothetical protein HMPREF1071_00569 [Bacteroides salyersiae
CL02T12C01]
gi|392676569|gb|EIY70001.1| hypothetical protein HMPREF1071_00569 [Bacteroides salyersiae
CL02T12C01]
Length = 1267
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/359 (36%), Positives = 188/359 (52%), Gaps = 54/359 (15%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR- 66
K ++ ++F + A F CH +T+ E KG + A+FGG+ E PD IW+ G
Sbjct: 910 KAILKDQFLY--EQAAFPECHGATVAETPKGDLVAAFFGGTKERNPDCCIWVCRKVKGSD 967
Query: 67 -WQSPIIADEEPNVPM-----WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
W P+ + P WNPVLF+LP L+LFYK G +V W+G + RS N G T
Sbjct: 968 VWTPPVKVADGVITPTLREACWNPVLFQLPKGELMLFYKRGDKVANWTGWLIRSKNGGKT 1027
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI- 179
WS E+LP G LGP KNKP + NG ++C SS E N W E++ D G++WRK GP+
Sbjct: 1028 WSSPEKLPDGFLGPVKNKPEYI-NGRIICPSSTEGENGWRIHFEISDDRGKTWRKVGPLE 1086
Query: 180 ----------------------------YIPNESLS---VIQPVPFHTANRTLRVLMRSF 208
+ +++L IQP + L++L R+
Sbjct: 1087 ADLALLTQHQGGSNAGKAEEEDDPEGGEVVRDKNLKPIYAIQPSLLKHKDGRLQILCRTR 1146
Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--------- 259
N G++ + S D G +W+ + T+++N NSG D V LKDGR LL YN
Sbjct: 1147 N--GKLAEAWSADNGDSWTKLRLTEVVNNNSGTDAVTLKDGRHLLVYNDFGALPGTRKGV 1204
Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
R L VA+S +DG +W A+ LE++ ++SYP+VIQ DG VHI YT+ R +IK+V L
Sbjct: 1205 RTPLSVAIS-EDGMTWDAAVVLEDSPISQYSYPSVIQGKDGKVHIVYTWRRQRIKYVEL 1262
>gi|432881173|ref|ZP_20097545.1| hypothetical protein A317_03829 [Escherichia coli KTE154]
gi|431412123|gb|ELG95208.1| hypothetical protein A317_03829 [Escherichia coli KTE154]
Length = 348
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 179/319 (56%), Gaps = 18/319 (5%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
++ +E FP K HASTI E+ G LV++FGG+ EG DVKIW + + W
Sbjct: 39 IIKQELIFPL--GLIKESHASTIAEMSNGTILVSWFGGTKEGDDDVKIWQSSMTESGWSK 96
Query: 70 PIIA-----DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P + D +P +P WNPVLF + N LFYK+G+ W G S N G++WS++
Sbjct: 97 PRVVAYGSEDGKP-LPTWNPVLFNI-HNKTYLFYKVGKTPSTWEGKYITSTNNGLSWSKK 154
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ LP GI GP KNKPIL +L SS E+ W +E T + + + I +
Sbjct: 155 QALPNGIYGPIKNKPILSGKNVLFP-SSEENDKGWFVHIETTSNM----KNFNKIEVSKP 209
Query: 185 S-LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+ S IQP + +++ RS G+ + S+S D G +W+ + + + + NSG+DG
Sbjct: 210 TGTSAIQPTLLKFRDNEIKMYTRSDQGV--LMESQSSDSGHSWTTLRKSTIPSTNSGVDG 267
Query: 244 VKLKDGRLLLAYN-TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+ L+DG L+ +N R +L ++ S D+G +W + LTLE EFSYPA+IQ+S+G +
Sbjct: 268 IVLRDGTGLIVHNPGKDRSILSISRSNDNGKTWGNVLTLESQEGKEFSYPAIIQSSNGKI 327
Query: 303 HITYTYNRTQIKHVVLLPN 321
HITYTYNR IKH+ L N
Sbjct: 328 HITYTYNRESIKHITLEEN 346
>gi|285018990|ref|YP_003376701.1| sialidase/neuraminidase [Xanthomonas albilineans GPE PC73]
gi|283474208|emb|CBA16709.1| putative sialidase/neuraminidase protein [Xanthomonas albilineans
GPE PC73]
Length = 358
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 167/312 (53%), Gaps = 20/312 (6%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW-QSPIIADEE----- 76
P CHAST+VE D G L A+FGG EGA DV IW+ W + +AD
Sbjct: 39 PTAQCHASTLVETDDG-LLAAWFGGQHEGADDVGIWVARRDAHGWLPAQRVADGAQAHGA 97
Query: 77 PNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSK 136
P +P WNPVLF+ L LFYK+G + + W G S + G WS E+LP GILGP K
Sbjct: 98 PPLPAWNPVLFQAALGPLRLFYKVGPDPKGWWGMQITSSDGGRHWSAPERLPTGILGPIK 157
Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT 196
NKP+ L +G +L SS E W + ME + D G W + GP + IQP
Sbjct: 158 NKPVQLASGRILSPSSSED-AGWVAHMEWSDDNGAHWTR-GPAMNDPAKIGAIQPSVLVH 215
Query: 197 ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN 256
AN ++ + RS V + S D G TW L NPNSG D V L DGR LL YN
Sbjct: 216 ANGRVQAVGRSRQ--NHVFSTWSHDQGRTWEPMTLLNLANPNSGTDAVVLADGRSLLVYN 273
Query: 257 TVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
RG+L VALS +DG W LTLE + EFSYPAVIQ DG +HI+YT+
Sbjct: 274 PTEAGKEWWDGRGILAVALS-NDGLHWTRVLTLENSPKDEFSYPAVIQTRDGQIHISYTW 332
Query: 309 NRTQIKHVVLLP 320
RT IKHVVL P
Sbjct: 333 KRTHIKHVVLDP 344
>gi|424796165|ref|ZP_18221930.1| exported exo-alpha-sialidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795084|gb|EKU23838.1| exported exo-alpha-sialidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 364
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 172/316 (54%), Gaps = 21/316 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD- 74
F AP CHAST++E G L A+FGG EGA DV IW+ WQ + +AD
Sbjct: 41 FVNADAPTAQCHASTLLETRDG-LLAAWFGGRHEGADDVGIWVARRGAQGWQRAQRVADG 99
Query: 75 EEPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
++P +P WNPVLF+ L LFYK+G + ++W G S + GV WS E+LP GI
Sbjct: 100 KQPQGAPLPAWNPVLFQPAKGPLRLFYKVGPDPKRWWGMQTTSRDGGVHWSAPERLPDGI 159
Query: 132 LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
LGP KNKP+ L +G +L SS E W + ME + D G W + GP + IQP
Sbjct: 160 LGPIKNKPVQLADGRILSPSSSED-AGWVAHMEWSDDNGAHWTR-GPALNDPARIGAIQP 217
Query: 192 -VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
V HT R + N V + S D G +W+ L NPNSG D V L DGR
Sbjct: 218 SVLVHTDGRVQAIGRSQQN---HVFSTWSRDHGRSWAPMTLLDLANPNSGTDAVVLADGR 274
Query: 251 LLLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
LL YN RG L VALS DG+ W LTLE++ EFSYPAVIQ DG V
Sbjct: 275 SLLVYNPTEAGKDWWDGRGTLAVALSG-DGNHWARVLTLEDSAKDEFSYPAVIQTRDGLV 333
Query: 303 HITYTYNRTQIKHVVL 318
HI+YT+ R +IKHVVL
Sbjct: 334 HISYTWKRIRIKHVVL 349
>gi|146299804|ref|YP_001194395.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium johnsoniae
UW101]
gi|146154222|gb|ABQ05076.1| hypothetical lipoprotein [Flavobacterium johnsoniae UW101]
Length = 349
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 171/311 (54%), Gaps = 26/311 (8%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ------SPIIADEEPNV 79
+ HAST+VE++ L A+FGG EGA DV I++ ++K+ +W P+I D + +
Sbjct: 44 ASHASTLVELENNTLLAAWFGGKYEGAKDVGIYISSYKEQKWSVPKELIKPLIKDGD-TL 102
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
P WNPVLFK S L LFYK+G+ ++W G M S ++G TWS + LP GILGP +NKP
Sbjct: 103 PCWNPVLFKSKSQNLYLFYKVGKNPREWFGAMIVSKDEGKTWSNSKYLPKGILGPIRNKP 162
Query: 140 ILLENGLLLCGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE--SLSVIQPVPFH 195
I G++LCGSS ES N W ++E + SW I N+ + +IQP
Sbjct: 163 IETTPGVILCGSSTESVDDNKWRVFIETYTEGTDSW----TIADINDKKNFDIIQPTFLV 218
Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
+++ +++L RS + ++ S S D G TW ++N NSG+D V L D LL
Sbjct: 219 HSDKEIQILSRSRH--NKLISSWSEDNGKTWQKTDSINVVNSNSGVDAVTLSDKSFLLVN 276
Query: 256 NTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307
N + R VL V SK DG +W LE EFSYPA+IQ SD VHI YT
Sbjct: 277 NPLKMGKDWFNGRNVLDVEYSK-DGVNWKKLFDLENQKEGEFSYPAIIQTSDKKVHILYT 335
Query: 308 YNRTQIKHVVL 318
YNR IKH
Sbjct: 336 YNRKFIKHTAF 346
>gi|256426128|ref|YP_003126781.1| neuraminidase [Chitinophaga pinensis DSM 2588]
gi|256041036|gb|ACU64580.1| putative neuraminidase [Chitinophaga pinensis DSM 2588]
Length = 370
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 176/324 (54%), Gaps = 23/324 (7%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV EEF P + HASTI + G L ++FGGS E APDV I+ F G+W +
Sbjct: 53 LVREEFIM--EKPPVPAAHASTITALPDGDLLASWFGGSRESAPDVCIYTARFHKGKWSA 110
Query: 70 PIIA-----DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
P + ++ WNPVL + ++LFYK+G ++W G + S + G++WS
Sbjct: 111 PEVVASGVINDSTRYAAWNPVLITTKAGRVILFYKVGPNPREWWGMLISSTDNGLSWSAP 170
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWN--SWGSWMEVTVDAGRSWRKYGPIYIP 182
E+LP ILGP KNKP+ L +G +L SS ES + W +E++ G W++ + I
Sbjct: 171 ERLPGNILGPIKNKPVQLADGSILHPSSTESLDEKDWHIHVELSDSTGHQWKR---VPID 227
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++ VIQP A + L++L RS + + S D G TWS +LNPNSGID
Sbjct: 228 CDTFGVIQPSALFHAGKRLQMLCRSRQNA--IVQTWSTDNGQTWSPLTKQSMLNPNSGID 285
Query: 243 GVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
V ++G+ +L YN R L+VA+S DG+ W D L + EFSYPA+
Sbjct: 286 AVTTRNGQQVLVYNPAVSGKDWWNGRNELRVAVSI-DGEHWKDVYVLVKEAKGEFSYPAI 344
Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
IQ DG +HITYTY+R I++V L
Sbjct: 345 IQTKDGILHITYTYDRKNIRYVAL 368
>gi|332188854|ref|ZP_08390562.1| BNR/Asp-box repeat family protein [Sphingomonas sp. S17]
gi|332011120|gb|EGI53217.1| BNR/Asp-box repeat family protein [Sphingomonas sp. S17]
Length = 358
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 173/322 (53%), Gaps = 26/322 (8%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-----IADE 75
+AP+ HASTI E+ G A+F GS E PDV+IW W +P+ I+ +
Sbjct: 39 NAPYPQAHASTIAELPDGTLAAAWFAGSGESRPDVRIWFARRGTKGWDAPVPVADGISAD 98
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS 135
P WNPVLF+ P L L+YK+G ++W G + RS + G W+ E+LP G+LGP
Sbjct: 99 GRRYPTWNPVLFQPPGEPLHLYYKVGPNPRQWWGTVIRSSDGGRHWTRPERLPDGVLGPI 158
Query: 136 KNKPILLENGLLLCGSSVE----SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
KNKP++ +G L SS E N W +E + D G+SW GP + IQP
Sbjct: 159 KNKPVIAPDGAWLSPSSREEGTAEANIWSLRIERSTDKGKSW-TVGPRIASPMHIEAIQP 217
Query: 192 -VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
+ FH R L ++ R+ G + S S D G+TWS L NPN+G D V L DGR
Sbjct: 218 SILFHRDGR-LELVARTRQGA--LAQSWSRDKGVTWSPIAAIDLPNPNAGTDAVTLADGR 274
Query: 251 LLLAYNTVS----------RGVLKVALSKDDGDSWHDALTLEEN-LAMEFSYPAVIQASD 299
L+ YN + R L + LS DDG W +ALTLE + ++YPAV+Q D
Sbjct: 275 QLVVYNHSAHAPDTPGDGPRWPLNIGLS-DDGVRWRNALTLESKPMPDGYAYPAVVQTRD 333
Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
G VH+TYT+NR I+HVV+ P
Sbjct: 334 GLVHVTYTWNRQHIRHVVIDPR 355
>gi|325299331|ref|YP_004259248.1| alpha-L-rhamnosidase [Bacteroides salanitronis DSM 18170]
gi|324318884|gb|ADY36775.1| alpha-L-rhamnosidase [Bacteroides salanitronis DSM 18170]
Length = 1273
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 139/353 (39%), Positives = 185/353 (52%), Gaps = 52/353 (14%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPI 71
E F A F CHASTIVE++ G + A+FGG+ E PDV IW+ G W P+
Sbjct: 917 ENQFLYEKASFPECHASTIVELENGDLVAAFFGGTKERNPDVCIWVCRKPKGSDTWTEPV 976
Query: 72 -IAD-----EEPNVPM--------------WNPVLFKLPSNGLLLFYKIGQEVQKWSGCM 111
+AD +P+ + WNPVLF++P LLLF+KIG V W+G +
Sbjct: 977 KVADGVFDLNDPDAKIAGITADIKDHRKACWNPVLFQVPGGDLLLFFKIGLNVPDWTGWL 1036
Query: 112 KRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
RS + G TWS RE LP G LGP KNKP + NG +LC SS E W +E + D G+
Sbjct: 1037 VRSKDGGKTWSRREALPQGFLGPIKNKPEFI-NGRILCPSSTEGEQGWRIHIEYSDDMGK 1095
Query: 172 SWRKYGPI----YIPNE------------SLSVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
+W+ GPI P++ + VIQP + TL+V+ R+ N +
Sbjct: 1096 TWKTTGPIPAELEFPSQYRNMNADEKEKRPILVIQPSILKHKDGTLQVICRTRN--SHLA 1153
Query: 216 MSESCDGGLTWSYAKPTQ-LLNPNSGIDGVKLKDGRLLLAYNTVS---------RGVLKV 265
S S D G TWS + L N NSG D V L+DGR L YN S R + +
Sbjct: 1154 TSWSKDNGSTWSKVTLIEGLPNNNSGTDAVTLQDGRHALVYNNFSPLLGDKKGVRTPINL 1213
Query: 266 ALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
ALS DDG W LTLE++ E+SYPA+IQ DG +HIT+T+ R IK++ +
Sbjct: 1214 ALS-DDGIHWTPVLTLEDSPVREYSYPAIIQGKDGKLHITFTWRRELIKYMEI 1265
>gi|395803564|ref|ZP_10482808.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
gi|395434118|gb|EJG00068.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
Length = 350
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 124/318 (38%), Positives = 171/318 (53%), Gaps = 20/318 (6%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-- 71
E ++ A+ + HAST+VE L A+FGG EGA DV I++ +KD +W +P
Sbjct: 32 ESSYIADPPVTTNSHASTLVEYKPNEILAAWFGGKYEGAKDVGIYISAYKDKKWSTPKEL 91
Query: 72 ---IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+ +P WNPVLFK S L LFYKIG+ ++W G MK S + G TW E + LP
Sbjct: 92 IQPLIKNGDTLPCWNPVLFKSKSQNLYLFYKIGKNPREWFGAMKVSKDNGTTWGEPKYLP 151
Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESL 186
GILGP +NKPI G++LCGSS ES N+ W +E +A SW+K N++
Sbjct: 152 EGILGPIRNKPIETAPGVILCGSSTESVNTDQWRVHVEEYTEATDSWKKIA--VENNQNF 209
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
+IQP + L++L RS + +V S S D G +W ++N NSGID + +
Sbjct: 210 DIIQPTFLIHSATDLQMLSRSKH--NKVVSSWSGDNGKSWIRTNTINVINSNSGIDALTV 267
Query: 247 KDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
LL N + R +L V SK DG +W LE+ EFSYPA+IQ +
Sbjct: 268 NKNLFLLVNNPLPMGKDWFNGRNILDVEYSK-DGLNWTKLFDLEKQEKGEFSYPAIIQTA 326
Query: 299 DGSVHITYTYNRTQIKHV 316
D +H+ YTY+R IKH
Sbjct: 327 DKRIHVLYTYDRKYIKHT 344
>gi|298480648|ref|ZP_06998844.1| alpha-rhamnosidase [Bacteroides sp. D22]
gi|298273082|gb|EFI14647.1| alpha-rhamnosidase [Bacteroides sp. D22]
Length = 1300
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 133/347 (38%), Positives = 182/347 (52%), Gaps = 32/347 (9%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
+ + G +G + +EF N PF HAS+I E +G + A+FGG+ E PD IW+
Sbjct: 946 LDAENGRQGFITDEFI--CNYPPFPESHASSIAETKEG-LVAAWFGGTKEKNPDCCIWVS 1002
Query: 61 TFKDGRWQSPI-IAD---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
W P +AD + WNPVL PS L L+YKIG V WSG + RS++
Sbjct: 1003 RHTPQGWSKPQKVADGILDGTKYACWNPVLTVTPSGELQLYYKIGINVSGWSGHVIRSHD 1062
Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKY 176
G +WS E LP G LGP KNKP+ + ++C SS E N W E++ D G +W+K
Sbjct: 1063 CGKSWSRPESLPEGFLGPIKNKPVWI-GKRMICPSSTEGENGWQVHFEISDDEGHTWKKT 1121
Query: 177 GPI----YIPNE----------SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
PI I E ++ IQP + L+ L R+ N G + + S D
Sbjct: 1122 NPIDAQEIIETETIKRSNAQKSTIQAIQPTILVHRDGKLQALCRTQNN-GSIASTWSYDQ 1180
Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDS 274
G TW+ + T+L N NSGIDGV LKDGR LL YN R + VA+S +DG
Sbjct: 1181 GETWTPLEFTELPNNNSGIDGVTLKDGRFLLVYNHYRYIRGKKKKRTPINVAIS-NDGIH 1239
Query: 275 WHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
W + LE++ ++SYP+VIQ DG VHI YT+ R +IK+ L P+
Sbjct: 1240 WQAGVVLEDSPINQYSYPSVIQGKDGKVHIVYTWRRQRIKYACLDPD 1286
>gi|423221594|ref|ZP_17208064.1| hypothetical protein HMPREF1062_00250 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645921|gb|EIY39641.1| hypothetical protein HMPREF1062_00250 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1289
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 180/342 (52%), Gaps = 32/342 (9%)
Query: 6 GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
G KGL+ ++F F HA+TIVE +G + A+FGG+ E PD IW+
Sbjct: 940 GRKGLLEDDFI--CRKPSFPESHAATIVEGRRG-IVAAWFGGTKEKNPDCCIWVSRKTKT 996
Query: 66 RWQSP-IIAD---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
W P ++AD + WNPVL + P L L+YKIG V WSG + S + G TW
Sbjct: 997 GWTEPQMVADGVLDGTKYACWNPVLTETPKGELQLYYKIGVNVAGWSGHVVTSKDGGKTW 1056
Query: 122 SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
S+ LP G LGP KNKP+ + ++C SS E N W E++ D G++WRK GPI
Sbjct: 1057 SKSRALPEGFLGPIKNKPVWI-GKRMICPSSTEDENGWRIHFEISDDEGKTWRKIGPITA 1115
Query: 182 --------------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
++++ VIQP + L+ L R+ N G V + S D G +WS
Sbjct: 1116 SEIVETDQVKFSNQKHKTIQVIQPTILTHKDGKLQALCRTQNN-GSVATTWSEDNGESWS 1174
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
L N NSGIDG+ LKDGR LL YN R + VA+S DDG W A+
Sbjct: 1175 PVTFIDLPNNNSGIDGITLKDGRHLLVYNHYRYIKGKRKERTPINVAIS-DDGMHWKAAV 1233
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE++ ++SYP+VIQ+ DG VHI YT+ R +IK+ L P
Sbjct: 1234 VLEDSPINQYSYPSVIQSKDGKVHIVYTWRRQRIKYACLDPQ 1275
>gi|300726577|ref|ZP_07060019.1| putative alpha-rhamnosidase [Prevotella bryantii B14]
gi|299776145|gb|EFI72713.1| putative alpha-rhamnosidase [Prevotella bryantii B14]
Length = 1251
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 184/342 (53%), Gaps = 41/342 (11%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG-- 65
+ + ++F + S P HASTI E+ G L AYFGG+ E PDV IW+ G
Sbjct: 908 EAFMEDQFLYEYTSFP--QAHASTITELKNGDLLAAYFGGTKEKNPDVCIWVSRKPKGAD 965
Query: 66 RWQSPII----ADEEPNVPM----------WNPVLFKLPSNGLLLFYKIGQEVQKWSGCM 111
+W++P++ D+ + P+ WNPVLF +P + F+KI +E+ W G +
Sbjct: 966 KWEAPMMIADGTDDYLHRPLNSTDKARKACWNPVLFTMPDGEIWCFFKIAEEISDWQGWV 1025
Query: 112 KRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
+S + G TWS+RE LP G +GP KNKP+L+ + LLC SS E + W +E+ +
Sbjct: 1026 VKSKDGGKTWSKREMLPKGFIGPVKNKPVLIGD-RLLCPSSTEG-DWWKFHVEIYNIKTK 1083
Query: 172 SWRKYGPIYIPNESLS---------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
W+ GPI + L+ IQP + L VLMRS N ++ S S D
Sbjct: 1084 EWKYVGPIEAEDAVLTDDQKVHPIKCIQPSILQLKDGRLMVLMRSHN--AKLAKSYSSDN 1141
Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGD 273
G TWS + +++ N SG D V LKDGR +L YN R L +A+S +DG+
Sbjct: 1142 GETWSKVELSEVENNQSGTDAVTLKDGRHVLIYNNFETLMGTKKGPRTPLSIAVS-EDGE 1200
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
WH +LTLE++ ++SYPA+IQ DG++H YT+ R +I +
Sbjct: 1201 HWHHSLTLEDSPVSQYSYPAIIQGKDGNLHCVYTWRRERIAY 1242
>gi|224538250|ref|ZP_03678789.1| hypothetical protein BACCELL_03141, partial [Bacteroides
cellulosilyticus DSM 14838]
gi|224520136|gb|EEF89241.1| hypothetical protein BACCELL_03141 [Bacteroides cellulosilyticus DSM
14838]
Length = 1248
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 179/342 (52%), Gaps = 32/342 (9%)
Query: 6 GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
G KGL+ ++F F HA+TIVE +G + A+FGG+ E PD IW+
Sbjct: 899 GRKGLLEDDFI--CRKPSFPESHAATIVEGRRG-IVAAWFGGTKEKNPDCCIWVSRKTKT 955
Query: 66 RWQSP-IIAD---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
W P ++AD + WNPVL + P L L+YKIG V WSG + S + G TW
Sbjct: 956 GWTEPQMVADGVLDGTKYACWNPVLTETPKGELQLYYKIGVNVAGWSGHVVTSKDGGKTW 1015
Query: 122 SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
S+ LP G LGP KNKP+ + ++C SS E N W E++ D G++WRK GPI
Sbjct: 1016 SKSRALPEGFLGPIKNKPVWI-GKRMICPSSTEDENGWRIHFEISDDEGKTWRKIGPITA 1074
Query: 182 --------------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
++++ VIQP + L+ L R+ N G V + S D G +WS
Sbjct: 1075 SEIVETDQVKFSNQKHKTIQVIQPTILTHKDGKLQALCRTQNN-GSVATTWSEDNGESWS 1133
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
L N NSGIDG+ LKDGR LL YN R + VA+S DDG W A+
Sbjct: 1134 PVTFIDLPNNNSGIDGITLKDGRHLLVYNHYRYIKGKRKERTPINVAIS-DDGMHWKAAV 1192
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE++ ++SYP+VIQ DG VHI YT+ R +IK+ L P
Sbjct: 1193 VLEDSPINQYSYPSVIQGKDGKVHIVYTWRRQRIKYACLDPQ 1234
>gi|32471321|ref|NP_864314.1| hypothetical protein RB1257 [Rhodopirellula baltica SH 1]
gi|32443162|emb|CAD71993.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 375
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/342 (37%), Positives = 181/342 (52%), Gaps = 31/342 (9%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
+D G++ ++F + + APF SCHASTI + G A+FGG EGA DV IW+
Sbjct: 40 QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 96
Query: 63 KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
W +P ++AD + P WNPVLF PS LF K+G + W G + S +
Sbjct: 97 DGNSWSAPRMVADGVQPDGSRHPCWNPVLFHAPSGATWLFIKVGPSPKDWWGEVLFSDDS 156
Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
G T+ +R +LP G+LGP + KP L+++G LLCGSS E + W E G++
Sbjct: 157 GKTFRDRTRLPEGVLGPIRCKPELVDHGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 215
Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
W + I+ E + IQP A+ LR L R+ + + ++S D G TW+
Sbjct: 216 VSASHWDRGEAIHDGKE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 272
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
T + NPNSGID V KDGR L N + R L +++S DG ++ +
Sbjct: 273 KPIATDMPNPNSGIDVVTAKDGRHWLISNPLPSKENGWGGRSRLTLSVST-DGKTYREVA 331
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE EFSYPA+IQA DG +H+TYT+ R +IKHV+ PN
Sbjct: 332 VLENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIFDPN 373
>gi|255036622|ref|YP_003087243.1| hypothetical protein Dfer_2863 [Dyadobacter fermentans DSM 18053]
gi|254949378|gb|ACT94078.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 371
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 173/313 (55%), Gaps = 17/313 (5%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
+EF F + PF CHAST+V +D G FL+A+FGG+ E PDV IWL + G+W +P+
Sbjct: 45 KEFVF-GDQRPFPQCHASTLVRLDDGQFLIAWFGGTEEKNPDVGIWLSKGRPGQWSAPVE 103
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PP 129
+ WNPVL K ++L++K+G+E+ W +K S + G TWS+ +L
Sbjct: 104 VAKIREDAHWNPVLQKTSDGKVILYFKVGKEIAHWETWVKTSSDNGETWSDAYELVKGDK 163
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS-- 187
G GP K+K I L NG L G+S E N W +++ + D G++W I +
Sbjct: 164 GGRGPVKDKLIELSNGDWLAGASNEV-NRWEVFVDRSTDKGKTWTASPYFKIDTTEIKGK 222
Query: 188 -VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
IQP + + + +L+R+ G+ + S+S D G TWS T L NPNSGID KL
Sbjct: 223 GAIQPTLWESEPGQVHMLVRTTGGV--IGRSDSKDNGKTWSTITKTSLPNPNSGIDLAKL 280
Query: 247 KDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLAM-EFSYPAVIQASDG 300
DG L+LAYN SR L + LS DDG +W D + + E+SYPAVI D
Sbjct: 281 NDGTLVLAYNPDDHNWGSRSPLSLILSYDDGQNWTDKIDIATGKKEDEYSYPAVISWGD- 339
Query: 301 SVHITYTYNRTQI 313
SV +TYTYNR +I
Sbjct: 340 SVAVTYTYNRRKI 352
>gi|312130600|ref|YP_003997940.1| hypothetical protein Lbys_1885 [Leadbetterella byssophila DSM
17132]
gi|311907146|gb|ADQ17587.1| hypothetical protein Lbys_1885 [Leadbetterella byssophila DSM
17132]
Length = 349
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/338 (37%), Positives = 186/338 (55%), Gaps = 32/338 (9%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
+GL+ E +P + P CHAST++EV+ G L A+F G+ E DV I + FK+G+W
Sbjct: 18 RGLIHEGLVYPLDKKPTAQCHASTLLEVNDG-ILCAFFAGTHEKHADVGIRVARFKEGKW 76
Query: 68 QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
P+ ++ P WNPVLF+ + LFYK+G +V W G S ++G+TWS
Sbjct: 77 SWPVEVVNGYVNDTLRYPTWNPVLFRPKDGPIYLFYKVGPDVDHWWGAYVTSEDEGLTWS 136
Query: 123 E-----REQLPPGILGPSKNKPILLENGLLLCGSSVESWNS-----WGSWMEVTVDAGRS 172
+ + +LGP KNK I L +G ++ +S+E + W + EV+ D G++
Sbjct: 137 TPQVMGKHAIVGDLLGPVKNKAIQLPDGTIISPTSMERRGTENGRDWRIYFEVSKDNGKN 196
Query: 173 WRKYGPIYIPNESLS--VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
W+ PI N+ + IQP N L++L R+ + + S S D G TWS
Sbjct: 197 WQVIPPI---NDGVQYDAIQPSILIHKNGDLQILARTLQDV--LVTSWSKDKGKTWSPLT 251
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENL 285
P+ L NPNSG D + L+DGR +L YN +R VL VA+S +DG W +++ EN+
Sbjct: 252 PSGLPNPNSGTDALTLQDGRHVLVYNHSVRSGENRNVLNVAVS-EDGVDW-KMVSVLENV 309
Query: 286 AME--FSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
+ +SYPAVIQ DG VHI+YTY R IKH++ P+
Sbjct: 310 PIHSGYSYPAVIQTKDGKVHISYTYARQSIKHMIFDPS 347
>gi|449138024|ref|ZP_21773326.1| BNR/Asp-box repeat domain protein [Rhodopirellula europaea 6C]
gi|448883351|gb|EMB13882.1| BNR/Asp-box repeat domain protein [Rhodopirellula europaea 6C]
Length = 362
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 128/342 (37%), Positives = 183/342 (53%), Gaps = 31/342 (9%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
+D G++ ++F + + APF SCHASTI + G F+ A+FGG EGA DV IW+ +
Sbjct: 26 QDDAPAGVLQQQFIY--DEAPFPSCHASTICQTPNG-FVAAWFGGEREGAKDVSIWVSDY 82
Query: 63 KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
W +P +AD + P WNPVLF+ PS LF K+G + W G + S +
Sbjct: 83 DGKAWSAPRKVADGVQADRSRHPCWNPVLFQAPSGVTWLFIKVGPSPRDWWGEVLFSDDG 142
Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
G T+ +R +LP G+LGP + KP L+++G LLCGSS E + W E + G +
Sbjct: 143 GKTFRDRTRLPEGVLGPIRCKPELVDDGKTLLCGSSTEH-DGWRVHFERLMLTGEANAEA 201
Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
W + I+ E + IQP A+ LR L R+ + + ++S D G TW+
Sbjct: 202 VSETHWDRGEAIHDGKE-FAAIQPAILRLADGRLRALCRTMQSV--IVQTDSDDNGRTWT 258
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
T + NPNSGID V KDGR L N + R L +++S DG+++ +
Sbjct: 259 KPISTDMPNPNSGIDVVTAKDGRHWLISNPLPSKENGWGGRSRLTLSVST-DGETYREIA 317
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE EFSYPA+IQA DG H+TYT+ R +IKHV+ P
Sbjct: 318 VLENEPKGEFSYPAIIQADDGKFHMTYTWKRKKIKHVIFDPT 359
>gi|417303952|ref|ZP_12090991.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica WH47]
gi|327539742|gb|EGF26347.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica WH47]
Length = 361
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 184/342 (53%), Gaps = 31/342 (9%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
+D G++ ++F + + APF SCHASTI + G A+FGG EGA DV IW+
Sbjct: 26 QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 82
Query: 63 KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
W +P ++AD ++ P WNPVLF PS LF K+G + W G + S +
Sbjct: 83 DGNSWSAPRMVADGVQPDDSRHPCWNPVLFHAPSGVTWLFIKVGPSPKDWWGEVLFSDDG 142
Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
G T+ +R +LP G+LGP + KP L+++G LLCGSS E + W E G++
Sbjct: 143 GKTFRDRTRLPKGVLGPIRCKPELVDHGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 201
Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
W + I+ NE + IQP A+ LR L R+ + + ++S D G TW+
Sbjct: 202 VSASHWDRGEAIHDGNE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 258
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
T + NPNSG+D V KDGR L N + R L +++S DG ++ +
Sbjct: 259 KPIATDMPNPNSGLDIVTAKDGRQWLISNPLPSKENGWGGRSRLTLSVST-DGKTYREVA 317
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE EFSYPA+IQA DG +H+TYT+ R +IKHV++ P+
Sbjct: 318 VLENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIVDPS 359
>gi|440717920|ref|ZP_20898393.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
gi|436436929|gb|ELP30617.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
Length = 361
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 128/342 (37%), Positives = 180/342 (52%), Gaps = 31/342 (9%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
+D G++ ++F + + APF SCHASTI + G A+FGG EGA DV IW+
Sbjct: 26 QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 82
Query: 63 KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
W +P +AD + P WNPVLF PS LF K+G + W G + S +
Sbjct: 83 DGNSWSAPRKVADGVQSDGSRHPCWNPVLFHAPSGATWLFIKVGPSPKDWWGEVLFSDDS 142
Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
G T+ +R +LP G+LGP + KP L+++G LLCGSS E + W E G++
Sbjct: 143 GKTFRDRTRLPEGVLGPIRCKPELVDHGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 201
Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
W + I+ E + IQP A+ LR L R+ + + ++S D G TW+
Sbjct: 202 VSASHWDRGEAIHDGKE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 258
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
T + NPNSGID V KDGR L N + R L +++S DG ++ +
Sbjct: 259 KPIATDMPNPNSGIDVVTAKDGRHWLISNPLPSKENGWGGRSRLTLSVST-DGKTYREVA 317
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE EFSYPA+IQA DG +H+TYT+ R +IKHV+ PN
Sbjct: 318 VLENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIFDPN 359
>gi|440716314|ref|ZP_20896825.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
gi|436438660|gb|ELP32185.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
Length = 381
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 134/357 (37%), Positives = 177/357 (49%), Gaps = 45/357 (12%)
Query: 5 CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
G LV+ + F P K HASTIVE G + A+F G+ E PDV I + ++
Sbjct: 28 AGEGALVSSQQIFDLADKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRHEN 86
Query: 65 GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
G+W + + P WNPVLF+ L+LFYK+G ++W G + S + G
Sbjct: 87 GQWTDSVEVVSGVQSSTLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146
Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
TWS +L +LGP KNKPI L +G +LC SS E + W EVT D
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPIELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206
Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
G++W GPI E+ IQP + L+VL RS R+ S S DGG TWS
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQPSILTYPDNRLQVLCRSKEK--RIVESWSKDGGKTWSTL 263
Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWH 276
T L NPNSG D V L+DGR +L YN R +L +A+S DG +W
Sbjct: 264 TATDLPNPNSGTDAVTLQDGRQVLIYNHSEGLVRREDAPDLKPRRILNLAISS-DGKTWK 322
Query: 277 DALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LTLE E+SYPA+IQ SDG +++ YTYNR +KH V+ P+
Sbjct: 323 PVLTLEHETGPHPKDLDRRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 379
>gi|423213426|ref|ZP_17199955.1| hypothetical protein HMPREF1074_01487 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693886|gb|EIY87116.1| hypothetical protein HMPREF1074_01487 [Bacteroides xylanisolvens
CL03T12C04]
Length = 1286
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 129/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 967 WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+SKD G +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVSKD-GINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|32475400|ref|NP_868394.1| hypothetical protein RB8501 [Rhodopirellula baltica SH 1]
gi|32445941|emb|CAD78672.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 381
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 132/357 (36%), Positives = 177/357 (49%), Gaps = 45/357 (12%)
Query: 5 CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
G LV+ + F P K HASTIVE G + A+F G+ E PDV I + ++
Sbjct: 28 AGEDALVSSQQIFDLADKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRHEN 86
Query: 65 GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
G+W + + P WNPVLF+ L+LFYK+G ++W G + S + G
Sbjct: 87 GQWTESVEVVSGVQSSTLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146
Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
TWS +L +LGP KNKP+ L +G +LC SS E + W EVT D
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPVELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206
Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
G++W GPI E+ IQP + L++L RS R+ S S DGG TWS
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQPSILTYPDDRLQILCRSKEK--RIVESWSKDGGKTWSTL 263
Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWH 276
T L NPNSG D V L+DGR +L YN R +L +A+S DG +W
Sbjct: 264 TATDLPNPNSGTDAVTLQDGRQVLIYNHSEGLVRRKDAPDLKPRRILNLAISS-DGKTWK 322
Query: 277 DALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LTLE E+SYPA+IQ SDG +++ YTYNR +KH V+ P+
Sbjct: 323 PVLTLEHETGPHPKDSDRRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 379
>gi|421609016|ref|ZP_16050221.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
gi|408500212|gb|EKK04666.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
Length = 381
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 133/357 (37%), Positives = 177/357 (49%), Gaps = 45/357 (12%)
Query: 5 CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
G LV+ + F P K HASTIVE G + A+F G+ E PDV I + ++
Sbjct: 28 AGEGALVSSQQIFDLADKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRHEN 86
Query: 65 GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
G+W + + P WNPVLF+ L+LFYK+G ++W G + S + G
Sbjct: 87 GQWTESVEVVSGVQSSTLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146
Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
TWS +L +LGP KNKPI L +G +LC SS E + W EVT D
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPIELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206
Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
G++W GPI E+ IQP + L++L RS R+ S S DGG TWS
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQPSILTYPDDRLQILCRSKEK--RIVESWSKDGGKTWSTL 263
Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWH 276
T L NPNSG D V L+DGR +L YN R +L +A+S DG +W
Sbjct: 264 TATDLPNPNSGTDAVTLQDGRQVLIYNHSEGLVRREDAPDLKPRRILNLAISS-DGKTWK 322
Query: 277 DALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LTLE E+SYPA+IQ SDG +++ YTYNR +KH V+ P+
Sbjct: 323 PVLTLEHETGPHPKDADRRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 379
>gi|298481333|ref|ZP_06999526.1| alpha-rhamnosidase [Bacteroides sp. D22]
gi|298272537|gb|EFI14105.1| alpha-rhamnosidase [Bacteroides sp. D22]
Length = 1286
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 967 WTAPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|237720806|ref|ZP_04551287.1| glycoside hydrolase family 78 protein [Bacteroides sp. 2_2_4]
gi|229449641|gb|EEO55432.1| glycoside hydrolase family 78 protein [Bacteroides sp. 2_2_4]
Length = 1286
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--GNASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 967 WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGTKPIYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|383113010|ref|ZP_09933790.1| hypothetical protein BSGG_0147 [Bacteroides sp. D2]
gi|382948894|gb|EFS29447.2| hypothetical protein BSGG_0147 [Bacteroides sp. D2]
Length = 1290
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 130/381 (34%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 913 AILSDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 970
Query: 67 WQSPIIA-------------------------DEEPNV-----PMWNPVLFKLPSNGLLL 96
W +P +A DE+ + WNPVLF++P L+L
Sbjct: 971 WTAPKLAADGVFSIKDSQAVLAGIDSTCTPVKDEKGKLIARRKACWNPVLFQIPGGDLIL 1030
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1031 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1089
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1090 NGWRVHFEISDDKGKTWKMIGPLDAELSVPTQNRKKGGMNVDDQEGGEAIRGEGAKPVYA 1149
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L+VL R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1150 IQPSILKHKDGRLQVLCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1207
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1208 GRHILIYNNFSTLPGTPKGPRTPLCVAIS-EDGINWKPVLTLEDSPISQYSYPSIIQGKD 1266
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1267 GKLHAIYTWRRQRIKYAEIDP 1287
>gi|160886119|ref|ZP_02067122.1| hypothetical protein BACOVA_04126 [Bacteroides ovatus ATCC 8483]
gi|423286899|ref|ZP_17265750.1| hypothetical protein HMPREF1069_00793 [Bacteroides ovatus CL02T12C04]
gi|156108932|gb|EDO10677.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides ovatus
ATCC 8483]
gi|392674437|gb|EIY67885.1| hypothetical protein HMPREF1069_00793 [Bacteroides ovatus CL02T12C04]
Length = 1286
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 967 WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLNVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|262408006|ref|ZP_06084554.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294808089|ref|ZP_06766862.1| BNR/Asp-box repeat protein [Bacteroides xylanisolvens SD CC 1b]
gi|345511520|ref|ZP_08791060.1| glycoside hydrolase family 78 [Bacteroides sp. D1]
gi|229444052|gb|EEO49843.1| glycoside hydrolase family 78 [Bacteroides sp. D1]
gi|262354814|gb|EEZ03906.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294444730|gb|EFG13424.1| BNR/Asp-box repeat protein [Bacteroides xylanisolvens SD CC 1b]
Length = 1286
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 967 WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLNVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|293369107|ref|ZP_06615702.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CMC 3f]
gi|292635819|gb|EFF54316.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CMC 3f]
Length = 1286
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 967 WTAPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|294645187|ref|ZP_06722909.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CC 2a]
gi|292639474|gb|EFF57770.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CC 2a]
Length = 545
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 168 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 225
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 226 WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 285
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 286 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 344
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 345 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 404
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 405 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLNVPNNNSGTDAVTMKD 462
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 463 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 521
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 522 GKLHAIYTWRRQRIKYAEIDP 542
>gi|421611775|ref|ZP_16052906.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
gi|408497418|gb|EKK01946.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
Length = 361
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 180/342 (52%), Gaps = 31/342 (9%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
+D G++ ++F + + APF SCHASTI + G A+FGG EGA DV IW+
Sbjct: 26 QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 82
Query: 63 KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
W +P ++AD + P WNPVLF PS LF K+G + W G + S +
Sbjct: 83 DGNSWSAPRMVADGVQPDGSRHPCWNPVLFHAPSGVTWLFIKVGPSPKDWWGEVLFSDDS 142
Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
G T+ +R +LP G+LGP + KP L++ G LLCGSS E + W E G++
Sbjct: 143 GKTFRDRTRLPEGVLGPIRCKPELVDQGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 201
Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
W + I+ E + IQP A+ LR L R+ + + ++S D G TW+
Sbjct: 202 VSASHWDRGEAIHDGKE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 258
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
T + NPNSGID V KDGR L N + R L +++S DG ++ +
Sbjct: 259 KPIATDMPNPNSGIDIVTAKDGRQWLISNPLPSKENGWGRRSRLTLSVST-DGKTYREVA 317
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
LE EFSYPA+IQA DG +H+TYT+ R +IKHV+ P+
Sbjct: 318 ALENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIFDPS 359
>gi|423298606|ref|ZP_17276662.1| hypothetical protein HMPREF1070_05327 [Bacteroides ovatus CL03T12C18]
gi|392662349|gb|EIY55910.1| hypothetical protein HMPREF1070_05327 [Bacteroides ovatus CL03T12C18]
Length = 1286
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 129/381 (33%), Positives = 191/381 (50%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR- 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ + KD +
Sbjct: 909 AILSDEFLY--KKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKDSKE 966
Query: 67 WQSPIIA--------DEEPNVP----------------------MWNPVLFKLPSNGLLL 96
W +P +A D + + WNPVLF++P L+L
Sbjct: 967 WTAPQLAADGVFSLKDSQAALAGIDSTCTPVKNEKGKLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGDMNVDDQEGGEAIKGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|336402501|ref|ZP_08583236.1| hypothetical protein HMPREF0127_00549 [Bacteroides sp. 1_1_30]
gi|335948350|gb|EGN10064.1| hypothetical protein HMPREF0127_00549 [Bacteroides sp. 1_1_30]
Length = 1286
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 127/381 (33%), Positives = 186/381 (48%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------------------------------EPNVPMWNPVLFKLPSNGLLL 96
W +P +A + WNPVLF++P L+L
Sbjct: 967 WTAPKLAADGVFSLKDSQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|295084330|emb|CBK65853.1| Alpha-L-rhamnosidase N-terminal domain./BNR/Asp-box repeat./Bacterial
alpha-L-rhamnosidase. [Bacteroides xylanisolvens XB1A]
Length = 1286
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 127/381 (33%), Positives = 186/381 (48%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++ +EF + +A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966
Query: 67 WQSPIIADE------------------------------EPNVPMWNPVLFKLPSNGLLL 96
W +P +A + WNPVLF++P L+L
Sbjct: 967 WTAPKLAADGVFSLKDSQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283
>gi|336416259|ref|ZP_08596595.1| hypothetical protein HMPREF1017_03703 [Bacteroides ovatus 3_8_47FAA]
gi|335938990|gb|EGN00869.1| hypothetical protein HMPREF1017_03703 [Bacteroides ovatus 3_8_47FAA]
Length = 1286
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 128/379 (33%), Positives = 188/379 (49%), Gaps = 75/379 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + S P CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 909 AILSDEFLYKKTSFP--ECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKLKGSKE 966
Query: 67 WQSPIIA-------------------------DEEPNV-----PMWNPVLFKLPSNGLLL 96
W +P +A DE+ + WNPVLF++P L+L
Sbjct: 967 WTAPKLAADGVFSLKDSQAVLAGIDSTCTPVKDEQGKLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGDMNVDDQEGGEAIKGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVL 318
G +H YT+ R +IK+ +
Sbjct: 1263 GKLHAIYTWRRQRIKYAEI 1281
>gi|312129645|ref|YP_003996985.1| neuraminidase [Leadbetterella byssophila DSM 17132]
gi|311906191|gb|ADQ16632.1| putative neuraminidase [Leadbetterella byssophila DSM 17132]
Length = 336
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 121/325 (37%), Positives = 177/325 (54%), Gaps = 28/325 (8%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
V +E PA P S HAST+VE+ G+ L A+FGG EG P V I+L F W
Sbjct: 21 VVQEHQIPA---PTASVHASTLVELQNGNLLAAWFGGIEEGDPSVGIYLSRFNGKVWSE- 76
Query: 71 IIADE--------EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
A+E + P WNPVL + N L L+YK+G ++W +K S ++G +WS
Sbjct: 77 --AEEVVLPSQAGDTTYPCWNPVLVRNKENTLFLYYKVGPNPREWWAQVKTSNDEGKSWS 134
Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKYGPIY 180
E LP G +GP + KP++L NG L SS E+ + W + EV+ + G +W+ +
Sbjct: 135 EARDLPEGHMGPVRVKPVMLANGDFLYPSSTETPETDYWSAHFEVSDNEGNNWQM---VP 191
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ ++ +IQP + + +L RS + RV + S + G +W+ T L NPN+G
Sbjct: 192 MDCDTFQIIQPTILEFSEGKMLMLARSRH--NRVIAAYSHNYGNSWTKPFATNLPNPNAG 249
Query: 241 IDGVKLKDGRLLLAYNTVS------RGVLKVAL-SKDDGDSWHDALTLEENLAMEFSYPA 293
ID VK+ R L+ YN + G K+ L S DG W + LTLE++ EFSYPA
Sbjct: 250 IDAVKVGKNRFLMVYNPLLAGKNWWEGRSKLVLGSSHDGLDWKEILTLEDHEKGEFSYPA 309
Query: 294 VIQASDGSVHITYTYNRTQIKHVVL 318
+I+ DG +HI+YTY+R IKH++L
Sbjct: 310 IIRTKDGKIHISYTYDRKFIKHLIL 334
>gi|299145502|ref|ZP_07038570.1| putative alpha-rhamnosidase [Bacteroides sp. 3_1_23]
gi|298515993|gb|EFI39874.1| putative alpha-rhamnosidase [Bacteroides sp. 3_1_23]
Length = 1286
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 128/379 (33%), Positives = 190/379 (50%), Gaps = 75/379 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR- 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ + KD +
Sbjct: 909 AILSDEFLY--KKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKDSKE 966
Query: 67 WQSPIIA--------DEEPNVP----------------------MWNPVLFKLPSNGLLL 96
W +P +A D + + WNPVLF++P L+L
Sbjct: 967 WTAPQLAADGVFSLKDSQAALAGIDSTCTPVKNEKGKLIARRKACWNPVLFQIPGGDLIL 1026
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1085
Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
N W E++ D G++W+ GP+ +P ++ +
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGDMNVDDQEGGEAIKGEGAKPVYA 1145
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N +V + S D G TWS + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262
Query: 300 GSVHITYTYNRTQIKHVVL 318
G +H YT+ R +IK+ +
Sbjct: 1263 GKLHAIYTWRRQRIKYAEI 1281
>gi|373853224|ref|ZP_09596023.1| BNR/Asp-box repeat domain protein [Opitutaceae bacterium TAV5]
gi|372472751|gb|EHP32762.1| BNR/Asp-box repeat domain protein [Opitutaceae bacterium TAV5]
Length = 337
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 131/330 (39%), Positives = 176/330 (53%), Gaps = 31/330 (9%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPI 71
EF F +AP SCHAST+ E G + A+FGG E PDV IWL G+W P+
Sbjct: 13 EFIF--ETAPTPSCHASTLAETPHG-LVAAWFGGKRECFPDVGIWLARLDAATGQWTEPV 69
Query: 72 -IAD------EEPNV---PMWNPVLFK---LPSNGLLLFYKIGQEVQKWSGCMKRSYNKG 118
+A+ E+ V WNPVLF+ P L+LFYK+G+ Q W G M S + G
Sbjct: 70 EVANGIQHTTEDGQVLRYACWNPVLFQPRHTPGAPLVLFYKVGKTPQTWWGMMTTSADGG 129
Query: 119 VTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKY 176
TW+ +LP GI GP KNKP+ L +G LC +S E W E T DAGR+W +
Sbjct: 130 HTWASHRRLPEGICGPVKNKPVELPDGTWLCPASTEESVTTGWRVHFERTADAGRTWTRT 189
Query: 177 GPIYIPNESLS--VIQP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
P+ N+ + IQP + H R L+ + R+ + R+ ++S D G TWS
Sbjct: 190 APV---NDGIDYGAIQPSILRHDGGRRLQAIGRT-QLVKRLFSTDSLDAGRTWSEIAFLD 245
Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
+ NP+SG D V L DGR L N R L +ALS DG W L LE E+SY
Sbjct: 246 VPNPDSGTDAVTLADGRHALICNPTESGRSPLSLALSS-DGLRWTRVLDLETGPG-EYSY 303
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
PA+IQ +DG +H+T+T+ R +IKH + P+
Sbjct: 304 PAIIQTADGHLHMTWTWKRERIKHAAIDPS 333
>gi|225165905|ref|ZP_03727673.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
gi|224799856|gb|EEG18317.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
Length = 381
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 172/323 (53%), Gaps = 34/323 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPI-IAD------EE 76
SCHASTIVE +G + A+FGG E PDV IWL GRW +P+ +A+ E+
Sbjct: 60 SCHASTIVETTQGELVAAWFGGKRECFPDVGIWLSRRDAITGRWSAPVEVANGIQHTTED 119
Query: 77 PNV---PMWNPVLFK---LPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
N WNPVLF+ P L+LFYK+G+ Q W G M S + G TW+ +LP G
Sbjct: 120 GNTLRYACWNPVLFQPRHTPGAPLVLFYKVGKTPQTWWGMMTTSTDGGRTWTAHRRLPEG 179
Query: 131 ILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I GP KNKPI L +G LC +S E W E T DAGR+W + P+ N+ +
Sbjct: 180 ICGPVKNKPIELPDGTWLCPASTEESVTTGWRVHFERTDDAGRTWTRTAPV---NDGIDY 236
Query: 188 -VIQP-VPFHTANRTLRVLMRSFNGIGRVCMSE------SCDGGLTWSYAKPTQLLNPNS 239
IQP + H A + R IGR +++ S D G TWS + NP+S
Sbjct: 237 GAIQPSILMHPAPASTTPPAR-LQAIGRTQLAKRLFSTTSLDAGRTWSEITFLDVPNPDS 295
Query: 240 GIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
G D V L+D R L YN R L +ALS +DG W L LE E+SYPA+IQ
Sbjct: 296 GTDAVTLRDSRHALIYNPTESGRSPLSLALS-NDGLHWTRILDLETTPG-EYSYPAIIQT 353
Query: 298 SDGSVHITYTYNRTQIKHVVLLP 320
+DG +H+T+T+ R +IKH + P
Sbjct: 354 TDGLLHMTWTWRRERIKHASIDP 376
>gi|255691747|ref|ZP_05415422.1| putative alpha-rhamnosidase [Bacteroides finegoldii DSM 17565]
gi|260622633|gb|EEX45504.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides
finegoldii DSM 17565]
Length = 1289
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 125/381 (32%), Positives = 188/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 912 AILSDEFLY--EKAAFPECHGATIVEMKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 970 WSAPKLAADGVFLLKDPQAILAGIDSTCTPVKDAKGKLIARRKACWNPVLFQVPGGDLIL 1029
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP ++ G ++C SS E
Sbjct: 1030 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPKGFLGPIKNKPEYID-GRIICPSSTEGS 1088
Query: 157 NSWGSWMEVTVDAGRSWRKYGPIY----IPNES------------------------LSV 188
W E++ D G++W+ GP+ +P ++ +
Sbjct: 1089 KGWRVHFEISDDKGKTWKMVGPLEAELSVPTQNRKKGGINTDDQEGGEAIKGEGAKPIYA 1148
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N ++ + S D G TWS + N NSG D V +KD
Sbjct: 1149 IQPSILKHKDGKLQILCRTRNA--QIATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1206
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG SW LTLE++ ++SYP++I+ D
Sbjct: 1207 GRHVLIYNNFSTLPGTPKGPRTPLCVAIS-EDGISWKPVLTLEDSPISQYSYPSIIEGKD 1265
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1266 GKLHAIYTWRRQRIKYAEIDP 1286
>gi|423299817|ref|ZP_17277842.1| hypothetical protein HMPREF1057_00983 [Bacteroides finegoldii
CL09T03C10]
gi|408473626|gb|EKJ92148.1| hypothetical protein HMPREF1057_00983 [Bacteroides finegoldii
CL09T03C10]
Length = 1289
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 125/381 (32%), Positives = 188/381 (49%), Gaps = 75/381 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 912 AILSDEFLY--EKAAFPECHGATIVEMKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +P + WNPVLF++P L+L
Sbjct: 970 WSAPKLAADGVFSLKDPQAILAGIDSTCTPVKDVKGKLIAHRKACWNPVLFQVPGGDLIL 1029
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG +V W+G + RS + G TWS+RE LP G LGP KNKP ++ G ++C SS E
Sbjct: 1030 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPKGFLGPIKNKPEYID-GRIICPSSTEGS 1088
Query: 157 NSWGSWMEVTVDAGRSWRKYGPIY----IPNES------------------------LSV 188
W E++ D G++W+ GP+ +P ++ +
Sbjct: 1089 KGWRVHFEISDDKGKTWKMVGPLEAELSVPTQNRKKGGINTDDQEGREAIKGEDAKPIYA 1148
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L++L R+ N ++ + S D G TWS + N NSG D V +KD
Sbjct: 1149 IQPSILKHKDGKLQILCRTRNA--QIATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1206
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG SW LTLE++ ++SYP++I+ D
Sbjct: 1207 GRHVLIYNNFSTLPGTPKGPRTPLCVAIS-EDGISWKPVLTLEDSPISQYSYPSIIEGKD 1265
Query: 300 GSVHITYTYNRTQIKHVVLLP 320
G +H YT+ R +IK+ + P
Sbjct: 1266 GKLHAIYTWRRQRIKYAGIDP 1286
>gi|386724698|ref|YP_006191024.1| hypothetical protein B2K_21575 [Paenibacillus mucilaginosus K02]
gi|384091823|gb|AFH63259.1| hypothetical protein B2K_21575 [Paenibacillus mucilaginosus K02]
Length = 332
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 122/321 (38%), Positives = 176/321 (54%), Gaps = 25/321 (7%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGRWQSPI 71
+E+ F + PF SCHASTI + G + A+FGGS EGA DV IW+ G W +P
Sbjct: 12 KEYIF-EDERPFASCHASTIELLPDGGAVAAWFGGSKEGAGDVAIWVAHRGAQGGWTAPR 70
Query: 72 IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
A P P WNPVLF+ LLL+YK G ++ WS + S + G TWS +L PG
Sbjct: 71 QAAGYPATPHWNPVLFRSEDGTLLLYYKTGIRIEHWSTLVMESTDDGYTWSGPAELVPGD 130
Query: 132 ---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
GP KNKP+ L +G + +S+E +W S+++++ D GR+W + + +P + +
Sbjct: 131 RGGRGPVKNKPVRLADGTIAAPASLEP--AWDSFVDLSPDGGRTWNRSDTVPMPADRSAF 188
Query: 188 ----VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+IQP + +A + +L RS G + S+S DGG TW A T L N NSG+D
Sbjct: 189 LGKGIIQPSLWESAPGCVHMLTRSTAGA--IYRSDSADGGRTWCPAYATSLPNNNSGLDV 246
Query: 244 VKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
VKL DG L L N R L + LS+D+G SW+ ++E ++SYPA+
Sbjct: 247 VKLADGTLALVCNPTRPEPGKLKGPRSPLVLLLSQDNGVSWYGETVIDEG-EKQYSYPAI 305
Query: 295 IQASDGSVHITYTYNRTQIKH 315
+ A G VH+TYT+ R +I +
Sbjct: 306 V-AQGGRVHVTYTWKRERIAY 325
>gi|337748986|ref|YP_004643148.1| hypothetical protein KNP414_04748 [Paenibacillus mucilaginosus
KNP414]
gi|379721955|ref|YP_005314086.1| hypothetical protein PM3016_4159 [Paenibacillus mucilaginosus 3016]
gi|336300175|gb|AEI43278.1| hypothetical protein KNP414_04748 [Paenibacillus mucilaginosus
KNP414]
gi|378570627|gb|AFC30937.1| hypothetical protein PM3016_4159 [Paenibacillus mucilaginosus 3016]
Length = 332
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 121/321 (37%), Positives = 176/321 (54%), Gaps = 25/321 (7%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGRWQSPI 71
+E+ F + PF SCHASTI + G + A+FGGS EGA DV IW+ G W +P
Sbjct: 12 KEYIF-EDERPFASCHASTIELLPDGGAVAAWFGGSKEGAGDVAIWVAHRGAQGGWTAPR 70
Query: 72 IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+A P P WNPVLF+ LLL+YK G ++ WS + +S + G TWS +L PG
Sbjct: 71 LAAGYPGTPHWNPVLFRREDGTLLLYYKTGTRIEHWSTLVMKSTDDGHTWSGPAELVPGD 130
Query: 132 ---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
GP KNKP+ L +G + +S+E +W S+++++ D GR+W + G + +P + +
Sbjct: 131 RGGRGPVKNKPVRLADGTIAAPASLEP--AWDSFVDLSPDGGRTWTRSGDVPMPADRSAF 188
Query: 188 ----VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+IQP + + + +L RS G V S+S DGG TW A T L N NSG+D
Sbjct: 189 LGKGIIQPSLWESEPGCVHMLTRSTAGA--VYRSDSADGGRTWCPAYATSLPNNNSGLDV 246
Query: 244 VKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
KL DG L L N R L + LS+D+G SW+ ++E ++SYPA+
Sbjct: 247 AKLGDGTLALVCNPTRPEPGKLKGPRSPLVLLLSQDNGVSWYGETVIDEG-EKQYSYPAI 305
Query: 295 IQASDGSVHITYTYNRTQIKH 315
+ A VH+TYT+ R +I +
Sbjct: 306 V-ARGSRVHVTYTWKRERIAY 325
>gi|375254465|ref|YP_005013632.1| BNR/Asp-box repeat-containing protein [Tannerella forsythia ATCC
43037]
gi|363406763|gb|AEW20449.1| BNR/Asp-box repeat protein [Tannerella forsythia ATCC 43037]
Length = 351
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 174/316 (55%), Gaps = 20/316 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI----- 71
F + PF+SCH+ST+VE++ G+ L A+FGG EG+ DV IWL G W SP
Sbjct: 37 FVFENPPFQSCHSSTVVEMENGNVLCAWFGGKHEGSNDVSIWLSELSSGTWTSPTLVADG 96
Query: 72 IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
I ++ WNPVL++ L LFYK+G W G K S + G TWS+ +LP GI
Sbjct: 97 IQNDTTRYACWNPVLYR-DKTKLFLFYKVGTSPSTWWGEYKVSTDDGKTWSKVAKLPDGI 155
Query: 132 LGPSKNKPILLENGLLLCGSSVESWNS-WGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
LGP KNK I + +L S+ + N+ W S++E++ W+ PI + S VIQ
Sbjct: 156 LGPIKNKAIRIGKTVLHPSSTETAGNTRWKSFIELSDRNLTQWKSV-PIDTAS-SYKVIQ 213
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
P ++ L+RS + + S S D G +W+ T +L+PNSGID V L +G
Sbjct: 214 PTLLCYGENRIQALLRSDS--NEILQSWSNDNGQSWTPITGTGILHPNSGIDAVGLSNGV 271
Query: 251 LLLAYNTVSRG--------VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
LL YN +G L VA S DG +W+D L LE++ EFSYPA+I+ + +
Sbjct: 272 KLLVYNPAIKGENWWDGRSRLTVAESV-DGLTWNDILILEKHEKGEFSYPAIIRLKNDDI 330
Query: 303 HITYTYNRTQIKHVVL 318
ITYTYNR +IK+V +
Sbjct: 331 LITYTYNRLKIKYVFI 346
>gi|150005175|ref|YP_001299919.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149933599|gb|ABR40297.1| glycoside hydrolase family 78 [Bacteroides vulgatus ATCC 8482]
Length = 1321
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 936 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 993
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 994 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1053
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1054 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1112
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1113 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1172
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1173 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1230
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1231 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1289
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1290 SIIQGKDGKLHAVYTWRRQRVAYKEL 1315
>gi|423311882|ref|ZP_17289819.1| hypothetical protein HMPREF1058_00431 [Bacteroides vulgatus
CL09T03C04]
gi|392689261|gb|EIY82539.1| hypothetical protein HMPREF1058_00431 [Bacteroides vulgatus
CL09T03C04]
Length = 1324
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 939 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 997 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMNVPTALRKANAANVDDQEGGEAIKGE 1175
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318
>gi|294777784|ref|ZP_06743230.1| BNR/Asp-box repeat protein [Bacteroides vulgatus PC510]
gi|294448372|gb|EFG16926.1| BNR/Asp-box repeat protein [Bacteroides vulgatus PC510]
Length = 1324
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 939 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 997 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKFRARRKACWNPVLFQI 1056
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKENAANVDDQEGGEAIKGE 1175
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318
>gi|265750680|ref|ZP_06086743.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_33FAA]
gi|263237576|gb|EEZ23026.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_33FAA]
Length = 1321
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 936 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 993
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 994 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1053
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1054 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1112
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1113 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1172
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1173 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1230
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1231 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1289
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1290 SIIQGKDGKLHAVYTWRRQRVAYKEL 1315
>gi|212694641|ref|ZP_03302769.1| hypothetical protein BACDOR_04172 [Bacteroides dorei DSM 17855]
gi|212663142|gb|EEB23716.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides dorei DSM
17855]
Length = 1324
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 939 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 997 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1175
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318
>gi|423239192|ref|ZP_17220308.1| hypothetical protein HMPREF1065_00931 [Bacteroides dorei CL03T12C01]
gi|392646979|gb|EIY40685.1| hypothetical protein HMPREF1065_00931 [Bacteroides dorei CL03T12C01]
Length = 1321
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 936 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 993
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 994 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1053
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1054 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1112
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1113 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1172
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1173 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1230
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1231 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1289
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1290 SIIQGKDGKLHAVYTWRRQRVAYKEL 1315
>gi|319642682|ref|ZP_07997328.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_40A]
gi|345520277|ref|ZP_08799675.1| glycoside hydrolase family 78 [Bacteroides sp. 4_3_47FAA]
gi|254836104|gb|EET16413.1| glycoside hydrolase family 78 [Bacteroides sp. 4_3_47FAA]
gi|317385770|gb|EFV66703.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_40A]
Length = 1324
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 939 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 997 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKENAANVDDQEGGEAIKGE 1175
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318
>gi|380695289|ref|ZP_09860148.1| alpha-rhamnosidase [Bacteroides faecis MAJ27]
Length = 1289
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/382 (33%), Positives = 186/382 (48%), Gaps = 75/382 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 911 AIISDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKAKGAKE 968
Query: 67 WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
W +P +A + +PN + WNPVLF++P L+L
Sbjct: 969 WTAPKLAADGVFALNDPNAEIAGIDTACTPVKDTKGKLTARRKACWNPVLFQIPGGDLIL 1028
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1029 FYKIGLSVADWTGWLVRSKDGGKTWSKREALPQGFLGPIKNKPEYI-NGRIICPSSREGK 1087
Query: 157 NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
W E + D G++W RK G I + ++ +
Sbjct: 1088 GGWRIHFEYSDDKGKTWKTTESVPAELSVPTQNRKKGGINVDDQEAGEAIQGKGAQPILA 1147
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L+VL R+ N + + + S D G TW+ + + N NSG D V + D
Sbjct: 1148 IQPSILKHKDGRLQVLCRTRNAL--LATAWSSDNGETWNKVTLSNVPNNNSGTDAVTMSD 1205
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG W LTLE++ ++SYP++IQ D
Sbjct: 1206 GRHILIYNNFSTQPGTPKGPRTPLCVAIS-EDGIHWKPVLTLEDSPISQYSYPSIIQGKD 1264
Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
G +H YT+ R +IK+ + P+
Sbjct: 1265 GKLHAIYTWRRQRIKYAEIDPS 1286
>gi|237711094|ref|ZP_04541575.1| glycoside hydrolase family 78 protein [Bacteroides sp. 9_1_42FAA]
gi|229454938|gb|EEO60659.1| glycoside hydrolase family 78 protein [Bacteroides sp. 9_1_42FAA]
Length = 1305
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 127/386 (32%), Positives = 184/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 920 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 977
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 978 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1037
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP ++ G ++C
Sbjct: 1038 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1096
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
SS E W E+ D G++W+ GP+ +P
Sbjct: 1097 PSSTEGDGGWRIHFEILDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1156
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1157 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1214
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1215 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1273
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1274 SIIQGKDGKLHAVYTWRRQRVAYKEL 1299
>gi|187918984|ref|YP_001888015.1| hypothetical protein Bphyt_4268 [Burkholderia phytofirmans PsJN]
gi|187717422|gb|ACD18645.1| conserved hypothetical protein [Burkholderia phytofirmans PsJN]
Length = 366
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 175/325 (53%), Gaps = 22/325 (6%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
+V+ E P + F CHAST V +GH + A+ GS E A DV +W+ F +W
Sbjct: 34 AIVSTEVVNPPQNIAF--CHAST-VAFSQGHLVAAWLAGSKEAANDVGVWVARFSGNQWS 90
Query: 69 SPI-IAD-EEPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
P+ +AD P+ + + NP+LF L+LFY+ G+ W S + G TW++
Sbjct: 91 PPVRVADGRSPDGEALTVINPILFSPKRGPLMLFYRRGKLPADWHPLRMTSLDGGATWTK 150
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPIYIP 182
L PGI GP+K+KP+ L NG+++ GSS E ++ W E ++D G +W Y + +P
Sbjct: 151 PVALDPGISGPAKDKPVELSNGVVIAGSSTE-YDGWKIHFERSMDGGNTWHVVYPAVGLP 209
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++ IQP + L+ L+R+ +G V ++S D G TWS + N NSG+D
Sbjct: 210 --TVQAIQPTILDHRHGQLQALVRTKSGF--VFSTKSSDWGKTWSALARLDIPNSNSGLD 265
Query: 243 GVKLKDGRLLLAYNTV-------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
V L DGR L+ N + R L V +S D ++ D L LE EFSYPAVI
Sbjct: 266 AVTLTDGRDLIVTNPLPYVEGRWDRHKLSVLISADH-QTYRDVLDLENEAGQEFSYPAVI 324
Query: 296 QASDGSVHITYTYNRTQIKHVVLLP 320
Q+ DG VHITYT+ + IKHVVL P
Sbjct: 325 QSPDGMVHITYTWKKIYIKHVVLDP 349
>gi|345515271|ref|ZP_08794777.1| glycoside hydrolase family 78 protein [Bacteroides dorei 5_1_36/D4]
gi|345455881|gb|EEO44380.2| glycoside hydrolase family 78 protein [Bacteroides dorei 5_1_36/D4]
Length = 1324
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 939 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 997 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE LP G LGP KNKP +++ ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVDD-RIIC 1115
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1175
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318
>gi|224535729|ref|ZP_03676268.1| hypothetical protein BACCELL_00593, partial [Bacteroides
cellulosilyticus DSM 14838]
gi|224522654|gb|EEF91759.1| hypothetical protein BACCELL_00593 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1003
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 127/379 (33%), Positives = 184/379 (48%), Gaps = 75/379 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 625 AILSDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 682
Query: 67 WQSPIIA--------DEEPNVP----------------------MWNPVLFKLPSNGLLL 96
W SP +A D + + WNPVLF++P L+L
Sbjct: 683 WTSPQLAADGVFSIKDAQATLAGIDSTCTPVTNARGKLVARRKACWNPVLFQIPGGDLIL 742
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG V W+G + RS + G TWS+RE LP G LGP KNKP + NG ++C SS E
Sbjct: 743 FYKIGLNVGDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSREGG 801
Query: 157 NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
W E++ D G++W RK G + + ++ +
Sbjct: 802 KGWRIHFEISDDNGKTWKTTESLAAEFSVLTQHRKKGSVNVDDQEGGEAVKGEGPKPIYA 861
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L+VL R+ N R+ + S D G TWS + N NSG D V L+D
Sbjct: 862 IQPSILMHKDGRLQVLCRTRN--ARIATAWSNDNGDTWSKVTLLDVPNNNSGTDAVTLQD 919
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L VA+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 920 GRHVLIYNNFSTLPGTPKGPRTPLCVAIS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 978
Query: 300 GSVHITYTYNRTQIKHVVL 318
G +H YT+ R ++K+ +
Sbjct: 979 GRLHAVYTWRRQRVKYAEI 997
>gi|423228041|ref|ZP_17214447.1| hypothetical protein HMPREF1063_00267 [Bacteroides dorei CL02T00C15]
gi|423243304|ref|ZP_17224380.1| hypothetical protein HMPREF1064_00586 [Bacteroides dorei CL02T12C06]
gi|392637077|gb|EIY30952.1| hypothetical protein HMPREF1063_00267 [Bacteroides dorei CL02T00C15]
gi|392645311|gb|EIY39039.1| hypothetical protein HMPREF1064_00586 [Bacteroides dorei CL02T12C06]
Length = 1324
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 184/386 (47%), Gaps = 82/386 (21%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
G+V ++F + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 939 GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996
Query: 67 WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
W +P +A + +P + WNPVLF++
Sbjct: 997 WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056
Query: 90 PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
P L+LFYKIG +V WSG + RS + G TWS+RE L G LGP KNKP ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLLKGFLGPIKNKPEYVD-GRIIC 1115
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
SS E W E++ D G++W+ GP+ +P
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1175
Query: 184 --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ + IQP + L+VL R+ N ++ S S D G TWS + N NSG
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233
Query: 242 DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
D V +KDGR +L YN S R L VA+S DDG W + +TLE++ ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
++IQ DG +H YT+ R ++ + L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318
>gi|242241312|ref|YP_002989493.1| hypothetical protein Dd703_3918 [Dickeya dadantii Ech703]
gi|242133369|gb|ACS87671.1| conserved hypothetical protein [Dickeya dadantii Ech703]
Length = 350
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/345 (35%), Positives = 173/345 (50%), Gaps = 43/345 (12%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
V ++F P + F +CHASTI + G A+F G EG+ D IWL ++ G+W +P
Sbjct: 6 VKQQFVLPTDHRYFGNCHASTIAALPDGRLRAAWFAGEKEGSGDTAIWLANYQQGQWHAP 65
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
+ E +P WNPVL + L LFYK+G +V W + S + G++WS L PG
Sbjct: 66 VRLAWEDGLPHWNPVLHR-QDGALWLFYKVGADVHHWQTRVMVSGDDGLSWSAPRMLVPG 124
Query: 131 I---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL- 186
GP KNK +++ NG L SVE W ++++++ D G W+ PI + +++
Sbjct: 125 DSAPRGPVKNKLLVMSNGEWLAPGSVEDDRDWDAFVDISGDRGEHWQAT-PIPLAHQAPG 183
Query: 187 ----------------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
VIQP + +A + VLMRS G + S+
Sbjct: 184 DGQHDALWQGLEHAALWENDLTRVFQWDGVIQPSAWESAPGQVHVLMRSTRGA--IYRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
S D G W A T L N NSGID V L GRL+L YN V SR L A S D+G+
Sbjct: 242 SPDYGRRWCDAYATALPNNNSGIDVVHLGAGRLVLVYNPVTGNWRSRYPLTAAYSADNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
W + + LE+ EFSYPA+I D ++H+TYT+NR I + L
Sbjct: 302 HWENLIDLEQEPG-EFSYPAIIADGD-TLHVTYTWNRKNIVYCAL 344
>gi|440751400|ref|ZP_20930633.1| hypothetical protein C943_3326 [Mariniradius saccharolyticus AK6]
gi|436480042|gb|ELP36307.1| hypothetical protein C943_3326 [Mariniradius saccharolyticus AK6]
Length = 348
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 176/319 (55%), Gaps = 16/319 (5%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V EF F + PF CHASTI ++ G +++A+F GS E DV IW K G W +
Sbjct: 25 IVKCEFIF-GDERPFPQCHASTIEKLGNGEYILAWFAGSHEKNDDVAIWTAKGKPGAWTA 83
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P + N P WNPVLF P L+L++K+G+E+ W + S +KG TWS ++L
Sbjct: 84 PKKLAKVRNDPHWNPVLFSAPDGRLVLYFKVGKEIDDWETWVMESLDKGETWSAPKELVA 143
Query: 130 GI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
G GP +N ++L +G L +SVE W ++++ + DAG+SW++ + + +
Sbjct: 144 GDRGGRGPVRNHILVLSDGTWLAPASVEKNRVWNAFVDFSTDAGKSWQRTDTLALDRHLV 203
Query: 187 ---SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
VIQP + + + +L+RS G++ +ES D G TWS T L N NSGID
Sbjct: 204 LGEGVIQPALWESEPGHIHMLLRS--STGKIARAESKDFGRTWSPVYLTDLPNNNSGIDL 261
Query: 244 VKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAM-EFSYPAVIQA 297
VK+ G + L YN VS R +++ALS D+G +W +E EFSYP++I+
Sbjct: 262 VKMPSGVIALVYNPVSGNWGKRYPMQLALSLDNGVTWPIQQEIEAGSGEDEFSYPSIIE- 320
Query: 298 SDGSVHITYTYNRTQIKHV 316
+G + YT+NR +I+ V
Sbjct: 321 DEGDLVFCYTWNRQRIRFV 339
>gi|29346423|ref|NP_809926.1| alpha-rhamnosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|29338319|gb|AAO76120.1| putative alpha-rhamnosidase [Bacteroides thetaiotaomicron VPI-5482]
Length = 1290
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 182/379 (48%), Gaps = 75/379 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 912 AIISDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969
Query: 67 WQSPIIADE-------------------EPNVPM-----------WNPVLFKLPSNGLLL 96
W +P +A + P V WNPVLF++P L+L
Sbjct: 970 WTAPKLAADGVFSLKDSQAALAGIDSTCTPVVDAKGKLTARRKACWNPVLFQIPGGDLIL 1029
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG V W+G + RS + G TW +RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1030 FYKIGLNVGDWTGWLVRSKDGGKTWGKREALPEGFLGPIKNKPEYI-NGRIICPSSREGK 1088
Query: 157 NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
W E + D G++W RK G I + ++ +
Sbjct: 1089 GGWRIHFEYSDDKGKTWKTTESVPAELSVPTQNRKKGGINVDDQEAGEAIQGEGAQPILA 1148
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L+VL R+ N ++ S S D G TWS + + N NSG D V + D
Sbjct: 1149 IQPSILKHKDGRLQVLCRTRN--AKIATSWSSDNGETWSKVTLSNVPNNNSGTDAVTMSD 1206
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L +A+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1207 GRHILIYNHFSTLPGTPKGPRTPLCIAIS-EDGINWKPILTLEDSPISQYSYPSIIQGKD 1265
Query: 300 GSVHITYTYNRTQIKHVVL 318
G +H YT+ R +IK+ +
Sbjct: 1266 GKLHAIYTWRRQRIKYTEI 1284
>gi|383122653|ref|ZP_09943345.1| hypothetical protein BSIG_0602 [Bacteroides sp. 1_1_6]
gi|382984354|gb|EES70329.2| hypothetical protein BSIG_0602 [Bacteroides sp. 1_1_6]
Length = 1290
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 182/379 (48%), Gaps = 75/379 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
++++EF + A F CH +TIVE+ G + ++FGG+ E PD IW+ G
Sbjct: 912 AIISDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969
Query: 67 WQSPIIADE-------------------EPNVPM-----------WNPVLFKLPSNGLLL 96
W +P +A + P V WNPVLF++P L+L
Sbjct: 970 WTAPKLAADGVFSLKDSQAALAGIDSTCTPVVDAKGKLTARRKACWNPVLFQIPGGDLIL 1029
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
FYKIG V W+G + RS + G TW +RE LP G LGP KNKP + NG ++C SS E
Sbjct: 1030 FYKIGLNVGDWTGWLVRSKDGGKTWGKREALPEGFLGPIKNKPEYI-NGRIICPSSREGK 1088
Query: 157 NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
W E + D G++W RK G I + ++ +
Sbjct: 1089 GGWRIHFEYSDDKGKTWKTTESVPAELSVPTQNRKKGGINVDDQEAGEAIQGEGAQPILA 1148
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + L+VL R+ N ++ S S D G TWS + + N NSG D V + D
Sbjct: 1149 IQPSILKHKDGRLQVLCRTRN--AKIATSWSSDNGETWSKVTLSNVPNNNSGTDAVTMSD 1206
Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
GR +L YN S R L +A+S +DG +W LTLE++ ++SYP++IQ D
Sbjct: 1207 GRHILIYNHFSTLPGTPKGPRTPLCIAIS-EDGINWKPILTLEDSPISQYSYPSIIQGKD 1265
Query: 300 GSVHITYTYNRTQIKHVVL 318
G +H YT+ R +IK+ +
Sbjct: 1266 GKLHAIYTWRRQRIKYTEI 1284
>gi|404451851|ref|ZP_11016803.1| hypothetical protein A33Q_21047 [Indibacter alkaliphilus LW1]
gi|403762440|gb|EJZ23505.1| hypothetical protein A33Q_21047 [Indibacter alkaliphilus LW1]
Length = 347
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 175/319 (54%), Gaps = 16/319 (5%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V+E F F + PF CHAST+ + G F++A+F G+ E A DV IW+ +DG W +
Sbjct: 26 VVSEGFVF-GDERPFDQCHASTLESLGDGQFMIAWFAGTEEKADDVGIWISKGRDGDWTA 84
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P + + WNPVLF P L L++K+G+E+ W + S + G TWSE +L
Sbjct: 85 PELVAKVREDAHWNPVLFNAPDGKLYLYFKVGKEIDHWETWVMESRDNGSTWSEPVELVE 144
Query: 130 GI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
G GP +N ++L +G L +S+E W ++++ + D G++W + + E +
Sbjct: 145 GDRGGRGPVRNHVLVLFDGTWLAPASLEKNRVWNAFVDRSEDEGKTWVNSETLILDREKI 204
Query: 187 ---SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
VIQP + +A + +L+R+ G++ S+S DGGLTWS + T L N NSGID
Sbjct: 205 PGEGVIQPALWESAALYVHMLLRT--SAGKIGRSDSHDGGLTWSPVELTDLPNNNSGIDI 262
Query: 244 VKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEE-NLAMEFSYPAVIQA 297
+ L L YN V R +++A+S D+G +W +EE N EFSYP+++
Sbjct: 263 AHISGDTLALIYNPVGENWGKRYPMELAVSYDNGKTWPLKQVIEEGNGQDEFSYPSILY- 321
Query: 298 SDGSVHITYTYNRTQIKHV 316
+G + YT+NR +IK V
Sbjct: 322 DNGELVFCYTWNREKIKFV 340
>gi|271502586|ref|YP_003335612.1| hypothetical protein Dd586_4080 [Dickeya dadantii Ech586]
gi|270346141|gb|ACZ78906.1| conserved hypothetical protein [Dickeya dadantii Ech586]
Length = 348
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 176/346 (50%), Gaps = 43/346 (12%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
++ P F +CHAST+V + G VA+F G EG+ D IWL + + GRW P
Sbjct: 6 TTRQYVLPEGHDYFGNCHASTVVALPGGRLRVAWFAGEKEGSGDTAIWLASHEQGRWSPP 65
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
+ E + WNPVL + L LFYK+G +V W ++ S ++G +WS L PG
Sbjct: 66 VRVAWEDGLAHWNPVL-HWQAGTLWLFYKVGADVHHWQTRVQLSTDEGASWSAPRPLVPG 124
Query: 131 I---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN---- 183
GP KNK +++ NG L +SVE W ++++++ D G+ W+ PI + +
Sbjct: 125 DSAPRGPVKNKLLVMSNGEWLAPASVEDDRHWDAFVDLSADQGQRWQT-APIPLAHRLPG 183
Query: 184 ----ESL---------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
E+L VIQP + ++ + VLMRS G + S+
Sbjct: 184 EHGREALWSGLEQAALWENDLTRVFQWDGVIQPSAWESSPGRVHVLMRSTRGA--LYRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G W A L N NSG+D V L GRL+L YN V+ R L VA S D+G+
Sbjct: 242 SDDYGRHWCEAYAVDLPNNNSGVDLVHLGAGRLVLVYNPVTGNWHRRYPLAVACSTDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
W DAL LE+ EFSYPA+I D ++H+TYT+NR I + L+
Sbjct: 302 HWQDALCLEQEPG-EFSYPAIIADGD-TLHVTYTWNRKNIVYCALI 345
>gi|420156344|ref|ZP_14663187.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
gi|394757642|gb|EJF40659.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
Length = 351
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 177/343 (51%), Gaps = 38/343 (11%)
Query: 1 MKKDCGIKGLVA--EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW 58
MK D I GL E+ + PF+SCHAS++ + G F +F GS EG DV IW
Sbjct: 1 MKTD--IAGLFRQIEKESILGEQPPFESCHASSLAILPNGDFFCVWFAGSREGEDDVAIW 58
Query: 59 LQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKG 118
G WQ P I ++ +P WNPVLF + LF+KIG+E+ W S++
Sbjct: 59 GARRSGGSWQKPQIL-QQNGLPHWNPVLFVQKDGSVWLFFKIGREISSWRTWASVSHDGC 117
Query: 119 VTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
TWS +L PG + GP +NK I+L +G L +S+E+ W S+ + + D G++W K
Sbjct: 118 HTWSPARELVPGDVGGRGPVRNKLIVLSSGRWLAPASLEN-GQWRSFADRSDDEGKTWSK 176
Query: 176 YGPIYIP-----NESLS---------------VIQPVPFHTANRTLRVLMRSFNGIGRVC 215
++IP +SL+ IQP + +A + +LMR+ G +
Sbjct: 177 SSEVFIPLAGTGTKSLTYQEIPVSEQSFTGRGAIQPTLWESAPGKVHMLMRTSEGF--IA 234
Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKD 270
S+S DGG TW T L N NSGID + DGRL L YN V R L +ALS+D
Sbjct: 235 RSDSQDGGETWCEGYATSLPNNNSGIDLDRTADGRLFLVYNPVGESWGPRFPLHLALSED 294
Query: 271 DGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
+G +W D TLEE E+SYP+V SD S+ +TYT R +
Sbjct: 295 NGQTWEDIYTLEEAEG-EYSYPSV-NYSDNSLFVTYTNCRKDV 335
>gi|307128843|ref|YP_003880859.1| hypothetical protein Dda3937_02530 [Dickeya dadantii 3937]
gi|306526372|gb|ADM96302.1| Putative cytoplasmic protein [Dickeya dadantii 3937]
Length = 374
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 122/347 (35%), Positives = 175/347 (50%), Gaps = 43/347 (12%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+ ++F P F +CHAST+ + +G VA+F G EG+ D IWL + GRW
Sbjct: 31 MTTQQFVLPEGHDYFGNCHASTVAVLPEGRLRVAWFAGEKEGSGDTAIWLACAEQGRWLQ 90
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P+ E V WNPVL + L LFYK+G +V W + S + G +WS L P
Sbjct: 91 PVRVAWEDGVAHWNPVL-HWQAGTLWLFYKVGADVHHWQTRVAVSTDDGASWSAPRLLVP 149
Query: 130 GI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN--- 183
G GP KNK +++ NG L +SVE W ++++++ D GRSW+ PI + +
Sbjct: 150 GDSAPRGPVKNKLLVMSNGEWLAPASVEDDRHWDAFVDLSGDRGRSWQT-APIPLTHRTP 208
Query: 184 -----ESL---------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
E+L VIQP + ++ + VLMRS G + S
Sbjct: 209 GERDGEALWQGLEQAALWENDLTRVFQWDGVIQPSAWESSPGQVHVLMRSTRGA--LYRS 266
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G W A L N NSG+D L GRL+L YN V+ R L VA S D+G
Sbjct: 267 DSDDYGRRWREAYAIDLPNNNSGVDLAHLGAGRLVLVYNPVTGNWRHRYPLTVACSTDNG 326
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
+ W +++ LE+ EFSYPA+I D ++H+TYT+NR I + L+
Sbjct: 327 EHWENSIDLEQEPG-EFSYPAIIADGD-TLHVTYTWNRKNIVYCALI 371
>gi|168467297|ref|ZP_02701139.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Newport str. SL317]
gi|418761271|ref|ZP_13317416.1| hypothetical protein SEEN185_01701 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|418768828|ref|ZP_13324872.1| hypothetical protein SEEN199_18734 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|418769767|ref|ZP_13325794.1| hypothetical protein SEEN539_09873 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|418775994|ref|ZP_13331943.1| hypothetical protein SEEN953_12207 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|418780335|ref|ZP_13336224.1| hypothetical protein SEEN188_02327 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|418786235|ref|ZP_13342055.1| hypothetical protein SEEN559_06356 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|418802426|ref|ZP_13358053.1| hypothetical protein SEEN202_07479 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
gi|419787803|ref|ZP_14313510.1| hypothetical protein SEENLE01_16150 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|419792175|ref|ZP_14317818.1| hypothetical protein SEENLE15_23177 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|195630395|gb|EDX49021.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Newport str. SL317]
gi|392619298|gb|EIX01683.1| hypothetical protein SEENLE01_16150 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 1]
gi|392619559|gb|EIX01943.1| hypothetical protein SEENLE15_23177 [Salmonella enterica subsp.
enterica serovar Newport str. Levine 15]
gi|392730828|gb|EIZ88068.1| hypothetical protein SEEN199_18734 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35199]
gi|392739213|gb|EIZ96352.1| hypothetical protein SEEN539_09873 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21539]
gi|392740889|gb|EIZ98004.1| hypothetical protein SEEN185_01701 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35185]
gi|392746627|gb|EJA03633.1| hypothetical protein SEEN953_12207 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 33953]
gi|392749249|gb|EJA06227.1| hypothetical protein SEEN559_06356 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 21559]
gi|392749385|gb|EJA06362.1| hypothetical protein SEEN188_02327 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35188]
gi|392777439|gb|EJA34122.1| hypothetical protein SEEN202_07479 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 35202]
Length = 347
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/346 (35%), Positives = 171/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G CEG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G V W S ++G TWS L
Sbjct: 64 TPQRVVAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
G + GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I A G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344
>gi|406662497|ref|ZP_11070592.1| putative neuraminidase (sialidase) [Cecembia lonarensis LW9]
gi|405553570|gb|EKB48775.1| putative neuraminidase (sialidase) [Cecembia lonarensis LW9]
Length = 350
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 175/322 (54%), Gaps = 16/322 (4%)
Query: 7 IKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR 66
+ G+ EF F PF+ CHAS+I + G +++ +F G+ E DV IW+ + G
Sbjct: 29 VPGIEHFEFVFEG-EMPFEQCHASSIEVLGDGRYMMVWFAGTHEKNDDVGIWMAQGRPGN 87
Query: 67 WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ 126
W P + + N WNPVLF+ P+ L L++K+G+E+ W S + G TWS E+
Sbjct: 88 WSFPKLLVKVRNDAHWNPVLFQNPAGRLFLYFKVGKEIDDWETWYIHSDDLGNTWSNPEE 147
Query: 127 LPPGI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
L PG GP +N ++L NG L +S+E W ++++ + D G++W K + +
Sbjct: 148 LMPGDQGGRGPVRNHVLVLSNGTWLAPASIEKNRVWNAFVDASSDYGKTWVKSDTLELDR 207
Query: 184 ESLS---VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ ++ VIQP + + + +L+R+ G++ S+S D G TWS T L N NSG
Sbjct: 208 QLINGEGVIQPALWESCPGQVHMLLRT--STGKIGRSDSNDYGETWSPVYLTDLPNNNSG 265
Query: 241 IDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLE-ENLAMEFSYPAV 294
ID + L LAYN V SR + +A+S+D+GDSW + +E EFSYP++
Sbjct: 266 IDVAHMGGKSLALAYNPVGKNWGSRYPMTLAISEDNGDSWPFKIDIEVGKEGNEFSYPSL 325
Query: 295 IQASDGSVHITYTYNRTQIKHV 316
+ + G + + YT+NRT I+ V
Sbjct: 326 LYEA-GHLVLCYTWNRTNIRFV 346
>gi|167553718|ref|ZP_02347464.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Saintpaul str. SARA29]
gi|418846517|ref|ZP_13401286.1| hypothetical protein SEEN443_16062 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|418858518|ref|ZP_13413132.1| hypothetical protein SEEN470_02240 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|418864008|ref|ZP_13418544.1| hypothetical protein SEEN536_15791 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|418868680|ref|ZP_13423121.1| hypothetical protein SEEN176_02779 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
gi|205321879|gb|EDZ09718.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Saintpaul str. SARA29]
gi|392810390|gb|EJA66410.1| hypothetical protein SEEN443_16062 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19443]
gi|392831857|gb|EJA87484.1| hypothetical protein SEEN536_15791 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19536]
gi|392832489|gb|EJA88109.1| hypothetical protein SEEN470_02240 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 19470]
gi|392837370|gb|EJA92940.1| hypothetical protein SEEN176_02779 [Salmonella enterica subsp.
enterica serovar Newport str. CVM 4176]
Length = 347
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 171/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G CEG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|161503647|ref|YP_001570759.1| hypothetical protein SARI_01730 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160864994|gb|ABX21617.1| hypothetical protein SARI_01730 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 347
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/346 (35%), Positives = 172/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + PA+ A CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQKIIPASRATPFQCHASTLVCLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
P WNPVLF PS+ L LFYK+G +V W S ++G TWS +L
Sbjct: 64 KPQRVAACEGEAHWNPVLFS-PSDRLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPTELV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW-------RKYGP 178
G + GP+KNK +L NG + S+E+ W ++++ + D G+ W +
Sbjct: 123 KGDILPRGPAKNKLLLASNGAWIAPGSIENAEHWEAFVDRSCDEGKHWDISFVPLEPHNM 182
Query: 179 IYIPNESL---------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
I N +L VIQP + ++ + +L+RS G V S
Sbjct: 183 ISAKNVALWEGIKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--VFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S D+G
Sbjct: 241 DSTDYGATWSVARATALPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASCDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE N E+SYPA+I +G VHITYT+NR I + L
Sbjct: 301 TSWLPLLELESNRG-EYSYPAIIN-EEGVVHITYTWNRKNIVYCRL 344
>gi|168239713|ref|ZP_02664771.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
gi|194736503|ref|YP_002114266.1| hypothetical protein SeSA_A1340 [Salmonella enterica subsp.
enterica serovar Schwarzengrund str. CVM19633]
gi|194712005|gb|ACF91226.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. CVM19633]
gi|197287621|gb|EDY27013.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Schwarzengrund str. SL480]
Length = 347
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 173/346 (50%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + F CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPI----Y 180
+ GP KNK +L NG + S+ES W ++++ + D G+ W + P+
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEADNA 182
Query: 181 IPNESLS-----------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
IP +++ VIQP + ++ + +L+RS G + S
Sbjct: 183 IPGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V + DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|29141600|ref|NP_804942.1| hypothetical protein t1130 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|213053004|ref|ZP_03345882.1| hypothetical protein Salmoneentericaenterica_08861 [Salmonella
enterica subsp. enterica serovar Typhi str. E00-7866]
gi|213421059|ref|ZP_03354125.1| hypothetical protein Salmonentericaenterica_26257 [Salmonella
enterica subsp. enterica serovar Typhi str. E01-6750]
gi|213622722|ref|ZP_03375505.1| hypothetical protein SentesTyp_36516 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-2068]
gi|213646491|ref|ZP_03376544.1| hypothetical protein SentesTy_03656 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
gi|213855360|ref|ZP_03383600.1| hypothetical protein SentesT_15367 [Salmonella enterica subsp.
enterica serovar Typhi str. M223]
gi|289828543|ref|ZP_06546385.1| hypothetical protein Salmonellentericaenterica_18887 [Salmonella
enterica subsp. enterica serovar Typhi str. E98-3139]
gi|378959292|ref|YP_005216778.1| BNR/Asp-box repeat domain-containing protein [Salmonella enterica
subsp. enterica serovar Typhi str. P-stx-12]
gi|29137228|gb|AAO68791.1| hypothetical protein t1130 [Salmonella enterica subsp. enterica
serovar Typhi str. Ty2]
gi|374353164|gb|AEZ44925.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Typhi str. P-stx-12]
Length = 347
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G CEG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATFLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SKGGVVHITYTWNRKNIVYCRL 344
>gi|224584240|ref|YP_002638038.1| hypothetical protein SPC_2491 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|375114152|ref|ZP_09759322.1| Neuraminidase [Salmonella enterica subsp. enterica serovar
Choleraesuis str. SCSA50]
gi|224468767|gb|ACN46597.1| hypothetical protein SPC_2491 [Salmonella enterica subsp. enterica
serovar Paratyphi C strain RKS4594]
gi|322714298|gb|EFZ05869.1| Neuraminidase [Salmonella enterica subsp. enterica serovar
Choleraesuis str. SCSA50]
Length = 347
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I A G VHITYT+NR I + L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344
>gi|167771446|ref|ZP_02443499.1| hypothetical protein ANACOL_02812 [Anaerotruncus colihominis DSM
17241]
gi|167666086|gb|EDS10216.1| BNR/Asp-box repeat protein [Anaerotruncus colihominis DSM 17241]
Length = 398
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 116/343 (33%), Positives = 172/343 (50%), Gaps = 49/343 (14%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
+ ++ P + F +CH+STIV+V K +L AY G CEG PD+ IWL DG W P
Sbjct: 56 INQQLIMPEDQTWFGNCHSSTIVKVGKNEYLCAYMAGECEGKPDMAIWLSRCVDGIWLQP 115
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
P WNPVL+ + L +K+G V W + +S + G TWS + PG
Sbjct: 116 EKIMGVYRFPHWNPVLY-FDGTTVTLIFKVGPSVPLWYSMISQSKDLGKTWSVPREAVPG 174
Query: 131 ILGP---SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP-NESL 186
P S+NK I NG L S+E+ W S+++++ D GR+W K+ IP N L
Sbjct: 175 DYQPRITSRNKIIRGSNGYLYGPCSIETEKYWDSYIDISKDNGRTWSKHT---IPFNHDL 231
Query: 187 ------------------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
+IQP + ++ L+RS G R+
Sbjct: 232 RQEDKNGGSWNGLAQGALWENDLATVLKWDGIIQPTIWESSAGIFHALLRSTRG--RIYR 289
Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
++S D G TW A PT + N NSGID K+ +G L+LAYN +S R + +++S+D+
Sbjct: 290 TDSTDYGDTWCEAYPTDIPNNNSGIDIAKMDNGTLVLAYNPISGNWSARSPISLSISEDN 349
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGS-VHITYTYNRTQI 313
G+++ + LE EFSYPA++ +DGS +++TYTY R I
Sbjct: 350 GNTFSQPVHLETKDG-EFSYPAIL--ADGSHIYMTYTYKRKSI 389
>gi|417334319|ref|ZP_12117570.1| hypothetical protein LTSEALA_2795 [Salmonella enterica subsp.
enterica serovar Alachua str. R6-377]
gi|353575510|gb|EHC38228.1| hypothetical protein LTSEALA_2795 [Salmonella enterica subsp.
enterica serovar Alachua str. R6-377]
Length = 347
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRKVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAASEGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I A G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344
>gi|161614362|ref|YP_001588327.1| hypothetical protein SPAB_02110 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
gi|161363726|gb|ABX67494.1| hypothetical protein SPAB_02110 [Salmonella enterica subsp.
enterica serovar Paratyphi B str. SPB7]
Length = 347
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G V W S ++G TWS L
Sbjct: 64 TPQRVVAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
G + GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKRVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|423139679|ref|ZP_17127317.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. houtenae
str. ATCC BAA-1581]
gi|379052233|gb|EHY70124.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. houtenae
str. ATCC BAA-1581]
Length = 347
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 175/346 (50%), Gaps = 43/346 (12%)
Query: 10 LVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + + PF+ CHAST+V + G + A+F G EG+ D IWL + W
Sbjct: 5 LVNRQVILPESGNEPFQ-CHASTLVRLPCGTLVAAWFAGLWEGSEDTAIWLSRYDHSIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
P WNPVLF PS+ L LFYK+G +V W S ++G TWS +L
Sbjct: 64 KPQRVTVREGKAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSPPTELV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
G + GP KNK +L NG+ + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGVWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWEGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G+TWS A+ T L N NSGID V ++DG L+LA N V+ R L + +S ++G
Sbjct: 241 DSVDYGVTWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGRRYPLSLIVSHNNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 KSWLPLLDLESDRG-EYSYPAII-SEGGIVHITYTWNRKNIVYCRL 344
>gi|62179834|ref|YP_216251.1| hypothetical protein SC1264 [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
gi|62127467|gb|AAX65170.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Choleraesuis str. SC-B67]
Length = 347
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 169/345 (48%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W +++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRVFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SETNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I A G VHITYT+NR I + L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344
>gi|200391019|ref|ZP_03217630.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
gi|199603464|gb|EDZ02010.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Virchow str. SL491]
Length = 347
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGKLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLVRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|168264223|ref|ZP_02686196.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
gi|205347291|gb|EDZ33922.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Hadar str. RI_05P066]
Length = 347
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLITSQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|168244341|ref|ZP_02669273.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|194449596|ref|YP_002045253.1| hypothetical protein SeHA_C1371 [Salmonella enterica subsp.
enterica serovar Heidelberg str. SL476]
gi|386591110|ref|YP_006087510.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|419728984|ref|ZP_14255945.1| hypothetical protein SEEH1579_06336 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|419734605|ref|ZP_14261494.1| hypothetical protein SEEH1563_08194 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|419738478|ref|ZP_14265240.1| hypothetical protein SEEH1573_12664 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|419744710|ref|ZP_14271363.1| hypothetical protein SEEH1566_17357 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|419747768|ref|ZP_14274271.1| hypothetical protein SEEH1565_01244 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|421570695|ref|ZP_16016380.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00322]
gi|421575919|ref|ZP_16021525.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00325]
gi|421580796|ref|ZP_16026350.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00326]
gi|421586419|ref|ZP_16031900.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00328]
gi|194407900|gb|ACF68119.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL476]
gi|205336718|gb|EDZ23482.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Heidelberg str. SL486]
gi|381297283|gb|EIC38377.1| hypothetical protein SEEH1563_08194 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41563]
gi|381297687|gb|EIC38773.1| hypothetical protein SEEH1579_06336 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41579]
gi|381302276|gb|EIC43320.1| hypothetical protein SEEH1573_12664 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41573]
gi|381307908|gb|EIC48756.1| hypothetical protein SEEH1566_17357 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41566]
gi|381316479|gb|EIC57228.1| hypothetical protein SEEH1565_01244 [Salmonella enterica subsp.
enterica serovar Heidelberg str. 41565]
gi|383798154|gb|AFH45236.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. B182]
gi|402519291|gb|EJW26654.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00326]
gi|402519872|gb|EJW27227.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00325]
gi|402523275|gb|EJW30593.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00322]
gi|402527818|gb|EJW35076.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
serovar Heidelberg str. CFSAN00328]
Length = 347
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|207857245|ref|YP_002243896.1| hypothetical protein SEN1799 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|421358257|ref|ZP_15808555.1| hypothetical protein SEEE3139_09409 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|421364478|ref|ZP_15814710.1| hypothetical protein SEEE0166_17747 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|421366733|ref|ZP_15816935.1| hypothetical protein SEEE0631_06073 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|421373445|ref|ZP_15823585.1| hypothetical protein SEEE0424_17144 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|421377170|ref|ZP_15827269.1| hypothetical protein SEEE3076_13088 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|421381669|ref|ZP_15831724.1| hypothetical protein SEEE4917_12838 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|421385349|ref|ZP_15835371.1| hypothetical protein SEEE6622_08654 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|421390323|ref|ZP_15840298.1| hypothetical protein SEEE6670_10927 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|421393583|ref|ZP_15843527.1| hypothetical protein SEEE6426_04619 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|421398371|ref|ZP_15848279.1| hypothetical protein SEEE6437_06551 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|421404183|ref|ZP_15854027.1| hypothetical protein SEEE7246_13025 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|421409694|ref|ZP_15859484.1| hypothetical protein SEEE7250_18127 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|421413417|ref|ZP_15863171.1| hypothetical protein SEEE1427_14046 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|421418729|ref|ZP_15868430.1| hypothetical protein SEEE2659_18131 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|421422405|ref|ZP_15872073.1| hypothetical protein SEEE1757_13914 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|421426358|ref|ZP_15875986.1| hypothetical protein SEEE5101_11107 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|421432891|ref|ZP_15882459.1| hypothetical protein SEEE8B1_21287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|421434896|ref|ZP_15884442.1| hypothetical protein SEEE5518_08095 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|421442212|ref|ZP_15891672.1| hypothetical protein SEEE1618_22209 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|421444503|ref|ZP_15893933.1| hypothetical protein SEEE3079_10672 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|421448192|ref|ZP_15897587.1| hypothetical protein SEEE6482_06556 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|436615807|ref|ZP_20514272.1| hypothetical protein SEE22704_12964 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|436661040|ref|ZP_20517144.1| hypothetical protein SEE30663_03338 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|436802173|ref|ZP_20525236.1| hypothetical protein SEECHS44_18315 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|436809155|ref|ZP_20528535.1| hypothetical protein SEEE1882_12004 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|436815089|ref|ZP_20532640.1| hypothetical protein SEEE1884_09883 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|436844512|ref|ZP_20538270.1| hypothetical protein SEEE1594_15593 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|436854157|ref|ZP_20543791.1| hypothetical protein SEEE1566_20694 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|436857445|ref|ZP_20545965.1| hypothetical protein SEEE1580_09000 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|436864618|ref|ZP_20550585.1| hypothetical protein SEEE1543_09785 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|436873818|ref|ZP_20556542.1| hypothetical protein SEEE1441_17432 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|436877984|ref|ZP_20558839.1| hypothetical protein SEEE1810_06327 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|436888475|ref|ZP_20564804.1| hypothetical protein SEEE1558_13714 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|436895742|ref|ZP_20568498.1| hypothetical protein SEEE1018_09447 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|436901624|ref|ZP_20572534.1| hypothetical protein SEEE1010_07259 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|436912337|ref|ZP_20578166.1| hypothetical protein SEEE1729_13185 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|436922269|ref|ZP_20584494.1| hypothetical protein SEEE0895_22380 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|436926994|ref|ZP_20586820.1| hypothetical protein SEEE0899_11154 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|436936288|ref|ZP_20591728.1| hypothetical protein SEEE1457_13246 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|436943478|ref|ZP_20596424.1| hypothetical protein SEEE1747_14387 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|436951034|ref|ZP_20600089.1| hypothetical protein SEEE0968_10029 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|436961641|ref|ZP_20605015.1| hypothetical protein SEEE1444_12060 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|436970765|ref|ZP_20609158.1| hypothetical protein SEEE1445_10221 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|436982073|ref|ZP_20613569.1| hypothetical protein SEEE1559_09954 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|436994712|ref|ZP_20618980.1| hypothetical protein SEEE1565_14509 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|437001793|ref|ZP_20621072.1| hypothetical protein SEEE1808_02374 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|437020015|ref|ZP_20627166.1| hypothetical protein SEEE1811_10403 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|437033926|ref|ZP_20632810.1| hypothetical protein SEEE0956_16111 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|437045625|ref|ZP_20637923.1| hypothetical protein SEEE1455_19166 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|437053838|ref|ZP_20642637.1| hypothetical protein SEEE1575_20376 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|437058808|ref|ZP_20645655.1| hypothetical protein SEEE1725_13019 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|437070369|ref|ZP_20651547.1| hypothetical protein SEEE1745_20038 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|437076498|ref|ZP_20654861.1| hypothetical protein SEEE1791_13902 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|437081140|ref|ZP_20657592.1| hypothetical protein SEEE1795_05026 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|437091697|ref|ZP_20663297.1| hypothetical protein SEEE6709_11337 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|437101910|ref|ZP_20666359.1| hypothetical protein SEEE9058_03884 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|437121633|ref|ZP_20671827.1| hypothetical protein SEEE0816_08842 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|437131102|ref|ZP_20677232.1| hypothetical protein SEEE0819_13345 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|437138854|ref|ZP_20681336.1| hypothetical protein SEEE3072_11262 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|437145709|ref|ZP_20685616.1| hypothetical protein SEEE3089_10037 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|437156786|ref|ZP_20692322.1| hypothetical protein SEEE9163_21197 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|437159253|ref|ZP_20693767.1| hypothetical protein SEEE151_05614 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|437166082|ref|ZP_20697867.1| hypothetical protein SEEEN202_03741 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|437178723|ref|ZP_20704841.1| hypothetical protein SEEE3991_16458 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|437187795|ref|ZP_20710076.1| hypothetical protein SEEE3618_20478 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|437198442|ref|ZP_20711381.1| hypothetical protein SEEE1831_04112 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|437258728|ref|ZP_20716648.1| hypothetical protein SEEE2490_04544 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|437268297|ref|ZP_20721767.1| hypothetical protein SEEEL909_07913 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|437277135|ref|ZP_20726654.1| hypothetical protein SEEEL913_09775 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|437293445|ref|ZP_20732160.1| hypothetical protein SEEE4941_15102 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|437312214|ref|ZP_20736322.1| hypothetical protein SEEE7015_13535 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|437328108|ref|ZP_20740890.1| hypothetical protein SEEE7927_13718 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|437343667|ref|ZP_20745835.1| hypothetical protein SEEECHS4_15995 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|437373562|ref|ZP_20749657.1| hypothetical protein SEEE2558_14484 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22558]
gi|437404159|ref|ZP_20752017.1| hypothetical protein SEEE2217_01700 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|437443993|ref|ZP_20758159.1| hypothetical protein SEEE4018_10092 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|437460590|ref|ZP_20761544.1| hypothetical protein SEEE6211_04232 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|437481393|ref|ZP_20768925.1| hypothetical protein SEEE4441_18971 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|437494631|ref|ZP_20772574.1| hypothetical protein SEEE4647_14758 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|437503932|ref|ZP_20774946.1| hypothetical protein SEEE9845_04038 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|437540090|ref|ZP_20782264.1| hypothetical protein SEEE9317_18535 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|437560982|ref|ZP_20786266.1| hypothetical protein SEEE0116_15785 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|437577678|ref|ZP_20791027.1| hypothetical protein SEEE1117_16834 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|437586346|ref|ZP_20793307.1| hypothetical protein SEEE1392_05599 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|437601775|ref|ZP_20798007.1| hypothetical protein SEEE0268_06583 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|437612714|ref|ZP_20801340.1| hypothetical protein SEEE0316_00512 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|437632540|ref|ZP_20806510.1| hypothetical protein SEEE0436_03904 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|437658095|ref|ZP_20811426.1| hypothetical protein SEEE1319_05243 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|437669255|ref|ZP_20815457.1| hypothetical protein SEEE4481_02719 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|437697048|ref|ZP_20822711.1| hypothetical protein SEEE6297_16475 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|437709448|ref|ZP_20826116.1| hypothetical protein SEEE4220_10960 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|437815168|ref|ZP_20842438.1| hypothetical protein SEEE3944_23353 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|437869314|ref|ZP_20848301.1| hypothetical protein SEEE5621_06740 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|437982824|ref|ZP_20853362.1| hypothetical protein SEEE5646_05468 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|438087086|ref|ZP_20859233.1| hypothetical protein SEEE2625_08452 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|438099815|ref|ZP_20863559.1| hypothetical protein SEEE1976_07464 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|438110647|ref|ZP_20868045.1| hypothetical protein SEEE3407_07431 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|445169197|ref|ZP_21395136.1| hypothetical protein SEE8A_002845 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|445198050|ref|ZP_21400936.1| hypothetical protein SE20037_15191 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|445314862|ref|ZP_21411932.1| hypothetical protein SEE436_024495 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
gi|445328324|ref|ZP_21412930.1| hypothetical protein SEE18569_010557 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|445353163|ref|ZP_21421055.1| hypothetical protein SEE13_020140 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|445363842|ref|ZP_21424765.1| hypothetical protein SEE23_013035 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
gi|206709048|emb|CAR33379.1| hypothetical protein SEN1799 [Salmonella enterica subsp. enterica
serovar Enteritidis str. P125109]
gi|395983967|gb|EJH93157.1| hypothetical protein SEEE0166_17747 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639016-6]
gi|395988561|gb|EJH97717.1| hypothetical protein SEEE3139_09409 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 622731-39]
gi|395989388|gb|EJH98522.1| hypothetical protein SEEE0631_06073 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 640631]
gi|395996564|gb|EJI05609.1| hypothetical protein SEEE0424_17144 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-0424]
gi|396000792|gb|EJI09806.1| hypothetical protein SEEE3076_13088 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-6]
gi|396001632|gb|EJI10644.1| hypothetical protein SEEE4917_12838 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 485549-17]
gi|396014133|gb|EJI23019.1| hypothetical protein SEEE6670_10927 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-70]
gi|396016786|gb|EJI25653.1| hypothetical protein SEEE6622_08654 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 596866-22]
gi|396017466|gb|EJI26331.1| hypothetical protein SEEE6426_04619 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-26]
gi|396024991|gb|EJI33775.1| hypothetical protein SEEE7250_18127 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-50]
gi|396027263|gb|EJI36027.1| hypothetical protein SEEE7246_13025 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 639672-46]
gi|396031444|gb|EJI40171.1| hypothetical protein SEEE6437_06551 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629164-37]
gi|396038007|gb|EJI46651.1| hypothetical protein SEEE2659_18131 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-2659]
gi|396040505|gb|EJI49129.1| hypothetical protein SEEE1427_14046 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 77-1427]
gi|396041720|gb|EJI50343.1| hypothetical protein SEEE1757_13914 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 78-1757]
gi|396049107|gb|EJI57650.1| hypothetical protein SEEE8B1_21287 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 8b-1]
gi|396053865|gb|EJI62358.1| hypothetical protein SEEE5101_11107 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22510-1]
gi|396059277|gb|EJI67732.1| hypothetical protein SEEE5518_08095 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648905 5-18]
gi|396062889|gb|EJI71300.1| hypothetical protein SEEE1618_22209 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 6-18]
gi|396066934|gb|EJI75294.1| hypothetical protein SEEE3079_10672 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-3079]
gi|396073792|gb|EJI82092.1| hypothetical protein SEEE6482_06556 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 58-6482]
gi|434957577|gb|ELL51206.1| hypothetical protein SEECHS44_18315 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS44]
gi|434966972|gb|ELL59807.1| hypothetical protein SEEE1882_12004 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1882]
gi|434967625|gb|ELL60431.1| hypothetical protein SEE22704_12964 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22704]
gi|434973205|gb|ELL65593.1| hypothetical protein SEEE1884_09883 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1884]
gi|434979098|gb|ELL71090.1| hypothetical protein SEEE1594_15593 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1594]
gi|434982960|gb|ELL74768.1| hypothetical protein SEEE1566_20694 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1566]
gi|434989597|gb|ELL81147.1| hypothetical protein SEEE1580_09000 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1580]
gi|434995653|gb|ELL86969.1| hypothetical protein SEEE1543_09785 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1543]
gi|434998575|gb|ELL89796.1| hypothetical protein SEEE1441_17432 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1441]
gi|435007921|gb|ELL98748.1| hypothetical protein SEEE1810_06327 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1810]
gi|435010185|gb|ELM00971.1| hypothetical protein SEEE1558_13714 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1558]
gi|435015631|gb|ELM06157.1| hypothetical protein SEEE1018_09447 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1018]
gi|435020063|gb|ELM10488.1| hypothetical protein SEE30663_03338 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE30663]
gi|435021058|gb|ELM11447.1| hypothetical protein SEEE1010_07259 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1010]
gi|435024587|gb|ELM14793.1| hypothetical protein SEEE0895_22380 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0895]
gi|435026582|gb|ELM16713.1| hypothetical protein SEEE1729_13185 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1729]
gi|435036835|gb|ELM26654.1| hypothetical protein SEEE0899_11154 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0899]
gi|435039126|gb|ELM28907.1| hypothetical protein SEEE1457_13246 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1457]
gi|435043677|gb|ELM33394.1| hypothetical protein SEEE1747_14387 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1747]
gi|435050780|gb|ELM40284.1| hypothetical protein SEEE1444_12060 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1444]
gi|435051501|gb|ELM41003.1| hypothetical protein SEEE0968_10029 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0968]
gi|435057054|gb|ELM46423.1| hypothetical protein SEEE1445_10221 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1445]
gi|435063943|gb|ELM53090.1| hypothetical protein SEEE1565_14509 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1565]
gi|435066327|gb|ELM55415.1| hypothetical protein SEEE1559_09954 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1559]
gi|435074664|gb|ELM63488.1| hypothetical protein SEEE1808_02374 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1808]
gi|435076816|gb|ELM65598.1| hypothetical protein SEEE0956_16111 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_0956]
gi|435079652|gb|ELM68347.1| hypothetical protein SEEE1811_10403 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1811]
gi|435079913|gb|ELM68607.1| hypothetical protein SEEE1455_19166 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1455]
gi|435088852|gb|ELM77307.1| hypothetical protein SEEE1575_20376 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1575]
gi|435090340|gb|ELM78742.1| hypothetical protein SEEE1745_20038 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1745]
gi|435094621|gb|ELM82960.1| hypothetical protein SEEE1725_13019 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1725]
gi|435105795|gb|ELM93832.1| hypothetical protein SEEE1791_13902 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1791]
gi|435111759|gb|ELM99647.1| hypothetical protein SEEE1795_05026 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CDC_2010K_1795]
gi|435112603|gb|ELN00468.1| hypothetical protein SEEE6709_11337 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 576709]
gi|435123889|gb|ELN11380.1| hypothetical protein SEEE9058_03884 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 635290-58]
gi|435125076|gb|ELN12532.1| hypothetical protein SEEE0819_13345 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-19]
gi|435125720|gb|ELN13157.1| hypothetical protein SEEE0816_08842 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-16]
gi|435132376|gb|ELN19574.1| hypothetical protein SEEE3072_11262 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607307-2]
gi|435135393|gb|ELN22502.1| hypothetical protein SEEE9163_21197 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 629163]
gi|435137170|gb|ELN24241.1| hypothetical protein SEEE3089_10037 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 607308-9]
gi|435150186|gb|ELN36870.1| hypothetical protein SEEE151_05614 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE15-1]
gi|435152140|gb|ELN38770.1| hypothetical protein SEEE3991_16458 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_56-3991]
gi|435153439|gb|ELN40047.1| hypothetical protein SEEEN202_03741 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_N202]
gi|435156727|gb|ELN43215.1| hypothetical protein SEEE3618_20478 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_76-3618]
gi|435165975|gb|ELN51977.1| hypothetical protein SEEE2490_04544 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_81-2490]
gi|435173321|gb|ELN58831.1| hypothetical protein SEEEL913_09775 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL913]
gi|435174476|gb|ELN59918.1| hypothetical protein SEEEL909_07913 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SL909]
gi|435180884|gb|ELN65989.1| hypothetical protein SEEE4941_15102 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CVM_69-4941]
gi|435183346|gb|ELN68321.1| hypothetical protein SEEE7015_13535 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 638970-15]
gi|435191342|gb|ELN75908.1| hypothetical protein SEEE7927_13718 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 17927]
gi|435191444|gb|ELN76001.1| hypothetical protein SEEECHS4_15995 [Salmonella enterica subsp.
enterica serovar Enteritidis str. CHS4]
gi|435205577|gb|ELN89166.1| hypothetical protein SEEE2217_01700 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 22-17]
gi|435205632|gb|ELN89218.1| hypothetical protein SEEE2558_14484 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 22558]
gi|435210693|gb|ELN93931.1| hypothetical protein SEEE4018_10092 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 40-18]
gi|435217808|gb|ELO00223.1| hypothetical protein SEEE4441_18971 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 4-1]
gi|435220882|gb|ELO03156.1| hypothetical protein SEEE6211_04232 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 1-1]
gi|435222329|gb|ELO04443.1| hypothetical protein SEEE1831_04112 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13183-1]
gi|435227148|gb|ELO08671.1| hypothetical protein SEEE4647_14758 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642046 4-7]
gi|435236459|gb|ELO17194.1| hypothetical protein SEEE9845_04038 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648898 4-5]
gi|435237453|gb|ELO18132.1| hypothetical protein SEEE9317_18535 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648899 3-17]
gi|435238308|gb|ELO18957.1| hypothetical protein SEEE0116_15785 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648900 1-16]
gi|435242620|gb|ELO22924.1| hypothetical protein SEEE1117_16834 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 1-17]
gi|435260304|gb|ELO39515.1| hypothetical protein SEEE1392_05599 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648901 39-2]
gi|435260787|gb|ELO39977.1| hypothetical protein SEEE0268_06583 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648902 6-8]
gi|435265287|gb|ELO44168.1| hypothetical protein SEEE0316_00512 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648903 1-6]
gi|435270790|gb|ELO49275.1| hypothetical protein SEEE1319_05243 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 653049 13-19]
gi|435276636|gb|ELO54638.1| hypothetical protein SEEE6297_16475 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 561362 9-7]
gi|435282233|gb|ELO59857.1| hypothetical protein SEEE4481_02719 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 642044 8-1]
gi|435282893|gb|ELO60492.1| hypothetical protein SEEE0436_03904 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 648904 3-6]
gi|435289569|gb|ELO66522.1| hypothetical protein SEEE4220_10960 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 543463 42-20]
gi|435295413|gb|ELO71875.1| hypothetical protein SEEE3944_23353 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 33944]
gi|435318294|gb|ELO91235.1| hypothetical protein SEEE2625_08452 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 81-2625]
gi|435325413|gb|ELO97278.1| hypothetical protein SEEE1976_07464 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 62-1976]
gi|435331854|gb|ELP02952.1| hypothetical protein SEEE3407_07431 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 53-407]
gi|435335251|gb|ELP05536.1| hypothetical protein SEEE5621_06740 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 6.0562-1]
gi|435337895|gb|ELP07341.1| hypothetical protein SEEE5646_05468 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 50-5646]
gi|444863100|gb|ELX87933.1| hypothetical protein SE20037_15191 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 20037]
gi|444863705|gb|ELX88522.1| hypothetical protein SEE8A_002845 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE8a]
gi|444871931|gb|ELX96317.1| hypothetical protein SEE436_024495 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 436]
gi|444873340|gb|ELX97641.1| hypothetical protein SEE13_020140 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 13-1]
gi|444880101|gb|ELY04185.1| hypothetical protein SEE18569_010557 [Salmonella enterica subsp.
enterica serovar Enteritidis str. 18569]
gi|444883553|gb|ELY07432.1| hypothetical protein SEE23_013035 [Salmonella enterica subsp.
enterica serovar Enteritidis str. PT23]
Length = 347
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|205353019|ref|YP_002226820.1| hypothetical protein SG1872 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|375123847|ref|ZP_09769011.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|445131150|ref|ZP_21381649.1| hypothetical protein SEEG9184_020752 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
gi|205272800|emb|CAR37723.1| hypothetical protein SG1872 [Salmonella enterica subsp. enterica
serovar Gallinarum str. 287/91]
gi|326628097|gb|EGE34440.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Gallinarum str. SG9]
gi|444850990|gb|ELX76086.1| hypothetical protein SEEG9184_020752 [Salmonella enterica subsp.
enterica serovar Gallinarum str. 9184]
Length = 347
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTKSFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|421884817|ref|ZP_16316022.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Senftenberg str. SS209]
gi|379985531|emb|CCF88295.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Senftenberg str. SS209]
Length = 347
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|417518819|ref|ZP_12181108.1| hypothetical protein LTSEUGA_2789 [Salmonella enterica subsp.
enterica serovar Uganda str. R8-3404]
gi|353648337|gb|EHC91256.1| hypothetical protein LTSEUGA_2789 [Salmonella enterica subsp.
enterica serovar Uganda str. R8-3404]
Length = 347
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRS--TCGAIFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLITSQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|437725350|ref|ZP_20829766.1| hypothetical protein SEEE1616_06119, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648901 16-16]
gi|435292569|gb|ELO69326.1| hypothetical protein SEEE1616_06119, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 648901 16-16]
Length = 351
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 9 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 68
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 69 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 127
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 128 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 187
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 188 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 245
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 246 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 305
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 306 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 348
>gi|197251200|ref|YP_002146789.1| hypothetical protein SeAg_B1929 [Salmonella enterica subsp.
enterica serovar Agona str. SL483]
gi|440761730|ref|ZP_20940799.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Agona str. SH11G1113]
gi|440767598|ref|ZP_20946574.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Agona str. SH08SF124]
gi|440774047|ref|ZP_20952935.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Agona str. SH10GFN094]
gi|197214903|gb|ACH52300.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Agona str. SL483]
gi|436413565|gb|ELP11498.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Agona str. SH10GFN094]
gi|436419507|gb|ELP17382.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Agona str. SH08SF124]
gi|436424510|gb|ELP22281.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Agona str. SH11G1113]
Length = 347
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|194444968|ref|YP_002040506.1| hypothetical protein SNSL254_A1356 [Salmonella enterica subsp.
enterica serovar Newport str. SL254]
gi|417366046|ref|ZP_12138473.1| hypothetical protein LTSEHVI_2715 [Salmonella enterica subsp.
enterica serovar Hvittingfoss str. A4-620]
gi|418795705|ref|ZP_13351406.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|418798443|ref|ZP_13354120.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|418808791|ref|ZP_13364344.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|418812947|ref|ZP_13368468.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|418816972|ref|ZP_13372460.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21538]
gi|418820413|ref|ZP_13375846.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22425]
gi|418824215|ref|ZP_13379586.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22462]
gi|418832840|ref|ZP_13387774.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|418835003|ref|ZP_13389902.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|418839690|ref|ZP_13394522.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|418852273|ref|ZP_13406977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|418855503|ref|ZP_13410159.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19593]
gi|194403631|gb|ACF63853.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. SL254]
gi|353592952|gb|EHC50836.1| hypothetical protein LTSEHVI_2715 [Salmonella enterica subsp.
enterica serovar Hvittingfoss str. A4-620]
gi|392758233|gb|EJA15108.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19449]
gi|392767100|gb|EJA23872.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19567]
gi|392774173|gb|EJA30868.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22513]
gi|392775474|gb|EJA32166.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21550]
gi|392789140|gb|EJA45660.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 21538]
gi|392792682|gb|EJA49136.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Newport str. CVM 22425]
gi|392796910|gb|EJA53238.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM N18486]
gi|392804284|gb|EJA60449.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM N1543]
gi|392812134|gb|EJA68129.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 21554]
gi|392815413|gb|EJA71355.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 37978]
gi|392821561|gb|EJA77385.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 19593]
gi|392824194|gb|EJA79982.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Newport str. CVM 22462]
Length = 347
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|437781881|ref|ZP_20836562.1| hypothetical protein SEEE2651_18221, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 76-2651]
gi|435299905|gb|ELO76014.1| hypothetical protein SEEE2651_18221, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. 76-2651]
Length = 341
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/342 (34%), Positives = 169/342 (49%), Gaps = 41/342 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
SW L LE + E+SYPA+I + G VHITYT+NR I +
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVY 341
>gi|16760641|ref|NP_456258.1| hypothetical protein STY1869 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|25512485|pir||AE0716 hypothetical protein STY1869 [imported] - Salmonella enterica
subsp. enterica serovar Typhi (strain CT18)
gi|16502937|emb|CAD02102.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhi]
Length = 347
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 169/346 (48%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G CEG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ P KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRSPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATFLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SKGGVVHITYTWNRKNIVYCRL 344
>gi|16764604|ref|NP_460219.1| hypothetical protein STM1252 [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|167991947|ref|ZP_02573046.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|197265228|ref|ZP_03165302.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|374980252|ref|ZP_09721582.1| hypothetical protein SEE_02317 [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|378444682|ref|YP_005232314.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhimurium str. D23580]
gi|378449728|ref|YP_005237087.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|378699142|ref|YP_005181099.1| hypothetical protein SL1344_1188 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|378983810|ref|YP_005246965.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|378988594|ref|YP_005251758.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|379700422|ref|YP_005242150.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|383495967|ref|YP_005396656.1| hypothetical protein UMN798_1299 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|422025401|ref|ZP_16371834.1| hypothetical protein B571_06201 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|422030405|ref|ZP_16376607.1| hypothetical protein B572_06153 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|427548970|ref|ZP_18927144.1| hypothetical protein B576_06301 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|427564625|ref|ZP_18931847.1| hypothetical protein B577_05655 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|427584472|ref|ZP_18936644.1| hypothetical protein B573_05696 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|427606886|ref|ZP_18941458.1| hypothetical protein B574_05724 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|427632073|ref|ZP_18946406.1| hypothetical protein B575_06292 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|427655242|ref|ZP_18951163.1| hypothetical protein B578_05907 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|427660578|ref|ZP_18956069.1| hypothetical protein B579_06527 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|427666430|ref|ZP_18960841.1| hypothetical protein B580_06089 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
gi|16419768|gb|AAL20178.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. LT2]
gi|197243483|gb|EDY26103.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Saintpaul str. SARA23]
gi|205329805|gb|EDZ16569.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar 4,[5],12:i:- str. CVM23701]
gi|261246461|emb|CBG24271.1| hypothetical protein STMMW_12591 [Salmonella enterica subsp.
enterica serovar Typhimurium str. D23580]
gi|267993106|gb|ACY87991.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. 14028S]
gi|301157790|emb|CBW17283.1| hypothetical protein SL1344_1188 [Salmonella enterica subsp.
enterica serovar Typhimurium str. SL1344]
gi|312912238|dbj|BAJ36212.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. T000240]
gi|321223872|gb|EFX48935.1| hypothetical protein SEE_02317 [Salmonella enterica subsp. enterica
serovar Typhimurium str. TN061786]
gi|323129521|gb|ADX16951.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. ST4/74]
gi|332988141|gb|AEF07124.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Typhimurium str. UK-1]
gi|380462788|gb|AFD58191.1| hypothetical protein UMN798_1299 [Salmonella enterica subsp.
enterica serovar Typhimurium str. 798]
gi|414020699|gb|EKT04275.1| hypothetical protein B571_06201 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm1]
gi|414020749|gb|EKT04322.1| hypothetical protein B576_06301 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm8]
gi|414022231|gb|EKT05722.1| hypothetical protein B572_06153 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm2]
gi|414034715|gb|EKT17633.1| hypothetical protein B577_05655 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm9]
gi|414035915|gb|EKT18761.1| hypothetical protein B573_05696 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm3]
gi|414039437|gb|EKT22108.1| hypothetical protein B574_05724 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm4]
gi|414049051|gb|EKT31277.1| hypothetical protein B578_05907 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm10]
gi|414050524|gb|EKT32695.1| hypothetical protein B575_06292 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm6]
gi|414055087|gb|EKT37007.1| hypothetical protein B579_06527 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm11]
gi|414060598|gb|EKT42107.1| hypothetical protein B580_06089 [Salmonella enterica subsp.
enterica serovar Typhimurium str. STm12]
Length = 347
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGRLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGIVHITYTWNRKNIVYCRL 344
>gi|375001658|ref|ZP_09725998.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. enterica
serovar Infantis str. SARB27]
gi|353076346|gb|EHB42106.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. enterica
serovar Infantis str. SARB27]
Length = 347
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 174/347 (50%), Gaps = 43/347 (12%)
Query: 9 GLVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
LV + P + + PF+ CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEPFQ-CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIW 62
Query: 68 QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS+ L
Sbjct: 63 TTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSKPAPL 121
Query: 128 PPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+
Sbjct: 122 VNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDN 181
Query: 185 SLS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
++S VIQP + ++ + +L+RS G +
Sbjct: 182 AISGINVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFR 239
Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
S+S D G TWS A+ T L N NSGID V + DG L+LA N V+ R L + S+D+
Sbjct: 240 SDSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPLSLIASQDN 299
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 300 GESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|416422214|ref|ZP_11690118.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 315996572]
gi|416431233|ref|ZP_11695456.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 495297-1]
gi|416441288|ref|ZP_11701500.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 495297-3]
gi|416446574|ref|ZP_11705164.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 495297-4]
gi|416451993|ref|ZP_11708660.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 515920-1]
gi|416458994|ref|ZP_11713503.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 515920-2]
gi|416467786|ref|ZP_11717635.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 531954]
gi|416479548|ref|ZP_11722357.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. NC_MB110209-0054]
gi|416489424|ref|ZP_11726188.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. OH_2009072675]
gi|416497443|ref|ZP_11729711.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CASC_09SCPH15965]
gi|416507413|ref|ZP_11735361.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. SARB31]
gi|416523558|ref|ZP_11741193.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. ATCC BAA710]
gi|416528495|ref|ZP_11743945.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. LQC 10]
gi|416535613|ref|ZP_11747867.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. SARB30]
gi|416542799|ref|ZP_11751799.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 19N]
gi|416553920|ref|ZP_11757948.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 29N]
gi|416558037|ref|ZP_11759980.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 42N]
gi|416571596|ref|ZP_11766830.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 4441 H]
gi|416576252|ref|ZP_11768939.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 81038-01]
gi|416585953|ref|ZP_11775253.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MD_MDA09249507]
gi|416590965|ref|ZP_11778140.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 414877]
gi|416599002|ref|ZP_11783353.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 366867]
gi|416607919|ref|ZP_11788913.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 413180]
gi|416611186|ref|ZP_11790616.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 446600]
gi|416624269|ref|ZP_11797925.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 609458-1]
gi|416629135|ref|ZP_11799899.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 556150-1]
gi|416639489|ref|ZP_11804558.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 609460]
gi|416650788|ref|ZP_11810553.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 507440-20]
gi|416653554|ref|ZP_11811923.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 556152]
gi|416665961|ref|ZP_11817112.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB101509-0077]
gi|416685717|ref|ZP_11825075.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB102109-0047]
gi|416697216|ref|ZP_11828141.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB110209-0055]
gi|416707208|ref|ZP_11832306.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB111609-0052]
gi|416714504|ref|ZP_11837822.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 2009083312]
gi|416717061|ref|ZP_11839342.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 2009085258]
gi|416725187|ref|ZP_11845557.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 315731156]
gi|416731619|ref|ZP_11849475.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2009159199]
gi|416736115|ref|ZP_11851832.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008282]
gi|416748761|ref|ZP_11858977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008283]
gi|416759859|ref|ZP_11864667.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008284]
gi|416761919|ref|ZP_11865969.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008285]
gi|416767302|ref|ZP_11869799.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008287]
gi|417463557|ref|ZP_12164709.1| hypothetical protein LTSEMON_2682 [Salmonella enterica subsp.
enterica serovar Montevideo str. S5-403]
gi|418485908|ref|ZP_13054890.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 80959-06]
gi|418486692|ref|ZP_13055640.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035278]
gi|418495457|ref|ZP_13061899.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035318]
gi|418499249|ref|ZP_13065658.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035320]
gi|418503128|ref|ZP_13069497.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035321]
gi|418506461|ref|ZP_13072793.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035327]
gi|418527050|ref|ZP_13093007.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008286]
gi|322616641|gb|EFY13550.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 315996572]
gi|322619844|gb|EFY16718.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 495297-1]
gi|322622412|gb|EFY19257.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 495297-3]
gi|322627936|gb|EFY24726.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 495297-4]
gi|322632966|gb|EFY29709.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 515920-1]
gi|322636788|gb|EFY33491.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 515920-2]
gi|322641401|gb|EFY38040.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 531954]
gi|322645176|gb|EFY41705.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. NC_MB110209-0054]
gi|322650117|gb|EFY46531.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. OH_2009072675]
gi|322655691|gb|EFY51993.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CASC_09SCPH15965]
gi|322660015|gb|EFY56254.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 19N]
gi|322665417|gb|EFY61605.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 81038-01]
gi|322667605|gb|EFY63766.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MD_MDA09249507]
gi|322673601|gb|EFY69703.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 414877]
gi|322677527|gb|EFY73591.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 366867]
gi|322679808|gb|EFY75847.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 413180]
gi|322687281|gb|EFY83253.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 446600]
gi|323192398|gb|EFZ77628.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 609458-1]
gi|323199476|gb|EFZ84569.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 556150-1]
gi|323203664|gb|EFZ88686.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 609460]
gi|323214133|gb|EFZ98893.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 556152]
gi|323217769|gb|EGA02484.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB101509-0077]
gi|323218920|gb|EGA03431.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB102109-0047]
gi|323224991|gb|EGA09247.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB110209-0055]
gi|323229572|gb|EGA13695.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. MB111609-0052]
gi|323232795|gb|EGA16891.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 2009083312]
gi|323240167|gb|EGA24211.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 2009085258]
gi|323242846|gb|EGA26867.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 315731156]
gi|323247778|gb|EGA31717.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2009159199]
gi|323254338|gb|EGA38155.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008282]
gi|323256116|gb|EGA39853.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008283]
gi|323259438|gb|EGA43074.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008284]
gi|323267034|gb|EGA50519.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008285]
gi|323272138|gb|EGA55552.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008287]
gi|353631044|gb|EHC78435.1| hypothetical protein LTSEMON_2682 [Salmonella enterica subsp.
enterica serovar Montevideo str. S5-403]
gi|363549711|gb|EHL34044.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. ATCC BAA710]
gi|363553423|gb|EHL37671.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. SARB31]
gi|363553812|gb|EHL38058.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. LQC 10]
gi|363562106|gb|EHL46212.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 29N]
gi|363565821|gb|EHL49845.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. SARB30]
gi|363574126|gb|EHL57999.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 4441 H]
gi|363577604|gb|EHL61424.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 42N]
gi|366055798|gb|EHN20133.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 80959-06]
gi|366059313|gb|EHN23587.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035318]
gi|366071784|gb|EHN35878.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035320]
gi|366072817|gb|EHN36900.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035278]
gi|366074852|gb|EHN38914.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035321]
gi|366083181|gb|EHN47108.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. CT_02035327]
gi|366827670|gb|EHN54568.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. 507440-20]
gi|372204519|gb|EHP18046.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Montevideo str. IA_2010008286]
Length = 347
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|168821866|ref|ZP_02833866.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|409250454|ref|YP_006886265.1| hypothetical protein SENTW_1976 [Salmonella enterica subsp.
enterica serovar Weltevreden str. 2007-60-3289-1]
gi|205341660|gb|EDZ28424.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Weltevreden str. HI_N05-537]
gi|320086282|emb|CBY96056.1| hypothetical protein SENTW_1976 [Salmonella enterica subsp.
enterica serovar Weltevreden str. 2007-60-3289-1]
Length = 347
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTGSFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183
Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
S VIQP + ++ + +L+RS G + S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|204930845|ref|ZP_03221718.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|452120576|ref|YP_007470824.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Javiana str. CFSAN001992]
gi|204320304|gb|EDZ05508.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Javiana str. GA_MM04042433]
gi|451909580|gb|AGF81386.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Javiana str. CFSAN001992]
Length = 347
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 122/347 (35%), Positives = 177/347 (51%), Gaps = 43/347 (12%)
Query: 9 GLVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
LV + P + + PF+ CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEPFQ-CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIW 62
Query: 68 QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 63 TTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPL 121
Query: 128 PPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPI---- 179
+ GP KNK +L NG + S+ES W ++++ + D G+ W + P+
Sbjct: 122 VNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEADN 181
Query: 180 YIPNESLS-----------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
IP +++ VIQP + ++ + +L+RS G +
Sbjct: 182 AIPGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFR 239
Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
S+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+
Sbjct: 240 SDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDN 299
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 300 GESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|418512678|ref|ZP_13078918.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Pomona str. ATCC 10729]
gi|366083506|gb|EHN47427.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Pomona str. ATCC 10729]
Length = 347
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 174/347 (50%), Gaps = 43/347 (12%)
Query: 9 GLVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
LV + P + + PF+ CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEPFQ-CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIW 62
Query: 68 QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 63 TTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPL 121
Query: 128 PPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+
Sbjct: 122 VNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDN 181
Query: 185 SLS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
++S VIQP + ++ + +L+RS G +
Sbjct: 182 AISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFR 239
Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
S+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+
Sbjct: 240 SDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDN 299
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 300 GESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|437817680|ref|ZP_20842764.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Enteritidis str. SARB17]
gi|435308158|gb|ELO83156.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
Enteritidis str. SARB17]
Length = 347
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 119/346 (34%), Positives = 168/346 (48%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRHVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVTLWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASHDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|343084744|ref|YP_004774039.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342353278|gb|AEL25808.1| hypothetical protein Cycma_2061 [Cyclobacterium marinum DSM 745]
Length = 368
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 106/317 (33%), Positives = 165/317 (52%), Gaps = 16/317 (5%)
Query: 10 LVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
L EE+ PF CHASTI + G++L A+F GS E DV IW+ K G W
Sbjct: 45 LTTEEYEHIFGEDRPFAQCHASTITGLGDGNYLAAWFAGSHEKHNDVGIWVSKGKAGDWS 104
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
P + + N P WNPVLF P + L++K+G+E+ W ++ + + G TWSE +L
Sbjct: 105 EPELLVKVRNEPHWNPVLFNAPDGKVYLYFKVGKEIDYWETWVQYTEDGGETWSEARELV 164
Query: 129 PGI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
PG GP +N ++L +G L +S E W +++ + D G++W + +
Sbjct: 165 PGDKGGRGPVRNHMLVLSDGTWLAPASDEKNKVWTVFVDRSEDGGKTWTATEKLDMDRSV 224
Query: 186 LS---VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++ VIQP + + + +LMRS + G +C S+S D G TWS +L N NSGID
Sbjct: 225 ITGEGVIQPALWESKPGHVHMLMRSSS--GNICRSDSEDYGKTWSKVTEIELPNNNSGID 282
Query: 243 GVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLA-MEFSYPAVIQ 296
++ ++ L YN V+ R + +A+S+D+G +W +E+ E SYPA+
Sbjct: 283 VAHIEGEKIALIYNPVAENWGDRFPISIAVSEDNGKTWPLKFEIEKGEGDDELSYPAMFY 342
Query: 297 ASDGSVHITYTYNRTQI 313
DG + YT+NR +
Sbjct: 343 -EDGHLVACYTWNRETV 358
>gi|238910930|ref|ZP_04654767.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Tennessee str. CDC07-0191]
Length = 347
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 120/347 (34%), Positives = 172/347 (49%), Gaps = 43/347 (12%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNLWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISETNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V +++G L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQNGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I S+GS VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII--SEGSVVHITYTWNRKNIVYCRL 344
>gi|412993057|emb|CCO16590.1| exported exo-alpha-sialidase [Bathycoccus prasinos]
Length = 490
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 131/398 (32%), Positives = 195/398 (48%), Gaps = 88/398 (22%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVD---------KGHFLVAYFGGSCEGAPDVKIWL 59
+++ +F FP + P S H++TI+ +D + L ++F G+ E APDV I+
Sbjct: 94 AIISSQFIFPVGTYP--SSHSATIIALDNEEEEEEGGRYRLLASWFAGTEENAPDVAIYT 151
Query: 60 QTF--KDGRWQS-PIIA-------DEEPNV------------PMWNPVLFKL-PSNGLLL 96
T+ + WQS P +A D V +WNPVL K +N + L
Sbjct: 152 STYDSRTSSWQSIPDVAVSAFRPDDSSCAVISGGEGCKLRENSLWNPVLVKENENNHVAL 211
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVTWSER-EQLPPGILGPSKNKPILLENGLLLCGSSVES 155
FYK G+ W G +KRS + G TW ++ E+LP I+GP+KNK + + + L L SS E+
Sbjct: 212 FYKAGKHPSIWEGYVKRSKDDGKTWEQQAEKLPRNIIGPAKNKILEVNDRLWLAPSSREA 271
Query: 156 -----WNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRS--- 207
+W +E + D G+SW + +P + + IQP + + LR++ R+
Sbjct: 272 NAKDGSRNWKLIVERSRDGGKSWDPEENVEVPFDG-NAIQPSMWRGDDGYLRMVARTATD 330
Query: 208 FNGIGR------------------VCMSESCDG-GLTWSYAKPTQLLNPNSGIDGVKLKD 248
++ GR + S+S D G W AKPT L PNSGID VKL+D
Sbjct: 331 YDENGRRADQVKVKKQTFIRGKKYIIYSKSLDKEGEKWMKAKPTSLPGPNSGIDAVKLRD 390
Query: 249 GRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTL------------------EEN 284
GR+++ YN ++ RG L VA+S DDG +W + T+ +
Sbjct: 391 GRVVIIYNHSWGLKSLGRGKLNVAISHDDGLTWTPSSTVVLESFDDGAISDVNVDKKNQK 450
Query: 285 LAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVVLLPN 321
+E SYPAVIQ +HI YTY+R IKHVV+ PN
Sbjct: 451 KPIEMSYPAVIQDPKTDLIHIVYTYDRQTIKHVVINPN 488
>gi|417326873|ref|ZP_12112443.1| hypothetical protein LTSEADE_2802 [Salmonella enterica subsp.
enterica serovar Adelaide str. A4-669]
gi|353572055|gb|EHC35813.1| hypothetical protein LTSEADE_2802 [Salmonella enterica subsp.
enterica serovar Adelaide str. A4-669]
Length = 360
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 171/359 (47%), Gaps = 54/359 (15%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTSIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
G + GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYRYPLSLIASQK 300
Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
R L + S+D+G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 RYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 357
>gi|417475643|ref|ZP_12170405.1| hypothetical protein LTSERUB_3011 [Salmonella enterica subsp.
enterica serovar Rubislaw str. A4-653]
gi|353643625|gb|EHC87773.1| hypothetical protein LTSERUB_3011 [Salmonella enterica subsp.
enterica serovar Rubislaw str. A4-653]
Length = 360
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 173/359 (48%), Gaps = 54/359 (15%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + F CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPI----Y 180
+ GP KNK +L NG + S+ES W ++++ + D G+ W + P+
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEADNA 182
Query: 181 IPNESLS-----------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
IP +++ VIQP + ++ + +L+RS G + S
Sbjct: 183 IPGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
+S D G TWS A+ T L N NSGID V + DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPLSTATGESAK 300
Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
R L + S+D+G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 RYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 357
>gi|402306159|ref|ZP_10825210.1| bacterial alpha-L-rhamnosidase [Prevotella sp. MSX73]
gi|400379926|gb|EJP32755.1| bacterial alpha-L-rhamnosidase [Prevotella sp. MSX73]
Length = 1338
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 102/267 (38%), Positives = 142/267 (53%), Gaps = 38/267 (14%)
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL 141
WNPVLF +P L LFYK+G V W+G + +S + G TWS+RE LP G +GP KNKP L
Sbjct: 1068 WNPVLFTMPDGELWLFYKVGATVADWTGWLVKSRDGGRTWSDREPLPKGFIGPVKNKPEL 1127
Query: 142 LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES---------------- 185
++ G L+CGSS E+ + W +E+ W+ GP+ N +
Sbjct: 1128 VD-GRLVCGSSTEN-DGWRFHVEILDLKTGRWKYVGPVEAENRARTDDVEPGCDMVAPVY 1185
Query: 186 --------LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ IQP + L+VLMR+ N ++ S S DGG TWS T++ N
Sbjct: 1186 KQGEGPRPIYSIQPSILRLKDGRLQVLMRTHN--AKLATSFSADGGDTWSPVTLTEVPNN 1243
Query: 238 NSGIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAME 288
SG D V L+DGR +L YN R + +ALS DDG WH ALTLE++ +
Sbjct: 1244 QSGTDAVTLRDGRHVLIYNDFETLPGTKKGPRTPISIALS-DDGTHWHHALTLEDSPINQ 1302
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKH 315
+SYPA+I+ DG +H YT+ R +I +
Sbjct: 1303 YSYPAIIEGRDGKLHCVYTWRRQRIAY 1329
>gi|168233143|ref|ZP_02658201.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Kentucky str. CDC 191]
gi|194471425|ref|ZP_03077409.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Kentucky str. CVM29188]
gi|194457789|gb|EDX46628.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Kentucky str. CVM29188]
gi|205332708|gb|EDZ19472.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
enterica serovar Kentucky str. CDC 191]
Length = 347
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/346 (34%), Positives = 169/346 (48%), Gaps = 41/346 (11%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARAIFLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344
>gi|410031410|ref|ZP_11281240.1| hypothetical protein MaAK2_19509 [Marinilabilia sp. AK2]
Length = 351
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 106/315 (33%), Positives = 169/315 (53%), Gaps = 16/315 (5%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIA 73
EF F PF+ CHAS+I + G +++ +F G+ E DV IW+ G W P +
Sbjct: 36 EFVFQG-KMPFEQCHASSIESLGNGQYMMVWFAGTHEKNDDVGIWMAKGSPGSWSVPKLL 94
Query: 74 DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI-- 131
+ + WNPVLF+ + L+LF+K+G+E+ W S + G +WS E+L G
Sbjct: 95 VKVRDDAHWNPVLFQTSAGRLILFFKVGKEIDDWETWWMYSEDAGNSWSNPEELVEGDRG 154
Query: 132 -LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL---S 187
GP +N ++L NG L +S+E W ++++ + D G++W K + + + +
Sbjct: 155 GRGPVRNHMLVLSNGNWLAPASIEKNRVWNAFVDASKDKGKTWVKSDTLQLDRQLVVGEG 214
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
VIQP + ++ + +L+R+ G++ S+S D G TWS T L N NSGID L
Sbjct: 215 VIQPALWESSPGQVHMLLRT--SAGKIGRSDSNDYGETWSPMLLTDLPNNNSGIDVAHLG 272
Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEE-NLAMEFSYPAVIQASDGS 301
L LAYN V R + +A+S+D+G SW + +E+ EFSYP++I +G
Sbjct: 273 GRTLALAYNPVGKNWGPRYPMTLAISEDNGHSWPIKIDIEKGEEGNEFSYPSLIH-EEGY 331
Query: 302 VHITYTYNRTQIKHV 316
+ + YT+NRT I+ V
Sbjct: 332 LVLCYTWNRTNIRFV 346
>gi|315607322|ref|ZP_07882322.1| alpha-rhamnosidase [Prevotella buccae ATCC 33574]
gi|315251025|gb|EFU31014.1| alpha-rhamnosidase [Prevotella buccae ATCC 33574]
Length = 1338
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/267 (37%), Positives = 141/267 (52%), Gaps = 38/267 (14%)
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL 141
WNPVLF +P L LFYK+G V W+G + +S + G TWS+RE LP G +GP KNKP L
Sbjct: 1068 WNPVLFTMPDGELWLFYKVGAIVADWTGWLVKSRDGGRTWSDREPLPKGFIGPVKNKPEL 1127
Query: 142 LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES---------------- 185
++ G L+CGSS E+ + W +E+ W+ GP+ +
Sbjct: 1128 VD-GRLVCGSSTEN-DGWRFHVEILDLKTGRWKYVGPVEAEGRARTDDVEPGCDMVAPVY 1185
Query: 186 --------LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ IQP + L+VLMR+ N ++ S S DGG TWS T++ N
Sbjct: 1186 KQGEGPRPIYSIQPSILRLKDGRLQVLMRTHN--AKLATSFSADGGDTWSPVTLTEVPNN 1243
Query: 238 NSGIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAME 288
SG D V L+DGR +L YN R + +ALS DDG WH ALTLE++ +
Sbjct: 1244 QSGTDAVTLRDGRHVLIYNDFETLPGTKKGPRTPISIALS-DDGTHWHHALTLEDSPINQ 1302
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKH 315
+SYPA+I+ DG +H YT+ R +I +
Sbjct: 1303 YSYPAIIEGRDGKLHCVYTWRRQRIAY 1329
>gi|323141219|ref|ZP_08076120.1| BNR/Asp-box repeat protein [Phascolarctobacterium succinatutens YIT
12067]
gi|322414362|gb|EFY05180.1| BNR/Asp-box repeat protein [Phascolarctobacterium succinatutens YIT
12067]
Length = 338
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/328 (35%), Positives = 164/328 (50%), Gaps = 40/328 (12%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP--IIADEEPNVP 80
P CHAST++ + G L +FGG+ EG DV I++ D W P ++ N
Sbjct: 11 PTAFCHASTVLALPDGSLLCCWFGGTHEGESDVGIYISRRTDAGWSEPKLLVNGAAAN-- 68
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL----GPSK 136
WNPVLF LLLFYK GQ++ W + +S + G TWS ++L PG + GP +
Sbjct: 69 -WNPVLFAGADGRLLLFYKQGQQIADWQTWLLQSTDNGETWSGPQELVPGDVSGGRGPVR 127
Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI-------------PN 183
NKP+ L +G +L G+S E W ++ +++ D G SW K + I N
Sbjct: 128 NKPLRLASGRILAGASTEH-GIWKAFADISDDDGASWHKSSAVQIEGLQYQTGEKTADSN 186
Query: 184 ESLS--------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
++S VIQP + +A+ ++ +L+RS G V S+S D G TW A L
Sbjct: 187 IAVSQQSFYGRGVIQPSLWQSADGSVHMLLRSSEGF--VYRSDSADDGETWCSAYALSLP 244
Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
N NSG+D V+ G L L N V+ R L + S D+G SW LE A EFS
Sbjct: 245 NNNSGLDLVRTAGGVLYLVCNPVAANWGQRSPLSLFKSTDEGASWQKLFDLETEPA-EFS 303
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YPA+I A + +TYTY R I V L
Sbjct: 304 YPAII-ADGEDLMLTYTYKRCNIACVRL 330
>gi|417358590|ref|ZP_12133452.1| hypothetical protein LTSEGIV_2690 [Salmonella enterica subsp.
enterica serovar Give str. S5-487]
gi|353590824|gb|EHC49250.1| hypothetical protein LTSEGIV_2690 [Salmonella enterica subsp.
enterica serovar Give str. S5-487]
Length = 361
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 169/360 (46%), Gaps = 55/360 (15%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
+S D G TWS A+ T L N NSGID V + DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPQRYPLSLIAG 300
Query: 260 -RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
R L + S+D+G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 301 KRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 358
>gi|323144495|ref|ZP_08079096.1| hypothetical protein HMPREF9444_01764 [Succinatimonas hippei YIT
12066]
gi|322415708|gb|EFY06441.1| hypothetical protein HMPREF9444_01764 [Succinatimonas hippei YIT
12066]
Length = 332
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 172/317 (54%), Gaps = 37/317 (11%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IADEEPNVPMW 82
F CHAS ++++D G + +F GS EG D I++ K G P+ IA + W
Sbjct: 18 FNMCHASNLIKLDDGTIITVFFAGSHEGNSDTSIYMCRAKAGESSEPVKIAGSDE--AHW 75
Query: 83 NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKP 139
NPVLF + L+LFYK+G + W + RSY++G+TWSE E+L PG + GP +NKP
Sbjct: 76 NPVLFAVSDEELVLFYKVGNIIATWRTMIVRSYDRGLTWSEPEELVPGDIGGRGPVRNKP 135
Query: 140 ILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK-----------YGP----IYIPNE 184
+ + +G +LC S+E+ W ++++++ D R+ K + P I + +
Sbjct: 136 VRISSGAILCPGSLEN-GPWRAFLDISYDNLRTLEKSAEISYQEEGNFSPLNKGIEVSEQ 194
Query: 185 SLS---VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
S S VIQP + ++ + VL+RS G+V ++S DGG T+ P + N NSG+
Sbjct: 195 SFSGKGVIQPTIWED-DKGVHVLLRS--TYGKVIRADSIDGGRTFLKPYPVNMDNNNSGL 251
Query: 242 DGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
D V + +G+L L N V R L + S +DG ++ + L + EFSYP I+
Sbjct: 252 DAVYV-NGKLYLVCNPVGGNWADRTPLTL-FSSEDGINFKEEAVLADGPG-EFSYPC-IR 307
Query: 297 ASDGSVHITYTYNRTQI 313
A D +++I+YTY+R I
Sbjct: 308 AYDNALYISYTYSRKNI 324
>gi|218129082|ref|ZP_03457886.1| hypothetical protein BACEGG_00656 [Bacteroides eggerthii DSM 20697]
gi|217988717|gb|EEC55036.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides eggerthii
DSM 20697]
Length = 1345
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 159/323 (49%), Gaps = 58/323 (17%)
Query: 45 FGGSCEGAPDVKIWLQTFKDGR----WQSPI------IADEEPNVPMWNPVLFKLPSNGL 94
GG+ E AP V + DG+ W +P + DE P +NPVLF++P L
Sbjct: 1029 MGGAREAAPAVPM-----GDGKNYPWWMAPAKNSGKPLVDETWRKPCYNPVLFQIPGGDL 1083
Query: 95 LLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVE 154
+L+YKIG V W+G RS N G TW + E L G LGP KNKP+ + NG ++ SS E
Sbjct: 1084 VLYYKIGYGVGDWTGWQVRSKNGGKTWGKPEALAKGFLGPIKNKPVYI-NGRIIAPSSTE 1142
Query: 155 SWNSWGSWMEVTVDAGRSWRKYGPI----YIPN--------------------------E 184
N W E++ D G++W+ GP+ +P +
Sbjct: 1143 G-NGWKFHFEISDDKGKTWKYVGPVDAEYSLPTALRKAGIKKETGDDLEAGEVLTEKGAQ 1201
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+ IQP + L + R+ N ++ ++ S D G TWS + L N NSG D V
Sbjct: 1202 PILCIQPSILQLKDGRLMAIGRTRN--AKLAVTYSSDCGDTWSKVVLSDLPNNNSGTDAV 1259
Query: 245 KLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
L DGR ++ +N + R + +A+S+DDG +W +A+ LE++ ++SYP++I
Sbjct: 1260 TLADGRQVVVFNDFATLPGTPKGVRTPVSIAVSEDDGKTWKNAVVLEDSPISQYSYPSII 1319
Query: 296 QASDGSVHITYTYNRTQIKHVVL 318
Q DG +H YT+ R +I + +
Sbjct: 1320 QGKDGKLHCVYTWRRKRIAYKAI 1342
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDG 65
K +V +EF + A F CH +T+VE++ G + A+FGG+ E PD IW + +
Sbjct: 910 KCIVTDEFLY--EDAGFPQCHGATVVELENGDLVAAFFGGTREKNPDCNIWTCRKPYGSK 967
Query: 66 RWQSPIIA 73
W P A
Sbjct: 968 EWTKPYKA 975
>gi|317474864|ref|ZP_07934134.1| bacterial alpha-L-rhamnosidase [Bacteroides eggerthii 1_2_48FAA]
gi|316909002|gb|EFV30686.1| bacterial alpha-L-rhamnosidase [Bacteroides eggerthii 1_2_48FAA]
Length = 1315
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 159/323 (49%), Gaps = 58/323 (17%)
Query: 45 FGGSCEGAPDVKIWLQTFKDGR----WQSPI------IADEEPNVPMWNPVLFKLPSNGL 94
GG+ E AP V + DG+ W +P + DE P +NPVLF++P L
Sbjct: 999 MGGAREAAPAVPM-----GDGKNYPWWMAPAKNSGKPLVDETWRKPCYNPVLFQIPGGDL 1053
Query: 95 LLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVE 154
+L+YKIG V W+G RS N G TW + E L G LGP KNKP+ + NG ++ SS E
Sbjct: 1054 VLYYKIGYGVGDWTGWQVRSKNGGKTWGKPEALAKGFLGPIKNKPVYI-NGRIIAPSSTE 1112
Query: 155 SWNSWGSWMEVTVDAGRSWRKYGPI----YIPN--------------------------E 184
N W E++ D G++W+ GP+ +P +
Sbjct: 1113 G-NGWKFHFEISDDKGKTWKYVGPVDAEYSLPTALRKSGIKKETGDDLEAGEVLTEKGAQ 1171
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+ IQP + L + R+ N ++ ++ S D G TWS + L N NSG D V
Sbjct: 1172 PILCIQPSILQLKDGRLMAIGRTRN--AKLAVTYSSDCGDTWSKVVLSDLPNNNSGTDAV 1229
Query: 245 KLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
L DGR ++ +N + R + +A+S+DDG +W +A+ LE++ ++SYP++I
Sbjct: 1230 TLADGRQVVVFNDFATLPGTPKGVRTPVSIAVSEDDGKTWKNAVVLEDSPISQYSYPSII 1289
Query: 296 QASDGSVHITYTYNRTQIKHVVL 318
Q DG +H YT+ R +I + +
Sbjct: 1290 QGKDGKLHCVYTWRRKRIAYKAI 1312
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDG 65
K +V +EF + A F CH +T+VE++ G + A+FGG+ E PD IW + +
Sbjct: 880 KCIVTDEFLY--EDAGFPQCHGATVVELENGDLVAAFFGGTREKNPDCNIWTCRKPYGSK 937
Query: 66 RWQSPIIA 73
W P A
Sbjct: 938 EWTKPYKA 945
>gi|197286767|ref|YP_002152639.1| hypothetical protein PMI2938 [Proteus mirabilis HI4320]
gi|227355042|ref|ZP_03839453.1| BNR repeat-containing glycosyl hydrolase [Proteus mirabilis ATCC
29906]
gi|194684254|emb|CAR45786.1| conserved hypothetical protein [Proteus mirabilis HI4320]
gi|227164829|gb|EEI49676.1| BNR repeat-containing glycosyl hydrolase [Proteus mirabilis ATCC
29906]
Length = 350
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 169/343 (49%), Gaps = 41/343 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
+F + F CHASTIV V + L A+F G EG+ + IWL + WQ
Sbjct: 9 QFVLDNAHSLFNHCHASTIVRVPNSERLLTAFFAGDKEGSGNTAIWLAIKEGNDWQPAQP 68
Query: 73 ADEEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+ P V WNPVL PS G + LFYK G +V W+ S + G +WS +L G
Sbjct: 69 VVKNPGVAHWNPVLHVDPSTGNIWLFYKTGPDVHSWTTQYVISKDGGNSWSLPSELVSGD 128
Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG-PIY------- 180
+ GP KNK +++ NG L SVE W ++++++ D G+ W++ PI
Sbjct: 129 VTPRGPVKNKVLVMSNGEWLAPGSVEDDRYWDAFVDISSDNGQHWQRVDIPIAHHQGGQA 188
Query: 181 -------IPNESL------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
+ +++L VIQP + + + +MRS G++ S+S D
Sbjct: 189 EHEIWQGLKDDALWETDLQRVFQWDGVIQPTLWGSQPGHVHAMMRSTR--GKIYRSDSTD 246
Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWH 276
G +W A T L N NSGID V DG L L YN S R + V+LS D+G +W
Sbjct: 247 YGRSWCPAYATTLPNNNSGIDVVSFADGLLALVYNPNSGNWSRRYPISVSLSSDNGSTWS 306
Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
+ L + EFSYPA+I A + ++H+T+T+NR I + L+
Sbjct: 307 EPFDLLDGEG-EFSYPAII-AENNTLHVTFTWNRKNIVYQQLI 347
>gi|226327532|ref|ZP_03803050.1| hypothetical protein PROPEN_01403 [Proteus penneri ATCC 35198]
gi|225204058|gb|EEG86412.1| BNR/Asp-box repeat protein [Proteus penneri ATCC 35198]
Length = 350
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 168/343 (48%), Gaps = 41/343 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
+F + F CHASTIV V + L A+F G EG+ + IWL + WQ
Sbjct: 9 QFVLDNAHSLFNHCHASTIVRVPNSERLLTAFFAGDKEGSGNTAIWLVMKEGDHWQHAQP 68
Query: 73 ADEEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+ P V WNPVL PS G + LFYK G +V W+ S + G +WS +L G
Sbjct: 69 VVKNPGVAHWNPVLHVDPSTGNIWLFYKTGPDVHSWTTQYVISKDGGNSWSLPSELVSGD 128
Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG-PIY------- 180
+ GP KNK +++ NG L SVE W ++++++ D G+ W + PI
Sbjct: 129 VAPRGPVKNKVLVMSNGEWLAPGSVEDDQFWDAFVDISSDNGQHWLRVDIPIEHHKGGQT 188
Query: 181 -------IPNESL------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
+ +++L VIQP + + + +MRS G + S+S D
Sbjct: 189 EHEIWQGLKDDALWETDLQRVFQWDGVIQPTLWESQPGHVHAMMRSTRGT--IYRSDSTD 246
Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWH 276
G +W A T L N NSGID V DG+L L YN S R + V+LS D+G +W
Sbjct: 247 YGRSWCPAYATILPNNNSGIDVVSFADGQLALVYNPNSGNWSRRYPISVSLSSDNGSTWS 306
Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
+ L + EFSYPA+I A + ++H+T+T+NR I + L+
Sbjct: 307 EPFDLLDGEG-EFSYPAII-AENNTLHVTFTWNRKNIVYQQLM 347
>gi|425069501|ref|ZP_18472616.1| hypothetical protein HMPREF1311_02686 [Proteus mirabilis WGLW6]
gi|425070956|ref|ZP_18474062.1| hypothetical protein HMPREF1310_00352 [Proteus mirabilis WGLW4]
gi|404596927|gb|EKA97434.1| hypothetical protein HMPREF1311_02686 [Proteus mirabilis WGLW6]
gi|404599781|gb|EKB00234.1| hypothetical protein HMPREF1310_00352 [Proteus mirabilis WGLW4]
Length = 350
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 168/343 (48%), Gaps = 41/343 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
+F + F CHASTI V + L A+F G EG+ + IWL + WQ
Sbjct: 9 QFVLDNAHSLFNHCHASTIARVPNSERLLTAFFAGDKEGSGNTAIWLAIKEGNDWQPAQP 68
Query: 73 ADEEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+ P V WNPVL PS G + LFYK G +V W+ S + G +WS +L G
Sbjct: 69 VVKNPGVAHWNPVLHVDPSTGNIWLFYKTGPDVHSWTTQYVISKDGGNSWSLPSELVSGD 128
Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG-PIY------- 180
+ GP KNK +++ NG L SVE W ++++++ D G+ W++ PI
Sbjct: 129 VTPRGPVKNKVLVMSNGEWLAPGSVEDDRYWDAFVDISSDNGQHWQRVDIPIAHHQGGQA 188
Query: 181 -------IPNESL------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
+ +++L VIQP + + + +MRS G++ S+S D
Sbjct: 189 EHEIWQGLKDDALWETDLQRVFQWDGVIQPTLWGSQPGHVHAMMRSTR--GKIYRSDSTD 246
Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWH 276
G +W A T L N NSGID V DG L L YN S R + V+LS D+G +W
Sbjct: 247 YGRSWCPAYATTLPNNNSGIDVVSFADGLLALVYNPNSGNWSRRYPISVSLSSDNGSTWS 306
Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
+ L + EFSYPA+I A + ++H+T+T+NR I + L+
Sbjct: 307 EPFDLLDGEG-EFSYPAII-AENNTLHVTFTWNRKNIVYQQLI 347
>gi|417421639|ref|ZP_12160118.1| hypothetical protein LTSEMIS_3387 [Salmonella enterica subsp.
enterica serovar Mississippi str. A4-633]
gi|353618915|gb|EHC69466.1| hypothetical protein LTSEMIS_3387 [Salmonella enterica subsp.
enterica serovar Mississippi str. A4-633]
Length = 326
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 160/328 (48%), Gaps = 55/328 (16%)
Query: 41 LVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKI 100
+ A+F G CEG+ D IWL ++ W +P WNPVLF PS+ L LFYK+
Sbjct: 1 MAAWFAGLCEGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKV 59
Query: 101 GQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWN 157
G V W S ++G TWS L G + GP KNK +L NG + S+ES
Sbjct: 60 GSGVHVWKTWFITSSDRGFTWSTPAPLVNGDILPRGPVKNKLLLASNGAWIAPGSIESPE 119
Query: 158 SWGSWMEVTVDAGRSWR-KYGPI----YIPNESLS-----------------------VI 189
W ++++ + D G+ W + P+ IP +++ VI
Sbjct: 120 RWRAFVDRSSDEGKHWNISFVPLEPDNAIPGTNVALWDGVKKGMLWECCLENLLRWDGVI 179
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP + ++ + +L+RS G + S+S D G TWS A+ T L N NSGID V ++DG
Sbjct: 180 QPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDG 237
Query: 250 RLLLAYNTVS-------------------RGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
L+LA N V+ R L + S+D+G+SW L LE + E+S
Sbjct: 238 TLILALNPVNGNWGKRYPLSTATGESARKRYPLSLIASQDNGESWLPLLDLESDRG-EYS 296
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YPA+I + G VHITYT+NR I + L
Sbjct: 297 YPAII-SEGGVVHITYTWNRKNIVYCRL 323
>gi|56413761|ref|YP_150836.1| hypothetical protein SPA1594 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|56128018|gb|AAV77524.1| hypothetical protein SPA1594 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
Length = 320
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 108/310 (34%), Positives = 153/310 (49%), Gaps = 41/310 (13%)
Query: 45 FGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEV 104
F G EG+ D IWL ++ W +P WNPVLF PS+ L LFYK+G +V
Sbjct: 13 FAGLREGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDV 71
Query: 105 QKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGS 161
W S ++G TWS L + GP KNK +L NG + S+ES W +
Sbjct: 72 HVWKTWFIISSDRGFTWSTPAPLVNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRA 131
Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLS----------------------------VIQPVP 193
+++ + D G+ W P+ ++S VIQP
Sbjct: 132 FVDRSSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTL 191
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +++ + +L+RS G + S+S D G TWS A+ T L N NSGID V ++DG L+L
Sbjct: 192 WESSSGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLIL 249
Query: 254 AYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
A N V+ R L + S D+G+SW L LE + E+SYPA+I + G VHITYT+
Sbjct: 250 ALNPVNGNWGKRYPLSLIASHDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTW 307
Query: 309 NRTQIKHVVL 318
NR I + L
Sbjct: 308 NRKNIVYCRL 317
>gi|289810295|ref|ZP_06540924.1| hypothetical protein Salmonellaentericaenterica_40322 [Salmonella
enterica subsp. enterica serovar Typhi str. AG3]
Length = 305
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 151/306 (49%), Gaps = 41/306 (13%)
Query: 49 CEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWS 108
CEG+ D IWL ++ W +P WNPVLF PS+ L LFYK+G V W
Sbjct: 2 CEGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWK 60
Query: 109 GCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEV 165
S ++G TWS L + GP KNK +L NG + S+ES W ++++
Sbjct: 61 TWFITSSDRGFTWSTPAPLVNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDR 120
Query: 166 TVDAGRSWRKYGPIYIPNESLS----------------------------VIQPVPFHTA 197
+ D G+ W P+ ++S VIQP + ++
Sbjct: 121 SSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESS 180
Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
+ +L+RS G + S+S D G TWS A+ T L N NSGID V ++DG L+LA N
Sbjct: 181 PGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATFLPNNNSGIDLVSMQDGTLILALNP 238
Query: 258 VS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQ 312
V+ R L + S+D+G+SW L LE + E+SYPA+I + G VHITYT+NR
Sbjct: 239 VNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SKGGVVHITYTWNRKN 296
Query: 313 IKHVVL 318
I + L
Sbjct: 297 IVYCRL 302
>gi|197362683|ref|YP_002142320.1| hypothetical protein SSPA1481 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
gi|197094160|emb|CAR59661.1| hypothetical protein SSPA1481 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. AKU_12601]
Length = 305
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 151/305 (49%), Gaps = 41/305 (13%)
Query: 50 EGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG 109
EG+ D IWL ++ W +P WNPVLF PS+ L LFYK+G +V W
Sbjct: 3 EGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKT 61
Query: 110 CMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVT 166
S ++G TWS L + GP KNK +L NG + S+ES W ++++ +
Sbjct: 62 WFIISSDRGFTWSTPAPLVNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRS 121
Query: 167 VDAGRSWRKYGPIYIPNESLS----------------------------VIQPVPFHTAN 198
D G+ W P+ ++S VIQP + +++
Sbjct: 122 SDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSS 181
Query: 199 RTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV 258
+ +L+RS G + S+S D G TWS A+ T L N NSGID V ++DG L+LA N V
Sbjct: 182 GHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPV 239
Query: 259 S-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
+ R L + S D+G+SW L LE + E+SYPA+I + G VHITYT+NR I
Sbjct: 240 NGNWGKRYPLSLIASHDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNI 297
Query: 314 KHVVL 318
+ L
Sbjct: 298 VYCRL 302
>gi|406836101|ref|ZP_11095695.1| hypothetical protein SpalD1_30819 [Schlesneria paludicola DSM
18645]
Length = 543
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 110/305 (36%), Positives = 151/305 (49%), Gaps = 24/305 (7%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
H STI E G L ++GGS E A D ++L + G WQ P + P+ N V
Sbjct: 237 HCSTIAEAANGDLLCLWYGGSYESADDQTLFLARRRAGERAWQEPQAILKGPDPLPGNGV 296
Query: 86 LFKLPSNGLLLFYKIGQEV------QKWSGC--MKR-SYNKGVTWS-EREQLPPGILGPS 135
+F + + + + V Q W C M R S + G+TWS + E L +L
Sbjct: 297 IFVDGHKRVWIVWCRMESVRPIGRGQGWDKCRLMARVSTDHGLTWSSDTEFLDQKLLAVP 356
Query: 136 KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
+N PILL +G L+ GS + D G+ WR+ G + P S QP
Sbjct: 357 RNPPILLSSGNLVLPLEAIVDGVEGSIFLIGADGGKQWRRGG--FTPGGS----QPAVIQ 410
Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
+++L LMR R+ ES DGG WS A P+ L NP+SGI +L +G ++ +
Sbjct: 411 RRDKSLFALMRK---APRLTQIESRDGGDNWSQATPSSLRNPDSGISMTQLANGHAVVVF 467
Query: 256 N--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
N ++R L +A S D+G +W L LE N E+SYP VIQ SDG +HITYTY R I
Sbjct: 468 NDTELNRTPLSIARSIDEGHTWETPLHLESNPG-EYSYPCVIQTSDGKIHITYTYRRYSI 526
Query: 314 KHVVL 318
KHV L
Sbjct: 527 KHVEL 531
>gi|227538528|ref|ZP_03968577.1| cytoplasmic protein [Sphingobacterium spiritivorum ATCC 33300]
gi|227241447|gb|EEI91462.1| cytoplasmic protein [Sphingobacterium spiritivorum ATCC 33300]
Length = 309
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 91/225 (40%), Positives = 121/225 (53%), Gaps = 12/225 (5%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
+V ++F F N F CHAST+ E G ++FGG+ EG DV IW F +W +
Sbjct: 1 MVKKDFLFEENRY-FAQCHASTLAETASGEIRASWFGGTHEGNKDVVIWSAAFDGQKWSA 59
Query: 70 PI-----IADEEPNVPMWNPVLFKLPS-NGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
P+ I+ + WNPVLFK N L L+YK+G ++W G +K S + G TWS
Sbjct: 60 PVSVADGISHDTVRYACWNPVLFKTKEDNTLYLYYKVGPNPREWWGAVKTSADNGKTWSS 119
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
LP GILGP KNKP+ L NG++L SSVE + + W +E + D +SW Y PI
Sbjct: 120 ASLLPKGILGPVKNKPLELANGIILSPSSVEVTEDRWMGHVERSDDQQKSWTSY-PI-DH 177
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
N ++IQP A+ L+VL RS G V S S D G WS
Sbjct: 178 NSPFNIIQPSILQHADGRLQVLCRSKEGF--VMTSWSSDEGKNWS 220
>gi|427752421|ref|ZP_18965946.1| hypothetical protein B581_07445, partial [Salmonella enterica
subsp. enterica serovar Typhimurium str. STm5]
gi|414066191|gb|EKT46800.1| hypothetical protein B581_07445, partial [Salmonella enterica
subsp. enterica serovar Typhimurium str. STm5]
Length = 305
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/303 (32%), Positives = 144/303 (47%), Gaps = 39/303 (12%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGRLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+ R L + S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300
Query: 273 DSW 275
+SW
Sbjct: 301 ESW 303
>gi|157803702|ref|YP_001492251.1| hypothetical protein A1E_02610 [Rickettsia canadensis str. McKiel]
gi|157784965|gb|ABV73466.1| hypothetical protein A1E_02610 [Rickettsia canadensis str. McKiel]
Length = 363
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 186/362 (51%), Gaps = 62/362 (17%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG-- 65
EEF N++ K HA+TI D ++++FG S + + IW+ FK
Sbjct: 9 EEFIL--NNSDLKDVHAATIFAFDDQKLIISFFGKTQSESGKSEENNSIWISLGFKKKNH 66
Query: 66 -RWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGC 110
+W +P II+ + + ++P+ NPV+ +N LL+F KIG + WSG
Sbjct: 67 YQWTTPERIISPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGI 125
Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
+ RSY++G+TW ++ +L GILGP++NKP++ EN +L S ES S++E + D
Sbjct: 126 LSRSYDRGLTW-QKPELLHGILGPARNKPLIYENN-ILSPCSRESCIDDFSYIEYSSDL- 182
Query: 171 RSWRKYGPIYIPNESL------SVIQPVPFHTANRTLRVLM----RSFNGI--GRVCMSE 218
+W PI N + IQP ++N +++M R + + + S
Sbjct: 183 HTWNLSNPILPFNPQIFQKGYRGFIQPT-LVSSNLPNKIIMLVRPRKTDSLISTYIHRSI 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
S + G+ WS +P LLNP+S ID + L LLLAYN V SR +L VA S D+G
Sbjct: 242 SINKGIYWSNLEPVNLLNPDSAIDTINLSSDVLLLAYNRVINNHKSRNILSVATSYDEGL 301
Query: 274 SWHDALTLEEN------------LAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVV--L 318
+W +T+ + L+ E+SYPA+I + D + +H+ YT+NR +KH V L
Sbjct: 302 NWR-PITIRNSIYPEGDIEYSNILSEEYSYPAIIMSPDNNEIHVIYTFNRINLKHKVFQL 360
Query: 319 LP 320
LP
Sbjct: 361 LP 362
>gi|379022881|ref|YP_005299542.1| hypothetical protein RCA_02435 [Rickettsia canadensis str. CA410]
gi|376323819|gb|AFB21060.1| hypothetical protein RCA_02435 [Rickettsia canadensis str. CA410]
Length = 363
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 118/362 (32%), Positives = 185/362 (51%), Gaps = 62/362 (17%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG-- 65
EEF N++ K HA+TI D + ++FG S + + IW+ FK
Sbjct: 9 EEFIL--NNSDLKDVHAATIFAFDDQKLITSFFGKTQSESGKSEENNSIWISLGFKKNNH 66
Query: 66 -RWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGC 110
+W +P II+ + + ++P+ NPV+ +N LL+F KIG + WSG
Sbjct: 67 YQWTTPERIISPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGI 125
Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
+ RSY++G+TW ++ +L GILGP++NKP++ EN +L S ES S++E + D
Sbjct: 126 LSRSYDRGLTW-QKPELLHGILGPARNKPLIYENN-ILSPCSRESCIDDFSYIEYSSDL- 182
Query: 171 RSWRKYGPIYIPNESL------SVIQPVPFHTANRTLRVLM----RSFNGI--GRVCMSE 218
+W PI N + IQP ++N +++M R + + + S
Sbjct: 183 HTWNLSNPILPFNPQIFQKGYRGFIQPT-LVSSNLPNKIIMLVRPRKTDSLISTYIHRSI 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
S + G+ WS +P LLNP+S ID + L LLLAYN V SR +L +A S D+G
Sbjct: 242 SINKGIYWSNLEPVNLLNPDSAIDTINLSSDVLLLAYNRVINNHKSRNILSLATSYDEGL 301
Query: 274 SWHDALTLEEN------------LAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVV--L 318
+W +T+ + L+ E+SYPA+I + D + +H+ YT+NR +KH V L
Sbjct: 302 NWR-PITIRNSIYPEGDIEYSNILSEEYSYPAIIMSPDNNEIHVIYTFNRINLKHKVFQL 360
Query: 319 LP 320
LP
Sbjct: 361 LP 362
>gi|412987550|emb|CCO20385.1| putative cytoplasmic protein [Bathycoccus prasinos]
Length = 562
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 186/379 (49%), Gaps = 75/379 (19%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEV-----DKGHFLVAYFGGSCEGAPDVKIWL--QT 61
G + ++P CHA+T + + DK LVA+FGGS EGA DV IW +
Sbjct: 163 GATRNQAKLNERTSPTFGCHATTSMHIKNPDTDKLEVLVAWFGGSYEGAEDVGIWTARRD 222
Query: 62 FKDGRWQSPIIADE---------------------EPNV----PMWNPVLF---KLPSNG 93
+ G W P +A + EP P WNPVLF S+G
Sbjct: 223 EETGMWTKPKLAAKKFRNKPYEGKNLKHNSPYNKPEPERKGGEPHWNPVLFCAGAGESDG 282
Query: 94 -----------LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PPGILGPSKNKP 139
++L++K+G ++ W RS + G TWSE ++ G GP KNKP
Sbjct: 283 DGASLGSCEKEIVLYFKLGWKIWFWETWEVRSRDGGNTWSEPKETVKNDKGGRGPVKNKP 342
Query: 140 ILLENGLLLCGSSVE------SWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-LSVIQPV 192
I+L +G L +S+E W ++ ++++D G++W K I +P+E+ IQP
Sbjct: 343 IVLSDGTWLAPASLEGPMPGQKRKQWRAFADISIDLGKTW-KPSSIILPSENGYGNIQPT 401
Query: 193 PFHTANRTLRVLMRSFNGIGRVCM--SESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KDG 249
+ + +L+RS G + + S+S DGG TW+ ++ T+L N NSG+D L K
Sbjct: 402 VWESEPGHAHMLLRSSRGRKGMWIWRSDSKDGGKTWTQSRKTKLPNNNSGLDVAYLPKSK 461
Query: 250 RLLLAYNTV----SRGVLKVALSKDDGDSWHDALTLE------ENLAMEFSYPAVI---- 295
L+LAYN V SR +++A+SKD+G++W + L+ N E+SYP+ +
Sbjct: 462 VLVLAYNHVMNVDSRSPIRLAISKDNGENWELQVELDNEKGSVHNAGHEYSYPSCVPWPA 521
Query: 296 -QASDGSVHITYTYNRTQI 313
+ + V +TYT++R ++
Sbjct: 522 KEYGEEGVSVTYTWHRRRV 540
>gi|383312842|ref|YP_005365643.1| hypothetical protein MCE_05845 [Candidatus Rickettsia amblyommii
str. GAT-30V]
gi|378931502|gb|AFC70011.1| hypothetical protein MCE_05845 [Candidatus Rickettsia amblyommii
str. GAT-30V]
Length = 364
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 183/362 (50%), Gaps = 61/362 (16%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG-- 65
EEF N++ K HA+TI D + ++FG S + + IW+ FK
Sbjct: 9 EEFIL--NNSDLKDVHAATIFAFDDKKLITSFFGKTRSESGKSEENNSIWISLGFKSNNH 66
Query: 66 -RWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGC 110
+W +P I++ + + ++P+ NPV+ +N LL+F KIG + WSG
Sbjct: 67 YQWTTPERIVSPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGI 125
Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
+ RSY++G+TW ++ +L GILGP++NKP++ +N +L S ES S++E + D
Sbjct: 126 LSRSYDRGLTW-QKPELLHGILGPARNKPLIYKNNIL-SPCSRESCIDDFSYIEYSSDL- 182
Query: 171 RSWRKYGPIYIPNESL------SVIQPVPFHTANRTLRVLM----RSFNGI--GRVCMSE 218
+W PI N + IQP ++N +++M R + + + S
Sbjct: 183 HTWNLSNPILPSNPKIFQKGYRGFIQPT-LVSSNLPNKIIMLVRPRKTDALMSTYIHRSI 241
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
S + G+ WS +P LLNP+S ID + L D LLLAYN V SR +L +A+S D+G
Sbjct: 242 SINKGIYWSNLEPVNLLNPDSAIDAINLGDDTLLLAYNRVINHLKSRNILSLAISYDEGL 301
Query: 274 SWHDALTLEENL-----------AMEFSYPAVIQAS--DGSVHITYTYNRTQIKHVV--L 318
+W+ + + E+SYPA+I S + +H+ YT+NR +KH V L
Sbjct: 302 NWYPIVIKNSTYPEGDIEYSNITSEEYSYPAIIIMSPDNNEIHVIYTFNRINLKHKVFQL 361
Query: 319 LP 320
LP
Sbjct: 362 LP 363
>gi|417389583|ref|ZP_12153340.1| hypothetical protein LTSEMIN_1521, partial [Salmonella enterica
subsp. enterica serovar Minnesota str. A4-603]
gi|353621404|gb|EHC71229.1| hypothetical protein LTSEMIN_1521, partial [Salmonella enterica
subsp. enterica serovar Minnesota str. A4-603]
Length = 299
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 137/295 (46%), Gaps = 47/295 (15%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + F CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSKDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS------ 122
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122
Query: 123 EREQLPPGIL----------GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRS 172
+ LP G L GP KNK +L NG + S+ES W ++++ + D G+
Sbjct: 123 NDDILPRGPLVNDDILPLPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKH 182
Query: 173 WRKYGPIYIPNESLS----------------------------VIQPVPFHTANRTLRVL 204
W P+ ++S VIQP + ++ + +L
Sbjct: 183 WNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHML 242
Query: 205 MRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS 259
+RS G + S+S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 243 LRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVN 295
>gi|417539863|ref|ZP_12192042.1| hypothetical protein LTSEWAN_3128 [Salmonella enterica subsp.
enterica serovar Wandsworth str. A4-580]
gi|353663716|gb|EHD02332.1| hypothetical protein LTSEWAN_3128 [Salmonella enterica subsp.
enterica serovar Wandsworth str. A4-580]
Length = 305
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 133/282 (47%), Gaps = 34/282 (12%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS 259
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVN 282
>gi|417339308|ref|ZP_12120895.1| hypothetical protein LTSEBAI_0535 [Salmonella enterica subsp.
enterica serovar Baildon str. R6-199]
gi|357960520|gb|EHJ84337.1| hypothetical protein LTSEBAI_0535 [Salmonella enterica subsp.
enterica serovar Baildon str. R6-199]
Length = 295
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 133/282 (47%), Gaps = 34/282 (12%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ GP KNK +L NG + S+ES W ++++ + D G+ W P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182
Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+S VIQP + ++ + +L+RS G + S
Sbjct: 183 ISGTNVALWDGIKKGRLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS 259
+S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVN 282
>gi|417373939|ref|ZP_12143845.1| hypothetical protein LTSEINV_2977 [Salmonella enterica subsp.
enterica serovar Inverness str. R8-3668]
gi|353601102|gb|EHC56810.1| hypothetical protein LTSEINV_2977 [Salmonella enterica subsp.
enterica serovar Inverness str. R8-3668]
Length = 321
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 128/264 (48%), Gaps = 34/264 (12%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVL 86
CHAST+V + G + A+F G EG+ D IWL ++ W +P WNPVL
Sbjct: 32 CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVL 91
Query: 87 FKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLE 143
F PS+ L LFYK+G +V W S ++G TWS L + GP KNK +L
Sbjct: 92 F-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVNDDILPRGPVKNKLLLAS 150
Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS---------------- 187
NG + S+ES W ++++ + D G+ W P+ ++S
Sbjct: 151 NGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLW 210
Query: 188 ------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
VIQP + ++ + +L+RS G + S+S D G TWS A+ T L
Sbjct: 211 ECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLP 268
Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS 259
N NSGID V ++DG L+LA N V+
Sbjct: 269 NNNSGIDLVSMQDGTLILALNPVN 292
>gi|58267692|ref|XP_571002.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134112085|ref|XP_775578.1| hypothetical protein CNBE2920 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258237|gb|EAL20931.1| hypothetical protein CNBE2920 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227236|gb|AAW43695.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 372
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/343 (31%), Positives = 155/343 (45%), Gaps = 58/343 (16%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLF 87
H STI + G V +FGG+ EG DVKIW +G W P + ++ WNPV F
Sbjct: 22 HCSTI-QAACGVEYVTWFGGTKEGTSDVKIWFSKCTNGCWTPPRVIAGSGDIVHWNPVSF 80
Query: 88 KLPSNG----LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI----LGPSKNKP 139
G L +FYK G + W + + + G TW+E +L PG GP KN P
Sbjct: 81 LPDPKGAPGHLYVFYKTGTPIPTWKTYVIETIDGGNTWTEPRELVPGDHSGGRGPQKNPP 140
Query: 140 ILLENGLLLCGSSVE-----SWNSWGSWMEV--------TVDAGRSWRKYGPIYIPNE-- 184
I+L NG L G+S E W ++ ++ T G W K I +P +
Sbjct: 141 IVLSNGDWLSGASKEVTLEGGKGLWDAFADIAPTPKEGDTFKQGERWVKAEFIKLPADRG 200
Query: 185 -------SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+IQP + +A++ V M + IGR+ ++S D G TW+ T L N
Sbjct: 201 VEGGSFPGEGIIQPSLWESADKPGHVHMMCRSSIGRIVRADSEDYGRTWTPGYATDLPNN 260
Query: 238 NSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEEN-------- 284
NSG +L+DGRL+ A N V R L + +S DDG++W TLE+
Sbjct: 261 NSGQCVTRLRDGRLVAAVNDVYKNWGPRTPLVLKVSFDDGETWSPWCTLEDQAPPASFQR 320
Query: 285 -LAM----------EFSYPAVIQASDGS---VHITYTYNRTQI 313
+A+ EFSYP V ++ V ++YT+ R I
Sbjct: 321 VIALETGIVNDGKSEFSYPTVTPTAETDEIGVWVSYTWQRRGI 363
>gi|239947969|ref|ZP_04699722.1| BNR/Asp-box repeat domain protein [Rickettsia endosymbiont of
Ixodes scapularis]
gi|239922245|gb|EER22269.1| BNR/Asp-box repeat domain protein [Rickettsia endosymbiont of
Ixodes scapularis]
Length = 375
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 179/371 (48%), Gaps = 58/371 (15%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVK---- 56
M+ I +EF + K HA ++ +++ +V +FG ++
Sbjct: 1 MQNKIPILETTIDEFIL--HDLDLKDVHAVNLLVLNEQELIVTFFGKRKSEPKGLEENNS 58
Query: 57 IW----LQTFKDGRWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFY 98
IW L+ + +W P I++ + + ++P+ NPVL L +N LL+F
Sbjct: 59 IWVCFGLKENRRYQWSIPQRIVSPQYFQDHHIPLKKDNGVISCGNPVL-TLFNNQLLIFS 117
Query: 99 KIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNS 158
KIG + WSG + RSY+ GVTW E E L GILGP++NKP++L++ ++L SS ES
Sbjct: 118 KIGPYPRTWSGILSRSYDHGVTWQEPEILH-GILGPTRNKPLILQDNIMLSPSSRESCID 176
Query: 159 WGSWMEVTVDAGRSWRKYGPIYIPNESL------SVIQP--VPF-HTANRTLRVLMRSFN 209
++E + D R W PI N + IQP VP TA + +L+R N
Sbjct: 177 DFPYVESSSDL-RIWHTSSPILTHNPKIFQEGYRGFIQPTLVPLTDTAGHKIIMLVRPRN 235
Query: 210 -----GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS----- 259
+ S S + G WS KP LLNP+S ID V L + LLLAYN V
Sbjct: 236 TNPSYSKLHIHRSVSVNKGKDWSNLKPIDLLNPDSAIDAVNLGNNLLLLAYNRVVHYSKF 295
Query: 260 RGVLKVALSKDDGDSWHDA-----------LTLEENLAMEFSYPAVIQASDGS-VHITYT 307
R +L +A+S ++G +W+ + + + E+SYPA++ ++ +H+ YT
Sbjct: 296 RNILSLAVSFNEGLNWYPIKIKYSTFPEGDIEYSKITSQEYSYPAILMDTNNQEIHVVYT 355
Query: 308 YNRTQIKHVVL 318
+NR KH V
Sbjct: 356 FNRVNFKHKVF 366
>gi|414590411|tpg|DAA40982.1| TPA: hypothetical protein ZEAMMB73_041549 [Zea mays]
Length = 99
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 61/94 (64%), Positives = 74/94 (78%)
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV 262
+L+RS IG VCM++S DGGL WS+A+ + L NPNSGIDG+K+KDGR+ LAYNTVSRG
Sbjct: 1 MLLRSHQTIGSVCMADSYDGGLRWSFARKSVLPNPNSGIDGIKMKDGRVALAYNTVSRGT 60
Query: 263 LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
LKVA+S DDG SW + +TLE EFSYPAVIQ
Sbjct: 61 LKVAVSTDDGLSWQEVVTLENAEGWEFSYPAVIQ 94
>gi|404399265|ref|ZP_10990849.1| hypothetical protein PfusU_05869 [Pseudomonas fuscovaginae UPB0736]
Length = 372
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 162/363 (44%), Gaps = 77/363 (21%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGRWQSPIIADEEPNVPM 81
P+ HA+ + E+ G L +F G+ EG D+ + L + +G W P+ E+P
Sbjct: 14 PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRKSPEGEWSEPVKLSEDPTRSE 73
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKN 137
NP+LF+ P L L + + + ++R S++ G+TW + E L PGI +
Sbjct: 74 QNPILFQPPQGPLWLIWTAQVSGNQETAIVRRRLSHDDGLTWGQIETLFDEPGIF--VRQ 131
Query: 138 KPILLENG---------LLLCGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
P++L NG + L G E W + S + ++ DAG SW+++ +P ES
Sbjct: 132 PPVILRNGDWALPVWYCIALPG---EKWVGSHDVSAVMISSDAGHSWQRHD---VP-EST 184
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
++ A+ +L L RS + S S D G TWS PT+L N NS I V+L
Sbjct: 185 GLVHMNIQQMADASLLGLFRS-RWADHIHESRSTDNGRTWSTPVPTELANNNSSIQFVRL 243
Query: 247 KDGRLLLAYNTVS---------------------------------------RGVLKVAL 267
G L L YN V+ R + +A+
Sbjct: 244 ASGELALVYNPVNAEGITERRTSLYDEIEDEGDDRVVPTAKAGGRAAVWGIPRAPMSLAI 303
Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
S+D+G SW + LE E L E+SYP++IQ +GS+ + +TY R +IKH
Sbjct: 304 SRDEGRSWPVRIDLELGDGFCLTNNSQEKLNREYSYPSIIQTRNGSLQVAFTYFRQRIKH 363
Query: 316 VVL 318
V L
Sbjct: 364 VHL 366
>gi|383764448|ref|YP_005443430.1| hypothetical protein CLDAP_34930 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384716|dbj|BAM01533.1| hypothetical protein CLDAP_34930 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 316
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/310 (31%), Positives = 150/310 (48%), Gaps = 19/310 (6%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIAD 74
FP S + + H +TI E+ G + A+FGGS E APD + + + RW +P
Sbjct: 9 FP-RSDDYPTVHCATIAELPNGDLIAAWFGGSYETAPDQILLTARYDLRQERWSAPRPLA 67
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQL--PP 129
+ PV + + LF+ QE + W + +RS ++G TWS ++L P
Sbjct: 68 VMEGKALGQPVFWVRKDGDIWLFFDAIQE-RDWRSALPFVQRSADEGRTWSAPQRLFDYP 126
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
G++ S+ + G ++ + E N W S M ++ D G+SWR PI P + I
Sbjct: 127 GLMFRSR---LCFLPGRVIVPAYDE--NRWESLMMISDDEGKSWRLTAPIITPQGN---I 178
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
P H L +R+ G + + S DGG W +PT L NPNSGID ++L+
Sbjct: 179 HPCLVHLQGGKLLAYLRTGGHGGVIWRTLSFDGGECWEKPEPTALPNPNSGIDLLRLRSD 238
Query: 250 RLLLAYNTVSRGVLKVALSKDD-GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
RLLLAYN + +AL+ + W LE++ E+SYP ++Q + +H+ YTY
Sbjct: 239 RLLLAYNPSAERRTPLALAVASLQEDWRPVRILEQD-DHEYSYPTLLQTQNSDIHLVYTY 297
Query: 309 NRTQIKHVVL 318
R I+H
Sbjct: 298 RRQHIQHACF 307
>gi|390455324|ref|ZP_10240852.1| hypothetical protein PpeoK3_15016 [Paenibacillus peoriae KCTC 3763]
Length = 395
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 157/360 (43%), Gaps = 77/360 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G L +F G+ EG PD+ I+L G W PI ++P NPV
Sbjct: 36 HAANLLPLDNGDLLCVWFAGTQEGIPDISIYLSRLNKGAEEWTEPIKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + ++ S ++G TW + L PG +PI
Sbjct: 96 LFSAPDGKLWLLYTAQKSGNQDTAIIRYRISEDEGYTWGDIGTLFDTPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C ++ E W N S ++++ D GR+W + +P +SL +
Sbjct: 153 VLDNGDWLLPIWYCYTNPGEKWTGNKDVSAVKISSDQGRTWTEAS---VP-DSLGCVHMN 208
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L L RS + SES D G TWS +PT L N NS I KL +G L
Sbjct: 209 IEKLDDGRLIALFRS-RWADYIYQSESVDHGRTWSSPQPTSLPNNNSSIQFTKLDNGHLA 267
Query: 253 LAYNTVS------------------------------------------RGVLKVALSKD 270
L +N +S R + VA+SKD
Sbjct: 268 LVFNNISANESMERRASLYDEIEDEEEEAVVEPVTVSAAEERQAFWGAPRAPMTVAISKD 327
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+G +W + +E E L E+SYP++ Q+ DG +HI +TY R IK+VV+
Sbjct: 328 NGLTWSEVKNIEVGDGYAMTNNSREKLNREYSYPSIKQSKDGKIHIAFTYFRQAIKYVVI 387
>gi|375306733|ref|ZP_09772026.1| hypothetical protein WG8_0550 [Paenibacillus sp. Aloe-11]
gi|375081120|gb|EHS59335.1| hypothetical protein WG8_0550 [Paenibacillus sp. Aloe-11]
Length = 395
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 158/360 (43%), Gaps = 77/360 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G L +F G+ EG PD+ I+L G W P+ ++P NPV
Sbjct: 36 HAANLLPLDNGDLLCVWFAGTQEGIPDISIYLSRLNKGAEEWTEPVKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + ++ S ++G TW + L PG +PI
Sbjct: 96 LFSAPDGKLWLLYTAQKSGNQDTAIIRYRISEDEGYTWGDIGTLFDTPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C ++ E W N S ++++ D GR+W + + +P +SL +
Sbjct: 153 VLDNGDWLLPIWYCYTNPGEKWTGNKDVSAVKISSDQGRTWTE---VSVP-DSLGCVHMN 208
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L L RS + SES D G TWS +PT L N NS I KL +G L
Sbjct: 209 IEKLDDGRLIALFRS-RWADYIYQSESVDHGRTWSSPQPTSLPNNNSSIQFTKLDNGHLA 267
Query: 253 LAYNTVS------------------------------------------RGVLKVALSKD 270
L +N +S R + VA+S+D
Sbjct: 268 LVFNNISANESMERRASLYDEIEDEEEEAVVEPVTAPAAEERQAFWGAPRAPMTVAISQD 327
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+G +W + +E E L E+SYP++ Q+ DG +HI +TY R IK+VV+
Sbjct: 328 NGLTWSEVKNIEVGDGYAMTNNSREKLNREYSYPSIKQSKDGKIHIAFTYFRQAIKYVVI 387
>gi|420156329|ref|ZP_14663172.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
gi|394757627|gb|EJF40644.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
Length = 387
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 155/336 (46%), Gaps = 47/336 (13%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
PF +CH+ ++E+ G L +F G+ EG DV I G RW P+ +P
Sbjct: 31 GPFATCHSPAMLELPGGDMLCCWFAGTYEGDADVHIVCARLPKGADRWTEPVAVSGDPRR 90
Query: 80 PMWNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGI 131
NP LF P + Y G++ +++ ++ +S++ G+TWS+ E + P
Sbjct: 91 SEQNPSLFYGPDGAVWCMYTAQLDRVAGKDNMQFTSQIRCQKSFDGGLTWSDYETVFP-R 149
Query: 132 LGPSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
G +PI +L+NG LC S E + S +++ D G++WR + +PN S
Sbjct: 150 EGSFCRQPIQVLKNGRWIFGNWLCTDSEEGLSGDPSVFQLSDDQGKTWRM---VEMPN-S 205
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ P N L MRS V SES D G TWS K T L N NS I ++
Sbjct: 206 RGRVHPTVVELDNGHLVAFMRSREA-DNVYRSESLDWGETWSEPKSTPLPNNNSSISAIR 264
Query: 246 LKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEE 283
K GR+ +AYN R + VALS+D G ++ + EE
Sbjct: 265 TKSGRVAVAYNPTCTPSPVPGKAAWPGLRCPVAVALSEDGGLTFPLIRWMERGEGYIGEE 324
Query: 284 NLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
N ++ YP ++Q++DG +H+TY Y R IK+V
Sbjct: 325 NKTNNKQYEYPYLMQSTDGRLHLTYAAYTRKCIKYV 360
>gi|333898278|ref|YP_004472151.1| hypothetical protein Psefu_0070 [Pseudomonas fulva 12-X]
gi|333113543|gb|AEF20057.1| BNR/Asp-box repeat-containing protein [Pseudomonas fulva 12-X]
Length = 374
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 155/357 (43%), Gaps = 68/357 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L G W P E+
Sbjct: 15 PYAQNHAANLHELADGTVLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDATRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NP+LF+ P+ L L + + + ++R S + G+TW E L + ++
Sbjct: 75 EQNPILFQAPNGPLWLIWTAQISGNQETAIVRRRLSEDGGLTWGPIETLFDEVGTFVRHP 134
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
P +L NG + C + E W N S + ++ D GR+W ++ +P + +V
Sbjct: 135 PAVLSNGDWVFPVWYCITQPGEKWIGNHDVSAVMISCDQGRTWSRHD---VPQSTGAVHM 191
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
V A+ TL L RS + S S D G TWS PT+L N NS I ++L G
Sbjct: 192 NV-HQLADGTLLGLFRS-RWADHIYASRSTDLGRTWSVPTPTELPNNNSSIQYIRLASGE 249
Query: 251 LLLAYNTVS---------------------------------------RGVLKVALSKDD 271
L L YN ++ R + +A+S+D+
Sbjct: 250 LALVYNPINAEGIEQRRASLYDEIEDEGDERVTPTVNADGRGAVWGIPRAPMSLAISRDE 309
Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
G SW L +E E L EFSYP++IQA DGS+H+ +TY R +IKHV
Sbjct: 310 GRSWPLRLDIELGDGFCMTNNSQEKLNREFSYPSIIQARDGSLHVAFTYFRQKIKHV 366
>gi|374321886|ref|YP_005075015.1| hypothetical protein HPL003_10155 [Paenibacillus terrae HPL-003]
gi|357200895|gb|AET58792.1| hypothetical protein HPL003_10155 [Paenibacillus terrae HPL-003]
Length = 395
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 156/360 (43%), Gaps = 77/360 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G L +F G+ EG PD+ I+L G W P+ ++P NPV
Sbjct: 36 HAANLLPLDNGDLLCVWFAGTQEGIPDISIYLSRLNKGAEEWMEPVKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + ++ S ++G TW + L PG +PI
Sbjct: 96 LFSAPDGKLWLLYTAQKSGNQDTAIIRYRISEDEGYTWGDIGTLFDTPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C ++ E W N S ++++ D GR+W + + +P SL +
Sbjct: 153 VLDNGDWLLPIWYCYTNPGEKWTGNKDVSAVKISSDQGRTWTE---VSVPG-SLGCVHMN 208
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L L RS + SES D G TWS PT L N NS I KL +G L
Sbjct: 209 IEKLDDGRLIALFRS-RWADYIYQSESVDHGRTWSVPIPTSLPNNNSSIQFTKLSNGHLA 267
Query: 253 LAYNTVS------------------------------------------RGVLKVALSKD 270
L +N +S R + VA+SKD
Sbjct: 268 LVFNNISANESMDRRASLYDEIEDEEEETVVESAAPPAVEERQAFWGAPRAPMTVAISKD 327
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+G +W + +E E L E+SYP++ Q+ DG +HI +TY R IK+VV+
Sbjct: 328 NGLTWSEVKNIEVGDGYAMTNNSKEKLNREYSYPSIKQSRDGKIHIAFTYFRQAIKYVVI 387
>gi|298157616|gb|EFH98696.1| glycosyl hydrolase, family 43 [Pseudomonas savastanoi pv.
savastanoi NCPPB 3335]
Length = 374
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L + G W P E+ +
Sbjct: 15 PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDASRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132
Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG L E W N S + ++ D G SW ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWSRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V + +L L RS + S S D G +WS PT+L N NS I VKL
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYRSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L LE E L EFSYP++IQA+DGS+H+ +TY R +IKHV
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|71734525|ref|YP_275595.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
phaseolicola 1448A]
gi|71555078|gb|AAZ34289.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. phaseolicola
1448A]
Length = 395
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L + G W P E+ +
Sbjct: 36 PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDSSRS 95
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 96 EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDKPGTF--VR 153
Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG L E W N S + ++ D G SW ++ +P+ + SV
Sbjct: 154 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 210
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V + +L L RS + S S D G +WS PT+L N NS I VKL
Sbjct: 211 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 268
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 269 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 328
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L LE E L EFSYP++IQA+DGS+H+ +TY R +IKHV
Sbjct: 329 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 388
Query: 318 L 318
L
Sbjct: 389 L 389
>gi|289627728|ref|ZP_06460682.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. NCPPB 3681]
gi|289647409|ref|ZP_06478752.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. 2250]
gi|422581538|ref|ZP_16656680.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. 0893_23]
gi|422596133|ref|ZP_16670416.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
lachrymans str. M301315]
gi|330866387|gb|EGH01096.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. 0893_23]
gi|330986433|gb|EGH84536.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
lachrymans str. M301315]
Length = 374
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L + G W P E+ +
Sbjct: 15 PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDSSRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132
Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG L E W N S + ++ D G SW ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V + +L L RS + S S D G +WS PT+L N NS I VKL
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L LE E L EFSYP++IQA+DGS+H+ +TY R +IKHV
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|422608475|ref|ZP_16680456.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
mori str. 301020]
gi|330892098|gb|EGH24759.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
mori str. 301020]
Length = 374
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L + G W P E+ +
Sbjct: 15 PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDSSRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132
Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG L E W N S + ++ D G SW ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V + +L L RS + S S D G +WS PT+L N NS I VKL
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSSSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAPRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L LE E L EFSYP++IQA+DGS+H+ +TY R +IKHV
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVP 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|422681666|ref|ZP_16739934.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
tabaci str. ATCC 11528]
gi|331011008|gb|EGH91064.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
tabaci str. ATCC 11528]
Length = 374
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 153/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L + G W P E+
Sbjct: 15 PYAQNHAANMHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDATRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132
Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG L E W N S + ++ D G SW ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V + +L L RS + S S D G +WS PT+L N NS I VKL
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSIDRGRSWSAPTPTELPNNNSSIQFVKLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L LE E L EFSYP++IQA+DGS+H+ +TY R +IKHV
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|416015500|ref|ZP_11563074.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. B076]
gi|320325058|gb|EFW81127.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. B076]
Length = 374
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L + G W P E+ +
Sbjct: 15 PYAQNHAANLHELADGTLLCTWFAGTQEGMVDIFVLLSRRDPATGVWSEPQKMSEDSSRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDKPGTF--VR 132
Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG L E W N S + ++ D G SW ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V + +L L RS + S S D G +WS PT+L N NS I VKL
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L LE E L EFSYP++IQA+DGS+H+ +TY R +IKHV
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|116669604|ref|YP_830537.1| hypothetical protein Arth_1043 [Arthrobacter sp. FB24]
gi|116609713|gb|ABK02437.1| conserved hypothetical protein [Arthrobacter sp. FB24]
Length = 414
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 155/380 (40%), Gaps = 91/380 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HA+ ++ + G +FGG+ EG PD+ IW + G +W P +
Sbjct: 31 APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFSALEPGSSQWSEPQQLSNDSTR 90
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPP-----GIL 132
NP+LF P+ L L Y + + + ++R S + G TW E E L P G+
Sbjct: 91 SEQNPILFTAPNGALWLLYTAQKAGNQDTAEVRRRTSMDGGRTWGEVETLFPANETGGVF 150
Query: 133 GPSKNKPILLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ P++L +G L+ + E W NS S + ++ DAG +W ++ +P
Sbjct: 151 --VRQLPVVLPSGRLIVPIFRCITTPGEKWVGNSDDSAVMISDDAGATWTEH---VLPGS 205
Query: 185 SLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
V IQPV A+ TL L RS + S S D G TWS PT+L N NS I
Sbjct: 206 LGCVHMNIQPV----ADGTLLALFRS-RWADSIYESRSTDDGSTWSEPVPTELPNNNSSI 260
Query: 242 DGVKLKDGRLLLAYNT-------------------------------------------- 257
L DGRL L YN
Sbjct: 261 QFTALADGRLALVYNHSRAEASTERRLSLYDEIDDDGLAEEQGQVAEPDASAFSEDDGSR 320
Query: 258 -----VSRGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDG 300
R + +A+S+D G SW D L N L E+SYP++ Q DG
Sbjct: 321 KAFWGTPRSPMTLAISEDSGRSWPIRRNLDVGDGYCLSNNSRDGLNREYSYPSIHQGPDG 380
Query: 301 SVHITYTYNRTQIKHVVLLP 320
S++I YTY R IK V + P
Sbjct: 381 SLNIAYTYFRQAIKFVRVDP 400
>gi|440746234|ref|ZP_20925519.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae
BRIP39023]
gi|440371361|gb|ELQ08204.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae
BRIP39023]
Length = 374
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L G W P E+
Sbjct: 15 PYAQNHAANLHELADGTVLCTWFAGTQEGMADIFVLLSRRDPVTGVWSEPRKMSEDSTRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPDGPLWLIWTAQISGNQETAIIRRRLSLDHGQSWGPIDTLFDEPGTF--VR 132
Query: 137 NKPILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG + C + E W N S + ++ D G +W ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWVLPVWYCITGPGEKWVGNHDVSAVMISTDQGHNWSRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V A+ +L L RS + S S D G TWS PT+L N NS I V+L
Sbjct: 190 HMNV-HQLADGSLLGLFRS-RWADFIYSSRSSDRGRTWSVPAPTELPNNNSSIQFVRLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSADGHAQRRGSLYDEIEDEGDERITPGPREDGKAAVWGIPRAPMSLAVSR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G +W L LE E L EFSYP++IQA+DGS+H+ +TY R +IKH+
Sbjct: 308 DGGGTWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHIH 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|422641302|ref|ZP_16704726.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae Cit 7]
gi|330953690|gb|EGH53950.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae Cit 7]
Length = 374
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 72/361 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVP 80
P+ HA+ + E+ G L +F G+ EG D+ + L G W P E+
Sbjct: 15 PYAQNHAANLHELADGTVLCTWFAGTQEGMADIFVLLSRRDPVTGVWSEPRKMSEDSTRS 74
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
NP+LF+ P L L + + + ++R S + G +W + L PG +
Sbjct: 75 EQNPILFQAPDGPLWLIWTAQISGNQETAIIRRRLSLDHGQSWGPIDTLFDEPGTF--VR 132
Query: 137 NKPILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
P++L+NG + C + E W N S + ++ D G +W ++ +P+ + SV
Sbjct: 133 QPPVVLDNGDWVLPVWYCITGPGEKWVGNHDVSAVMISTDQGHNWSRHD---VPDSTGSV 189
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V A+ +L L RS + S S D G TWS PT+L N NS I V+L
Sbjct: 190 HMNV-HQLADGSLLGLFRS-RWADFIYSSRSSDRGRTWSVPAPTELPNNNSSIQFVRLAT 247
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
L L YN VS R + +A+S+
Sbjct: 248 DELALVYNPVSADGHAQRRASLYDEIEDEGDERITPGPREDGKAAVWGIPRAPMSLAVSR 307
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G +W L LE E L EFSYP++IQA+DGS+H+ +TY R +IKH+
Sbjct: 308 DGGGTWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHIH 367
Query: 318 L 318
L
Sbjct: 368 L 368
>gi|359776238|ref|ZP_09279555.1| hypothetical protein ARGLB_037_02350 [Arthrobacter globiformis NBRC
12137]
gi|359306678|dbj|GAB13384.1| hypothetical protein ARGLB_037_02350 [Arthrobacter globiformis NBRC
12137]
Length = 415
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/402 (27%), Positives = 163/402 (40%), Gaps = 94/402 (23%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
+ D +K +F + AP HA+ ++ + G +FGG+ EG PD+ IW
Sbjct: 11 ISPDGAVKRADGADFAYLP--APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFS 68
Query: 61 TFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYN 116
T + G +W ++P NP+LF P+ L L Y + + + ++R S +
Sbjct: 69 TLEPGSDQWSPAQQLSDDPTRSEQNPILFAAPNGALWLLYTAQKAGNQDTAEVRRRTSTD 128
Query: 117 KGVTWSEREQLPP-----GILGPSKNKPILLENGLLLC------GSSVESW--NSWGSWM 163
G TW E L P G+ + P+LL +G L+ + E W NS S +
Sbjct: 129 SGRTWGPVETLFPANETGGVF--VRQLPVLLPSGRLIIPIFRCITTPGEKWVGNSDDSAV 186
Query: 164 EVTVDAGRSWRKYGPIYIPNESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
++ D G +W ++ +P V IQPV A+ +L L RS + S S
Sbjct: 187 MISDDGGATWSEH---ILPGSLGCVHMNIQPV----ADGSLLALFRS-RWADSIYESRST 238
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------------------------ 256
D G TWS PT+L N NS I L DGRL L YN
Sbjct: 239 DDGTTWSEPVPTELPNNNSSIQFAALADGRLALVYNHSRAEENTERRLSLYDEIDDDGLA 298
Query: 257 --------------------------TVSRGVLKVALSKDDGDSW--------HDALTLE 282
R + +A+S+D G SW D L
Sbjct: 299 AEQGQLTEPDPAAGAGTDDGGRRAFWGTPRSPMTLAISEDSGRSWPIRRNLDVGDGYCLS 358
Query: 283 EN----LAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
N L E+SYP+V Q DG+++I YTY R IK+V + P
Sbjct: 359 NNSRDGLNREYSYPSVHQGPDGALNIAYTYFRQAIKYVRVDP 400
>gi|302877447|ref|YP_003846011.1| BNR/Asp-box repeat-containing protein [Gallionella
capsiferriformans ES-2]
gi|302580236|gb|ADL54247.1| BNR/Asp-box repeat protein [Gallionella capsiferriformans ES-2]
Length = 432
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 159/390 (40%), Gaps = 84/390 (21%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
M+ +K A T A+S F HA++ +E+ G +F GS EGA DV I
Sbjct: 34 MQATVPVKVTSARYETHFASSKLFTQVHAASSIELKDGRIRAFWFSGSREGAKDVAIHSA 93
Query: 61 TF--KDGRWQSPIIADEEPNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG 109
F G+W + +IA + NP++ ++ S+G L Y + + W+G
Sbjct: 94 VFDPATGQWSAELIAATREQTQSTLHRYIAKLGNPIVGRM-SDGRLRMYYVAVSLGGWAG 152
Query: 110 ---CMKRSYNKGVTWSEREQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSW 162
S ++G TW+ +L I K P L +G L E + +G
Sbjct: 153 SSITTMTSSDEGTTWTTPRRLITSPFMNISTLVKGAPFLYSDGTLGLPVYHEFISKFGEM 212
Query: 163 MEVTVDAGRSWRKYGPIYIPNESLS-----VIQPVPFHTANRTLRVLMR--SFNGIGRVC 215
+ ++ + + + + L+ +QPV + +VLMR S +G RV
Sbjct: 213 LHLSQEG---------VVLDKQRLAAGGQGTLQPVVLMQNEHSAKVLMRYSSASGPHRVV 263
Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGD 273
+ D GL WS + T L NP++ + GV L DGR+L N + R L + LS+D G
Sbjct: 264 AVSTQDAGLHWSAPEKTALRNPDAAVTGVTLPDGRMLAVLNDQELGRDTLSLVLSQDGGV 323
Query: 274 SWHDALTLEE---------------NLAM------------------------------- 287
+W LE+ N A
Sbjct: 324 TWRVVQLLEDQQQASSQPDEANFIKNAAQMIESSDAISTSRLEPAVESARRTVCHDGHCR 383
Query: 288 -EFSYPAVIQASDGSVHITYTYNRTQIKHV 316
EFSYP +IQAS G H+ YT+NRT IKHV
Sbjct: 384 YEFSYPYLIQASGGDFHLVYTWNRTFIKHV 413
>gi|50122687|ref|YP_051854.1| hypothetical protein ECA3765 [Pectobacterium atrosepticum SCRI1043]
gi|49613213|emb|CAG76664.1| conserved hypothetical protein [Pectobacterium atrosepticum
SCRI1043]
Length = 395
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 153/361 (42%), Gaps = 79/361 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KD-GRWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ KD G W E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKDSGSWSKAAKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L Y + + + ++ +S + G TW E L PG +PI
Sbjct: 96 LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
+LENG + C E W N S ++++ D G++WR+Y +PN + V +
Sbjct: 153 VLENGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWREYP---VPNSTGCVHMNI 209
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
P TL L RS + S S DGG TWS T L N NS I LK+G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSEPVATDLPNNNSSIQVTTLKNGHL 266
Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
L +N +S R + +A+S+
Sbjct: 267 ALVFNHMSAADATDRRLSLYDEIEDEEDKASGAKMPEVQAGDRSAFWGAPRAPMTLAISE 326
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW +E E L EFSYP+V QA DG +H+ +TY R IKHVV
Sbjct: 327 DGGKSWPYQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAPDGKLHVAFTYFRQAIKHVV 386
Query: 318 L 318
+
Sbjct: 387 V 387
>gi|392379261|ref|YP_004986420.1| glycosyl hydrolase (modular protein) [Azospirillum brasilense
Sp245]
gi|356881628|emb|CCD02617.1| glycosyl hydrolase (modular protein) [Azospirillum brasilense
Sp245]
Length = 724
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 156/376 (41%), Gaps = 84/376 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G L A+FGG+ EG PDV I + G RW++P+ ++P
Sbjct: 342 SPCVQNHAANLALLPNGDLLCAWFGGTQEGIPDVSIHVARLPAGGDRWEAPVKLSDDPGR 401
Query: 80 PMWNPVLFKLPSNGLLLFYKI---GQEVQKWSGCMKRSYNKGVTWSEREQL--PPGILGP 134
NPVLF P L L Y G + C + S ++G +W L G G
Sbjct: 402 SEQNPVLFPAPDGTLWLMYTAQVSGHQDTAIVRC-RVSADQGRSWGPVRTLFEAEGHTGV 460
Query: 135 SKNKPIL-LENG-----LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+P++ L+NG LC G+ +W + S + ++ D GR+W ++ +P+ +
Sbjct: 461 FIRQPVVVLDNGDWLLPTFLCHGTPGRAWVGDHDTSAVRISSDGGRTWTEHA---VPDST 517
Query: 186 LSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
V + VP A+ L RS V S S DGG +W+ +PT+L N NS I
Sbjct: 518 GCVHMNIVPL--ADGRLAAFYRS-RWADHVYRSLSSDGGRSWTAPEPTELPNNNSSIQVT 574
Query: 245 KLKDGRLLLAYNTVS--------------------------------------------- 259
L DGRL + YN S
Sbjct: 575 ALSDGRLAIVYNHASAADATGRRLGLYDDIEEAGETANPAAQPAAEPAVEPPAEGRTAFW 634
Query: 260 ---RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHI 304
R L +ALS D G +W LE E L EFSYPA++Q DG + I
Sbjct: 635 GAPRAPLTLALSSDGGVTWPLRRDLEVGDGYCMTNNSKEKLNREFSYPAILQTPDGRLRI 694
Query: 305 TYTYNRTQIKHVVLLP 320
+TY R IK+ L P
Sbjct: 695 AFTYFRQAIKYTELPP 710
>gi|302882514|ref|XP_003040166.1| hypothetical protein NECHADRAFT_44471 [Nectria haematococca mpVI
77-13-4]
gi|256721036|gb|EEU34453.1| hypothetical protein NECHADRAFT_44471 [Nectria haematococca mpVI
77-13-4]
Length = 379
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 158/359 (44%), Gaps = 74/359 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++++ L A+FGGS EG PD+ IWL + G W SP E N NPV
Sbjct: 18 HASNLLQLPDKTLLCAWFGGSQEGLPDISIWLSRLEPGSSSWASPKKISYEENRSCQNPV 77
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGILGPSKNKPI-L 141
LF++P +G + Q+ G +++S ++G+TWSE QL PG G +PI +
Sbjct: 78 LFRVPDSGEIWLLHTSQDAGNQDGAYVLVRKSSDQGITWSEANQLLPGKTGIFTRQPIVI 137
Query: 142 LENG-----LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L++G + C G+ W N S + + D G++W + P+ S +V +
Sbjct: 138 LKDGTWVLPVFYCRGTPGHRWIGNDDISGVFYSKDGGKTWNEK---QFPDSSGAVHMNIV 194
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ ++L V + V S S + GL W+ + L NPN GI +L+ G+L +
Sbjct: 195 PPASEKSLWVALYRSRWADNVYRSTSTN-GLDWAKPEALTLPNPNRGICAARLRSGKLAI 253
Query: 254 AYNTVS--------------------------------------RGVLKVALSKDDGDSW 275
+N + R L V +S DDG +W
Sbjct: 254 VFNRSAASPDSLRREGLYDDITPEDDKRPNQVAVGQKSAIWGTPRKALTVGVSDDDGLTW 313
Query: 276 HDAL-------------TLEENLAMEFSYPAV-IQASDGS--VHITYTYNRTQIKHVVL 318
+ + T +EN E SYP++ IQ DG HI +T++R IKHV +
Sbjct: 314 TERVLEDGDGFCGTNSSTGKEN--RELSYPSILIQDGDGPEVTHIAFTFHRQFIKHVRI 370
>gi|337748984|ref|YP_004643146.1| hypothetical protein KNP414_04746 [Paenibacillus mucilaginosus
KNP414]
gi|379721953|ref|YP_005314084.1| hypothetical protein PM3016_4157 [Paenibacillus mucilaginosus 3016]
gi|386724696|ref|YP_006191022.1| hypothetical protein B2K_21565 [Paenibacillus mucilaginosus K02]
gi|336300173|gb|AEI43276.1| hypothetical protein KNP414_04746 [Paenibacillus mucilaginosus
KNP414]
gi|378570625|gb|AFC30935.1| hypothetical protein PM3016_4157 [Paenibacillus mucilaginosus 3016]
gi|384091821|gb|AFH63257.1| hypothetical protein B2K_21565 [Paenibacillus mucilaginosus K02]
Length = 354
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 149/328 (45%), Gaps = 47/328 (14%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
HA+ ++ +D G L +F GS EG PD + L G RW +P +P NP
Sbjct: 30 SHAANLLSLDNGDLLCTWFTGSGEGNPDTNVVLSRLPAGSDRWTTPEQLSSDPERSEQNP 89
Query: 85 VLFKLPSNGLLLFYKIGQ-EVQKWSGCMKR-SYNKGVTWSEREQLP--PGILGPSKNKPI 140
VLF+ P L L + + QK S + R S ++G+TW E + PGI ++ P+
Sbjct: 90 VLFQAPDGRLWLLHTSNEPHNQKTSRIVTRLSEDRGLTWGPAEVITDRPGIF--LRHPPV 147
Query: 141 LLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRT 200
+ +G + + + S + + D GR+W + +P S+ +Q ++ +
Sbjct: 148 VTGSGDWVLPAYYCRLDGHYSVVLRSTDQGRTWTE---TEVPG-SIHRVQMNIVPRSDGS 203
Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS- 259
L + RS R+ MSES D G TWS + L N NS +L DGR+ L YN +
Sbjct: 204 LLAMFRSRQA-DRIYMSESVDAGRTWSPPVKSPLANNNSSTQAARLTDGRIALIYNDSTM 262
Query: 260 -------------------RGVLKVALSKDDGDSWHDALTLEENLAME----------FS 290
R L +++S+D+G +W L N+ M +S
Sbjct: 263 ERDQFRWVQRKGETRKKPLRTPLTLSISEDEGRTW----PLTRNVQMADLEYKESEVGYS 318
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YP++I DG +HI ++Y R IK+V +
Sbjct: 319 YPSIIAPGDGRLHIAFSYLRKGIKYVCV 346
>gi|405120802|gb|AFR95572.1| BNR/Asp-box repeat family protein [Cryptococcus neoformans var.
grubii H99]
Length = 354
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 136/307 (44%), Gaps = 57/307 (18%)
Query: 64 DGRWQSPIIADEEPNVPMWNPVLFKLPSNG----LLLFYKIGQEVQKWSGCMKRSYNKGV 119
DGRW P + ++ WNPV F G L +FYK G + W + + + G
Sbjct: 39 DGRWTPPRVIAGRGDIVHWNPVSFLPDPKGAPGHLYVFYKTGTPIPTWKTYVIETVDGGT 98
Query: 120 TWSEREQLPPGI----LGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEV----- 165
TW+E +L PG GP KN PI+L NG L G+S E W ++ ++
Sbjct: 99 TWTEPRELVPGDHSGGRGPQKNPPIVLSNGDWLSGASKEVTLGGGKGLWDAFADIAPSPK 158
Query: 166 ---TVDAGRSWRKYGPIYIPNE---------SLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
T G W K I +P + +IQP + +A + V M + IGR
Sbjct: 159 EGDTFKQGERWVKAEFIKLPADRGVEGGSFPGEGIIQPSLWESAEKPGHVHMMCRSSIGR 218
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALS 268
+ ++S D G TW+ T L N NSG +L+DGRL++A N V R L + +S
Sbjct: 219 IVRADSEDYGRTWTPGYATDLPNNNSGQCVTRLRDGRLVIAVNDVYTNWGPRTPLVLKVS 278
Query: 269 KDDGDSWHDALTLEEN---------LAM----------EFSYPAVIQASDGS---VHITY 306
DDG +W TLE+ +A+ EFSYP V ++ V ++Y
Sbjct: 279 SDDGKTWTSWCTLEDQAPPASFQKVIALETGIVNDGKSEFSYPTVTPTAETDEIGVWVSY 338
Query: 307 TYNRTQI 313
T+ R I
Sbjct: 339 TWQRRGI 345
>gi|253689902|ref|YP_003019092.1| hypothetical protein PC1_3540 [Pectobacterium carotovorum subsp.
carotovorum PC1]
gi|251756480|gb|ACT14556.1| conserved hypothetical protein [Pectobacterium carotovorum subsp.
carotovorum PC1]
Length = 395
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 153/361 (42%), Gaps = 79/361 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G W + E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDNWSKAVKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L Y + + + ++ +S + G +W E L PG +PI
Sbjct: 96 LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDFGASWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
+L NG + C E W N+ S ++++ D G++WR+Y +PN + V +
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNNDDSAVKISSDQGKTWREYP---VPNSTGCVHMNI 209
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
P TL L RS + S S DGG TWS PT L N NS I LK+G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSDPVPTDLPNNNSSIQVTTLKNGHL 266
Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
L +N +S R + +A+S+
Sbjct: 267 ALVFNHMSAADATERRLSLYDEIEDEEDKASNAKMPEVQAGSRSAFWGAPRAPMTLAISE 326
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW +E E L EFSYP+V QA DG +H+ +TY R IKHVV
Sbjct: 327 DGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKHVV 386
Query: 318 L 318
+
Sbjct: 387 V 387
>gi|255954997|ref|XP_002568251.1| Pc21g12200 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211589962|emb|CAP96117.1| Pc21g12200 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 392
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 147/357 (41%), Gaps = 73/357 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
CHAS ++ + G L +FGG+ EG PD+ I+L G W + + NP
Sbjct: 37 CHASNLLRLPNGDLLCTWFGGNMEGKPDISIYLSRLPSGGQTWGEAVKVTHDDGRSEQNP 96
Query: 85 VLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI 140
VLF+ PS L L Y Q + S +KR S + G TWS L PG + I
Sbjct: 97 VLFRHPSGELWLLYTSQQGGNQDSAVVKRVISSDNGATWSGPTILFDDPGTF--IRQPVI 154
Query: 141 LLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+LENG+ + G W N S + + D GR+W + + +P ES +
Sbjct: 155 ILENGVFVVPTFKCRGEPGAKWIGNDDISVIRTSADQGRTWSE---VAVP-ESTGCVHME 210
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N + L RS + +S S D GL WS +PT L NPN+GI L GR++
Sbjct: 211 IQRLKNGSYLALYRS-RWADYIHLSTSSD-GLDWSAPQPTSLPNPNAGICFDVLSSGRVV 268
Query: 253 LAYNTVS----------------------------------------RGVLKVALSKDDG 272
YN S R L VA S DDG
Sbjct: 269 AVYNHSSRKNAEARREGLYDEITEEGQDTRPNQKDRTDGKEAFWGAPRAPLCVAWSDDDG 328
Query: 273 DSWH-------DALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW D L E+ E SYP+++ DG++HI +T+ R IK+V +
Sbjct: 329 KSWQHRTLEEGDGFCLTNNSEQKRNRELSYPSMVVGEDGTIHIAFTFWRQTIKYVQI 385
>gi|290509605|ref|ZP_06548976.1| glycosyl hydrolase, BNR repeat [Klebsiella sp. 1_1_55]
gi|289778999|gb|EFD86996.1| glycosyl hydrolase, BNR repeat [Klebsiella sp. 1_1_55]
Length = 388
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 156/356 (43%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G + +FGGS EG D+ IW G RW + ++P+ NPV
Sbjct: 33 HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P N L L + + + ++ S++ G +W + L PG +PI
Sbjct: 93 LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ +G + C + E W N+ S ++++ D G+SWR + +P ESL + +
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMI 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L RS + S+S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATTLDNGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G+SW
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + E+ L EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380
>gi|221065827|ref|ZP_03541932.1| BNR repeat-containing glycosyl hydrolase [Comamonas testosteroni
KF-1]
gi|220710850|gb|EED66218.1| BNR repeat-containing glycosyl hydrolase [Comamonas testosteroni
KF-1]
Length = 384
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 149/362 (41%), Gaps = 67/362 (18%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ ++ +D G +FGGS EG D+ +++ G +W PI +
Sbjct: 22 SPCVQAHAANLMVLDDGTLGCVWFGGSMEGRSDISVFMSRLDPGAAQWSEPIQLSHDAER 81
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NPVLF P L L + Q + + ++R S ++G++W E L G
Sbjct: 82 SEQNPVLFPAPGGELWLLHTAQQSGHQNTSVVRRRLSRDQGLSWEPTETLADAPAGTFVR 141
Query: 138 KPILLENG----LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
+PI + L + + +W S + + D GRSW++ I +P SL
Sbjct: 142 QPIHVHTDGSWLLPVFHCRAQPGQAWDGSHDDSGVLRSADQGRSWQR---IAVPG-SLGC 197
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ ++ L RS V S S D GL+W + T+L N NS I ++L D
Sbjct: 198 VHMNIVQASDGGLLAFFRS-RWADHVYRSRSDDCGLSWQEPEATELPNNNSSIQALRLAD 256
Query: 249 GRLLLAYNTVS-------------------------------------RGVLKVALSKDD 271
GRL + +N S R + +ALS DD
Sbjct: 257 GRLAMIFNASSAADATQRRESLYDELGDSSATPAKAAAGERRAFWGAPRAPMTLALSADD 316
Query: 272 ------------GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
GD W + E E+SYP++ Q++DG++H+ YT R I+HV +
Sbjct: 317 GLSWPWQRNLEVGDGWCMSNDSEHGRNREYSYPSIRQSADGALHLAYTVFRQHIRHVRVQ 376
Query: 320 PN 321
P+
Sbjct: 377 PD 378
>gi|365856121|ref|ZP_09396145.1| BNR/Asp-box repeat protein [Acetobacteraceae bacterium AT-5844]
gi|363718349|gb|EHM01690.1| BNR/Asp-box repeat protein [Acetobacteraceae bacterium AT-5844]
Length = 402
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 153/358 (42%), Gaps = 72/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G +FGG+ EG PD+ + + G RW +P+ ++ NP+
Sbjct: 40 HAANLMPLPDGSLGCVWFGGTQEGVPDISAYFSRLEKGGTRWSAPVRLSDDDTRSEQNPL 99
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL---PPGILGPSKNKPI 140
LF PS L L + + + +++ S + G +W E L PG + +
Sbjct: 100 LFPAPSGELWLIWTAQISGNQDTAIVRKRISRDNGRSWGPIETLFGPRPGGGTFMRQPVV 159
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+LENG + +C S+ + W + S + ++ D GRSW + + +P ESL +
Sbjct: 160 VLENGDWLLPVWICASTPGKKWVGDDDTSAVMISSDRGRSWTE---VAVP-ESLGCVHMS 215
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL RS V +S S D G +WS +PT+L N NS I +L +G L
Sbjct: 216 IVDLRNGTLAAFYRS-RWADHVYVSRSTDNGRSWSAPQPTELPNNNSSIQVTRLANGHLA 274
Query: 253 LAYNT----------------------------------------VSRGVLKVALSKDDG 272
+ +N V R + VA+S+DDG
Sbjct: 275 MVFNESSAEDATERRLSLYDDIEDDTDDGKVVAPIDTSRRTTFWGVPRAPMTVAISEDDG 334
Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW LE E L EFSYP+++Q +G +HI YT R IK+V +
Sbjct: 335 QSWPFRRNLETGDGYCMTNNSREGLNREFSYPSIVQGPEGDLHIAYTVYRQAIKYVTV 392
>gi|444307837|ref|ZP_21143550.1| hypothetical protein G205_22151 [Arthrobacter sp. SJCon]
gi|443479848|gb|ELT42830.1| hypothetical protein G205_22151 [Arthrobacter sp. SJCon]
Length = 413
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 155/381 (40%), Gaps = 91/381 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HA+ ++ + G +FGG+ EG PD+ IW + G +W SP ++
Sbjct: 30 APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFSALEPGSSQWSSPEQLSDDSTR 89
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPP-----GIL 132
NP+LF P L L Y + + + ++R S + G TW E E L P G+
Sbjct: 90 SEQNPILFTAPDKSLWLLYTAQKAGNQDTAEVRRRISTDSGRTWGEVETLFPANETGGVF 149
Query: 133 GPSKNKPILLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ P++L +G L+ + E W NS S + ++ D G +W ++ +P
Sbjct: 150 --VRQLPVVLPSGRLIVPIFRCITTPGEKWVGNSDDSAVMISEDGGATWTEH---VLPGS 204
Query: 185 SLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
V IQPV A+ +L L RS + S S D G TWS PT+L N NS I
Sbjct: 205 LGCVHMNIQPV----ADGSLLALFRS-RWADSIYESRSTDDGSTWSEPVPTELPNNNSSI 259
Query: 242 DGVKLKDGRLLLAYNT-------------------------------------------- 257
L DGRL L YN
Sbjct: 260 QFTALADGRLALVYNHSRAGEGTERRLSLYDEIDDDGLADEQGQVAEPDAAAFSEDDGVK 319
Query: 258 -----VSRGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDG 300
R + +A+S+D G +W D L N L E+SYP++ Q DG
Sbjct: 320 RAFWGTPRSPMTLAISEDSGRTWPIRRNLDVGDGYCLSNNSRDGLNREYSYPSIHQGPDG 379
Query: 301 SVHITYTYNRTQIKHVVLLPN 321
+++I YTY R IK V + P
Sbjct: 380 ALNIAYTYFRQAIKFVRVDPQ 400
>gi|302540315|ref|ZP_07292657.1| BNR/Asp-box repeat protein [Streptomyces hygroscopicus ATCC 53653]
gi|302457933|gb|EFL21026.1| BNR/Asp-box repeat protein [Streptomyces himastatinicus ATCC 53653]
Length = 401
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 156/373 (41%), Gaps = 81/373 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HA+ + + G +FGG+ EG PD+ +W G W P+ ++P
Sbjct: 32 APAVQNHAANLTVLPGGDLGCVWFGGTQEGVPDISVWFSRLAPGADTWSEPVRLSDDPAR 91
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQLPPGILGPSK- 136
NP+LF P+ L L Y Q + + ++ S + G TW L P G
Sbjct: 92 SEQNPLLFPTPAGELWLLYTAQQAGNQDTAEVRVRVSGDNGDTWDAPRTLFPATGGGGVF 151
Query: 137 -NKPI-LLENGLLL-----CGSSVES-W--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+PI +L++G L C S+ + W + S + V+ D GR+WR+ +P +
Sbjct: 152 IRQPIAVLDSGRWLLPVFHCVSTPGAKWVGDHDTSAVMVSDDEGRTWRER---PVPGSTG 208
Query: 187 SV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
V + P+P + TL L RS V S S DGG +WS +PT+L N NS +
Sbjct: 209 CVHMNVHPLP----DGTLLALFRS-RWADAVYRSHSTDGGESWSEPEPTELPNNNSSVQY 263
Query: 244 VKLKDGRLLLAYNTVS-------------------------------------------R 260
V L DGRL L YN S R
Sbjct: 264 VPLGDGRLALVYNHSSRADATARRVSLYDEIDDDGEAGDAPAAAPSQREDAPGAFWGAPR 323
Query: 261 GVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTY 308
+ +ALS D G +W L+ + L E SYP + Q DG++HI YTY
Sbjct: 324 APMTLALSADGGRTWPVRRDLDVGDGHCLTNNSRDRLNRELSYPTIRQTPDGTLHIAYTY 383
Query: 309 NRTQIKHVVLLPN 321
+R IK+V + P+
Sbjct: 384 HRQAIKYVRVSPD 396
>gi|417305704|ref|ZP_12092654.1| conserved hypothetical protein, secreted, partial [Rhodopirellula
baltica WH47]
gi|327538010|gb|EGF24704.1| conserved hypothetical protein, secreted [Rhodopirellula baltica
WH47]
Length = 239
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 99/202 (49%), Gaps = 17/202 (8%)
Query: 5 CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
G LV+ F P K HASTIVE G + A+F G+ E PDV I + ++
Sbjct: 28 AGEGALVSSRQIFDLADKPTKESHASTIVETPNG-LVAAWFAGTRERDPDVGIRVSRHEN 86
Query: 65 GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
G+W + + P WNPVLF+ L+LFYK+G ++W G + S + G
Sbjct: 87 GQWTDSVEVVSGVQSSMLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146
Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
TWS +L +LGP KNKP+ L +G +LC SS E + W EVT D
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPVELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206
Query: 170 GRSWRKYGPIYIPNESLSVIQP 191
G++W GPI E+ IQP
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQP 227
>gi|407978690|ref|ZP_11159518.1| hypothetical protein BA1_05802 [Bacillus sp. HYC-10]
gi|407414721|gb|EKF36351.1| hypothetical protein BA1_05802 [Bacillus sp. HYC-10]
Length = 391
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 165/380 (43%), Gaps = 75/380 (19%)
Query: 7 IKGLVAEEFTFPANSAPF--KSC---HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
I G++ + P F +C HA+ ++E+D G L +FGG+ EG PD+ I++
Sbjct: 10 IDGILRQNKQDPVRKEAFLPTNCVQNHAANLIELDNGDVLCVWFGGTQEGIPDISIYMSR 69
Query: 62 FKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNK 117
K G +W + + NPVLF+ S L L Y + + ++ S ++
Sbjct: 70 LKQGSDQWTEAVKLSNDSTRSEQNPVLFQEKSGRLWLLYTAQLSGNQDTAIVRYRVSEDR 129
Query: 118 GVTWSEREQL--PPGILGPSKNKPILLENG-----LLLCGS--SVE-SWNSWGSWMEVTV 167
G TW E L PG + ++L+N + C + V+ + N S ++++
Sbjct: 130 GETWGEIGTLFDQPGTF--IRQPIVVLDNDDWLLPVFYCKTIPGVKWTGNRDVSAVKISS 187
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W + + +PN + V + A+ TL L RS + MS S D G TWS
Sbjct: 188 DQGKTWEE---VIVPNSTGCVHMNIG-KCADGTLIALFRS-RFADSIYMSRSVDHGRTWS 242
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---------------------------- 259
K T+L N NS I L++G L L YN ++
Sbjct: 243 EPKATELPNNNSSIQFTVLQNGHLALVYNHINADDQTERRASLYDEIEDEGDTRTAVDTE 302
Query: 260 ---------RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQAS 298
R + +A+SKDDG +W LE + L E+SYP++ +
Sbjct: 303 ARPAFWGTPRAPMTLAISKDDGATWPVRRNLEIGDGYAMTNNSKDKLNREYSYPSITEGK 362
Query: 299 DGSVHITYTYNRTQIKHVVL 318
DG +HI +T+ R IK+V +
Sbjct: 363 DGKLHIAFTFYRQAIKYVCV 382
>gi|321261720|ref|XP_003195579.1| hypothetical protein CGB_H0050C [Cryptococcus gattii WM276]
gi|317462053|gb|ADV23792.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 394
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 158/366 (43%), Gaps = 65/366 (17%)
Query: 10 LVAEEFTFPANS-APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV EF + AP C T + V G +FGGS EGA D +IW DG+W
Sbjct: 23 LVTREFVSKDDERAPSIHC---TNIAVAGGVRYSTWFGGSKEGATDNQIWFSKNADGKWT 79
Query: 69 SP--IIADEE-PNVPMWNPVLF---KLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
P I + E +V WNPVLF + + +F+K + W S + G TWS
Sbjct: 80 EPRAIAGNREGEDVVYWNPVLFIPDRSQPKTIHVFFKKHTPIPVWVTFWMDSDDGGDTWS 139
Query: 123 EREQLPPGI-----LGPSKNKPILLENGLLLCGSSVESWNSWGS------WMEVTVD--- 168
E +L PG GP KN PI+L NG L S E N GS W +V +
Sbjct: 140 EPRELVPGPDGARGRGPQKNPPIVLSNGDWLSAGSYEVTNPPGSGAAGDSWADVALKPGP 199
Query: 169 -----AGRSWRKYGPIYIPNE---------SLSVIQPVPFHTANRTLRVLMRSFNGIGRV 214
G W + I +P++ VIQP + + ++ M + +G V
Sbjct: 200 GDDFKQGDKWVRSKLIELPSDRGKSDGGFPGEGVIQPGIWESKDQPGHCHMMMRSSVGCV 259
Query: 215 CMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSK 269
++S D G TW A T L N NSG+ L+DGR++ A N + R L +A+S+
Sbjct: 260 IQADSVDYGRTWGPAFRTSLPNNNSGLCITTLRDGRVVWAGNYQTQNWGPRTPLCLAISE 319
Query: 270 DDGDSWHDALTLEE-------------------NLAMEFSYPAVI---QASDGSVHITYT 307
DDG +W TLE+ + EFSYP +I + V +++T
Sbjct: 320 DDGKTWKLWATLEDAPPPEDFKRVIALETGIVNDGRSEFSYPCLIPTENDDEDGVWMSWT 379
Query: 308 YNRTQI 313
+ R I
Sbjct: 380 WQRRGI 385
>gi|301060837|ref|ZP_07201651.1| BNR/Asp-box repeat protein [delta proteobacterium NaphS2]
gi|300445086|gb|EFK09037.1| BNR/Asp-box repeat protein [delta proteobacterium NaphS2]
Length = 398
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 48/345 (13%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG---R 66
+ +F P + H +I E+ G A++GG+ EGA DV I+ T G
Sbjct: 51 FIKHDFINPGTAGAM--VHVGSICEMTDGQLAAAWYGGTREGAKDVAIFFSTKPPGTNTS 108
Query: 67 WQSP-IIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRS 114
W P +I D + NP+LF P L L Y + V WSG +K S
Sbjct: 109 WSPPKVIVDRAKATRELKRYIRKVGNPILFSGPGKQLWLIY-VSIAVGGWSGSSINVKTS 167
Query: 115 YNKGVTWSEREQLPPGILGPSKNKPILLENG---LLLCGS-------SVESWNSWGS--- 161
+ G W+E ++L L P N L++ L+ G+ +V ++ +
Sbjct: 168 RDGGERWTESQRL---TLSPFFNISELVKGRPLPLITMGARMNQDRFAVPIYHEFLGNFP 224
Query: 162 ---WMEVTVDAGRSWRKYGPIYIPNESL---SVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
W+ D+GR +Y + S IQP RS + RV
Sbjct: 225 EILWISFCKDSGRV------VYEKSRMAGGKSFIQPSIAALNTHFATAFYRSVSTDKRVA 278
Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--SRGVLKVALSKDDGD 273
+ + D G W+ L NP+S +D + L+ +LLA+N SR ++++A SKD G
Sbjct: 279 AATTEDCGQNWTVPDYLDLPNPDSAVDALPLQRNCILLAFNDSRNSREIMQLAFSKDGGK 338
Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W TLE + EFSYP +I+ +G +H+ YT+ R +IKH+V
Sbjct: 339 NWTRIHTLENSPNEEFSYPYMIRGRNGLIHVVYTWKRKRIKHLVF 383
>gi|291521032|emb|CBK79325.1| Predicted neuraminidase (sialidase) [Coprococcus catus GD/7]
Length = 386
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 153/334 (45%), Gaps = 47/334 (14%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
+ + HA ++E++ G L A+F GS EG+ D+ I +++G W P+ +
Sbjct: 33 YSTAHAPALLELENGDMLCAWFAGSFEGSKDISIVCARWENGSDHWSEPVQVSYDSERSE 92
Query: 82 WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILG 133
NP LFK P + Y G++ +++ ++ +S++ G TW E E + P G
Sbjct: 93 QNPSLFKGPDGAVWCMYTAQLDRMEGKDNMQFTSIIRCQKSFDGGKTWGEAEVIFPE-EG 151
Query: 134 PSKNKPI-LLENGLLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+PI +L NG + C SV + ++ D GR+W+K + +P + +
Sbjct: 152 SFCRQPIQVLSNGRWIFGNWICTDSVNGLEGDPTAFRISDDEGRTWKK---VDMPESNGA 208
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
V V + + L F + +SES D G TW+ PT L N NS I +KL
Sbjct: 209 VHANVVELEGGKLVCFLRSRF--ADNIYISESKDWGDTWTKPVPTVLPNNNSSISALKLA 266
Query: 248 DGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEENL 285
GR+ +AYN R + VALS+D G +W + T +EN
Sbjct: 267 SGRIAIAYNPTHAPHPVYGKVAWPGLRCPVAVALSEDGGKTWPMIRVMERGEGFTGDENT 326
Query: 286 A--MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
++ YP ++Q DG +H+ + Y NR IK++
Sbjct: 327 TNNKQYEYPYLMQGKDGRLHLAFAYKNRIGIKYM 360
>gi|261822947|ref|YP_003261053.1| hypothetical protein Pecwa_3711 [Pectobacterium wasabiae WPP163]
gi|261606960|gb|ACX89446.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163]
gi|385873395|gb|AFI91915.1| BNR/Asp-box repeat protein [Pectobacterium sp. SCC3193]
Length = 395
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 153/361 (42%), Gaps = 79/361 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G W + E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSKAVKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L Y + + + ++ +S + G TW E L PG +PI
Sbjct: 96 LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
+LENG + C E W N+ S ++++ D G++WR+Y +PN + V +
Sbjct: 153 VLENGDWLLPVFYCRVQPGEKWVGNNDDSAVKISSDRGKTWREYS---VPNSTGCVHMNI 209
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
P TL L RS + S S DGG TWS T L N NS I L++G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSDPVATDLPNNNSSIQVTTLQNGHL 266
Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
L +N +S R + +A+S+
Sbjct: 267 ALVFNHMSAADATDRRLSLYDEIEDEEDKASGAKMPEVQAGDRSAFWGAPRAPMTLAISE 326
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW +E E L EFSYP+V QA DG +H+ +TY R IKHVV
Sbjct: 327 DGGKSWPWQRNVEVGDGYCMTNNSTEKLNREFSYPSVKQAPDGKLHLAFTYFRQAIKHVV 386
Query: 318 L 318
+
Sbjct: 387 V 387
>gi|403059973|ref|YP_006648190.1| hypothetical protein PCC21_035340 [Pectobacterium carotovorum
subsp. carotovorum PCC21]
gi|402807299|gb|AFR04937.1| hypothetical protein PCC21_035340 [Pectobacterium carotovorum
subsp. carotovorum PCC21]
Length = 395
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 155/363 (42%), Gaps = 83/363 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G W + E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSQAVKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L Y + + + ++ +S + G TW E L PG +PI
Sbjct: 96 LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
+L NG + C E W N S ++++ D G++W++Y +PN + V I
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWQEYP---VPNSTGCVHMNI 209
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
P+ + TL L RS + S S DGG TWS PT+L N NS I L++G
Sbjct: 210 TPL----KDGTLLALYRS-RWADFIYQSRSTDGGKTWSDPVPTELPNNNSSIQVTTLENG 264
Query: 250 RLLLAYNTVS------------------------------------------RGVLKVAL 267
L L +N +S R + +A+
Sbjct: 265 HLALVFNHMSAADATERRLSLYDEIEDEEDKASNAKMPEVQAGGRSAFWGAPRAPMTLAI 324
Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
S+D G SW +E E L EFSYP+V QA DG +H+ +TY R IKH
Sbjct: 325 SEDGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKH 384
Query: 316 VVL 318
VV+
Sbjct: 385 VVV 387
>gi|397168581|ref|ZP_10492019.1| BNR/Asp-box repeat family protein [Enterobacter radicincitans DSM
16656]
gi|396090116|gb|EJI87688.1| BNR/Asp-box repeat family protein [Enterobacter radicincitans DSM
16656]
Length = 385
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 153/363 (42%), Gaps = 72/363 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ I+ + G L +FGGS EG D+ ++ G RW + ++P
Sbjct: 26 SPCPQNHAANILPLPDGTLLCVWFGGSQEGVADISVYGSRLAPGAQRWSDAVKLSDDPTR 85
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPS 135
NPVLF P N L L + + + + ++S + G +W E L PG
Sbjct: 86 SEQNPVLFLAPDNTLWLLWTAQISGNQDTAIVRYRQSADFGASWGEIGTLLDKPGTF--- 142
Query: 136 KNKPI-LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+PI +LENG + C + E W N S ++++ D G+SWR + +P +SL
Sbjct: 143 IRQPITVLENGNWLLPVFYCRTQPGEKWVGNDDISAVKISADRGKSWRD---VDVP-QSL 198
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
+ TL L RS + S S DGG +WS +PT L N NS I L
Sbjct: 199 GCVHMNITALPGGTLVALFRS-RWADNIYYSHSTDGGESWSVPEPTVLPNNNSSIQVTTL 257
Query: 247 KDGRLLLAYNTVS-------------------------------------RGVLKVALSK 269
DG L L +N +S R + VA+S
Sbjct: 258 TDGTLALVFNHMSAAGATERRASLYDEIDDGDGRKEPTVTSGRSAFWGAPRAPMTVAISP 317
Query: 270 DDGDSWHDALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW L +E L EFSYP++ Q +DG +HI YT+ R IK+V
Sbjct: 318 DGGKSWPWQRNLDEGDGYCMTNNSQEKLNREFSYPSIKQGADGDLHIAYTWFRQAIKYVR 377
Query: 318 LLP 320
+ P
Sbjct: 378 VSP 380
>gi|424933506|ref|ZP_18351878.1| Glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
subsp. pneumoniae KpQ3]
gi|407807693|gb|EKF78944.1| Glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
subsp. pneumoniae KpQ3]
Length = 388
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 155/356 (43%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G + +FGGS EG D+ IW G RW + ++P+ NPV
Sbjct: 33 HAANLLPLDDGSLMCVWFGGSLEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P N L L + + + ++ S++ G +W + L PG +PI
Sbjct: 93 LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ +G + C + E W N+ S ++++ D G+SWR + +P ESL +
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L RS + S+S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATPLDNGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G+SW
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + E+ L EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380
>gi|288935569|ref|YP_003439628.1| glycoside hydrolase family protein [Klebsiella variicola At-22]
gi|288890278|gb|ADC58596.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
variicola At-22]
Length = 388
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 155/357 (43%), Gaps = 71/357 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G + +FGGS EG D+ IW G RW + ++P+ NPV
Sbjct: 33 HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P N L L + + + ++ S++ G +W + L PG +PI
Sbjct: 93 LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ +G + C + E W N+ S ++++ D G+SWR + +P ESL +
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L RS + S+S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATTLDNGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G+SW
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
D + E+ L EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WLRHLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSPQ 381
>gi|227115492|ref|ZP_03829148.1| hypothetical protein PcarbP_21165 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length = 395
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 155/363 (42%), Gaps = 83/363 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G W + E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSQAVKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L Y + + + ++ +S + G TW E L PG +PI
Sbjct: 96 LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDFGATWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
+L NG + C E W N S ++++ D G++W++Y +PN + V I
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWQEYP---VPNSTGCVHMNI 209
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
P+ + TL L RS + S S DGG TWS PT+L N NS I L++G
Sbjct: 210 TPL----KDGTLLALYRS-RWADFIYQSRSTDGGKTWSDPVPTELPNNNSSIQVTTLENG 264
Query: 250 RLLLAYNTVS------------------------------------------RGVLKVAL 267
L L +N +S R + +A+
Sbjct: 265 HLALVFNHMSAADATERRLSLYDEIEDEEDKASNAKMPEVQAGGRSAFWGAPRAPMTLAI 324
Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
S+D G SW +E E L EFSYP+V QA DG +H+ +TY R IKH
Sbjct: 325 SEDGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKH 384
Query: 316 VVL 318
VV+
Sbjct: 385 VVV 387
>gi|330006452|ref|ZP_08305616.1| BNR/Asp-box repeat protein [Klebsiella sp. MS 92-3]
gi|386034753|ref|YP_005954666.1| glycosyl hydrolase family protein [Klebsiella pneumoniae KCTC 2242]
gi|424830554|ref|ZP_18255282.1| glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
subsp. pneumoniae Ecl8]
gi|425091448|ref|ZP_18494533.1| hypothetical protein HMPREF1308_01708 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
gi|328535850|gb|EGF62282.1| BNR/Asp-box repeat protein [Klebsiella sp. MS 92-3]
gi|339761881|gb|AEJ98101.1| glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae KCTC
2242]
gi|405612507|gb|EKB85258.1| hypothetical protein HMPREF1308_01708 [Klebsiella pneumoniae subsp.
pneumoniae WGLW5]
gi|414707981|emb|CCN29685.1| glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
subsp. pneumoniae Ecl8]
Length = 388
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 155/356 (43%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G + +FGGS EG D+ IW G RW + ++P+ NPV
Sbjct: 33 HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P N L L + + + ++ S++ G +W + L PG +PI
Sbjct: 93 LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ +G + C + E W N+ S ++++ D G+SWR + +P ESL +
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L RS + S+S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATPLDNGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G+SW
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + E+ L EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380
>gi|354723546|ref|ZP_09037761.1| hypothetical protein EmorL2_11843 [Enterobacter mori LMG 25706]
Length = 388
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 154/358 (43%), Gaps = 74/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W + ++ NPV
Sbjct: 33 HAANLLPLPDGALMCVWFGGTQEGIADISVWGSRLTQGSQQWSDAVKLSDDATRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P N L L + + + + ++S + G TW E + L PG + ++
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSLDLGQTWGEIDTLLDKPGTF--IRQPIVV 150
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP-- 191
LENG + C + E W N S ++++ D G++WR + +P ESL +
Sbjct: 151 LENGNWLLPVFYCRTRPGEKWVGNDDISAVKISEDRGKTWRD---VVVP-ESLGCVHMNI 206
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
P H N TL L RS G + S+S DGG +WS + T L N NS I L G L
Sbjct: 207 TPLH--NGTLVALFRSRWG-DNIYYSQSTDGGESWSVPETTALPNNNSSIQVTTLSSGEL 263
Query: 252 LLAYNTVS-------------------------------------RGVLKVALSKDDGDS 274
L +N +S R + VA+S D G S
Sbjct: 264 ALVFNHMSAAGALERRASLYDEIDDGDDRKEPVVTRGRAAFWGAPRAPMTVAISPDGGKS 323
Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
W L+E L EFSYP++ Q S+G++HI YTY R IK+V + P
Sbjct: 324 WPWQRNLDEGDGYCMTNNSMKKLNREFSYPSIKQGSEGNIHIAYTYFRQAIKYVRVTP 381
>gi|421082558|ref|ZP_15543441.1| BNR/Asp-box repeat family protein [Pectobacterium wasabiae CFBP
3304]
gi|401702795|gb|EJS93035.1| BNR/Asp-box repeat family protein [Pectobacterium wasabiae CFBP
3304]
Length = 395
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 153/361 (42%), Gaps = 79/361 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G W + E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSKAVKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P + L L Y + + + ++ +S + G TW E L PG +PI
Sbjct: 96 LFLAPDDVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
+LENG + C E W N+ S ++++ D G++WR+Y +PN + V +
Sbjct: 153 VLENGDWLLPVFYCRVQPGEKWVGNNDDSAVKISSDRGKTWREYP---VPNSTGCVHMNI 209
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
P TL L RS + S S DGG TWS T L N NS I LK+G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSDPVATDLPNNNSSIQVTTLKNGHL 266
Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
L +N +S R + +A+S+
Sbjct: 267 ALVFNHMSAADATDRRLSLYDEIEDEEDKASGAKMPEVQAGDRSAFWGAPRAPMTLAISE 326
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G SW +E E L EFSYP+V QA DG +H+ +TY R IKHVV
Sbjct: 327 DGGKSWPWQRNVEVGDGYCMTNNSTEKLNREFSYPSVKQAPDGKLHLAFTYFRQAIKHVV 386
Query: 318 L 318
+
Sbjct: 387 V 387
>gi|386335766|ref|YP_006031936.1| hypothetical protein RSPO_m00762 [Ralstonia solanacearum Po82]
gi|334198216|gb|AEG71400.1| conserved hypothetical protein [Ralstonia solanacearum Po82]
Length = 392
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 151/360 (41%), Gaps = 70/360 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P HA+ + + G L A+FGG+ EG PDV I+L G WQ + ++P
Sbjct: 32 PCVQNHAANLHALPNGDLLCAWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NPVLF P L L Y + + ++R S ++G +W E L +
Sbjct: 92 EQNPVLFTTPDGALWLIYTAQLSGHQNTAIVRRRISTDRGRSWGPIETLFDQAGTFVRQP 151
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
++L +G + LC + E W N S + ++ D GR+W P +P +S+ +
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDVSAVMISEDQGRTWT---PHTVP-DSVGCVH 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
A+ +L L RS V S S D G WS +PT L N NS I V L++G
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHVYASRSRD-GRAWSAPQPTALPNNNSSIQFVALRNGH 265
Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
L L YN R L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTRGTAVWGAPRAPLSLAISED 325
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W LE + E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385
>gi|325962498|ref|YP_004240404.1| neuraminidase (sialidase) [Arthrobacter phenanthrenivorans Sphe3]
gi|323468585|gb|ADX72270.1| putative neuraminidase (sialidase) [Arthrobacter phenanthrenivorans
Sphe3]
Length = 419
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 161/400 (40%), Gaps = 92/400 (23%)
Query: 1 MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
+ D +K +F + AP HA+ ++ + G +FGG+ EG PD+ IW
Sbjct: 12 ITPDGAVKRADGADFAYLP--APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFS 69
Query: 61 TFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYN 116
+ G +W P ++ NP+LF L L Y + + + ++R S +
Sbjct: 70 ALEPGSSQWSEPQQLSDDSTRSEQNPILFTNTDGALWLLYTAQKAGNQDTAEVRRRISTD 129
Query: 117 KGVTWSEREQLPP-----GILGPSKNKPILLENGLLLC------GSSVESW--NSWGSWM 163
G TW E E L P G+ + P++L +G L+ + E W NS S +
Sbjct: 130 SGRTWGEVETLFPANETGGVF--VRQLPVVLPSGRLIVPIFRCITTPGEKWVGNSDDSAV 187
Query: 164 EVTVDAGRSWRKYGPIYIPNESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
++ DAG +W ++ +P V IQPV A+ +L L RS + S S
Sbjct: 188 MISDDAGATWTEH---VLPGSLGCVHMNIQPV----ADGSLLALFRS-RWADAIYESRST 239
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT----------------------- 257
D G TWS PT+L N NS I L DGRL L YN
Sbjct: 240 DDGSTWSEPVPTELPNNNSSIQFTALADGRLALVYNHSRAQENTERRLSLYDEIDDDGLA 299
Query: 258 -------------------------VSRGVLKVALSKDDGDSW--------HDALTLEEN 284
R + +A+S+D G +W D L N
Sbjct: 300 EEQGQLAEPDAAAVADDGSRKAFWGTPRSPMTLAISEDSGRTWPIRRNLDVGDGYCLSNN 359
Query: 285 ----LAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L E+SYP++ Q DGS++I YTY R IK V + P
Sbjct: 360 SRDGLNREYSYPSIHQGPDGSLNIAYTYFRQAIKFVRVDP 399
>gi|168700772|ref|ZP_02733049.1| arylsulfatase A [Gemmata obscuriglobus UQM 2246]
Length = 861
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 148/362 (40%), Gaps = 65/362 (17%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQT 61
K G G A E FP + + HA IVE G LV+++ GS E A DV ++
Sbjct: 485 KKEGKDGSFAAELIFPLHK---QHNHAPGIVECPNGDLLVSWYRGSGERSADDVAVYGAR 541
Query: 62 FKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLF---------------YKIGQEV 104
K G W + + P P N ++ L LF Y++ +
Sbjct: 542 KKAGADTWSDAFLMADTPGFPDCNTTMWVDKDGKLWLFWPLILANSWESCLTSYRVSSDY 601
Query: 105 Q-------KWSGC--MKRSYNKGVTWSEREQLPPGILGPSKN------------------ 137
Q +W G +K +GV E E + SK
Sbjct: 602 QTGGAPKWEWQGTIPLKPKDFEGVMLREFEAWKKQVAAASKMPFEPDAAPLKKKVGDKLL 661
Query: 138 ---------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
KP++L G +L +++++ M ++ D G++W P+
Sbjct: 662 SRLGWQPRCKPVVLGTGRVLLPLYSDTYDA--GLMALSDDGGKTWTASQPL----AGFGS 715
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
IQP + TL MR +V ES D G +W T L+NP SG+D V+L
Sbjct: 716 IQPAVLERKDGTLVAYMRENGVFKKVRACESKDRGESWGTVYSTDLINPGSGLDAVRLAS 775
Query: 249 GRLLLAYNTVSRG--VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
G L+ YN +G L V+LS D+G +W LE++ ++ YPAVIQ+ DG+VH Y
Sbjct: 776 GNWLIVYNDTVKGRSRLAVSLSDDEGKTWKWTRHLEKHETGQYHYPAVIQSKDGAVHAVY 835
Query: 307 TY 308
+Y
Sbjct: 836 SY 837
>gi|429859179|gb|ELA33968.1| glycosyl hydrolase [Colletotrichum gloeosporioides Nara gc5]
Length = 379
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 97/356 (27%), Positives = 151/356 (42%), Gaps = 70/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HAS ++++ L +FGGS EG PD+ IWL + G W SP + N NPV
Sbjct: 19 HASNLLQLPDKTLLCTWFGGSQEGLPDISIWLSRQEPGSDTWSSPQKISSDTNRSCQNPV 78
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKP-IL 141
LF+ P G++ + Q+ GC + S + GVTWSE + G+ G +P ++
Sbjct: 79 LFRAPKTGVIWLFHTSQDAGNQDGCYIMSRTSSDNGVTWSEARKPLKGVTGAFVRQPLVV 138
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQPV 192
L +G + C S+ + W N+ S + + D G +W + +P+ +V + P+
Sbjct: 139 LADGTWILPVFHCRSTPGQRWIGNNDISAVFYSRDDGETWTES---VVPDSVGAVHMNPI 195
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
RS V S S DG L W+ KPT L NPNSGI +L G L
Sbjct: 196 IPSKEGSDYVAFYRS-RWADNVYRSTSSDG-LNWTAPKPTSLPNPNSGICAARLPSGNLA 253
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
+ +N + R L V +S D+G +
Sbjct: 254 IVFNRSAAEVGMAKREGLYDDITPEDDKRPNQAAVNGKTAIWGTPRKTLTVGISSDEGLT 313
Query: 275 WHDALTLE-----------ENLAMEFSYPAV-IQASDGSVHITYTYNRTQIKHVVL 318
W + + E E E SYP++ + A + VH+ +T++R IK+V +
Sbjct: 314 WKERVLEEGDGFCMTNNSIEKTNRELSYPSIFVDAEESVVHVAFTFHRQHIKYVRI 369
>gi|378978714|ref|YP_005226855.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae HS11286]
gi|419974388|ref|ZP_14489807.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH1]
gi|419979991|ref|ZP_14495279.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH2]
gi|419985417|ref|ZP_14500558.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH4]
gi|419990995|ref|ZP_14505963.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH5]
gi|419997124|ref|ZP_14511922.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH6]
gi|420003333|ref|ZP_14517979.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH7]
gi|420008981|ref|ZP_14523467.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH8]
gi|420015314|ref|ZP_14529615.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH9]
gi|420020612|ref|ZP_14534798.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH10]
gi|420025987|ref|ZP_14539992.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH11]
gi|420032064|ref|ZP_14545881.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH12]
gi|420037597|ref|ZP_14551250.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH14]
gi|420043577|ref|ZP_14557064.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH16]
gi|420049293|ref|ZP_14562602.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH17]
gi|420054797|ref|ZP_14567968.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH18]
gi|420061145|ref|ZP_14574137.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH19]
gi|420066729|ref|ZP_14579527.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH20]
gi|420071224|ref|ZP_14583871.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae KPNIH21]
gi|420077268|ref|ZP_14589734.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae KPNIH22]
gi|420085227|ref|ZP_14597461.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae KPNIH23]
gi|421910221|ref|ZP_16340011.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
ST258-K26BO]
gi|421915507|ref|ZP_16345110.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
ST258-K28BO]
gi|428149143|ref|ZP_18996973.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
ST512-K30BO]
gi|428942990|ref|ZP_19015938.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae VA360]
gi|364518125|gb|AEW61253.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae HS11286]
gi|397345815|gb|EJJ38935.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH1]
gi|397347501|gb|EJJ40608.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH2]
gi|397351811|gb|EJJ44893.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH4]
gi|397363393|gb|EJJ56033.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH6]
gi|397364918|gb|EJJ57545.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH5]
gi|397369701|gb|EJJ62300.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH7]
gi|397376556|gb|EJJ68809.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH9]
gi|397382436|gb|EJJ74597.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH8]
gi|397387607|gb|EJJ79622.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH10]
gi|397396048|gb|EJJ87743.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH11]
gi|397398387|gb|EJJ90050.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH12]
gi|397405162|gb|EJJ96633.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH14]
gi|397413685|gb|EJK04897.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH17]
gi|397413873|gb|EJK05079.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH16]
gi|397422392|gb|EJK13361.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH18]
gi|397429209|gb|EJK19928.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH20]
gi|397431639|gb|EJK22311.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
pneumoniae KPNIH19]
gi|397440548|gb|EJK30950.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae KPNIH21]
gi|397446150|gb|EJK36373.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae KPNIH22]
gi|397449437|gb|EJK39573.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae subsp. pneumoniae KPNIH23]
gi|410115885|emb|CCM82636.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
ST258-K26BO]
gi|410122197|emb|CCM87735.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
ST258-K28BO]
gi|426297581|gb|EKV60063.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
pneumoniae VA360]
gi|427540908|emb|CCM93111.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
ST512-K30BO]
Length = 388
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 155/356 (43%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ +D G + +FGGS EG D+ IW G RW + ++P+ NPV
Sbjct: 33 HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLCDDPDRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P N L L + + + ++ S++ G +W + L PG +PI
Sbjct: 93 LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ +G + C + E W N+ S ++++ D G+SWR + +P ESL +
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L RS + S+S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATPLDNGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G+SW
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + E+ L EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380
>gi|300697882|ref|YP_003748543.1| Putative Glycosyl hydrolase, BNR repeat [Ralstonia solanacearum
CFBP2957]
gi|299074606|emb|CBJ54159.1| Putative Glycosyl hydrolase, BNR repeat [Ralstonia solanacearum
CFBP2957]
Length = 392
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 151/360 (41%), Gaps = 70/360 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P HA+ + + G L +FGG+ EG PDV I+L G WQ + ++P
Sbjct: 32 PCVQNHAANLHALPNGDLLCTWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NPVLF P L L Y + + ++R S + G +W E L +
Sbjct: 92 EQNPVLFTTPDGALWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIETLFDQAGTFVRQP 151
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
++L +G + LC + E W N S + ++ D GR+W ++ +P +S+ +
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDVSAVMISKDQGRTWTQHA---VP-DSVGCVH 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
A+ +L L RS RV S S DG WS +PT L N NS I V L++G
Sbjct: 208 MNVQTLADGSLLALYRS-RWADRVYASRSRDG-RAWSAPQPTALPNNNSSIQFVALRNGH 265
Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
L L YN R L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTRGTAFWGAPRAPLSLAISED 325
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W LE + E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385
>gi|451820791|ref|YP_007456992.1| putative neuraminidase [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
gi|451786770|gb|AGF57738.1| putative neuraminidase [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
Length = 398
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 151/366 (41%), Gaps = 82/366 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HAS ++ + G L A+F G+ EG D+ I++ G W P+ E+P NP+
Sbjct: 31 HASNVMPLANGDLLCAWFSGTQEGIADISIFMSRLNKGETVWTEPVKLSEDPTKSEQNPI 90
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P+ L L Y + + + ++ S + G TW E L PG + ++
Sbjct: 91 LFPAPNGDLWLIYTAQKSGNQDTAIVRYRVSSDNGYTWGPIETLFDKPGTF--VRQPIVV 148
Query: 142 LENG-----LLLCGS-SVESWNSWG--SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L+N + C + + WN S ++++ D G +W +Y +PN + V V
Sbjct: 149 LDNSEWLIPIFYCYTPQGQKWNGDYDVSAVKISADNGNTWTEYE---VPNSTGCVHMNVE 205
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ TL L RS + MS S D G TW+ PT+L N NS I L +G L L
Sbjct: 206 -KLHDGTLLALYRS-RWADNIYMSRSTDNGRTWTEPVPTELPNNNSSIQFTVLNNGHLAL 263
Query: 254 AYNTVS-------------------------------------------------RGVLK 264
+N ++ R L
Sbjct: 264 IFNDINAEKATERRASLYDEIEDDEEASEAVDNTENSFVPEDNGTERKTAFWGTPRAPLA 323
Query: 265 VALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQ 312
+A+S+D G +W +E E + EFSYP+V Q SDG ++IT+TY R
Sbjct: 324 IAISEDGGKTWPYKRNIEVGDGYCMSNNSKEKVNREFSYPSVKQTSDGKINITFTYFRQY 383
Query: 313 IKHVVL 318
IKHV +
Sbjct: 384 IKHVCV 389
>gi|157693727|ref|YP_001488189.1| hypothetical protein BPUM_2975 [Bacillus pumilus SAFR-032]
gi|157682485|gb|ABV63629.1| hypothetical protein BPUM_2975 [Bacillus pumilus SAFR-032]
Length = 391
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 156/354 (44%), Gaps = 70/354 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ IVE+D G L +FGG+ EG PD+ I++ K G +W + +P NPV
Sbjct: 36 HAANIVEMDNGDVLCVWFGGTQEGIPDISIYMSRLKRGSNKWTKAVKLSSDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF+ L L Y + + ++ S ++G TW E + L PG + ++
Sbjct: 96 LFQEKDGRLWLLYTAQLSGNQDTAIVRYRLSEDRGETWGEIDTLFDQPGTF--IRQPIVV 153
Query: 142 LENG-----LLLCGS--SVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L+N + C + V+ + N S ++++ D G++W + + +PN + V +
Sbjct: 154 LDNQDWLLPVFYCKTIPGVKWTGNRDVSAVKISGDQGKTWEE---VIVPNSTGYVHMNIG 210
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
A+ +L L RS + +S S D G TWS K T+L N NS I L +G L L
Sbjct: 211 -KCADGSLLALFRS-RFADSIYISRSVDHGRTWSDPKATELPNNNSSIQFTVLNNGHLAL 268
Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
YN ++ R + +A+S D+G++W
Sbjct: 269 VYNHINADEQTERRASLYDEIEDEGDTRTAVDTEARPAFWGTPRAPMTLAVSVDNGETWP 328
Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
LE + L E+SYP++ + DG +HI +TY R IK+V +
Sbjct: 329 IRRNLEVGDGYAMTNNSKDKLNREYSYPSITEGKDGKLHIAFTYYRQAIKYVCV 382
>gi|357056123|ref|ZP_09117178.1| hypothetical protein HMPREF9467_04150 [Clostridium clostridioforme
2_1_49FAA]
gi|355381371|gb|EHG28497.1| hypothetical protein HMPREF9467_04150 [Clostridium clostridioforme
2_1_49FAA]
Length = 386
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/338 (27%), Positives = 159/338 (47%), Gaps = 55/338 (16%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
+K+ H ++E++ G L +F GS EG DV + K G RW+ P++ +P+
Sbjct: 33 YKTAHGPGLLELENGDLLCVWFAGSFEGNADVNVICARLKKGSSRWEEPVLVSHDPDRSE 92
Query: 82 WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILG 133
NP LF P + Y G++ +++ ++ +S + G TWS+ E + P G
Sbjct: 93 QNPSLFAGPDGKVWAMYTSQLSRVEGKDNMQFTSVIRCQKSADGGETWSDYETVFPE-EG 151
Query: 134 PSKNKPI-LLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGPIYIPNES-- 185
+PI +LE+G + G+ + + + G + ++ D G++W+ + +P +
Sbjct: 152 SFCRQPIQVLESGRWIFGNWICTDSELGLTGDPTAFRISDDQGKTWKT---VDMPESNGA 208
Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+V++ P H L MRS + SES D G TW+ PT L N NS I
Sbjct: 209 VHANVVELEPGH-----LAAFMRS-RAADHIYRSESMDNGDTWTAPVPTVLPNNNSSISA 262
Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE------- 282
VKLK GR+ +AYN R + VALS+D G ++ +E
Sbjct: 263 VKLKSGRIAVAYNPTCTPCPTPGVASWPGLRCPVAVALSEDGGLTFPMIRYMERGEGYMG 322
Query: 283 -ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
EN ++ YP ++Q +DG +H+ + Y +R +K++
Sbjct: 323 AENKTNNRQYEYPYIMQGADGRLHLAFAYKDRLSVKYM 360
>gi|407712890|ref|YP_006833455.1| BNR repeat-containing glucosyll hydrolase [Burkholderia
phenoliruptrix BR3459a]
gi|407235074|gb|AFT85273.1| BNR repeat-containing glucosyll hydrolase [Burkholderia
phenoliruptrix BR3459a]
Length = 395
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 151/357 (42%), Gaps = 78/357 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L A+FGG+ EG PD+ I+L G W P+ ++P NPV
Sbjct: 34 HAANLLALANGDLLCAWFGGTQEGVPDISIYLSRLAQGSSTWSEPVRLSDDPTRSEQNPV 93
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNKPILL- 142
LF P L L Y + + ++R S + G TW + L + G +PI++
Sbjct: 94 LFAAPDGKLWLIYTAQLSGHQNTSIVRRRVSGDNGHTWGPIDTL-FDLPGTFVRQPIVVA 152
Query: 143 ENGLLLCG------SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---IQP 191
+G LC + E W N S + + D G +W + +P V IQP
Sbjct: 153 RDGAWLCPVFLCRVAPGERWSGNDDVSAVMRSTDNGATWTAHD---VPGSVGCVHMNIQP 209
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
+P + +L L RS + MS S D G+ WS +P L N NS I V L +G L
Sbjct: 210 LP----DGSLVALFRS-RWADHIHMSRSAD-GIAWSEPEPLALPNNNSSIQFVTLANGHL 263
Query: 252 LLAYNTVS----------------------------------------RGVLKVALSKDD 271
L +N S R + +A+S D
Sbjct: 264 ALVFNNSSAAQSTGRRASLYDDIEDAEDSGTLATQAASARGTAFWGAPRAPMTLAVSMDG 323
Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
G +W LE + L EFSYP+++Q++DG +HI YTY R +IK+V
Sbjct: 324 GRTWPVMRNLETGDGYCMTNNSTDKLNREFSYPSIVQSADGRLHIAYTYFRQRIKYV 380
>gi|220911882|ref|YP_002487191.1| hypothetical protein Achl_1110 [Arthrobacter chlorophenolicus A6]
gi|219858760|gb|ACL39102.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6]
Length = 413
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 158/392 (40%), Gaps = 91/392 (23%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQ 68
A+ F AP HA+ ++ + G +FGG+ EG PD+ IW + G +W
Sbjct: 19 TADGADFAYLPAPTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFSALEPGSKQWS 78
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQ 126
+P ++ NP+LF L L Y + + + ++R S + G TW + E
Sbjct: 79 APEQLSDDSTRSEQNPILFTNTDGALWLLYTAQKAGNQDTAEVRRRISLDSGRTWGDVET 138
Query: 127 L-----PPGILGPSKNKPILLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSW 173
L G+ + P++L +G L+ + E W NS S + ++ DAG +W
Sbjct: 139 LFAANETGGVF--VRQLPVVLPSGRLIIPIFRCITTPGEKWVGNSDDSAVMISDDAGATW 196
Query: 174 RKYGPIYIPNESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
+ +P V IQPV A+ +L L RS + S S D G TWS
Sbjct: 197 TE---TVLPGSLGCVHMNIQPV----ADGSLLALFRS-RWADSIYESRSTDDGSTWSEPV 248
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNT--------------------------------- 257
PT+L N NS I V LKDGRL L YN
Sbjct: 249 PTELPNNNSSIQFVALKDGRLALVYNHSRAGEGTERRLSLYDEIDDDGLADEQGQVAEPD 308
Query: 258 ----------------VSRGVLKVALSKDDGDSW--------HDALTLEEN----LAMEF 289
R + +A+S+D G SW D L N L E+
Sbjct: 309 ASAFSEDDGVKRAFWGTPRSPMTLAISEDSGRSWPIRRNLDVGDGYCLSNNSRDGLNREY 368
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
SYP++ Q DG+++I YTY R IK V + P
Sbjct: 369 SYPSIHQGPDGALNIAYTYFRQAIKFVRVDPQ 400
>gi|374991679|ref|YP_004967174.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
bingchenggensis BCW-1]
gi|297162331|gb|ADI12043.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
bingchenggensis BCW-1]
Length = 409
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 156/379 (41%), Gaps = 91/379 (24%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HA+ + + G +FGG+ EG D+ +W G W P+ ++
Sbjct: 38 APAVQNHAANLAVLPGGDLGCVWFGGTQEGVADICVWFSRLAPGADTWTDPVRLSDDLTR 97
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQL-----PPGIL 132
NPVLF P+ L L + Q + + ++ S + G TW L G+
Sbjct: 98 SEQNPVLFSAPTGELWLLHTAQQAGNQDTAEVRLRTSADGGATWGPTRTLFAATATGGVF 157
Query: 133 GPSKNKPILLENG--LLLCGSSVES----W--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ P++L++G LL V + W + S + ++ D GR+WR+ +P
Sbjct: 158 --VRQPPVVLDDGRWLLPVFHCVATPGIPWVGDHDTSAVMISDDQGRTWREQ---PVPGS 212
Query: 185 SLSV---IQPVPFHTANRTLRVLMRS--FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
+ V I P+P + TL L RS + I R S S DGG TWS T+L N NS
Sbjct: 213 TGCVHMNIHPLP----DGTLLALFRSRWADAIHR---SHSRDGGETWSEPVATELPNNNS 265
Query: 240 GIDGVKLKDGRLLLAYNTVS---------------------------------------- 259
I V L DGRL L YN S
Sbjct: 266 SIQYVPLADGRLALVYNHSSAADATARRVSLYDEIDDDGQAGESPAATAAPEVADGGKTA 325
Query: 260 -----RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSV 302
R + +ALS D GD+W L+ + L E SYP+V Q+ DG++
Sbjct: 326 FWGAPRAPMTLALSSDGGDTWPHRRDLDTGDGHCLTNNSRDRLNRELSYPSVTQSPDGAL 385
Query: 303 HITYTYNRTQIKHVVLLPN 321
HI YTY+R IK+V + P+
Sbjct: 386 HIAYTYHRRAIKYVRVAPD 404
>gi|375263597|ref|YP_005025827.1| hypothetical protein VEJY3_22196 [Vibrio sp. EJY3]
gi|369844024|gb|AEX24852.1| hypothetical protein VEJY3_22196 [Vibrio sp. EJY3]
Length = 385
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 150/356 (42%), Gaps = 70/356 (19%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVPM 81
+ HAS ++ + G L +F G+ EG D+ I + W+ + +P
Sbjct: 32 YPQNHASNLLVLPNGDLLCTWFAGTQEGVSDISILCSRLRKDSNEWEPAVKLSNDPERSE 91
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKN 137
NPVLF P N L L + + + + ++S++ G TW + + L PG +
Sbjct: 92 QNPVLFLDPDNTLWLLWTAQLAGNQDTAIVRYRKSFDLGETWQDIDVLIDKPGTF--IRQ 149
Query: 138 KPILLENG---LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
++L NG L + +++ W S ++++ D G++WR + +P ESL +
Sbjct: 150 PIVVLPNGNWLLPIFNCTIKPGEKWVGNYDTSSVKISEDKGQTWRD---VDVP-ESLGCV 205
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
N +L + RS + S S D G TWS PT+L N NS I L +G
Sbjct: 206 HMSVALLKNGSLIAVYRS-RWADSIYQSYSHDNGETWSAPVPTELPNNNSSIQLTALNNG 264
Query: 250 RLLLAYNTVS-------------------------------------RGVLKVALSKDDG 272
L + +N +S R + +A+SKDDG
Sbjct: 265 DLAMVFNNMSAEGATDRRSSLYDEIEDDSDAKEPEIIDGKSAFWGAPRAPMSLAISKDDG 324
Query: 273 DSWHDALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
SW L +E L EFSYP + Q+SDG++HI YTY R IK+V
Sbjct: 325 ASWQIVANLDEGDGFCMTNNSQEKLNREFSYPTITQSSDGAIHIAYTYFRQAIKYV 380
>gi|288961203|ref|YP_003451542.1| glycosyl hydrolase [Azospirillum sp. B510]
gi|288913511|dbj|BAI74998.1| glycosyl hydrolase [Azospirillum sp. B510]
Length = 408
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 151/376 (40%), Gaps = 83/376 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ I + G +FGG+ EG PD+ I+ G RW P+ ++P
Sbjct: 35 SPCVQNHAANITVLGNGDLGCVWFGGTQEGIPDISIYFSRLAQGSDRWTDPVKLSDDPTR 94
Query: 80 PMWNPVLFKLPSNGLLLFYKI---GQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS- 135
NPVLF P L LFY G + + C + S + G +W L G G
Sbjct: 95 SEQNPVLFPAPDGTLWLFYTAQVSGNQDTAFVRC-RSSADHGHSWGPIRTLFEGTNGDGI 153
Query: 136 --KNKPILLENG-----LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+ ++L NG LC G+ E W ++ S + ++ D G SW ++ +P+
Sbjct: 154 FIRQPVVVLPNGDWLMPTFLCHGTPGEKWVGDNDTSAVRISSDQGESWTEHA---VPDSV 210
Query: 186 LSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
V + VP + TL RS V S S D G +WS T+L N NS I
Sbjct: 211 GCVHMSVVPLR--DSTLAAFYRS-RWADHVYRSLSTDNGRSWSAPVATELPNNNSSIQVT 267
Query: 245 KLKDGRLLLAYNTVS--------------------------------------------- 259
L DGRL + +N S
Sbjct: 268 ALADGRLAIVFNQSSAENATGRRASLYDEIEDDVPAGGAAVAVAPAPVKAEALARKAFWG 327
Query: 260 --RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHIT 305
R L +ALS+D G +W LE ++L EFSYP+V Q +DG +H+
Sbjct: 328 APRAPLTLALSEDGGLTWPLRRNLEVGDGYCMTNNSKDSLNREFSYPSVTQTADGDLHVA 387
Query: 306 YTYNRTQIKHVVLLPN 321
+TY R IK+ + P+
Sbjct: 388 FTYFRQAIKYSRVPPD 403
>gi|408675436|ref|YP_006875184.1| neuraminidase (sialidase)-like protein [Emticicia oligotrophica DSM
17448]
gi|387857060|gb|AFK05157.1| neuraminidase (sialidase)-like protein [Emticicia oligotrophica DSM
17448]
Length = 395
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 156/360 (43%), Gaps = 72/360 (20%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
K HAS++V + G +L +F GS E + DVKI + G+ W P + + PN+P
Sbjct: 37 KHTHASSLVNLPNGDYLCVWFYGSGERTSDDVKIMGARLEKGKKTWSEPFLMADTPNIPD 96
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SYNKG--VTWSERE--------- 125
NPVLF L L + I + KW + R Y K WS ++
Sbjct: 97 CNPVLFLNHEKKLFLVW-IAVQANKWEQSILRFRTSTDYQKSGPPVWSWQDNILLKPDDE 155
Query: 126 ----------QLPPGILG------------------PSKN--------KPILLENGLLLC 149
+LPP G P+K KP+LL N ++
Sbjct: 156 FAKEVNKRLKELPPPTAGWAGYAPKYDESINIASNDPTKRSIGWMTRIKPLLLPNNRIVL 215
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMR-SF 208
+ +N S M ++ D G SW P+ IQP N L LMR S
Sbjct: 216 PLYSDGFNM--SLMAISDDDGTSWHPSLPLV----GRGPIQPAVIQKKNGNLYALMRDSG 269
Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVAL- 267
+ RV SES D G TW + T++ N + ++ + LKDGR N + G +++L
Sbjct: 270 DDPSRVHESESTDLGETWKASSKTKIPN-TASVELLVLKDGRWAFLGNDIDDGRYRLSLF 328
Query: 268 -SKDDGDSWHDALTLE--ENLAMEFSYPAVIQASDGSVHITYTY----NRTQIKHVVLLP 320
S D+G +W LE E FSYP++IQ DG +H+TY+Y ++ IK+V++ P
Sbjct: 329 LSDDEGKTWKWKTKLENVEPNKGSFSYPSLIQTEDGLIHLTYSYHLEKDKKSIKYVIVDP 388
>gi|389574379|ref|ZP_10164443.1| hypothetical protein BAME_30120 [Bacillus sp. M 2-6]
gi|388425987|gb|EIL83808.1| hypothetical protein BAME_30120 [Bacillus sp. M 2-6]
Length = 391
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 155/354 (43%), Gaps = 70/354 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++E+D G L +FGG+ EG PD+ I++ K G +W + ++ NPV
Sbjct: 36 HAANLIELDNGDLLCVWFGGTQEGIPDISIYMSRLKQGSNQWTKAVKLSDDSTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF+ L L Y + + ++ S ++G TW E L PG + ++
Sbjct: 96 LFQEKDGRLWLLYTAQLSGNQDTALVRYRLSEDRGETWGEIGTLFDQPGTF--IRQPIVV 153
Query: 142 LENG-----LLLCGS--SVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L+N + C + V+ + N S ++++ D G++W + + +PN + V +
Sbjct: 154 LDNDDWLLPVFYCKTIPGVKWTGNRDVSAVKISSDQGKTWEE---VTVPNSTGCVHMNIG 210
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
A+ +L L RS + MS S D G TWS + T+L N NS I L +G L L
Sbjct: 211 -KCADGSLLALFRS-RFADSIYMSRSVDHGRTWSEPRATELPNNNSSIQFTVLHNGHLAL 268
Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
YN ++ R + +A+S DDG +W
Sbjct: 269 VYNHINADDQTERRASLYDEIEDEGDTRTAVDTEARPAFWGTPRAPMTLAISTDDGATWP 328
Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
LE + L E+SYP++ + DGS+HI +T+ R IK+V +
Sbjct: 329 VRRNLEIGDGYAMTNNSKDKLNREYSYPSITEGKDGSLHIAFTFYRQAIKYVCV 382
>gi|227327959|ref|ZP_03831983.1| hypothetical protein PcarcW_11745 [Pectobacterium carotovorum
subsp. carotovorum WPP14]
Length = 395
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 154/363 (42%), Gaps = 83/363 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G W + + E+ NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSNAVKLSEDATRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L Y + + + ++ +S + G TW E L PG +PI
Sbjct: 96 LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
+L NG + C E W N S ++++ D G++W++Y +PN + V I
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWQEYP---VPNSTGCVHMNI 209
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
P+ + TL L RS + S S DGG TWS T L N NS I L++G
Sbjct: 210 TPL----KDGTLLALYRS-RWADFIYQSRSTDGGKTWSDPVATDLPNNNSSIQVTTLENG 264
Query: 250 RLLLAYNTVS------------------------------------------RGVLKVAL 267
L L +N +S R + +A+
Sbjct: 265 HLALVFNHMSAADATERRLSLYDEIEDEEDKASDAKMPEVQAGGRSAFWGAPRAPMTLAI 324
Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
S+D G SW +E E L EFSYP+V QA DG +H+ +TY R IKH
Sbjct: 325 SEDGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKH 384
Query: 316 VVL 318
VV+
Sbjct: 385 VVV 387
>gi|83749872|ref|ZP_00946839.1| Hypothetical protein RRSL_00197 [Ralstonia solanacearum UW551]
gi|207739581|ref|YP_002257974.1| hypothetical protein RSIPO_04281 [Ralstonia solanacearum IPO1609]
gi|83723462|gb|EAP70673.1| Hypothetical protein RRSL_00197 [Ralstonia solanacearum UW551]
gi|206592961|emb|CAQ59867.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609]
Length = 392
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 150/360 (41%), Gaps = 70/360 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P HA+ + + G L +FGG+ EG PDV I+L G WQ + ++P
Sbjct: 32 PCVQNHAANLHALPNGDLLCTWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NPVLF P+ L L Y + + ++R S + G +W E L +
Sbjct: 92 EQNPVLFTTPAGALWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIETLFEQAGTFVRQP 151
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
++L +G + LC + E W N S + ++ D GR+W P +P +S+ +
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDVSAVMISEDQGRTWT---PHAVP-DSVGCVH 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
A+ +L L RS V S S DG WS +PT L N NS I V L++G
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHVYASRSRDG-RAWSAPQPTALPNNNSSIQFVALRNGH 265
Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
L L YN R L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTHGTAFWGAPRAPLSLAISED 325
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W LE + E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385
>gi|421898935|ref|ZP_16329301.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
gi|206590141|emb|CAQ37102.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
Length = 392
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 150/360 (41%), Gaps = 70/360 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P HA+ + + G L +FGG+ EG PDV I+L G WQ + ++P
Sbjct: 32 PCVQNHAANLHALPNGDLLCTWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NPVLF P+ L L Y + + ++R S + G +W E L +
Sbjct: 92 EQNPVLFTTPAGALWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIETLFEQAGTFVRQP 151
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
++L +G + +C + E W N S + ++ D GR+W P +P +S+ +
Sbjct: 152 IVVLSDGAWACPVFMCRTQPGERWVGNDDVSAVMISEDQGRTWT---PHAVP-DSVGCVH 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
A+ +L L RS V S S DG WS +PT L N NS I V L++G
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHVYASRSRDG-RAWSAPQPTALPNNNSSIQFVALRNGH 265
Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
L L YN R L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTHGTAFWGAPRAPLSLAISED 325
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W LE + E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385
>gi|392979080|ref|YP_006477668.1| hypothetical protein A3UG_11160 [Enterobacter cloacae subsp.
dissolvens SDM]
gi|392325013|gb|AFM59966.1| hypothetical protein A3UG_11160 [Enterobacter cloacae subsp.
dissolvens SDM]
Length = 387
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 150/356 (42%), Gaps = 70/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W ++ NPV
Sbjct: 32 HAANLLPLPDGSLMCVWFGGTQEGIADISVWGSRLSPGSQQWSEAEKLSDDATRSEQNPV 91
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P N L L + + + + ++S ++G TW + L PG + ++
Sbjct: 92 LFLAPDNVLWLLWTAQMSGNQDTAIVRYRQSRDRGKTWGDIATLLDKPGTF--IRQPIVV 149
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
LENG + C + E W N S ++++ D G++WR + +P +SL +
Sbjct: 150 LENGNWLLPVFYCRTRPGEKWVGNDDISAVKISEDGGKTWRD---VEVP-DSLGCVHMNI 205
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
N TL L RS + S+S D G +WS +PT L N NS I L G L L
Sbjct: 206 TPLCNGTLVALFRS-RWADNIYYSQSTDNGESWSVPEPTVLPNNNSSIQVTTLASGELAL 264
Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
+N +S R + VA+S D G SW
Sbjct: 265 VFNFMSAAGAQERRTSLYDEIDDGDGRKEPVVTHGRAAFWGAPRAPMTVAISPDGGKSWP 324
Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DGS+HI YTY R IK+V + P
Sbjct: 325 WQRHLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGSLHIAYTYFRQAIKYVRVTP 380
>gi|343084746|ref|YP_004774041.1| neuraminidase (sialidase)-like protein [Cyclobacterium marinum DSM
745]
gi|342353280|gb|AEL25810.1| neuraminidase (sialidase)-like protein [Cyclobacterium marinum DSM
745]
Length = 393
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 163/377 (43%), Gaps = 75/377 (19%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTF--KD 64
K LV+E FP H S+IV + G L A+F GS E A DV+I KD
Sbjct: 25 KVLVSESLIFPLQQ---NHVHGSSIVSLPNGDLLTAWFEGSGERKADDVRIMGARLRKKD 81
Query: 65 GRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SY-NKG 118
+W P + + PN+P NPVLF L G L I + +W + R SY N G
Sbjct: 82 RQWGEPFLMADTPNIPDCNPVLF-LNQEGKLFLVWIAVQANRWEQSILRFRTSTSYENAG 140
Query: 119 V---TWSEREQLPPG-------------------------------ILGPSKN------- 137
+W + L PG I+ SK+
Sbjct: 141 APIWSWQDNILLKPGEEFHQEVAKRFEQLPENNSGWSEYAPQYDQMIIEASKDSKKSSIG 200
Query: 138 -----KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
KP++L +G +L + +N S M ++ D G +W P+ IQP
Sbjct: 201 WMTRIKPVVLTSGRILLPLYSDGFNF--SMMAISDDDGTTWFPSKPLV----GRGPIQPA 254
Query: 193 PFHTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
N + +MR S + R+ S S D G+TWS ++ T + N S ++ + L +G
Sbjct: 255 LGQRENGEIMAIMRDSGDAPTRIHKSYSQDEGMTWSASEKTSIPNTAS-VELLVLDNGTW 313
Query: 252 LLAYNTVSRGVLKVAL--SKDDGDSWHDALTLEENL--AMEFSYPAVIQASDGSVHITYT 307
L N + G +++L S D+G +W + LE+ FSYP++IQ DG VHITY+
Sbjct: 314 LFLGNDIDDGRYRISLYISDDEGQTWRGKIYLEDEKKDVGGFSYPSLIQDIDGMVHITYS 373
Query: 308 YNRTQ----IKHVVLLP 320
Y+ + IK++ + P
Sbjct: 374 YHLNEKGKSIKYLKIDP 390
>gi|418402771|ref|ZP_12976276.1| hypothetical protein SM0020_21702 [Sinorhizobium meliloti
CCNWSX0020]
gi|359503265|gb|EHK75822.1| hypothetical protein SM0020_21702 [Sinorhizobium meliloti
CCNWSX0020]
Length = 401
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 144/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ G +W P +
Sbjct: 34 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAEH 93
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 94 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 147
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + V++D GRSW+
Sbjct: 148 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 203
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ N S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 204 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 262
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L+DGRL + N V+ R
Sbjct: 263 SSITVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 322
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E+ E SYP +++ DGS+H+ YTY+
Sbjct: 323 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHVAYTYH 382
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 383 RRAIKYVRLPP 393
>gi|407724134|ref|YP_006843795.1| hypothetical protein BN406_06513 [Sinorhizobium meliloti Rm41]
gi|407324194|emb|CCM72795.1| hypothetical protein BN406_06513 [Sinorhizobium meliloti Rm41]
Length = 395
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 144/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ G +W P +
Sbjct: 28 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAEH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + V++D GRSW+
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ N S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L+DGRL + N V+ R
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCAPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E+ E SYP +++ DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHVAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLPP 387
>gi|378582298|ref|ZP_09830937.1| hypothetical protein CKS_3144 [Pantoea stewartii subsp. stewartii
DC283]
gi|377815035|gb|EHT98151.1| hypothetical protein CKS_3144 [Pantoea stewartii subsp. stewartii
DC283]
Length = 385
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 153/357 (42%), Gaps = 71/357 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ I+ + G L +FGG+ EG D+ +W G W + ++P NPV
Sbjct: 33 HAANILPLHDGSLLCVWFGGTQEGIADISVWCSRLPAGSHIWSEAVKLSDDPTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW + L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQLSGNQDTAIVRYRQSGDLGKTWGPIDTLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+LENG + C + E W N S ++++ D G+SWR + +P ESL +
Sbjct: 150 VLENGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGQSWRD---VVVP-ESLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ +L L RS + +S S DGG +WS + T L N NS I L+ G L
Sbjct: 206 ITALQDGSLVALYRS-RWADNIYLSRSTDGGESWSAPEATVLPNNNSSIQVTTLQSGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N++S R + VA+S D G +W
Sbjct: 265 LVFNSMSAAGATERRASLYDEIDDGDGRKEPEATGRSAFWGAPRAPMTVAISADGGKTWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
D + +E L EFSYP++ Q + G++HI YTY R IK+ + P+
Sbjct: 325 WQKNLDEGDGYCMTNNSQEKLNREFSYPSIKQDAQGTLHIAYTYFRQAIKYARVTPD 381
>gi|400289912|ref|ZP_10791939.1| hypothetical protein SRA_03251 [Streptococcus ratti FA-1 = DSM
20564]
gi|399920703|gb|EJN93520.1| hypothetical protein SRA_03251 [Streptococcus ratti FA-1 = DSM
20564]
Length = 390
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 150/335 (44%), Gaps = 45/335 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWN 83
+ HA ++E G L A+F GS EG+ D+ I + + W P I + + N
Sbjct: 37 TAHAPALIETQDGGLLCAWFAGSFEGSGDISIAVSKLDPEKQVWSIPQIVSKGKDRSEQN 96
Query: 84 PVLFKLPSNGLLLFY------KIGQEVQKWSGCM--KRSYNKGVTWSEREQLPPGILGPS 135
P F+ P + L Y + G++ +++ + ++S ++G TW E + L P S
Sbjct: 97 PAFFRAPDGQIWLIYTSQLSRQEGKDNMQFTSVIMVQKSSDEGQTWGEPQILFPEEGTFS 156
Query: 136 KNKPILLENG-----LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
+ +L NG LC S E + + V+ DAG++W++ + +P+ + V
Sbjct: 157 RQAIQVLSNGRWIFATWLCEDSAEGLTNDPTEFRVSDDAGQTWKR---VRMPDSNGRVHA 213
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
V A + + F V +SES D G +W+ + T L N N+ I +KL+ G
Sbjct: 214 NVIEVEAGHLVAFMRSRF--ADNVYISESYDFGDSWTIPQKTVLPNNNASISAIKLQSGD 271
Query: 251 LLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE--------ENLA-- 286
+ LAYN + R + V++S+D G +W LE EN
Sbjct: 272 IALAYNVNAARHPEFGKVAWPGLRNPVAVSISEDFGKTWPIGRILEPAEGFIGTENKTNN 331
Query: 287 MEFSYPAVIQASDGSVHITYTY-NRTQIKHVVLLP 320
++ YP + Q DG +H+ Y Y NR +K+V P
Sbjct: 332 AQYEYPTLYQTKDGLLHLVYAYRNRLCVKYVRFSP 366
>gi|309780704|ref|ZP_07675445.1| BNR/Asp-box repeat protein [Ralstonia sp. 5_7_47FAA]
gi|404394394|ref|ZP_10986198.1| hypothetical protein HMPREF0989_03334 [Ralstonia sp. 5_2_56FAA]
gi|308920386|gb|EFP66042.1| BNR/Asp-box repeat protein [Ralstonia sp. 5_7_47FAA]
gi|348613456|gb|EGY63041.1| hypothetical protein HMPREF0989_03334 [Ralstonia sp. 5_2_56FAA]
Length = 391
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 152/359 (42%), Gaps = 69/359 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P HA+ + + G L +FGG+ EG PDV I+L G WQ E+P
Sbjct: 32 PCVQNHAANLHALPNGDLLCVWFGGTQEGIPDVSIYLSRLPAGTSEWQPATKLSEDPTRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NPVLF P+ L L Y + + ++R S + G +W + L +
Sbjct: 92 EQNPVLFSTPTGELWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIDTLFDQAGTFVRQP 151
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
++L +G + LC + E W N S + ++ D GR+W ++ +P +S+ +
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDISAVMISEDQGRTWSRHD---VP-DSIGCVH 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
A+ +L + RS + S S DG TWS +PT L N NS I V L++G
Sbjct: 208 MNVQTLADGSLLAVYRS-RWADHIYASRSQDG-RTWSAPQPTTLPNNNSSIQFVALRNGH 265
Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
L L YN R L +A+S+D
Sbjct: 266 LALVYNESDASHSTGRRPSLYDDIEDAEDHGELRDQQASTRGTAFWGAPRAPLSLAISED 325
Query: 271 DGDSWH-------DALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W D + N A E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGHTWRRRNIEIGDGYCMTNNSADQRNREYSYPSIVQGNDGALHIAFTYFRQRIKYVTV 384
>gi|402082238|gb|EJT77383.1| glycosyl hydrolase [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 393
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 145/363 (39%), Gaps = 85/363 (23%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++ +D G L A+FGGS EG D+ IWL G W P +P NPV
Sbjct: 32 HASNLLVLDSGEVLCAFFGGSMEGTSDIYIWLARLAPGADEWSEPARVSGDPGRSEQNPV 91
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSE-REQLP------------- 128
LF+ P L QE + C+ R S ++G TWSE RE
Sbjct: 92 LFEDPRTKNLWLLYTAQEHGAQNKCLVRRRVSTDQGRTWSEAREAFSDRGVLVRQPVTVL 151
Query: 129 PGILGPSKNKPILLENGLLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P GP ++ +L + C + E W N S + + D G +W++ +P+
Sbjct: 152 PDKEGPGRDVWVLP---VFHCRAAEGEEWVGNDDVSAVRYSADGGETWKES---VVPDSR 205
Query: 186 LSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+V I+P P R + I + S+ C +TW +PT+L NPNSGI
Sbjct: 206 GAVHMNIRPDPGPEGGWVAVFRSRWADNI-YMSRSQDC---ITWFKPQPTELPNPNSGIG 261
Query: 243 GVKLKDGRLLLAYNT--------------------------------------VSRGVLK 264
K GRL L YN R L
Sbjct: 262 MAVTKGGRLALVYNDSRATPDMARREGLYDDLETGKPKLSKQAVIDGKPAIWGTPRNRLT 321
Query: 265 VALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQ 312
+A S D G +W L++ L E SYP++ A+DG++HI YT+ R +
Sbjct: 322 LATSDDGGRTWKTRGLLDDGLGFCLTNNSKGQRNRELSYPSIAVAADGTIHIAYTFWRMR 381
Query: 313 IKH 315
IK+
Sbjct: 382 IKY 384
>gi|365133916|ref|ZP_09343066.1| hypothetical protein HMPREF1032_00862 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363614831|gb|EHL66310.1| hypothetical protein HMPREF1032_00862 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 385
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/337 (29%), Positives = 147/337 (43%), Gaps = 55/337 (16%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P+ + HA ++E G L A+F GS EG+ DV I G RW++P+ +
Sbjct: 32 PWATAHAPALLETPDGSLLCAWFAGSFEGSGDVSIVCARLPKGAARWEAPVQVSHDEKRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMKR--SYNKGVTWSEREQLPPGIL 132
NP LF P + Y G++ +++ ++R S + G TW E P +L
Sbjct: 92 EQNPSLFLAPDGAVWAVYTAQLDRVPGKDNMQFTSVIRRQKSTDGGRTWGE-----PDVL 146
Query: 133 ----GPSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
G +PI +L +G LC S + +++ D G +WR + +P
Sbjct: 147 FAREGSFCRQPIQVLASGRWIFANWLCSDSASGLAGDPTAFQISDDQGCTWRT---VEMP 203
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+ V V A L MRS + SES D G TWS PT L N NS I
Sbjct: 204 GSNGRVHANV-VELAPGRLAAFMRS-RAADFIYRSESLDDGNTWSEPVPTVLPNNNSSIS 261
Query: 243 GVKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE------ 282
VKL+ GR+ +AYN R + VALS+D G +W +E
Sbjct: 262 AVKLQSGRIAVAYNPTHTPAPQPGMAAWPGLRCPVAVALSEDGGLTWPMIRHMELGEGFV 321
Query: 283 --ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIK 314
EN ++ YP ++QASDG +H+ + Y NR +K
Sbjct: 322 GPENSTNNRQYEYPYLMQASDGRLHLAFAYQNRRGVK 358
>gi|296534630|ref|ZP_06897036.1| BNR/Asp-box repeat protein [Roseomonas cervicalis ATCC 49957]
gi|296265040|gb|EFH11259.1| BNR/Asp-box repeat protein [Roseomonas cervicalis ATCC 49957]
Length = 402
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 159/387 (41%), Gaps = 77/387 (19%)
Query: 2 KKDCGIKGLVAEEFTFPANSAPF-----KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVK 56
+ D G GLV PA + HA+ + + G +FGG+ EG PD+
Sbjct: 9 RLDLGEAGLVRPAPEDPARLEAYIPPATVQSHAANLTPLPDGSLGCVWFGGTQEGVPDIC 68
Query: 57 IWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR- 113
W G RW +P+ E+ NP+LF P L L + + + +++
Sbjct: 69 AWFSRLDPGAERWSAPVRLSEDATRSEQNPLLFPAPDGTLWLIWTAQISGNQDTAIVRKR 128
Query: 114 -SYNKGVTWSEREQL--PPGILGPSKNKPIL-LENG-----LLLCGSSV-ESW--NSWGS 161
S++ G +W E L P G +P++ L+NG + +C S+ + W + S
Sbjct: 129 ISHDMGRSWGPVETLFGPRPEGGTFMRQPVVVLDNGDWLLPIWVCKSTPGKKWVGDEDVS 188
Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
++++ D G+SW + I +P +SL + + TL RS V +S S D
Sbjct: 189 AVKLSTDQGKSWTE---IEVP-DSLGCVHMSIVDLRDGTLAAFYRS-RWADNVYVSRSTD 243
Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT------------------------ 257
GG +WS +PT+L N NS I +L +G L + +N
Sbjct: 244 GGRSWSAPEPTELPNNNSSIQVARLSNGHLAMVFNHSSAADATERRLSLYDDIEDDSDDG 303
Query: 258 ----------------VSRGVLKVALSKDDGDSWHDALTLE------------ENLAMEF 289
V R + +A+S+D G SW LE E L EF
Sbjct: 304 KVVAAIDTTRRSTFWGVPRAPMTLAISEDGGRSWPWRRNLETGDGYCMTNNSREGLNREF 363
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHV 316
SYP V + DG +HI YT R IK+V
Sbjct: 364 SYPTVTEGQDGRLHIAYTVYRQAIKYV 390
>gi|332667926|ref|YP_004450714.1| neuraminidase (sialidase)-like protein [Haliscomenobacter hydrossis
DSM 1100]
gi|332336740|gb|AEE53841.1| neuraminidase (sialidase)-like protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 390
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 109/361 (30%), Positives = 161/361 (44%), Gaps = 75/361 (20%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
K HAS++V + G FL +F GS E A DVK+ + G+ W P + + PN+P
Sbjct: 37 KHVHASSVVYLPNGDFLAVWFYGSGERTADDVKLMGSRLEKGKSTWSEPFLMADTPNIPD 96
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SYNKGV----TWSEREQLPPG-- 130
NPVLF N L L + I + KW + R Y K W + L PG
Sbjct: 97 CNPVLFLNQENKLFLVW-IAVQANKWEQSILRLRTSTDYLKSGPPLWNWQDNILLKPGDT 155
Query: 131 -----------------------------ILGPSKN------------KPILLENGLLLC 149
I+ S++ KP++L N ++L
Sbjct: 156 FTQEVARKMKELPENTAGWAEYAPAYDKMIIVASQDVTKRSFGWMTRIKPLMLGNRIVLP 215
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMR-SF 208
S + +N S ++ D G +WR P+ IQP N L MR S
Sbjct: 216 LYS-DGFNF--SLCAISEDRGNTWRPGLPLV----GRGPIQPAIVQKMNGNLMAYMRDSG 268
Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVAL- 267
+ GRV SES D G +W+ A + + N + ++ + L+DG+ N + G +++L
Sbjct: 269 DAPGRVQTSESSDQGESWTAAVKSDIPN-TASVELLVLQDGKWAFVGNDLEDGRYRLSLY 327
Query: 268 -SKDDGDSWHDALTLEENLAM---EFSYPAVIQASDGSVHITYTYNRTQ----IKHVVLL 319
S D+G SW TLE N+A FSYP++IQA DG +H+TY+Y+ Q IK+VV+
Sbjct: 328 LSDDEGKSWKWKTTLE-NVAPGQGSFSYPSLIQAPDGLLHLTYSYHLAQDKKSIKYVVVN 386
Query: 320 P 320
P
Sbjct: 387 P 387
>gi|261406860|ref|YP_003243101.1| hypothetical protein GYMC10_3028 [Paenibacillus sp. Y412MC10]
gi|261283323|gb|ACX65294.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
Length = 362
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 94/321 (29%), Positives = 140/321 (43%), Gaps = 37/321 (11%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
H++ ++ +D G L +F GS EG PD + L G RW+ PI +P NP
Sbjct: 33 SHSANLLALDNGDLLCVWFSGSGEGNPDTNVLLSRLPAGGERWEEPIEVAADPERSEQNP 92
Query: 85 VLFKLPSNGLLLFYKIGQ-EVQKWSGCMKR-SYNKGVTWSEREQLPPGILGPSKNKPIL- 141
++F+ P N + L + + Q+ S + R S ++G TW E L G G +PIL
Sbjct: 93 LVFQAPDNKVWLLHTSNEPHNQQTSKIVGRISDDRGYTWGEPFVLHEGP-GMFLRQPILA 151
Query: 142 LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTL 201
+ NG L S + ++ D G +W +Y SL +Q TL
Sbjct: 152 MSNGEWLLPCYYCKPGGHYSVVLISADQGETWSEYEV----QGSLHRVQMSVVELDGGTL 207
Query: 202 RVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-- 259
+ RS + R+ S S D G TWS T L N NS I KL +G L + YN +
Sbjct: 208 FAVFRSRHA-DRIYGSVSKDFGKTWSEPAKTSLPNNNSSIQLAKLANGHLAIIYNDSTME 266
Query: 260 ------------------RGVLKVALSKDDGDSWHDALTLE------ENLAMEFSYPAVI 295
R L VA+S+D G +W ++ + +SYP+++
Sbjct: 267 RDQYRWIERNGEWLKKTLRTPLTVAISEDGGKTWPHVKNVQMSDLEHKEKQTGYSYPSIM 326
Query: 296 QASDGSVHITYTYNRTQIKHV 316
DGS+H Y+Y R IK+V
Sbjct: 327 GTKDGSIHAAYSYLRKAIKYV 347
>gi|350271843|ref|YP_004883151.1| hypothetical protein OBV_34470 [Oscillibacter valericigenes
Sjm18-20]
gi|348596685|dbj|BAL00646.1| hypothetical protein OBV_34470 [Oscillibacter valericigenes
Sjm18-20]
Length = 386
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 145/325 (44%), Gaps = 46/325 (14%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
+ + HA ++E++ G L A+F GS EG+ DV I G +W P+ +P
Sbjct: 33 YATAHAPALLELENGDLLCAWFAGSYEGSADVSIVCARLPKGAEKWSKPVRVSTDPQRSE 92
Query: 82 WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILG 133
NP LF P + Y G++ +++ ++R S + G TWS + L P G
Sbjct: 93 QNPSLFLAPDEEIWAVYTAQLDRAEGKDNMQYTAVVRRQRSNDGGCTWSNYDVLFPE-EG 151
Query: 134 PSKNKPI-LLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGPIYIPNESLS 187
+PI +L NG + G+ + + ++ G S ++ D G +WR+ + IP +S
Sbjct: 152 TFCRQPIQVLSNGRWIFGNWICTDSASGLAGDPSAFRISDDGGNTWRR---VDIP-QSHG 207
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P + L MRS + SES D G +WS PT L N NS I VKL
Sbjct: 208 RVHPNVVELEDGHLAAFMRS-RAADFIYRSESFDNGDSWSVPVPTLLPNNNSSISAVKLS 266
Query: 248 DGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEENL 285
R+ +AYN R + VALS+D G ++ + EEN
Sbjct: 267 SSRIAIAYNPTCAPNPRADEAAWPGLRCPVAVALSEDGGATFPLIRYMELGEGYVGEENS 326
Query: 286 A--MEFSYPAVIQASDGSVHITYTY 308
++ YP ++Q D S+H+ + Y
Sbjct: 327 TNNRQYEYPYLMQGRDDSLHLAFAY 351
>gi|295090883|emb|CBK76990.1| Predicted neuraminidase (sialidase) [Clostridium cf.
saccharolyticum K10]
Length = 386
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 151/338 (44%), Gaps = 55/338 (16%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDGR-WQSPIIADEEPNVPM 81
+ + HA +VE+ G L +F G+ EG+ DV I KDG+ W P+ +P
Sbjct: 33 YPTAHAPAMVELPNGDLLCCWFAGTYEGSADVHIVCSRLPKDGQAWLPPVDISSDPTRSE 92
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
NP LF P + + Y + Q+ M+ +S + G+TW E + P G
Sbjct: 93 QNPSLFYGPDSAVWAMYTAQLDRQEGKDNMQFTSVVRCQKSTDGGLTWGPYETIFPE-EG 151
Query: 134 PSKNKPI-LLENGLLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-- 185
+PI +L NG + C S + + + ++ D G++W+K + +P +
Sbjct: 152 TFCRQPIQILSNGRWIFSNWICTDSADGLSGDPTAFRISDDQGKTWKK---VMMPESNGH 208
Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+V++ P H L MR+ R+ SES D G TWS +PT L N NS I
Sbjct: 209 VHANVVELEPGH-----LAAFMRNREAY-RIHRSESFDWGETWSKPEPTPLPNNNSSISA 262
Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTL 281
VKL+ GR+ +AYN R + VALS+D G ++ +
Sbjct: 263 VKLQSGRIAIAYNPTCTPDPQPGKAAWPGLRCPVAVALSEDGGLTFPIIRWMERGEGFIG 322
Query: 282 EENLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
+EN ++ YP ++Q DG +H+ Y R +K+V
Sbjct: 323 DENKTNNKQYEYPYIMQGKDGMIHLAYAARTRQGVKYV 360
>gi|291087517|ref|ZP_06346666.2| BNR/Asp-box repeat protein [Clostridium sp. M62/1]
gi|291074881|gb|EFE12245.1| BNR/Asp-box repeat protein [Clostridium sp. M62/1]
Length = 391
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 151/338 (44%), Gaps = 55/338 (16%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDGR-WQSPIIADEEPNVPM 81
+ + HA +VE+ G L +F G+ EG+ DV I KDG+ W P+ +P
Sbjct: 38 YPTAHAPAMVELPNGDLLCCWFAGTYEGSADVHIVCSRLPKDGQAWLPPVDISSDPTRSE 97
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
NP LF P + + Y + Q+ M+ +S + G+TW E + P G
Sbjct: 98 QNPSLFYGPDSAVWAMYTAQLDRQEGKDNMQFTSVVRCQKSTDGGLTWGPYETIFPE-EG 156
Query: 134 PSKNKPI-LLENGLLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-- 185
+PI +L NG + C S + + + ++ D G++W+K + +P +
Sbjct: 157 TFCRQPIQILSNGRWIFSNWICTDSADGLSGDPTAFRISDDQGKTWKK---VMMPESNGH 213
Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+V++ P H L MR+ R+ SES D G TWS +PT L N NS I
Sbjct: 214 VHANVVELEPGH-----LAAFMRNREAY-RIHRSESFDWGETWSKPEPTPLPNNNSSISA 267
Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTL 281
VKL+ GR+ +AYN R + VALS+D G ++ +
Sbjct: 268 VKLQSGRIAIAYNPTCTPDPQPGKAAWPGLRCPVAVALSEDGGLTFPIIRWMERGEGFIG 327
Query: 282 EENLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
+EN ++ YP ++Q DG +H+ Y R +K+V
Sbjct: 328 DENKTNNKQYEYPYIMQGKDGMIHLAYAARTRQGVKYV 365
>gi|323525540|ref|YP_004227693.1| hypothetical protein BC1001_1190 [Burkholderia sp. CCGE1001]
gi|323382542|gb|ADX54633.1| hypothetical protein BC1001_1190 [Burkholderia sp. CCGE1001]
Length = 401
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 150/357 (42%), Gaps = 78/357 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L A+FGG+ EG PD+ I+L G W P+ ++P NPV
Sbjct: 40 HAANLLGLANGDLLCAWFGGTQEGVPDISIYLSRLAQGSSTWSEPLRLSDDPTRSEQNPV 99
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNKPILL- 142
LF P L L Y + + ++R S + G TW + L + G +PI++
Sbjct: 100 LFAAPDGKLWLIYTAQLSGHQNTSIVRRRVSGDNGHTWGPIDTL-FDLPGTFVRQPIVVA 158
Query: 143 ENGLLLCG------SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---IQP 191
+G LC + E W N S + + D G +W + +P V IQP
Sbjct: 159 RDGAWLCPVFLCRVAPGERWSGNDDVSAVMRSTDNGATWTAHD---VPGSVGCVHMNIQP 215
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
+ + +L L RS + MS S D G+ WS +P L N NS I V L +G L
Sbjct: 216 L----RDGSLVALFRS-RWADHIHMSRSAD-GIAWSEPEPLALPNNNSSIQFVTLANGHL 269
Query: 252 LLAYNTVS----------------------------------------RGVLKVALSKDD 271
L +N S R + +A+S D
Sbjct: 270 ALVFNNSSAAQSTGRRASLYDDIEDAEDSGTLATQAASARGTAFWGAPRAPMTLAVSMDG 329
Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
G +W LE + L EFSYP+++Q++DG +HI YTY R +IK+V
Sbjct: 330 GRTWPVMRNLETGDGYCMTNNSTDKLNREFSYPSIVQSADGRLHIAYTYFRQRIKYV 386
>gi|225389863|ref|ZP_03759587.1| hypothetical protein CLOSTASPAR_03612 [Clostridium asparagiforme
DSM 15981]
gi|225044056|gb|EEG54302.1| hypothetical protein CLOSTASPAR_03612 [Clostridium asparagiforme
DSM 15981]
Length = 386
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 156/338 (46%), Gaps = 55/338 (16%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
+K+ H ++E++ G L +F GS EG DV + K G RW+ P++ +P
Sbjct: 33 YKTAHGPGLLELENGDMLCVWFAGSFEGNADVNVICARLKKGADRWEEPVLVSHDPERSE 92
Query: 82 WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILG 133
NP LF P + Y G++ +++ ++ +S + G TWS+ E + P G
Sbjct: 93 QNPSLFSGPDGMVWAMYTSQLSRVEGKDNMQFTSVIRCQKSADGGETWSDYETVFPE-EG 151
Query: 134 PSKNKPI-LLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGPIYIPNES-- 185
+PI +L NG + G+ + + + G + ++ D G++W+ + +P +
Sbjct: 152 SFCRQPIQVLSNGRWIFGNWICTDSELGLTGDPTAFRISDDQGKTWKM---VDMPESNGA 208
Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+V++ P H L MRS + SES D G TW+ PT L N NS I
Sbjct: 209 VHANVVELEPGH-----LAAFMRS-RAADNIYRSESLDYGDTWTKPVPTVLPNNNSSISA 262
Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE------- 282
VKL+ GR+ +AYN R + VALS+D G ++ +E
Sbjct: 263 VKLQSGRIAVAYNPTCTPCPTPGIASWPGLRCPVAVALSEDGGLTFPMIRYMERGEGYMG 322
Query: 283 -ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
EN ++ YP ++Q DG +H+ + Y +R +K++
Sbjct: 323 DENKTNNRQYEYPYIMQGRDGRLHLAFAYKDRLSVKYM 360
>gi|312142943|ref|YP_003994389.1| BNR repeat-containing glycosyl hydrolase [Halanaerobium
hydrogeniformans]
gi|311903594|gb|ADQ14035.1| BNR repeat-containing glycosyl hydrolase [Halanaerobium
hydrogeniformans]
Length = 357
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 144/327 (44%), Gaps = 56/327 (17%)
Query: 43 AYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY-- 98
+FGGSCEG D+ I K G W ++ + N NP+L+++ L L Y
Sbjct: 29 TWFGGSCEGKADISIHYSRLKKGEVSWSKAVVLSGDENRSEQNPILYEVKPGHLWLLYTA 88
Query: 99 KIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNS 158
+IG + ++RS + G WS+ E L +N PI LEN +L + +
Sbjct: 89 QIGVHQETAVVRIRRSDDYGHNWSKAEDLFEDEGLFVRNPPIKLENDDILLPAYYCQKSE 148
Query: 159 WG------SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIG 212
G S ++++ D G +W++ + IP ES ++ N+ + R+
Sbjct: 149 TGFLGDDYSVVKLSSDGGSTWKE---VSIP-ESKGLVHMSAVELDNKDIVGFFRNRRA-D 203
Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV---------- 262
+ + S D G+TWS +P +L N NS I +KLK G+L L Y+ V++ +
Sbjct: 204 YIYRTFSKDQGITWSVPEPLELPNNNSSIQCLKLKSGKLALVYDDVNKHISPPTVDMPPW 263
Query: 263 ---------------------------LKVALSKDDGDSW--HDALTLEENL--AMEFSY 291
L ++LS D G +W L +E L EFSY
Sbjct: 264 FDKKDMENVGVKEVEKPSAVWGVKRNPLVISLSDDGGRTWPQKKELMTDEGLEGEPEFSY 323
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVL 318
P+++Q G +HI YTY R I++V +
Sbjct: 324 PSLVQDDTGLIHIAYTYLREYIRYVTI 350
>gi|187927659|ref|YP_001898146.1| glycosyl hydrolase family protein [Ralstonia pickettii 12J]
gi|187724549|gb|ACD25714.1| glycosyl hydrolase BNR repeat-containing protein [Ralstonia
pickettii 12J]
Length = 391
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 151/359 (42%), Gaps = 69/359 (19%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
P HA+ + + G L +FGG+ EG PDV I+L G WQ E+
Sbjct: 32 PCVQNHAANLHALPNGDLLCVWFGGTQEGIPDVSIYLSRLAAGTSEWQPATKLSEDTTRS 91
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
NPVLF P+ L L Y + + ++R S + G +W + L +
Sbjct: 92 EQNPVLFTTPTGELWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIDTLFDQAGTFVRQP 151
Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
++L +G + LC + E W N S + ++ D GR+W ++ +P +S+ +
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDISAVMISEDQGRTWSRHD---VP-DSIGCVH 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
A+ +L L RS + S S DG TWS +PT L N NS I V L++G
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHIYASRSQDG-RTWSAPQPTTLPNNNSSIQFVALRNGH 265
Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
L L YN R L +A+S+D
Sbjct: 266 LALVYNESDASHSTGRRTSLYDDIEDAEDHGELRDQQASTRGTAFWGAPRAPLSLAISED 325
Query: 271 DGDSWH-------DALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W D + N A E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRNIEIGDGYCMTNNSADQRNREYSYPSIVQGNDGALHIAFTYFRQRIKYVTV 384
>gi|238024875|ref|YP_002909107.1| BNR repeat-containing glycosyl hydrolase [Burkholderia glumae BGR1]
gi|237879540|gb|ACR31872.1| BNR repeat-containing glycosyl hydrolase [Burkholderia glumae BGR1]
Length = 407
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 76/357 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
CHA+ ++ + G L A+FGG+ EG PD+ I+ + G W P+ ++ NP
Sbjct: 39 CHAANLLALANGDLLCAWFGGTQEGVPDISIYCSRLEKGGASWSEPVRLSDDTTRSEQNP 98
Query: 85 VLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI 140
VLF P L L Y + + ++R S ++G TW + L PG +PI
Sbjct: 99 VLFAAPDGDLWLLYTAQLSGHQNTSIVRRRISKDQGRTWGAIDTLFDKPGTF---VRQPI 155
Query: 141 LL-ENGLLLCG---SSVESWNSWGSWMEVTV-----DAGRSWRKYGPIYIPNESLSVIQP 191
++ +G LC V+ W +V+V D G +W ++ +P +S+ +
Sbjct: 156 VVARDGAWLCPVFLCRVQPGERWSGNDDVSVVMRSTDGGATWSEHA---VP-DSVGCVHM 211
Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
A+ TL L RS + S S D GL WS + L N NS I V L +G L
Sbjct: 212 NIQMLADGTLLALYRS-RWADHIYASRSRD-GLAWSAPEALDLPNNNSSIQFVTLANGHL 269
Query: 252 LLAYNTVS----------------------------------------RGVLKVALSKDD 271
L +N S R + +A+S D
Sbjct: 270 GLVFNASSAAQSTARRASLYDDIEDSEDSGELVAQAASARGTAFWGAPRAPMTLAISLDG 329
Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
G +W LE + L EFSYP++ Q+ DG +HI YTY R +IK+V
Sbjct: 330 GRTWPVRRNLETGDGYCMTNNSVDKLNREFSYPSIAQSPDGRLHIAYTYFRQRIKYV 386
>gi|433611690|ref|YP_007195151.1| putative neuraminidase (sialidase) [Sinorhizobium meliloti GR4]
gi|429556632|gb|AGA11552.1| putative neuraminidase (sialidase) [Sinorhizobium meliloti GR4]
Length = 395
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ +W P +
Sbjct: 28 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPDAAQWGEPQRLSFDAEH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141
Query: 140 ILLENGLLLCGSSV----ESW------------NSWGSWME-----VTVDAGRSWRKYGP 178
+ L G + V E+W W + V++D GRSW+
Sbjct: 142 LDLPRGCFIRAPLVIRDDEAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ N S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L+DGRL + N V+ R
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E+ E SYP +++ DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGHDGSLHVAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLPP 387
>gi|384533958|ref|YP_005716622.1| hypothetical protein [Sinorhizobium meliloti BL225C]
gi|333816134|gb|AEG08801.1| hypothetical protein SinmeB_4519 [Sinorhizobium meliloti BL225C]
Length = 395
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ +W P +
Sbjct: 28 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPDAAQWGEPQRLSFDAEH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + V++D GRSW+
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ N S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L+DGRL + N V+ R
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E+ E SYP +++ DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGHDGSLHVAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLPP 387
>gi|296102356|ref|YP_003612502.1| hypothetical protein ECL_02001 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
gi|295056815|gb|ADF61553.1| hypothetical protein ECL_02001 [Enterobacter cloacae subsp. cloacae
ATCC 13047]
Length = 387
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 149/356 (41%), Gaps = 70/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W ++ NPV
Sbjct: 32 HAANLLPLPDGSLMCVWFGGTQEGIADISVWGSRLSPGSQQWSEAEKLSDDATRSEQNPV 91
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P N L L + + + + ++S + G TW + L PG + ++
Sbjct: 92 LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSRDLGKTWGDIATLLDKPGTF--IRQPIVV 149
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
LENG + C + E W N S ++++ D G++WR + +P +SL +
Sbjct: 150 LENGNWLLPVFYCRTRPGEKWVGNDDISAVKISEDGGKTWRD---VEVP-DSLGCVHMNI 205
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
N TL L RS + S+S D G +WS +PT L N NS I L G L L
Sbjct: 206 TPLRNGTLVALFRS-RWADNIYYSQSTDNGESWSVPEPTVLPNNNSSIQVTTLASGELAL 264
Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
+N +S R + VA+S D G SW
Sbjct: 265 VFNLMSAAGAQERRTSLYDEIDDGDGRKEPVVTHGRAAFWGAPRAPMTVAISPDGGKSWP 324
Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DGS+HI YTY R IK+V + P
Sbjct: 325 WQRHLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGSLHIAYTYFRQAIKYVRVTP 380
>gi|145589428|ref|YP_001156025.1| BNR/Asp-box repeat-containing protein [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
gi|145047834|gb|ABP34461.1| BNR/Asp-box repeat protein [Polynucleobacter necessarius subsp.
asymbioticus QLW-P1DMWA-1]
Length = 417
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 143/321 (44%), Gaps = 35/321 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDGRWQSPIIADEEPNV----- 79
S HA++++ + G +F GS EGAPDV I K G+W +P + +
Sbjct: 75 SVHAASLIALKDGGIRAFWFAGSREGAPDVVINTSALDKSGKWSAPAVVMDRVTAEKGLG 134
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSY---NKGVTWSEREQLPPGIL 132
+ NPV +L L LF+ + + W+G S ++G++W + ++L I
Sbjct: 135 RYIAKLGNPVPSRLADGRLQLFF-VTVSLGGWAGSSISSVISEDEGLSWGKPQRL---IS 190
Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS----V 188
P N L+++ L ++ W+ G R G I +S
Sbjct: 191 SPLLNLSTLVKSPALQFTDGRLGLPAYHEWIG---RFGEFLRIEGAQVIDKRRMSSGRSA 247
Query: 189 IQPVPFHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
IQPV F + + R G + +S++ + G +W A ++ NPNS + G+
Sbjct: 248 IQPVVFVNSAQDASAYFRQTRSAGLAKHIPVSQTQNAGQSWQAAGDLEIANPNSAVAGLT 307
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEENLAM------EFSYPAVIQA 297
L +G LLA N + G ++ L D S WH LE + A+ E+SYP ++ A
Sbjct: 308 LSNGARLLALNNIETGRHRLVLMMSDPKSGQWHVVDVLENDEALPDDQRKEYSYPYLLSA 367
Query: 298 SDGSVHITYTYNRTQIKHVVL 318
+ H+ YT++R +I+HV L
Sbjct: 368 NGNDAHLVYTWDRKKIRHVYL 388
>gi|220914973|ref|YP_002490281.1| BNR repeat-containing glycosyl hydrolase [Methylobacterium nodulans
ORS 2060]
gi|219952724|gb|ACL63114.1| BNR repeat-containing glycosyl hydrolase [Methylobacterium nodulans
ORS 2060]
Length = 398
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 148/358 (41%), Gaps = 72/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G +FGG+ EG D+ ++ G W ++P NP+
Sbjct: 37 HAANLMPLANGDLACVWFGGTQEGMADISVYFSRLAKGSDAWSPAEKLSDDPARSEQNPI 96
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPS---KNKPI 140
LF P+ L L + + + ++R S + G +W E L P G + P+
Sbjct: 97 LFPAPTGDLWLIWTAQVSGNQDTAFVRRRLSRDHGRSWGPIETLFPRREGCGTFVRQPPV 156
Query: 141 LLENG-----LLLCGSSVES-W--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L NG + C S + W + S ++++ DAGR+W + + +P+ + V +
Sbjct: 157 VLANGDWLLPIFHCPSVPGAKWVGDDDTSAVKISSDAGRTWHE---VAVPDSTGCVHMSI 213
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + S S DGG TWS PT L N NS I +L +G L
Sbjct: 214 -MPLDDGTLLALFRS-RWADAIYESRSTDGGATWSAPVPTALPNNNSSIQATRLANGHLA 271
Query: 253 LAYNTVS----------------------------------------RGVLKVALSKDDG 272
L YN V+ R L +A+S+D G
Sbjct: 272 LVYNAVNAEAATERRASLYDEIEDEAGESAAPLPAAPGKRTAFWGTPRAPLTLAISEDGG 331
Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W +E + E SYP+V Q DG++H+ +TY+R IK+V L
Sbjct: 332 RTWPHRRDVETGDGYCMTNNSRDRTNRELSYPSVAQTPDGALHVAFTYHRQAIKYVRL 389
>gi|16263893|ref|NP_436685.1| hypothetical protein SM_b20145 [Sinorhizobium meliloti 1021]
gi|15140017|emb|CAC48545.1| HYPOTHETICAL PROTEIN SM_b20145 [Sinorhizobium meliloti 1021]
Length = 395
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ +W P +
Sbjct: 28 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPDAAQWGEPQRLSFDAEH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAVSEGRY 141
Query: 140 ILLENGLLLCGSSV----ESW------------NSWGSWME-----VTVDAGRSWRKYGP 178
+ L G + V E+W W + V++D GRSW+
Sbjct: 142 LDLPRGCFIRAPLVIRDDEAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ N S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L+DGRL + N V+ R
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E+ E SYP +++ DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGHDGSLHVAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLPP 387
>gi|373106197|ref|ZP_09520500.1| hypothetical protein HMPREF9623_00164 [Stomatobaculum longum]
gi|371652572|gb|EHO17980.1| hypothetical protein HMPREF9623_00164 [Stomatobaculum longum]
Length = 387
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 148/338 (43%), Gaps = 55/338 (16%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
F + HA +VE+ G L +F G+ EG+ D+ I ++G +W + +P
Sbjct: 34 FPTAHAPAMVELPNGDLLACWFAGTYEGSADIHIVCSALRNGETKWSAVTDLSRDPKKSE 93
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
NP LF P + Y + + M+ +S++ G +W E E + P G
Sbjct: 94 QNPSLFLGPDGAVWAMYTAQDDRESGKDNMQFTAEVRCQKSFDGGRSWGEYETVFPEP-G 152
Query: 134 PSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-- 185
+PI +L NG LC S + + ++ D G+SW+ + +P +
Sbjct: 153 TFCRQPIQILSNGRWIFSNWLCTDSPDGLAGDPTGFRISDDQGKSWKL---VMMPKSNGH 209
Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+VI+ P H L MR R+ SES D G +WS PT L N NSGI
Sbjct: 210 VHANVIELSPGH-----LVAFMRHREA-KRIHRSESFDWGESWSEPVPTPLPNNNSGISA 263
Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTL 281
VK+K GR+ +AYN S R + VALS+D G ++ +
Sbjct: 264 VKMKSGRIAIAYNPTSAPESQSGKAAWPGLRCPVAVALSEDGGKTFPLIRWMERGEGFIG 323
Query: 282 EENLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
+EN ++ YP ++Q+ DG +H+ Y R IK++
Sbjct: 324 DENRTNNRQYEYPYLMQSKDGMLHLAYAARTREGIKYL 361
>gi|110640892|ref|YP_668620.1| hypothetical protein ECP_0693 [Escherichia coli 536]
gi|191173952|ref|ZP_03035470.1| hypthetical protein [Escherichia coli F11]
gi|300996592|ref|ZP_07181486.1| BNR/Asp-box repeat protein [Escherichia coli MS 200-1]
gi|422378283|ref|ZP_16458506.1| BNR/Asp-box repeat protein [Escherichia coli MS 60-1]
gi|432470061|ref|ZP_19712114.1| hypothetical protein A15M_00940 [Escherichia coli KTE206]
gi|432712351|ref|ZP_19947401.1| hypothetical protein WCI_00715 [Escherichia coli KTE8]
gi|433076851|ref|ZP_20263415.1| hypothetical protein WIU_00724 [Escherichia coli KTE131]
gi|110342484|gb|ABG68721.1| hypothetical membrane protein [Escherichia coli 536]
gi|190905728|gb|EDV65349.1| hypthetical protein [Escherichia coli F11]
gi|300304489|gb|EFJ59009.1| BNR/Asp-box repeat protein [Escherichia coli MS 200-1]
gi|324010400|gb|EGB79619.1| BNR/Asp-box repeat protein [Escherichia coli MS 60-1]
gi|431000128|gb|ELD16202.1| hypothetical protein A15M_00940 [Escherichia coli KTE206]
gi|431259302|gb|ELF51676.1| hypothetical protein WCI_00715 [Escherichia coli KTE8]
gi|431600571|gb|ELI70239.1| hypothetical protein WIU_00724 [Escherichia coli KTE131]
Length = 389
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTLEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVEHRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|432552639|ref|ZP_19789370.1| hypothetical protein A1S3_01027 [Escherichia coli KTE47]
gi|431086924|gb|ELD92940.1| hypothetical protein A1S3_01027 [Escherichia coli KTE47]
Length = 389
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ IW G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISIWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVEHRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|345011191|ref|YP_004813545.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
violaceusniger Tu 4113]
gi|344037540|gb|AEM83265.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
violaceusniger Tu 4113]
Length = 401
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 149/370 (40%), Gaps = 75/370 (20%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
AP HA+ + + G +FGG+ EG PD+ +W G W P+ ++
Sbjct: 32 APAVQNHAANLTVLPGGDLGCVWFGGTQEGVPDISVWFSRLAPGSGTWTEPVRLSDDDTR 91
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQLPPGIL--GPS 135
NP+LF P+ L L Y + + + ++ S ++G TW L P G
Sbjct: 92 SEQNPLLFPTPAGELWLLYTAQRAGDQDTAEVRLRVSADEGATWDAPRTLFPATEAGGVF 151
Query: 136 KNKPI-LLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+P+ +L++G L + W + S + ++ D GR+WR+ +P +
Sbjct: 152 IRQPVAVLDSGRWLLPVFHCVATPGVKWVGDHDTSAVMISDDQGRTWRERP---VPGSTG 208
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
V V + T+ L RS V S S DGG TWS T+L N NS + V L
Sbjct: 209 CVHMNV-HQLPDTTVLALFRS-RWADAVYRSHSTDGGETWSEPVRTELPNNNSSVQYVPL 266
Query: 247 KDGRLLLAYNTVS-------------------------------------------RGVL 263
DGRL L YN S R +
Sbjct: 267 ADGRLALVYNHSSRADATARRVSLYDEIDDEGRTGEPPAPAATRDDAAPGAFWGAPRAPM 326
Query: 264 KVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRT 311
+ALS D+G +W L+ + L E SYP + Q DG++HI YTY+R
Sbjct: 327 TLALSSDNGLTWPVRRDLDTGDGHCLTNNSRDRLNRELSYPTIRQGFDGTLHIAYTYHRR 386
Query: 312 QIKHVVLLPN 321
IK+V + P+
Sbjct: 387 AIKYVRVTPD 396
>gi|334319973|ref|YP_004556602.1| hypothetical protein [Sinorhizobium meliloti AK83]
gi|334097712|gb|AEG55722.1| hypothetical protein Sinme_4022 [Sinorhizobium meliloti AK83]
Length = 395
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ G +W P +
Sbjct: 28 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAER 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + V++D GRSW+
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ N S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L+ GRL + N V+ R
Sbjct: 257 SSIAVVRLRGGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E+ E SYP +++ DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLERDDGSLHVAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLPP 387
>gi|307132546|ref|YP_003884562.1| hypothetical protein Dda3937_02632 [Dickeya dadantii 3937]
gi|306530075|gb|ADN00006.1| expressed protein [Dickeya dadantii 3937]
Length = 391
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 152/360 (42%), Gaps = 80/360 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I+L G+W + ++P NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGVSDISIYLSRLVKGSGQWTPAVKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + ++ +S ++G TW + L PG +PI
Sbjct: 96 LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSLDQGYTWGDIGTLLEQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
+L NG + C + E W N S ++++ D G++WR+ +PN + V I
Sbjct: 153 VLPNGDWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKTWRES---VVPNSTGCVHMNI 209
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
P+ + +L L RS + S S D G TWS PT L N NS I L +G
Sbjct: 210 TPL----KDGSLLALFRS-RWADFIYRSHSTDSGATWSEPVPTVLPNNNSSIQVTTLDNG 264
Query: 250 RLLLAYNTVS---------------------------------------RGVLKVALSKD 270
L L +N +S R + +A+S+D
Sbjct: 265 HLALVFNAMSAEGATERRLSLYDEIEDEEETDAKMPEVSSGRSAFWGAPRAPMTLAISED 324
Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G +W LE E L EFSYP++ Q DG +HI +TY R IK+V +
Sbjct: 325 GGKTWPWQRNLEVGDGYCMTNNSTEKLNREFSYPSIKQGPDGKLHIAFTYFRQAIKYVCV 384
>gi|306812905|ref|ZP_07447098.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli NC101]
gi|432380316|ref|ZP_19623273.1| hypothetical protein WCU_00459 [Escherichia coli KTE15]
gi|432386086|ref|ZP_19628984.1| hypothetical protein WCY_01334 [Escherichia coli KTE16]
gi|432512887|ref|ZP_19750123.1| hypothetical protein A17M_00740 [Escherichia coli KTE224]
gi|432610363|ref|ZP_19846535.1| hypothetical protein A1UG_00716 [Escherichia coli KTE72]
gi|432645121|ref|ZP_19880921.1| hypothetical protein A1W5_00867 [Escherichia coli KTE86]
gi|432654919|ref|ZP_19890632.1| hypothetical protein A1WE_01030 [Escherichia coli KTE93]
gi|432697999|ref|ZP_19933166.1| hypothetical protein A31M_00742 [Escherichia coli KTE169]
gi|432744619|ref|ZP_19979319.1| hypothetical protein WGG_00735 [Escherichia coli KTE43]
gi|432903024|ref|ZP_20112641.1| hypothetical protein A13Y_00999 [Escherichia coli KTE194]
gi|432942639|ref|ZP_20139898.1| hypothetical protein A13C_04397 [Escherichia coli KTE183]
gi|432970805|ref|ZP_20159683.1| hypothetical protein A15O_01367 [Escherichia coli KTE207]
gi|432984321|ref|ZP_20173059.1| hypothetical protein A175_00773 [Escherichia coli KTE215]
gi|433037625|ref|ZP_20225241.1| hypothetical protein WIE_00969 [Escherichia coli KTE113]
gi|433081575|ref|ZP_20268049.1| hypothetical protein WIW_00713 [Escherichia coli KTE133]
gi|433100201|ref|ZP_20286311.1| hypothetical protein WK5_00754 [Escherichia coli KTE145]
gi|433143269|ref|ZP_20328436.1| hypothetical protein WKO_00802 [Escherichia coli KTE168]
gi|433187477|ref|ZP_20371596.1| hypothetical protein WGS_00551 [Escherichia coli KTE88]
gi|305853668|gb|EFM54107.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli NC101]
gi|430909592|gb|ELC30958.1| hypothetical protein WCY_01334 [Escherichia coli KTE16]
gi|430911425|gb|ELC32711.1| hypothetical protein WCU_00459 [Escherichia coli KTE15]
gi|431044445|gb|ELD54718.1| hypothetical protein A17M_00740 [Escherichia coli KTE224]
gi|431151181|gb|ELE52216.1| hypothetical protein A1UG_00716 [Escherichia coli KTE72]
gi|431183899|gb|ELE83672.1| hypothetical protein A1W5_00867 [Escherichia coli KTE86]
gi|431194313|gb|ELE93578.1| hypothetical protein A1WE_01030 [Escherichia coli KTE93]
gi|431246487|gb|ELF40751.1| hypothetical protein A31M_00742 [Escherichia coli KTE169]
gi|431294901|gb|ELF85078.1| hypothetical protein WGG_00735 [Escherichia coli KTE43]
gi|431436554|gb|ELH18069.1| hypothetical protein A13Y_00999 [Escherichia coli KTE194]
gi|431453691|gb|ELH34075.1| hypothetical protein A13C_04397 [Escherichia coli KTE183]
gi|431485942|gb|ELH65599.1| hypothetical protein A15O_01367 [Escherichia coli KTE207]
gi|431506064|gb|ELH84668.1| hypothetical protein A175_00773 [Escherichia coli KTE215]
gi|431554941|gb|ELI28805.1| hypothetical protein WIE_00969 [Escherichia coli KTE113]
gi|431605410|gb|ELI74799.1| hypothetical protein WIW_00713 [Escherichia coli KTE133]
gi|431622324|gb|ELI91105.1| hypothetical protein WK5_00754 [Escherichia coli KTE145]
gi|431665665|gb|ELJ32379.1| hypothetical protein WKO_00802 [Escherichia coli KTE168]
gi|431708911|gb|ELJ73413.1| hypothetical protein WGS_00551 [Escherichia coli KTE88]
Length = 389
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIVDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|432582750|ref|ZP_19819160.1| hypothetical protein A1SM_01972 [Escherichia coli KTE57]
gi|433119241|ref|ZP_20304949.1| hypothetical protein WKC_00679 [Escherichia coli KTE157]
gi|431119766|gb|ELE22765.1| hypothetical protein A1SM_01972 [Escherichia coli KTE57]
gi|431648466|gb|ELJ15862.1| hypothetical protein WKC_00679 [Escherichia coli KTE157]
Length = 389
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|312965114|ref|ZP_07779351.1| BNR/Asp-box repeat family protein [Escherichia coli 2362-75]
gi|418995571|ref|ZP_13543185.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1A]
gi|312290205|gb|EFR18088.1| BNR/Asp-box repeat family protein [Escherichia coli 2362-75]
gi|377848961|gb|EHU13937.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1A]
Length = 370
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W GR W + + NPV
Sbjct: 14 HAANILPMPDGALMCVWFAGTQEGIADISVWGSRLPAGRTQWSDAVKLSHDDTRSEQNPV 73
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 74 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 130
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 131 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 186
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 187 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 245
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 246 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 305
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 306 PWRRNLDEGDGNCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 362
>gi|215485704|ref|YP_002328135.1| hypothetical protein E2348C_0564 [Escherichia coli O127:H6 str.
E2348/69]
gi|417754437|ref|ZP_12402532.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2B]
gi|419000826|ref|ZP_13548385.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1B]
gi|419006338|ref|ZP_13553794.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1C]
gi|419012203|ref|ZP_13559568.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1D]
gi|419017107|ref|ZP_13564433.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1E]
gi|419022799|ref|ZP_13570041.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2A]
gi|419027608|ref|ZP_13574807.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2C]
gi|419033150|ref|ZP_13580248.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2D]
gi|419038389|ref|ZP_13585448.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2E]
gi|215263776|emb|CAS08112.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
gi|377851139|gb|EHU16094.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1C]
gi|377853473|gb|EHU18372.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1B]
gi|377862968|gb|EHU27775.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1D]
gi|377867081|gb|EHU31845.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1E]
gi|377868437|gb|EHU33181.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2A]
gi|377879383|gb|EHU43956.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2B]
gi|377883569|gb|EHU48087.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2D]
gi|377885491|gb|EHU49986.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2C]
gi|377897990|gb|EHU62353.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2E]
Length = 389
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W GR W + + NPV
Sbjct: 33 HAANILPMPDGALMCVWFAGTQEGIADISVWGSRLPAGRTQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGNCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|26246649|ref|NP_752689.1| hypthetical protein [Escherichia coli CFT073]
gi|222155418|ref|YP_002555557.1| hypothetical protein LF82_075 [Escherichia coli LF82]
gi|227884352|ref|ZP_04002157.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
[Escherichia coli 83972]
gi|300989966|ref|ZP_07179042.1| BNR/Asp-box repeat protein [Escherichia coli MS 45-1]
gi|301046002|ref|ZP_07193183.1| BNR/Asp-box repeat protein [Escherichia coli MS 185-1]
gi|386628221|ref|YP_006147941.1| hypothetical protein i02_0728 [Escherichia coli str. 'clone D i2']
gi|386633141|ref|YP_006152860.1| hypothetical protein i14_0728 [Escherichia coli str. 'clone D i14']
gi|386638042|ref|YP_006104840.1| hypothetical protein ECABU_c07260 [Escherichia coli ABU 83972]
gi|387615950|ref|YP_006118972.1| hypothetical protein NRG857_03035 [Escherichia coli O83:H1 str. NRG
857C]
gi|417288700|ref|ZP_12075985.1| BNR/Asp-box repeat protein [Escherichia coli TW07793]
gi|419915280|ref|ZP_14433646.1| hypothetical protein ECKD1_18892 [Escherichia coli KD1]
gi|422365231|ref|ZP_16445733.1| BNR/Asp-box repeat protein [Escherichia coli MS 153-1]
gi|422382509|ref|ZP_16462669.1| BNR/Asp-box repeat protein [Escherichia coli MS 57-2]
gi|425298838|ref|ZP_18688888.1| sialidase [Escherichia coli 07798]
gi|432410693|ref|ZP_19653375.1| hypothetical protein WG9_01172 [Escherichia coli KTE39]
gi|432430740|ref|ZP_19673185.1| hypothetical protein A13K_01028 [Escherichia coli KTE187]
gi|432435268|ref|ZP_19677669.1| hypothetical protein A13M_00974 [Escherichia coli KTE188]
gi|432455554|ref|ZP_19697755.1| hypothetical protein A15C_01344 [Escherichia coli KTE201]
gi|432503332|ref|ZP_19745069.1| hypothetical protein A17E_00383 [Escherichia coli KTE220]
gi|432522777|ref|ZP_19759916.1| hypothetical protein A17Y_00889 [Escherichia coli KTE230]
gi|432567463|ref|ZP_19803990.1| hypothetical protein A1SE_01039 [Escherichia coli KTE53]
gi|432591742|ref|ZP_19828071.1| hypothetical protein A1SS_01163 [Escherichia coli KTE60]
gi|432606509|ref|ZP_19842704.1| hypothetical protein A1U7_01506 [Escherichia coli KTE67]
gi|432650152|ref|ZP_19885914.1| hypothetical protein A1W7_01151 [Escherichia coli KTE87]
gi|432731355|ref|ZP_19966193.1| hypothetical protein WGK_01192 [Escherichia coli KTE45]
gi|432758434|ref|ZP_19992937.1| hypothetical protein A1S1_00552 [Escherichia coli KTE46]
gi|432782531|ref|ZP_20016717.1| hypothetical protein A1SY_01362 [Escherichia coli KTE63]
gi|432842911|ref|ZP_20076332.1| hypothetical protein A1YS_01061 [Escherichia coli KTE141]
gi|432977349|ref|ZP_20166174.1| hypothetical protein A15S_03260 [Escherichia coli KTE209]
gi|432994420|ref|ZP_20183036.1| hypothetical protein A17A_01501 [Escherichia coli KTE218]
gi|432998838|ref|ZP_20187378.1| hypothetical protein A17K_01174 [Escherichia coli KTE223]
gi|433056984|ref|ZP_20244068.1| hypothetical protein WIM_00763 [Escherichia coli KTE124]
gi|433086298|ref|ZP_20272695.1| hypothetical protein WIY_00744 [Escherichia coli KTE137]
gi|433114575|ref|ZP_20300391.1| hypothetical protein WKA_00763 [Escherichia coli KTE153]
gi|433124234|ref|ZP_20309823.1| hypothetical protein WKE_00731 [Escherichia coli KTE160]
gi|433138253|ref|ZP_20323540.1| hypothetical protein WKM_00536 [Escherichia coli KTE167]
gi|433148081|ref|ZP_20333147.1| hypothetical protein WKQ_00746 [Escherichia coli KTE174]
gi|433206808|ref|ZP_20390506.1| hypothetical protein WI1_00578 [Escherichia coli KTE97]
gi|433211553|ref|ZP_20395168.1| hypothetical protein WI3_00729 [Escherichia coli KTE99]
gi|442606900|ref|ZP_21021695.1| expressed protein [Escherichia coli Nissle 1917]
gi|26107048|gb|AAN79232.1|AE016757_136 Hypthetical protein [Escherichia coli CFT073]
gi|222032423|emb|CAP75162.1| hypthetical protein [Escherichia coli LF82]
gi|227838438|gb|EEJ48904.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
[Escherichia coli 83972]
gi|300301975|gb|EFJ58360.1| BNR/Asp-box repeat protein [Escherichia coli MS 185-1]
gi|300407241|gb|EFJ90779.1| BNR/Asp-box repeat protein [Escherichia coli MS 45-1]
gi|307552534|gb|ADN45309.1| conserved hypothetical protein [Escherichia coli ABU 83972]
gi|312945211|gb|ADR26038.1| hypothetical protein NRG857_03035 [Escherichia coli O83:H1 str. NRG
857C]
gi|315292047|gb|EFU51399.1| BNR/Asp-box repeat protein [Escherichia coli MS 153-1]
gi|324006293|gb|EGB75512.1| BNR/Asp-box repeat protein [Escherichia coli MS 57-2]
gi|355419120|gb|AER83317.1| hypothetical protein i02_0728 [Escherichia coli str. 'clone D i2']
gi|355424040|gb|AER88236.1| hypothetical protein i14_0728 [Escherichia coli str. 'clone D i14']
gi|386247492|gb|EII93665.1| BNR/Asp-box repeat protein [Escherichia coli TW07793]
gi|388384072|gb|EIL45814.1| hypothetical protein ECKD1_18892 [Escherichia coli KD1]
gi|408221841|gb|EKI45774.1| sialidase [Escherichia coli 07798]
gi|430937935|gb|ELC58187.1| hypothetical protein WG9_01172 [Escherichia coli KTE39]
gi|430956362|gb|ELC75038.1| hypothetical protein A13K_01028 [Escherichia coli KTE187]
gi|430966847|gb|ELC84210.1| hypothetical protein A13M_00974 [Escherichia coli KTE188]
gi|430984877|gb|ELD01494.1| hypothetical protein A15C_01344 [Escherichia coli KTE201]
gi|431042334|gb|ELD52826.1| hypothetical protein A17E_00383 [Escherichia coli KTE220]
gi|431054897|gb|ELD64466.1| hypothetical protein A17Y_00889 [Escherichia coli KTE230]
gi|431103296|gb|ELE07966.1| hypothetical protein A1SE_01039 [Escherichia coli KTE53]
gi|431133186|gb|ELE35184.1| hypothetical protein A1SS_01163 [Escherichia coli KTE60]
gi|431140408|gb|ELE42179.1| hypothetical protein A1U7_01506 [Escherichia coli KTE67]
gi|431193523|gb|ELE92859.1| hypothetical protein A1W7_01151 [Escherichia coli KTE87]
gi|431278263|gb|ELF69257.1| hypothetical protein WGK_01192 [Escherichia coli KTE45]
gi|431311398|gb|ELF99561.1| hypothetical protein A1S1_00552 [Escherichia coli KTE46]
gi|431332243|gb|ELG19486.1| hypothetical protein A1SY_01362 [Escherichia coli KTE63]
gi|431397439|gb|ELG80895.1| hypothetical protein A1YS_01061 [Escherichia coli KTE141]
gi|431482453|gb|ELH62156.1| hypothetical protein A15S_03260 [Escherichia coli KTE209]
gi|431509421|gb|ELH87672.1| hypothetical protein A17A_01501 [Escherichia coli KTE218]
gi|431514407|gb|ELH92249.1| hypothetical protein A17K_01174 [Escherichia coli KTE223]
gi|431573971|gb|ELI46760.1| hypothetical protein WIM_00763 [Escherichia coli KTE124]
gi|431609334|gb|ELI78659.1| hypothetical protein WIY_00744 [Escherichia coli KTE137]
gi|431637075|gb|ELJ05194.1| hypothetical protein WKA_00763 [Escherichia coli KTE153]
gi|431649547|gb|ELJ16894.1| hypothetical protein WKE_00731 [Escherichia coli KTE160]
gi|431665222|gb|ELJ31949.1| hypothetical protein WKM_00536 [Escherichia coli KTE167]
gi|431676974|gb|ELJ43057.1| hypothetical protein WKQ_00746 [Escherichia coli KTE174]
gi|431732618|gb|ELJ96069.1| hypothetical protein WI1_00578 [Escherichia coli KTE97]
gi|431736208|gb|ELJ99550.1| hypothetical protein WI3_00729 [Escherichia coli KTE99]
gi|441712207|emb|CCQ07672.1| expressed protein [Escherichia coli Nissle 1917]
Length = 389
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|345299303|ref|YP_004828661.1| hypothetical protein Entas_2142 [Enterobacter asburiae LF7a]
gi|345093240|gb|AEN64876.1| hypothetical protein Entas_2142 [Enterobacter asburiae LF7a]
Length = 388
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 150/357 (42%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W + E+ NPV
Sbjct: 33 HAANLLPLPDGSLMCVWFGGTQEGIADISVWGSRLARGSQQWSEAVKLSEDATRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S++ G +W + L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQVSGNQDTAIVRYRQSHDLGNSWGKIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L NG + C + E W N S ++++ D G++WR + +P ESL +
Sbjct: 150 VLNNGNWLLPVFYCRTQPGEKWVGNDDISAVKISEDQGKTWRD---VEVP-ESLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N L L RS + S+S D G +WS +PT L N NS + L+ G L
Sbjct: 206 ITPLKNGNLVALFRS-RWADNIYYSQSVDHGESWSVPEPTVLPNNNSSVQVTTLESGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L +N +S R + VA+S D G SW
Sbjct: 265 LVFNHMSAAGALERRASLYDEIDDGDGRKEPVVTSGRAAFWGAPRAPMTVAISPDGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q DG++HI YTY R IK+V + P
Sbjct: 325 PWQRHLDEGDGYCMTNNSLEKLNREFSYPSIKQGQDGALHIAYTYFRQAIKYVRVTP 381
>gi|422369047|ref|ZP_16449450.1| BNR/Asp-box repeat protein [Escherichia coli MS 16-3]
gi|432897502|ref|ZP_20108411.1| hypothetical protein A13U_01159 [Escherichia coli KTE192]
gi|433027672|ref|ZP_20215547.1| hypothetical protein WIA_00762 [Escherichia coli KTE109]
gi|315299228|gb|EFU58482.1| BNR/Asp-box repeat protein [Escherichia coli MS 16-3]
gi|431429548|gb|ELH11476.1| hypothetical protein A13U_01159 [Escherichia coli KTE192]
gi|431545742|gb|ELI20389.1| hypothetical protein WIA_00762 [Escherichia coli KTE109]
Length = 389
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVEHRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|384539696|ref|YP_005723780.1| hypothetical protein SM11_pD1447 [Sinorhizobium meliloti SM11]
gi|336038349|gb|AEH84279.1| hypothetical protein SM11_pD1447 [Sinorhizobium meliloti SM11]
Length = 395
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 144/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ G +W P +
Sbjct: 28 SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAEH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L S+ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + V++D GRSW+
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ + S+ + P + R +V ES DGG TWS PT + N N
Sbjct: 198 LADLDGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I V+L++GRL + N V+ R
Sbjct: 257 SSIAVVRLREGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
+ V LS DDG ++ D + +E + E SYP +++ DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEGGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHVAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLPP 387
>gi|432800913|ref|ZP_20034900.1| hypothetical protein A1W3_01167 [Escherichia coli KTE84]
gi|431350871|gb|ELG37674.1| hypothetical protein A1W3_01167 [Escherichia coli KTE84]
Length = 389
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADIPVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|283785231|ref|YP_003365096.1| hypothetical protein ROD_15131 [Citrobacter rodentium ICC168]
gi|282948685|emb|CBG88277.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
Length = 390
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 152/379 (40%), Gaps = 76/379 (20%)
Query: 8 KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG-- 65
KG + E P+ HA+ ++ + G + +FGG+ EG D+ IW G
Sbjct: 16 KGALVETAMLPS---ACPQNHAANLLPLPDGSLMCVWFGGTQEGVADISIWGSRLSPGSH 72
Query: 66 RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWSGCMKRSYNKGVTWSE 123
+W + +P NPVLF P N L L + +I +RS + G +WS+
Sbjct: 73 QWSEAVRLSHDPTRSEQNPVLFLAPDNVLWLLWTAQIAGNQDTAIVRYRRSTDLGYSWSD 132
Query: 124 REQL--PPGILGPSKNKPI-LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRS 172
L PG +PI +L NG + C + E W NS S ++++ D G+S
Sbjct: 133 TAVLLDTPGTF---IRQPITVLANGNWLLPVFYCRTKPGEKWVGNSDISAVKISSDGGQS 189
Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
WR I +SL + + L L RS + S+S D G +W+ PT
Sbjct: 190 WRD----AIVPQSLGCVHMNITTLPDGRLAALFRS-RWADSIYYSQSDDNGESWTAPTPT 244
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTVS--------------------------------- 259
L N NS I L G L L YN +S
Sbjct: 245 ALPNNNSSIQVTTLSSGELALVYNHMSAEGAQERRASLYDEIDGGDSSRTEPEATNGKSA 304
Query: 260 -----RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSV 302
R + VA+S D G SW L+E EFSYP++ Q DG++
Sbjct: 305 FWGAPRAPMTVAISADGGKSWPWRRNLDEGDGYCMTNNSQDKRNREFSYPSINQTPDGTL 364
Query: 303 HITYTYNRTQIKHVVLLPN 321
HI YT+ R IK+V + P+
Sbjct: 365 HIAYTWFRQAIKYVRVSPD 383
>gi|386018390|ref|YP_005940995.1| hypothetical protein PAJ_p0030 [Pantoea ananatis AJ13355]
gi|386076504|ref|YP_005990587.1| BNR repeat-containing glycosyl hydrolase [Pantoea ananatis PA13]
gi|327396476|dbj|BAK13897.1| hypothetical protein PAJ_p0030 [Pantoea ananatis AJ13355]
gi|354685372|gb|AER34739.1| BNR repeat-containing glycosyl hydrolase [Pantoea ananatis PA13]
Length = 385
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G L +FGG+ EG D+ +W G W + ++P NPV
Sbjct: 33 HAANILPLQDGSLLCVWFGGTQEGIADISVWCSRLPAGADTWSEAVKLSDDPTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P+N L L + + + + ++S + G TW + L PG +PI
Sbjct: 93 LFLAPNNVLWLLWTAQISGNQDTAIVRYRQSSDLGQTWGPIDTLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G+SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKSWRD---VVVP-DSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ +L L RS + +S S DGG TWS + T L N NS I L G L
Sbjct: 206 ITALQDGSLVALYRS-RWADHIYLSRSTDGGETWSSPEATVLPNNNSSIQVTTLHSGELA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G +W
Sbjct: 265 LVFNNMSAAGATERRASLYDEIDDGDGRKEPETTGRTAFWGAPRAPMTVAISADGGKTWP 324
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + +E EFSYP++ Q + G++HI YTY R IK+ + P
Sbjct: 325 WQKNLDEGDGYCMTNNSQEKRNREFSYPSIKQDAGGTLHIAYTYFRQAIKYARVTP 380
>gi|432464652|ref|ZP_19706760.1| hypothetical protein A15K_00595 [Escherichia coli KTE205]
gi|433071739|ref|ZP_20258435.1| hypothetical protein WIS_00713 [Escherichia coli KTE129]
gi|433182227|ref|ZP_20366524.1| hypothetical protein WGO_00682 [Escherichia coli KTE85]
gi|430997403|gb|ELD13670.1| hypothetical protein A15K_00595 [Escherichia coli KTE205]
gi|431592830|gb|ELI63399.1| hypothetical protein WIS_00713 [Escherichia coli KTE129]
gi|431711315|gb|ELJ75668.1| hypothetical protein WGO_00682 [Escherichia coli KTE85]
Length = 389
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPVNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|172065238|ref|YP_001815950.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
[Burkholderia ambifaria MC40-6]
gi|171997480|gb|ACB68397.1| glycosyl hydrolase BNR repeat-containing protein [Burkholderia
ambifaria MC40-6]
Length = 395
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 147/355 (41%), Gaps = 74/355 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ + + G L +FGG+ EG PDV I+L G W SP ++P NPV
Sbjct: 39 HAANLHCLANGDLLCVWFGGTQEGIPDVSIYLSRLAAGTNAWSSPQKLSDDPTRSEQNPV 98
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P L L Y + + ++R S + GVTW E L PG + I
Sbjct: 99 LFTAPDGRLWLIYTAQLSGHQNTAIVRRRVSTDGGVTWGPVETLFDRPGTF--VRQPIIA 156
Query: 142 LENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L+NG LC + E W N S + V+ D G SW + +P +S+ +
Sbjct: 157 LDNGTWLCPVFLCRTAPGERWVGNDDVSAVMVSDDQGASWSLH---EVP-DSVGCVHMNV 212
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +L L RS V S S DG TWS +PT + N NS I L++G L L
Sbjct: 213 HVLHDGSLLALYRS-RWADFVYASRSTDG-RTWSAPEPTDVPNNNSSIQFTVLRNGHLAL 270
Query: 254 AYNT----------------------------------------VSRGVLKVALSKDDGD 273
+N R + +A+S+D G
Sbjct: 271 VFNESDASHANARRASLYDDIEDSEDSGALRDQKASARGTAFWGAPRAPMSLAISEDGGR 330
Query: 274 SWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+W LE + E+SYP+++Q+ DG++HI +T+ R +IK+V
Sbjct: 331 TWRRRRNLEIGDGYCMTNNSADRRNREYSYPSIVQSQDGALHIAFTWFRQKIKYV 385
>gi|412985180|emb|CCO20205.1| putative alpha-rhamnosidase [Bathycoccus prasinos]
Length = 586
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 142/330 (43%), Gaps = 75/330 (22%)
Query: 66 RWQSPIIADE--EPNVPMWNPVLFKLP-SNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+W +P +A + N W P LF +P ++ +LLFYK G + W G +KRS + G TW
Sbjct: 254 QWHAPKMAVKGTSSNDKCWRPQLFVVPQTHEVLLFYKRGASRETWEGFLKRSLDGGKTWL 313
Query: 123 EREQLPPGILGPSKNKPIL-LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPIY 180
E E+LPPGILGPSKNK + E+ +L S + +S +W+E GR W K GPI
Sbjct: 314 EEEKLPPGILGPSKNKVLRNSEDESILSASERWTRSSACAWIERAEREGRDWTVKSGPIR 373
Query: 181 IPNESLSV---------------------IQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
+ + L I+P F+ + TL VL R R S S
Sbjct: 374 MEKKGLQERYGGGRMGDKAAEEEARDYGGIEPAIFYKDDGTLVVLARPAKTTERFPKSLS 433
Query: 220 C---DGGLTWSYAK--PTQLL-----NPNS-------------------GIDGVKLKDGR 250
GG + ++ P L PN+ G D KL DGR
Sbjct: 434 GVVHAGGRSKQSSRYFPKMLKAELKDEPNAQWSAFKEIADLPLPTVFGRGFDVEKLTDGR 493
Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSY------------------- 291
++L N + + V +S+DDG+++ +A +E+ + S
Sbjct: 494 VVLVMNDPGKKGVSVKISRDDGETFREAFVIEDGDDEDESSGGSSGSSSSSGSSSSYTRD 553
Query: 292 PAVIQASDGSVHITYTYNR-TQIKHVVLLP 320
P + A+D +H+ Y N +IK VL P
Sbjct: 554 PVIHVANDAMIHVVYVANNGKKIKRAVLDP 583
>gi|378769380|ref|YP_005197854.1| BNR/Asp-box repeat family protein [Pantoea ananatis LMG 5342]
gi|365188868|emb|CCF11817.1| BNR/Asp-box repeat family protein [Pantoea ananatis LMG 5342]
Length = 393
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G L +FGG+ EG D+ +W G W + ++P NPV
Sbjct: 41 HAANILPLQDGSLLCVWFGGTQEGIADISVWCSRLPAGADTWSEAVKLSDDPTRSEQNPV 100
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P+N L L + + + + ++S + G TW + L PG +PI
Sbjct: 101 LFLAPNNVLWLLWTAQISGNQDTAIVRYRQSSDLGQTWGPIDTLLDKPGTF---IRQPIT 157
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G+SWR + +P +SL +
Sbjct: 158 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKSWRD---VVVP-DSLGCVHMN 213
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ +L L RS + +S S DGG TWS + T L N NS I L G L
Sbjct: 214 ITALQDGSLVALYRS-RWADHIYLSRSTDGGETWSSPEATVLPNNNSSIQVTTLHSGELA 272
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G +W
Sbjct: 273 LVFNNMSAAGATERRASLYDEIDDGDGRKEPETTGRTAFWGAPRAPMTVAISADGGKTWP 332
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + +E EFSYP++ Q + G++HI YTY R IK+ + P
Sbjct: 333 WQKNLDEGDGYCMTNNSQEKRNREFSYPSIKQDAGGTLHIAYTYFRQAIKYARVTP 388
>gi|440287602|ref|YP_007340367.1| putative neuraminidase (sialidase) [Enterobacteriaceae bacterium
strain FGI 57]
gi|440047124|gb|AGB78182.1| putative neuraminidase (sialidase) [Enterobacteriaceae bacterium
strain FGI 57]
Length = 389
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 147/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGTQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + +RS + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRRSTDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + +L N NS I L G L
Sbjct: 206 ITMLRDSTLVALFRS-RWADSIYISHSADHGESWSVPQAIELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DGS+HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGSLHIAYTWWRQAIKYVRISP 381
>gi|301025188|ref|ZP_07188761.1| BNR/Asp-box repeat protein [Escherichia coli MS 69-1]
gi|419917195|ref|ZP_14435468.1| Putative Glycosyl hydrolase [Escherichia coli KD2]
gi|300396188|gb|EFJ79726.1| BNR/Asp-box repeat protein [Escherichia coli MS 69-1]
gi|388394582|gb|EIL55845.1| Putative Glycosyl hydrolase [Escherichia coli KD2]
Length = 389
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VVVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDSTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|399021574|ref|ZP_10723670.1| putative neuraminidase (sialidase) [Herbaspirillum sp. CF444]
gi|398091127|gb|EJL81577.1| putative neuraminidase (sialidase) [Herbaspirillum sp. CF444]
Length = 405
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 148/361 (40%), Gaps = 82/361 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ + + G L A+FGG+ EG PD+ I+L + G W +P+ ++ NPV
Sbjct: 44 HAANLASLSNGDLLCAWFGGTQEGIPDISIYLSRLRVGSDTWSAPVRLSDDATRSEQNPV 103
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF+ P L L Y + + ++R S ++G TW E L PG +PI+
Sbjct: 104 LFETPEGALWLIYTAQLSGHQNTSIVRRRISQDQGQTWGPVETLFDKPGTF---VRQPIV 160
Query: 142 LE-NGLLLC---GSSVESWNSWGSWMEVTV-----DAGRSWRKY---GPIYIPNESLSVI 189
+ +G LC V + W +++V D G SW + G + + ++ +
Sbjct: 161 VTGDGAWLCPVFMCRVAAGERWSGNDDISVVMRSTDRGSSWSAHEVPGSVGCVHMNIQAL 220
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
+ F R+ R + I S D G+TWS P L N NS I + L +G
Sbjct: 221 KSGGFLALYRS-----RWADHI----YSSRSDDGVTWSEPHPLALPNNNSSIQFLALANG 271
Query: 250 RLLLAYNTVS----------------------------------------RGVLKVALSK 269
L L +N S R + +A+S
Sbjct: 272 HLGLVFNNSSAAQSTGRRASLYDDIEDSEDSGELASQSASHRGTAFWGAPRAPMTLAISC 331
Query: 270 DDGDSW--------HDALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G +W D + N A EFSYP+++Q+ DG +HI YTY R IK+V
Sbjct: 332 DGGHTWPAMRNLEVGDGYCMTNNSADKRNREFSYPSIVQSDDGKLHIAYTYFRQNIKYVC 391
Query: 318 L 318
+
Sbjct: 392 V 392
>gi|432726746|ref|ZP_19961628.1| hypothetical protein WE3_01183 [Escherichia coli KTE18]
gi|432740432|ref|ZP_19975154.1| hypothetical protein WEE_01107 [Escherichia coli KTE23]
gi|432989745|ref|ZP_20178412.1| hypothetical protein A179_01513 [Escherichia coli KTE217]
gi|433109967|ref|ZP_20295842.1| hypothetical protein WK9_00828 [Escherichia coli KTE150]
gi|431275455|gb|ELF66484.1| hypothetical protein WE3_01183 [Escherichia coli KTE18]
gi|431286561|gb|ELF77387.1| hypothetical protein WEE_01107 [Escherichia coli KTE23]
gi|431497982|gb|ELH77198.1| hypothetical protein A179_01513 [Escherichia coli KTE217]
gi|431630902|gb|ELI99226.1| hypothetical protein WK9_00828 [Escherichia coli KTE150]
Length = 389
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRSSLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 SWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|432396574|ref|ZP_19639360.1| hypothetical protein WEI_01488 [Escherichia coli KTE25]
gi|432722204|ref|ZP_19957128.1| hypothetical protein WE1_01227 [Escherichia coli KTE17]
gi|430917546|gb|ELC38590.1| hypothetical protein WEI_01488 [Escherichia coli KTE25]
gi|431268003|gb|ELF59517.1| hypothetical protein WE1_01227 [Escherichia coli KTE17]
Length = 389
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 SWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|291619532|ref|YP_003522274.1| hypothetical Protein PANA_3979 [Pantoea ananatis LMG 20103]
gi|291154562|gb|ADD79146.1| Hypothetical Protein PANA_3979 [Pantoea ananatis LMG 20103]
Length = 393
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 71/356 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G L +FGG+ EG D+ +W G W + ++P NPV
Sbjct: 41 HAANILPLQDGSLLCVWFGGTQEGIADISVWCSRLPAGADTWSEAVKLSDDPTRSEQNPV 100
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P+N L L + + + + ++S + G TW + L PG +PI
Sbjct: 101 LFLAPNNVLWLLWTAQISGNQDTAIVRYRQSSDLGQTWGPIDTLLDKPGTF---IRQPIT 157
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G+SWR + +P +SL +
Sbjct: 158 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKSWRD---VVVP-DSLGCVHMN 213
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ +L L RS + +S S DGG TWS + T L N NS I L G L
Sbjct: 214 ITALQDGSLVALYRS-RWADHIYLSRSTDGGETWSSPEATVLPNNNSSIQVTTLLSGELA 272
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
L +N +S R + VA+S D G +W
Sbjct: 273 LVFNNMSAAGATERRASLYDEIDDGDGRKEPETTGRTAFWGAPRAPMTVAISADGGKTWP 332
Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
D + +E EFSYP++ Q + G++HI YTY R IK+ + P
Sbjct: 333 WQKNLDEGDGYCMTNNSQEKRNREFSYPSIKQDAGGTLHIAYTYFRQAIKYARVTP 388
>gi|116621776|ref|YP_823932.1| neuraminidase (sialidase)-like protein [Candidatus Solibacter
usitatus Ellin6076]
gi|116224938|gb|ABJ83647.1| neuraminidase (sialidase)-like protein [Candidatus Solibacter
usitatus Ellin6076]
Length = 447
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 157/353 (44%), Gaps = 74/353 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCE-GAPDVKI----WLQTFKDGRWQSPIIADEEPNVPMW 82
H+S++VE+ G LV +F GS E A DV I W T K +W +P + P P
Sbjct: 94 HSSSVVELPNGDLLVCWFHGSGELTADDVLIQAARWNHTTK--QWSAPFTLADTPGFPET 151
Query: 83 NPVLFKLPSNGLLLF----------------YKIGQEVQKWSGCMKRSYNKGVTW----- 121
NPVLF L S+ L F Y+I + Q+ SG K + +
Sbjct: 152 NPVLF-LDSHQRLFFAWPLIVAHKWETALMKYRISTDYQQPSGPPKWEFQDNIVLIPKDI 210
Query: 122 ----SEREQLPPGILGP-----------------------SKNKPILLENGLLLCGSSVE 154
E GP ++ P+ L +G +L +
Sbjct: 211 AALTKEFAGAEAAGTGPRADRAKKQIEHAEDEYFSRMGWFTRTHPLELPSGRILMPMYSD 270
Query: 155 SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGI--G 212
+ S+G M ++ D G +W PI IQP + +L +R NG
Sbjct: 271 GY-SYGI-MAISDDGGYTWTASQPIV----GEGCIQPSVVRKKDGSLVAYLRD-NGPPPK 323
Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRG--VLKVALSKD 270
R +S S D G+TW+ A+ T++ NP + ++ ++L++G ++ YN + +G L A+S D
Sbjct: 324 RAHISYSKDDGVTWTRARDTEIPNPGTSLENIRLRNGNWIMVYNDLEQGRYSLVAAMSDD 383
Query: 271 DGDSWHDALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQ---IKHV 316
+G +W L+ + + ++ YP+VIQA DG++H+TY+Y + IKHV
Sbjct: 384 EGATWKWKRHLDGSPSGARTSQYHYPSVIQARDGAIHVTYSYFTPEGKAIKHV 436
>gi|406830180|ref|ZP_11089774.1| hypothetical protein SpalD1_01032 [Schlesneria paludicola DSM
18645]
Length = 344
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 139/313 (44%), Gaps = 27/313 (8%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
H ++I E+ G +AY+GG E D ++ + G+ W P + P+ N V
Sbjct: 31 HPASIGELANGDLYIAYYGGEGEYKGDTAVFGSRREKGKTSWSLPTTIADTPDRADGNGV 90
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKPILL 142
+++ P LFY + + + W+ + K S++ G TW++ E L +++PI L
Sbjct: 91 IWQEPGGATWLFY-VCRYGKTWADSVIKYKYSHDNGHTWTDSEMLTFERGMMVRSQPIQL 149
Query: 143 ENGLLLC------GSSVESWNSWGSWMEVTV-DAGRSWRKYGPIYIPNESLSVIQPVPFH 195
+G L G+ ++ + S + + W ++ + IQP
Sbjct: 150 FDGDFLVPIYQEKGTDAKAIGAQSSSLFARYHKKTKEWSFTNKVH---SRIGNIQPSVVQ 206
Query: 196 TANRTLRVLMRSFNGI------GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
L R GI G + +ES DGG TW+ + T NPNS +D +KLK+G
Sbjct: 207 LDQNKLIAFCRR-GGIYGPLLDGFIVRTESRDGGKTWALGEDTSFPNPNSAVDLIKLKNG 265
Query: 250 RLLLAYNT---VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
L++ YN R L + +S D G +W A + E YP +IQ +DG ++ Y
Sbjct: 266 HLVMIYNNSNQAKRNPLTMRVSTDQGQTWPAARDIVNTPDDEQGYPFIIQTADGRINGVY 325
Query: 307 T-YNRTQIKHVVL 318
T RT + H VL
Sbjct: 326 TSQKRTVVNHFVL 338
>gi|417627604|ref|ZP_12277851.1| BNR/Asp-box repeat family protein [Escherichia coli STEC_MHI813]
gi|345377908|gb|EGX09839.1| BNR/Asp-box repeat family protein [Escherichia coli STEC_MHI813]
Length = 389
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGTQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLAALFRS-RWADNIYISHSADHGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|312132035|ref|YP_003999375.1| neuraminidase (sialidase)-like protein [Leadbetterella byssophila
DSM 17132]
gi|311908581|gb|ADQ19022.1| neuraminidase (sialidase)-like protein [Leadbetterella byssophila
DSM 17132]
Length = 371
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 149/349 (42%), Gaps = 63/349 (18%)
Query: 13 EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGRWQSPI 71
E FP + HAS+IVE+ G LVA+F GS E A DVKI W +P
Sbjct: 19 EALIFPLQG---QHVHASSIVELPNGDLLVAWFQGSGERTADDVKIMGARRTGKSWSAPF 75
Query: 72 IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM-----KRSYNKGVTWS---- 122
+ + P +P NPVLF L L + I + +W + + + W
Sbjct: 76 VLADTPFLPDCNPVLFLNHKKQLHLVW-IAVQANRWEQSILKILKSKDHEPNWVWQDNIL 134
Query: 123 ---------EREQ----LPP------------------GILGPSKN--------KPILLE 143
E EQ LPP P K KP+LL
Sbjct: 135 LKPAEEFAVEVEQKLKSLPPLNHGWAEYAPKYDDLIITAAQDPVKRSTGWMTRIKPLLLP 194
Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
+ +L + +N S M ++ D G +W P+ IQP + ++
Sbjct: 195 DQKILLPLYSDGFNF--SMMALSEDDGETWLPSLPLV----GRGPIQPALVLKKDGSIAA 248
Query: 204 LMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--SR 260
+R S + +V SES DGG TWS ++ ++ S ++ + L DGR ++ N SR
Sbjct: 249 FLRDSGDSPNQVHYSESKDGGYTWSASRKIKIPTAAS-VEALHLPDGRWIMVLNDQENSR 307
Query: 261 GVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
L + +S+D+G +W LEE FSYP++I +SDG +H+TY+Y+
Sbjct: 308 HRLSLYVSQDEGINWERKKILEETARGSFSYPSLIPSSDGHLHLTYSYH 356
>gi|117622881|ref|YP_851794.1| hypothetical protein APECO1_1391 [Escherichia coli APEC O1]
gi|218557602|ref|YP_002390515.1| Glycosyl hydrolase, BNR repeat [Escherichia coli S88]
gi|237707359|ref|ZP_04537840.1| hypothetical protein ESAG_00540 [Escherichia sp. 3_2_53FAA]
gi|386598409|ref|YP_006099915.1| hypothetical protein ECOK1_0683 [Escherichia coli IHE3034]
gi|417083139|ref|ZP_11951275.1| hypothetical protein i01_00877 [Escherichia coli cloneA_i1]
gi|419945350|ref|ZP_14461796.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli HM605]
gi|422359085|ref|ZP_16439734.1| BNR/Asp-box repeat protein [Escherichia coli MS 110-3]
gi|422748303|ref|ZP_16802216.1| hypothetical protein ERKG_00531 [Escherichia coli H252]
gi|422753293|ref|ZP_16807120.1| hypothetical protein ERLG_00415 [Escherichia coli H263]
gi|422839155|ref|ZP_16887127.1| hypothetical protein ESPG_01813 [Escherichia coli H397]
gi|432356976|ref|ZP_19600223.1| hypothetical protein WCC_00931 [Escherichia coli KTE4]
gi|432361448|ref|ZP_19604633.1| hypothetical protein WCE_00469 [Escherichia coli KTE5]
gi|432440010|ref|ZP_19682365.1| hypothetical protein A13O_00836 [Escherichia coli KTE189]
gi|432445123|ref|ZP_19687431.1| hypothetical protein A13S_01157 [Escherichia coli KTE191]
gi|432586930|ref|ZP_19823301.1| hypothetical protein A1SO_01285 [Escherichia coli KTE58]
gi|432596572|ref|ZP_19832855.1| hypothetical protein A1SW_01279 [Escherichia coli KTE62]
gi|432753429|ref|ZP_19987996.1| hypothetical protein WEA_00408 [Escherichia coli KTE22]
gi|432777568|ref|ZP_20011819.1| hypothetical protein A1SQ_01231 [Escherichia coli KTE59]
gi|432786357|ref|ZP_20020523.1| hypothetical protein A1U3_00490 [Escherichia coli KTE65]
gi|432819949|ref|ZP_20053663.1| hypothetical protein A1Y5_01555 [Escherichia coli KTE118]
gi|432826163|ref|ZP_20059819.1| hypothetical protein A1YA_02873 [Escherichia coli KTE123]
gi|433004157|ref|ZP_20192595.1| hypothetical protein A17S_01712 [Escherichia coli KTE227]
gi|433011366|ref|ZP_20199771.1| hypothetical protein A17W_04129 [Escherichia coli KTE229]
gi|433012863|ref|ZP_20201241.1| hypothetical protein WI5_00690 [Escherichia coli KTE104]
gi|433022488|ref|ZP_20210503.1| hypothetical protein WI9_00655 [Escherichia coli KTE106]
gi|433152781|ref|ZP_20337748.1| hypothetical protein WKS_00709 [Escherichia coli KTE176]
gi|433162482|ref|ZP_20347241.1| hypothetical protein WKW_00685 [Escherichia coli KTE179]
gi|433167491|ref|ZP_20352159.1| hypothetical protein WKY_00750 [Escherichia coli KTE180]
gi|433323367|ref|ZP_20400716.1| hypothetical protein B185_007808 [Escherichia coli J96]
gi|115512005|gb|ABJ00080.1| conserved hypothetical protein [Escherichia coli APEC O1]
gi|218364371|emb|CAR02048.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli S88]
gi|226898569|gb|EEH84828.1| hypothetical protein ESAG_00540 [Escherichia sp. 3_2_53FAA]
gi|294489432|gb|ADE88188.1| conserved hypothetical protein [Escherichia coli IHE3034]
gi|315287119|gb|EFU46533.1| BNR/Asp-box repeat protein [Escherichia coli MS 110-3]
gi|323952758|gb|EGB48626.1| hypothetical protein ERKG_00531 [Escherichia coli H252]
gi|323958425|gb|EGB54131.1| hypothetical protein ERLG_00415 [Escherichia coli H263]
gi|355353007|gb|EHG02180.1| hypothetical protein i01_00877 [Escherichia coli cloneA_i1]
gi|371611176|gb|EHN99702.1| hypothetical protein ESPG_01813 [Escherichia coli H397]
gi|388415905|gb|EIL75813.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli HM605]
gi|430879786|gb|ELC03117.1| hypothetical protein WCC_00931 [Escherichia coli KTE4]
gi|430889966|gb|ELC12613.1| hypothetical protein WCE_00469 [Escherichia coli KTE5]
gi|430969242|gb|ELC86392.1| hypothetical protein A13O_00836 [Escherichia coli KTE189]
gi|430975845|gb|ELC92729.1| hypothetical protein A13S_01157 [Escherichia coli KTE191]
gi|431123402|gb|ELE26140.1| hypothetical protein A1SO_01285 [Escherichia coli KTE58]
gi|431133475|gb|ELE35465.1| hypothetical protein A1SW_01279 [Escherichia coli KTE62]
gi|431305039|gb|ELF93555.1| hypothetical protein WEA_00408 [Escherichia coli KTE22]
gi|431330168|gb|ELG17450.1| hypothetical protein A1SQ_01231 [Escherichia coli KTE59]
gi|431341486|gb|ELG28493.1| hypothetical protein A1U3_00490 [Escherichia coli KTE65]
gi|431370951|gb|ELG56744.1| hypothetical protein A1Y5_01555 [Escherichia coli KTE118]
gi|431374359|gb|ELG59951.1| hypothetical protein A1YA_02873 [Escherichia coli KTE123]
gi|431517478|gb|ELH95000.1| hypothetical protein A17S_01712 [Escherichia coli KTE227]
gi|431519578|gb|ELH97030.1| hypothetical protein A17W_04129 [Escherichia coli KTE229]
gi|431535134|gb|ELI11517.1| hypothetical protein WI5_00690 [Escherichia coli KTE104]
gi|431540007|gb|ELI15639.1| hypothetical protein WI9_00655 [Escherichia coli KTE106]
gi|431678162|gb|ELJ44170.1| hypothetical protein WKS_00709 [Escherichia coli KTE176]
gi|431691152|gb|ELJ56612.1| hypothetical protein WKW_00685 [Escherichia coli KTE179]
gi|431693491|gb|ELJ58904.1| hypothetical protein WKY_00750 [Escherichia coli KTE180]
gi|432348070|gb|ELL42522.1| hypothetical protein B185_007808 [Escherichia coli J96]
Length = 389
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIVDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|432405505|ref|ZP_19648227.1| hypothetical protein WEO_00688 [Escherichia coli KTE28]
gi|430932422|gb|ELC52845.1| hypothetical protein WEO_00688 [Escherichia coli KTE28]
Length = 389
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|342883941|gb|EGU84341.1| hypothetical protein FOXB_05140 [Fusarium oxysporum Fo5176]
Length = 380
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 157/372 (42%), Gaps = 77/372 (20%)
Query: 16 TFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIA 73
T P SA +S HAS ++++ L +FGGS EG PD+ IWL + G W +P
Sbjct: 7 TIPVPSATVQS-HASNLLQLPDKTVLCTWFGGSQEGLPDISIWLSRLEPGSSSWTTPQKI 65
Query: 74 DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC--MKR-SYNKGVTWSEREQLPPG 130
+ N NPVLF+ P +G + Q+ G +KR S ++G TWSE +L P
Sbjct: 66 SFDENRSCQNPVLFRAPHSGEIWLLHTSQDAGNQDGAYILKRTSSDQGHTWSEASRLLPD 125
Query: 131 ILGPSKNKPILLENG-----LLLCGSSVESWNSWGSWMEV-----TVDAGRSWR-KYGPI 179
+ G +PI++ NG L + E + W ++ + D G +W+ K P
Sbjct: 126 VTGIFIRQPIVI-NGDGTWILPVFYCRTEPGHRWIGSDDISGVLYSDDNGATWKEKQAPD 184
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
+ + +++I PV + + + RS + + G+ W K T L NPNS
Sbjct: 185 SVGSVHMNIIPPV---SGSSSWVAFYRSR--WADNVYRSTSNNGIDWETPKATTLPNPNS 239
Query: 240 GIDGVKLKDGRLLLAYNTVS--------------------------------------RG 261
GI +L GR+ + +N + R
Sbjct: 240 GICAARLSSGRIAIVFNRSNASADTLKRQGLYDDITPEDDKRPNQVTKTGKDAIWGTPRK 299
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGS---VHITY 306
L + +S+D+G +W + + LE+ E SYP++ DG H+ Y
Sbjct: 300 TLTLGVSEDEGLTWRERV-LEDGDGFCGTNSSSGQENRELSYPSIYIEKDGDRDVAHVAY 358
Query: 307 TYNRTQIKHVVL 318
T++R IK+V +
Sbjct: 359 TWHRQHIKYVRI 370
>gi|271502008|ref|YP_003335034.1| hypothetical protein Dd586_3498 [Dickeya dadantii Ech586]
gi|270345563|gb|ACZ78328.1| conserved hypothetical protein [Dickeya dadantii Ech586]
Length = 394
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 150/357 (42%), Gaps = 74/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQSPIIADEEPNVPMWNPV 85
HAS ++ + G L +FGG+ EG D+ I+L + G+W + ++P NPV
Sbjct: 39 HASNLLHLPNGDVLCVWFGGTQEGVSDISIYLSRLVNGSGQWTPAVKLSDDPTRSEQNPV 98
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + + ++S ++G TW E L PG +PI
Sbjct: 99 LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSQDQGYTWGEIGVLLEQPGTF---IRQPIT 155
Query: 141 LLENG-----LLLCG-SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L NG + C E W N S ++++ D G++WR+ +PN + V +
Sbjct: 156 VLPNGDWLLPVFYCRIQPGEKWVGNDDISAVKISSDRGQTWRES---VVPNSTGCVHMNI 212
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + S S DGG TWS + T L N NS I L +G L
Sbjct: 213 TL-LKDGTLLALFRS-RWADFIYRSHSTDGGETWSEPEATDLPNNNSSIQVTTLDNGHLA 270
Query: 253 LAYNTVS---------------------------------------RGVLKVALSKDDGD 273
L +N ++ R + +A+S+D G
Sbjct: 271 LVFNAMNADGATERRLSLYDEIEDEEESDAKMPEIATGRSAFWGAPRAPMTLAISEDGGK 330
Query: 274 SWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W LE + EFSYP++ Q DG +HI +TY R IK+V +
Sbjct: 331 TWPWQRNLEVGDGYCMTNNSTDKRNREFSYPSIKQGPDGKLHIAFTYFRQAIKYVCV 387
>gi|158423600|ref|YP_001524892.1| glycosyl hydrolase [Azorhizobium caulinodans ORS 571]
gi|158330489|dbj|BAF87974.1| putative glycosyl hydrolase [Azorhizobium caulinodans ORS 571]
Length = 403
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 147/371 (39%), Gaps = 79/371 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ I + G +FGG+ EG PD+ ++ G RW P ++
Sbjct: 36 SPCVQNHAANIAVLGNGDLATVWFGGTQEGIPDISVYFSRLAKGSDRWTVPQKLSDDGTR 95
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL---PPGILGP 134
NP+LF PS L L + + + +++ S + GVTW E L P G
Sbjct: 96 SEQNPILFPTPSGDLWLIWTAQISGNQDTSMVRKRVSRDHGVTWGPIETLFEAPEGFGLF 155
Query: 135 SKNKPILLENG-----LLLCGSSVESWNSWG----SWMEVTVDAGRSWRKYGPIYIPNES 185
+ P++L++G + C SV G S + V+ D GR+W ++ +P+ S
Sbjct: 156 VRQPPVVLDDGAWVIPVFHC-VSVPGAKWVGDRDISAVRVSADEGRTWTEH---VVPH-S 210
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
L + T L + + + +S S DG + WS + T L N NS I
Sbjct: 211 LGCVHMNVLKTKEGLLALYRSRWADF--IYLSRSADGRI-WSAPEATVLPNNNSSIQATV 267
Query: 246 LKDGRLLLAYNTVS-------------------------------------------RGV 262
L DGR+ L +N S R
Sbjct: 268 LADGRIALVFNNSSNKDATDRRTGLYDDIEDDSAPVASASPAVSPEPTGRTAFWGAPRAP 327
Query: 263 LKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
L +ALS D G SW LE + L EFSYP++ Q DG + I +TY R
Sbjct: 328 LTLALSADGGRSWPVVRNLEVGDGYCMTNNSKDGLNREFSYPSITQTPDGMLQIAFTYYR 387
Query: 311 TQIKHVVLLPN 321
IKHV + P+
Sbjct: 388 RAIKHVRVAPD 398
>gi|449138248|ref|ZP_21773539.1| putative secreted protein [Rhodopirellula europaea 6C]
gi|448883184|gb|EMB13726.1| putative secreted protein [Rhodopirellula europaea 6C]
Length = 188
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 84/160 (52%), Gaps = 11/160 (6%)
Query: 5 CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
G LV+ + F + P K HASTIVE G + A+F G+ E PDV I + ++
Sbjct: 28 AGDGALVSSQQIFDLANKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRNEN 86
Query: 65 GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
G+W P+ + P WNPVLF+ L+LFYK+G ++W G + S + G
Sbjct: 87 GQWTEPVEVVNGVQSSTLRYPTWNPVLFQPTEGPLMLFYKVGPNPREWWGMLTTSEDGGK 146
Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE 154
TW+ +L +LGP KNKPI L +G +LC SS E
Sbjct: 147 TWTWPTKLGEAHTIGHLLGPVKNKPIQLADGTILCPSSTE 186
>gi|91209716|ref|YP_539702.1| hypothetical protein UTI89_C0678 [Escherichia coli UTI89]
gi|386605414|ref|YP_006111714.1| Putative Glycosyl hydrolase, BNR repeat protein [Escherichia coli
UM146]
gi|432572625|ref|ZP_19809116.1| hypothetical protein A1SI_01312 [Escherichia coli KTE55]
gi|91071290|gb|ABE06171.1| hypothetical protein UTI89_C0678 [Escherichia coli UTI89]
gi|307627898|gb|ADN72202.1| Putative Glycosyl hydrolase, BNR repeat protein [Escherichia coli
UM146]
gi|431110963|gb|ELE14880.1| hypothetical protein A1SI_01312 [Escherichia coli KTE55]
Length = 389
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIVDGDDSRKEPTAVGRSAFWGAPRAPVTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|317139154|ref|XP_001817301.2| glycosyl hydrolase [Aspergillus oryzae RIB40]
Length = 387
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 149/353 (42%), Gaps = 69/353 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HAS ++ + G L A+FGGS EG PD+ I+L + G W I + NPV
Sbjct: 35 HASNLLLLPNGDVLCAWFGGSMEGKPDISIYLSRLRSGEQSWSEAIRMTHDNTRSEQNPV 94
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK--GVTWSEREQLPPGILGPSKNKPILLE 143
LF+ P+ L L Y + S +KR +K G+TW + E L P + I+L+
Sbjct: 95 LFRTPTGDLWLLYTSQHAGNQDSAIVKRRVSKDDGITWGKEEVLFPDSGIFIRQPAIVLD 154
Query: 144 NG---LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
+G + + VE W S + V+ D G++W + IP ES +
Sbjct: 155 DGAWVIPVFKCRVEPGERWLGNNDISCIRVSRDEGQTWTES---VIP-ESTGCVHMEIQR 210
Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
+ + L RS + ++ S D GL+WS +PT L NPN+GI L GR+++ Y
Sbjct: 211 LKDCSYLGLFRS-RWADHIYLATSPD-GLSWSPPQPTILPNPNAGICFDVLPSGRVVVVY 268
Query: 256 NTVS---------------------------------------RGVLKVALSKDDGDSWH 276
N S R L VA S D G +W
Sbjct: 269 NHSSKLDATGRRQGLYDDIADGVDERRDQSSTQDGRESFWGAPRAPLCVAWSDDSGKTWE 328
Query: 277 DAL-----------TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+ E+ L E SYP+++ +G +HI YT+ R +IK+V L
Sbjct: 329 RRVLEDGDGYCMTNNSEKKLNRELSYPSMV-LDEGKIHIAYTFWRQRIKYVQL 380
>gi|445228817|ref|ZP_21404854.1| hypothetical protein SEE10_016225, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. SE10]
gi|444865989|gb|ELX90745.1| hypothetical protein SEE10_016225, partial [Salmonella enterica
subsp. enterica serovar Enteritidis str. SE10]
Length = 189
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/167 (34%), Positives = 83/167 (49%), Gaps = 4/167 (2%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
+ GP KNK +L NG + S+ES W ++++ + D G+ W
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHW 170
>gi|440229205|ref|YP_007342998.1| putative neuraminidase (sialidase) [Serratia marcescens FGI94]
gi|440050910|gb|AGB80813.1| putative neuraminidase (sialidase) [Serratia marcescens FGI94]
Length = 398
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 150/359 (41%), Gaps = 75/359 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +F G+ EG D+ I+ G +W + ++ + NPV
Sbjct: 36 HAANLLPLPNGDLLCVWFAGTQEGIADISIYSSRLPQGQQQWTPAVKLSDDESRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P N L L + + + + + ++S + G +W E L PG + ++
Sbjct: 96 LFLDPDNVLWLLWTAQKSGNQDTAFVRYRQSRDLGASWGEIATLLEQPGTF--IRQPIVV 153
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L+NG + C ++ E W N S ++++ D GRSW + +P+ + V +
Sbjct: 154 LDNGDWLLPVFYCRTAPGEKWVGNHDDSAVKISSDRGRSWHD---VAVPDSTGCVHMNIT 210
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +L L RS + S S DGG +WS +PT L N NS I L++G L L
Sbjct: 211 L-LQDGSLLALYRS-RWADFIYQSRSVDGGRSWSAPQPTALPNNNSSIQLTTLRNGDLAL 268
Query: 254 AYNTVS------------------------------------------RGVLKVALSKDD 271
+N +S R + +A+S+D
Sbjct: 269 VFNAMSASGALERRTSLYDEIEDEDDDGGAEAAIPAPGDGRSAFWGAPRAPMTLAISRDG 328
Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G SW L+ E L EFSYP + Q DG +HI YTY R IK+V +
Sbjct: 329 GRSWPYQRNLDEGDGYCMTNNSREKLNREFSYPTIKQGDDGRLHIAYTYFRQAIKYVCV 387
>gi|438121586|ref|ZP_20872173.1| hypothetical protein SEEP9120_01128, partial [Salmonella enterica
subsp. enterica serovar Pullorum str. ATCC 9120]
gi|434943228|gb|ELL49380.1| hypothetical protein SEEP9120_01128, partial [Salmonella enterica
subsp. enterica serovar Pullorum str. ATCC 9120]
Length = 174
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/167 (34%), Positives = 83/167 (49%), Gaps = 4/167 (2%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W +
Sbjct: 5 LVNRQVILPESGTKSFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 65 PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123
Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
+ GP KNK +L NG + S+ES W ++++ + D G+ W
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHW 170
>gi|397658130|ref|YP_006498832.1| hypothetical protein A225_3121 [Klebsiella oxytoca E718]
gi|394346470|gb|AFN32591.1| hypothetical protein A225_3121 [Klebsiella oxytoca E718]
Length = 390
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 151/358 (42%), Gaps = 73/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W + ++ + NPV
Sbjct: 33 HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSEAVKLSDDASRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQVSGNQDTAIVRYRQSTDLGQTWGAIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L NG + C + E W N S ++++ D G++WR + +P +SL +
Sbjct: 150 VLNNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDCGKTWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+N TL L RS + S+S D G +WS +PT+L N NS I L G L
Sbjct: 206 ITALSNGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLTSGELA 264
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
L +N +S R + VA+S D G +
Sbjct: 265 LVFNHMSAAGALERRASLYDEIDDGDNSRKEPEVISGRSAFWGAPRAPMTVAISADGGKT 324
Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
W L+E EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WPWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHVSP 382
>gi|331645830|ref|ZP_08346933.1| hypothetical protein ECIG_00213 [Escherichia coli M605]
gi|386618128|ref|YP_006137708.1| hypothetical protein ECNA114_0612 [Escherichia coli NA114]
gi|387828664|ref|YP_003348601.1| hypothetical protein ECSF_0611 [Escherichia coli SE15]
gi|417661213|ref|ZP_12310794.1| expressed protein [Escherichia coli AA86]
gi|432420758|ref|ZP_19663314.1| hypothetical protein A137_01174 [Escherichia coli KTE178]
gi|432498894|ref|ZP_19740671.1| hypothetical protein A177_00990 [Escherichia coli KTE216]
gi|432557664|ref|ZP_19794354.1| hypothetical protein A1S7_01313 [Escherichia coli KTE49]
gi|432693434|ref|ZP_19928646.1| hypothetical protein A31I_00900 [Escherichia coli KTE162]
gi|432709480|ref|ZP_19944547.1| hypothetical protein WCG_02799 [Escherichia coli KTE6]
gi|432917826|ref|ZP_20122309.1| hypothetical protein A133_01213 [Escherichia coli KTE173]
gi|432925132|ref|ZP_20127223.1| hypothetical protein A135_01261 [Escherichia coli KTE175]
gi|432980152|ref|ZP_20168931.1| hypothetical protein A15W_01271 [Escherichia coli KTE211]
gi|433095576|ref|ZP_20281788.1| hypothetical protein WK3_00778 [Escherichia coli KTE139]
gi|433104785|ref|ZP_20290805.1| hypothetical protein WK7_00661 [Escherichia coli KTE148]
gi|281177821|dbj|BAI54151.1| conserved hypothetical protein [Escherichia coli SE15]
gi|330910431|gb|EGH38941.1| expressed protein [Escherichia coli AA86]
gi|331044582|gb|EGI16709.1| hypothetical protein ECIG_00213 [Escherichia coli M605]
gi|333968629|gb|AEG35434.1| Hypothetical protein ECNA114_0612 [Escherichia coli NA114]
gi|430947230|gb|ELC66937.1| hypothetical protein A137_01174 [Escherichia coli KTE178]
gi|431031795|gb|ELD44528.1| hypothetical protein A177_00990 [Escherichia coli KTE216]
gi|431094099|gb|ELD99749.1| hypothetical protein A1S7_01313 [Escherichia coli KTE49]
gi|431236671|gb|ELF31876.1| hypothetical protein A31I_00900 [Escherichia coli KTE162]
gi|431251649|gb|ELF45656.1| hypothetical protein WCG_02799 [Escherichia coli KTE6]
gi|431446700|gb|ELH27444.1| hypothetical protein A133_01213 [Escherichia coli KTE173]
gi|431448604|gb|ELH29319.1| hypothetical protein A135_01261 [Escherichia coli KTE175]
gi|431494147|gb|ELH73737.1| hypothetical protein A15W_01271 [Escherichia coli KTE211]
gi|431619090|gb|ELI88015.1| hypothetical protein WK3_00778 [Escherichia coli KTE139]
gi|431634084|gb|ELJ02340.1| hypothetical protein WK7_00661 [Escherichia coli KTE148]
Length = 389
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 147/357 (41%), Gaps = 72/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAAKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
L YN +S R + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324
Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+ E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 325 SWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381
>gi|423114519|ref|ZP_17102210.1| hypothetical protein HMPREF9689_02267 [Klebsiella oxytoca 10-5245]
gi|376384368|gb|EHS97091.1| hypothetical protein HMPREF9689_02267 [Klebsiella oxytoca 10-5245]
Length = 390
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 151/358 (42%), Gaps = 73/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W + ++ + NPV
Sbjct: 33 HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSDAVKLSDDASRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G++WR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDCGKTWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
TL L RS + S+S D G +WS +PT+L N NS I L G L
Sbjct: 206 ITALPGGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
L +N +S R + VA+S D G +
Sbjct: 265 LVFNHMSAAGARERRASLYDEIDDGDNSRKEPEVTSGHSAFWGAPRAPMTVAISADGGKT 324
Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
W L+E EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WPWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHVSP 382
>gi|423108495|ref|ZP_17096190.1| hypothetical protein HMPREF9687_01741 [Klebsiella oxytoca 10-5243]
gi|376384900|gb|EHS97622.1| hypothetical protein HMPREF9687_01741 [Klebsiella oxytoca 10-5243]
Length = 390
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 151/358 (42%), Gaps = 73/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W + ++ + NPV
Sbjct: 33 HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSDAVKLSDDASRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + + ++S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G++WR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDCGKTWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
TL L RS + S+S D G +WS +PT+L N NS I L G L
Sbjct: 206 ITALPGGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
L +N +S R + VA+S D G +
Sbjct: 265 LVFNHMSAAGARERRASLYDEIDDGDNSRKEPEVTSGRSAFWGAPRAPMTVAISADGGKT 324
Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
W L+E EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 325 WPWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHVSP 382
>gi|160943611|ref|ZP_02090843.1| hypothetical protein FAEPRAM212_01103 [Faecalibacterium prausnitzii
M21/2]
gi|158445066|gb|EDP22069.1| BNR/Asp-box repeat protein [Faecalibacterium prausnitzii M21/2]
gi|295104439|emb|CBL01983.1| Predicted neuraminidase (sialidase) [Faecalibacterium prausnitzii
SL3/3]
Length = 386
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 149/336 (44%), Gaps = 55/336 (16%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDG-RWQSPIIADEEPNVPMWN 83
+ HA +VE+ G L +F G+ EG+ DV I DG +W P+ +P N
Sbjct: 35 TAHAPAMVELPNGDLLCCWFAGTYEGSADVHIICSVLPHDGTKWLEPVNISGDPTRSEQN 94
Query: 84 PVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILGPS 135
P LF P N + Y G++ +++ ++ +S + G TW + + P G
Sbjct: 95 PSLFYGPDNAVWAMYTAQLDRVEGKDNMQFTAVVRCQKSTDGGKTWGDYTTVFPE-EGTF 153
Query: 136 KNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES---- 185
+PI +L NG LC S E + + ++ D G++WR + +P +
Sbjct: 154 CRQPIQVLSNGRWIFANWLCTDSAEGLSGDPTAFRISDDEGKTWRM---VMMPGSNGHVH 210
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+VI+ P H L MR+ R+ SES D G TW+ PT L N NS I VK
Sbjct: 211 ANVIELEPGH-----LVAFMRNREAY-RIHRSESFDWGETWTVPAPTPLPNNNSSISAVK 264
Query: 246 LKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEE 283
L+ GR+ +AYN R + VALS+D G ++ + +E
Sbjct: 265 LQSGRIAIAYNPTCTPNPVPGKAAWPGLRCPVAVALSEDGGLTFPIIRWMERGEGYMGDE 324
Query: 284 NLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
N ++ YP ++Q DG +H+ Y R IK+V
Sbjct: 325 NKTNNKQYEYPYLMQGRDGMLHLAYAARTRQGIKYV 360
>gi|159186481|ref|NP_396007.2| hypothetical protein Atu5072 [Agrobacterium fabrum str. C58]
gi|159141541|gb|AAK90448.2| conserved hypothetical protein [Agrobacterium fabrum str. C58]
Length = 396
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 140/367 (38%), Gaps = 73/367 (19%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPN 78
++P HA + D G + A+FGG+ EG D+ I+ G +W P +P+
Sbjct: 27 ASPMIQNHAPFLHLADDGALICAWFGGTLEGKSDISIFTSVLAAGSNKWGEPQRLSFDPD 86
Query: 79 VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG-------I 131
NPVLF P N LLLF+ Q C R S+ +L G
Sbjct: 87 HSEQNPVLFAAPGNTLLLFH-TSQPSGNQDECRIRMAEVSRDASDPTRLTAGEGLYLDLP 145
Query: 132 LGPSKNKPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIP 182
G P+ + + C + WN V + D G+SWR +
Sbjct: 146 RGCFVRAPLTVRADGAWLLPIFRCIQRPGQKWNGSHDRAAVGISEDCGKSWR----LEDI 201
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
++S + P + L + R V +ES DGG TWS + T + N NS I
Sbjct: 202 DQSTGCVHMSPLVIGDEKLAAVFRRRQA-DFVYRTESADGGRTWSAPQATDVPNNNSSIA 260
Query: 243 GVKLKDGRLLLAYNT-------------------------------------VSRGVLKV 265
++L DGR+ + N V R + V
Sbjct: 261 AIRLNDGRIAMICNPTNAAMSSDRRASLYDELGEDDDRPDANPDGGCVPIWGVPRAPVTV 320
Query: 266 ALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQI 313
+S+D S+ +T+E+ E SYP +++ +DGS+HI YTY R I
Sbjct: 321 CISEDGAKSFPQRITIEDGPGTCLSNNSIDGHNKEMSYPWLLEGADGSLHIAYTYYRRAI 380
Query: 314 KHVVLLP 320
K+V L P
Sbjct: 381 KYVRLAP 387
>gi|336250192|ref|YP_004593902.1| glycosyl hydrolase family protein [Enterobacter aerogenes KCTC
2190]
gi|334736248|gb|AEG98623.1| glycosyl hydrolase, BNR repeat protein [Enterobacter aerogenes KCTC
2190]
Length = 388
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 146/357 (40%), Gaps = 71/357 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGGS EG D+ +W G W + ++P NPV
Sbjct: 33 HAANLLPLPDGTLLCVWFGGSQEGKADISVWGARLAPGATTWSEAVKLSDDPQRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P L L + + + + ++S + G W L PG +PI
Sbjct: 93 LFQAPDGVLWLLWTAQHAGNQDTAIVRYRQSRDGGKHWGAIATLLDEPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ NG + C + E W N S ++++ D G+SWR + +P +SL +
Sbjct: 150 VMPNGNWLLPVFYCRTRPGEKWVGNDDVSAVKISADGGKSWRD---VAVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
A+ +L RS + S S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLADGSLVAFFRS-RWADHIWYSRSTDFGESWSAPIPTTLPNNNSSIQVTTLSNGDLA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSWH 276
L +N +S R + VA+S D G SW
Sbjct: 265 LVFNNMSAAGASERRASLYDEIEDDDGRKEPERTGKTAFWGAPRAPMTVAISADGGKSWP 324
Query: 277 DALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
L E+ L EFSYP++ Q DG++HI YT+ R IK++ + P
Sbjct: 325 WMRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGEDGNLHIAYTWYRQAIKYIRVSPQ 381
>gi|358373759|dbj|GAA90355.1| glycosyl hydrolase BNR repeat-containing protein [Aspergillus
kawachii IFO 4308]
Length = 406
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 106/359 (29%), Positives = 147/359 (40%), Gaps = 69/359 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HAS ++ + G L A+FGGS EG PD+ I+L + G W +
Sbjct: 27 APTVQSHASNLLRMPNGDLLCAWFGGSLEGKPDISIYLSRLRCGESEWTEASKMTHDHTR 86
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NPVLF P+ L L Y + + +K S + G TW + L +
Sbjct: 87 SEQNPVLFHTPTGNLWLLYTSQHAGDQDTAIVKYRVSPDSGTTWGGEKVLFHDTGTFIRQ 146
Query: 138 KPILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
++L++G + C E W N S + V+ D GRSW + IP ES +
Sbjct: 147 PVVVLDDGAWVVPVFKCRVQPGERWLGNDDISCIRVSRDQGRSWTES---EIP-ESAGCV 202
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
+ T + RS V ++ S D GL WS +PT L NPN+GI L G
Sbjct: 203 HMEIQRLKDGTYLGMFRS-RWADNVYLATSPD-GLVWSTPQPTSLPNPNAGICFDVLPSG 260
Query: 250 RLLLAYNTVS---------------------------------------RGVLKVALSKD 270
R++L YN S R L VA S D
Sbjct: 261 RVVLVYNHSSKRDALGRRQGLYDDIADGVDERKNQAAAKDGRESFWGSPRAPLCVAWSDD 320
Query: 271 DGDSWH-------DALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G++W D L E L E SYP++I A DG++HI YT+ R +IK+V L
Sbjct: 321 KGNTWKHRILEEGDGYCLTNNSERKLNRELSYPSMICA-DGTIHIAYTFWRQRIKYVQL 378
>gi|421724415|ref|ZP_16163637.1| hypothetical protein KOXM_01978 [Klebsiella oxytoca M5al]
gi|410374785|gb|EKP29444.1| hypothetical protein KOXM_01978 [Klebsiella oxytoca M5al]
Length = 390
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 148/357 (41%), Gaps = 71/357 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W G +W + + + NPV
Sbjct: 33 HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSEAVKLSNDASRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P N L L + + + + ++S + G TW + L PG + +
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGDIATLLDKPGTF--IRQPVTV 150
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L+NG + C + E W N S ++++ D G++WR + +P +SL +
Sbjct: 151 LDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADKGKTWRD---VEVP-QSLGCVHMNI 206
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
TL L RS + S+S D G +WS +PT+L N NS I L G L L
Sbjct: 207 TALPGGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLASGELAL 265
Query: 254 AYNTVS--------------------------------------RGVLKVALSKDDGDSW 275
+N +S R + VA+S D G +W
Sbjct: 266 VFNHMSAAGALERRASLYDEIDDGDNSRKEPEVTSGRSAFWGAPRAPMTVAISADGGKTW 325
Query: 276 HDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
L+E EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 326 PWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHITP 382
>gi|107023083|ref|YP_621410.1| glycosyl hydrolase [Burkholderia cenocepacia AU 1054]
gi|116686675|ref|YP_839922.1| BNR repeat-containing glycosyl hydrolase [Burkholderia cenocepacia
HI2424]
gi|105893272|gb|ABF76437.1| glycosyl hydrolase, BNR repeat protein [Burkholderia cenocepacia AU
1054]
gi|116652390|gb|ABK13029.1| glycosyl hydrolase, BNR repeat-containing protein [Burkholderia
cenocepacia HI2424]
Length = 395
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 151/356 (42%), Gaps = 76/356 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ + +D G L +FGG+ EG PDV I+L G W +P ++P NPV
Sbjct: 39 HAANLHCLDNGDLLCVWFGGTQEGIPDVSIYLSRLAAGTDEWSAPRKLSDDPTRSEQNPV 98
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P L L Y + + ++R S + G+TW + L PG +PI+
Sbjct: 99 LFTAPDGRLWLIYTAQLSGHQNTAIVRRRVSTDGGLTWGPVDTLFDRPGTF---VRQPIV 155
Query: 142 -LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
L++G + LC + E W N S + V+ D GR+W P +P +S+ +
Sbjct: 156 ALDDGTWICPIFLCRTEPGERWVGNDDVSAVMVSDDQGRTW---TPHEVP-DSVGCVHMN 211
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ +L L RS V S S D G TWS + T L N NS I L++G L
Sbjct: 212 VHVLHDGSLLALYRS-RWADFVHASRSTD-GRTWSAPERTDLPNNNSSIQFTVLQNGHLA 269
Query: 253 LAYNT----------------------------------------VSRGVLKVALSKDDG 272
L +N R + +A+S+D G
Sbjct: 270 LVFNESDASHGKERRASLYDDIDDSEDSGELRDQQASVRGTAFWGAPRAPMSLAISEDGG 329
Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+W LE + E+SYP+++Q+ DG++HI +T+ R +IK+V
Sbjct: 330 RTWPRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQSRDGALHIAFTWFRQKIKYV 385
>gi|423123646|ref|ZP_17111325.1| hypothetical protein HMPREF9694_00337 [Klebsiella oxytoca 10-5250]
gi|376401727|gb|EHT14333.1| hypothetical protein HMPREF9694_00337 [Klebsiella oxytoca 10-5250]
Length = 390
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 150/360 (41%), Gaps = 77/360 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ ++ + G + +FGG+ EG D+ +W GR W + ++P+ NP+
Sbjct: 33 HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGRRQWCDAVKLSDDPSRSEQNPL 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P N L L + + + + ++S + G TW + L PG + +
Sbjct: 93 LFLDPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGDIATLLDKPGTF--IRQPMTV 150
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---IQ 190
L+NG + C + E W N S ++++ D G++WR + +P+ V I
Sbjct: 151 LDNGNWLLPVFYCRTRPGEKWVGNDDISAVKISADKGKTWRD---VEVPHSLGCVHMNIT 207
Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
P+P L L RS + S+S D G +WS +PT L N NS I L G
Sbjct: 208 PLP----GGVLVALFRS-RWADNIYYSQSVDNGESWSAPEPTVLPNNNSSIQVTTLASGE 262
Query: 251 LLLAYNTVS--------------------------------------RGVLKVALSKDDG 272
L L +N +S R + VALS D G
Sbjct: 263 LALVFNPMSAAGALERRASLYDEIDDGDSSRKEPEANGGRSAFWGAPRAPMTVALSADGG 322
Query: 273 DSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
+W L+E EFSYP++ Q +DG++HI YT+ R IK+V + P
Sbjct: 323 KTWPWRRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHITP 382
>gi|251788284|ref|YP_003003005.1| hypothetical protein Dd1591_0644 [Dickeya zeae Ech1591]
gi|247536905|gb|ACT05526.1| conserved hypothetical protein [Dickeya zeae Ech1591]
Length = 391
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 149/357 (41%), Gaps = 74/357 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I+L G +W + ++P NPV
Sbjct: 36 HAANLLHLPNGDVLCVWFGGTQEGVSDISIYLSRLVKGSEQWTPAVKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + + ++S ++G TW E L PG +PI
Sbjct: 96 LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSLDQGYTWGEIGVLLEQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCG-SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L NG + C E W N S ++++ D G++WR+ +PN + V +
Sbjct: 153 VLPNGDWLLPVFYCRIQPGEKWVGNDDISAVKISSDQGKTWRES---VVPNSTGCVHMNI 209
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + S S DGG +WS + T L N NS I L +G L
Sbjct: 210 TL-LKDGTLLALFRS-RWADFIYRSHSTDGGESWSEPEATTLPNNNSSIQVTTLDNGHLA 267
Query: 253 LAYNTVS---------------------------------------RGVLKVALSKDDGD 273
L +N ++ R + +A+S+D G
Sbjct: 268 LVFNAMNADGATERRLSLYDEIEDEEETDAKMPDISSGRSAFWGAPRAPMTLAISEDGGK 327
Query: 274 SWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SW LE + EFSYP++ Q DG +H+ +TY R IK+V +
Sbjct: 328 SWPWQRNLEVGDGYCMTNNSTDKRNREFSYPSIKQGPDGKLHVAFTYFRQAIKYVCV 384
>gi|398377645|ref|ZP_10535819.1| putative neuraminidase (sialidase) [Rhizobium sp. AP16]
gi|397726508|gb|EJK86942.1| putative neuraminidase (sialidase) [Rhizobium sp. AP16]
Length = 395
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 150/370 (40%), Gaps = 84/370 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW +P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLSPGATRWSTPEKMSDDPAK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
NP++F P + L Y Q G + + S + G T+ E L + G
Sbjct: 95 SEQNPLIFNAPGGDVWLLYT-SQTSGNQDGSVVKCRISADGGKTFGEVRVL-CDLPGTFV 152
Query: 137 NKPILLENG------LLLC-GSSVESW--NSWGSWMEVTVDAGRSW------RKYGPIYI 181
+PI++ + + C G E W ++ + + ++ D G+SW G +++
Sbjct: 153 RQPIIVNSRGAWLLPVFRCVGLPGERWTGDADTACVLISRDQGKSWVMTAVPDSLGAVHM 212
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+L V F+ NR + V S S DGG +WS +P L N NS I
Sbjct: 213 NIVALEGDDLVAFYR-NRFAQ----------SVLASRSTDGGESWSAPQPMDLPNNNSSI 261
Query: 242 DGVKLKDGRLLLAYN---------------------------------------TVSRGV 262
VKLK G + + YN V R
Sbjct: 262 QAVKLKSGVIAIVYNHSNALMSDARRQSLYDEIEGGEAIDAAPLQTLERRKAVWGVPRAP 321
Query: 263 LKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
L +A S D G S+ + L+ ++L EFSYP+++Q +DG VHI YTY R
Sbjct: 322 LSLAFSSDGGTSFSRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVQGADGVVHIAYTYYR 381
Query: 311 TQIKHVVLLP 320
IK+V L P
Sbjct: 382 RAIKYVRLSP 391
>gi|373852619|ref|ZP_09595419.1| hypothetical protein Opit5DRAFT_3473 [Opitutaceae bacterium TAV5]
gi|372474848|gb|EHP34858.1| hypothetical protein Opit5DRAFT_3473 [Opitutaceae bacterium TAV5]
Length = 535
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 148/332 (44%), Gaps = 46/332 (13%)
Query: 18 PANSAPFKSCHA---------STIVEVDKGHFLVAYFGGSCEGAP-DVKIWLQTFKDG-- 65
P P ++ HA S++ G V ++ G G + + L T DG
Sbjct: 31 PGYVGPPQAAHAVTNRAFQGISSMAVSPGGRLWVIWYAGVTPGEDRNNYVVLATSGDGGK 90
Query: 66 RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
W+ ++ D + P+ ++P ++ P L L+ W+ + TW+
Sbjct: 91 TWKEVLVVDPDGPGPVRAFDPEVWMGPDGRLRLY---------WAQAAGHDGSISGTWT- 140
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SW--------MEVTVDAGR 171
E P P+ + P + +G+++C S V S W +W M V+ D GR
Sbjct: 141 LETGEPESEAPAWSAPARISDGIMMCKSVVLSTGEWALPVSTWRKTDHSAKMLVSTDQGR 200
Query: 172 SWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
+W G ++P + S + + + +LR+L+R+ GIG S S D G+TW P
Sbjct: 201 TWSVRGACHVPEKDRSFDEHMLVERGDGSLRMLVRTRYGIGE---SISTDRGVTWPELVP 257
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSWHDALTLEENLA 286
+ + +P++ +L G LLL + R L +S DDG +W L L+E
Sbjct: 258 SAIPHPSARFFITRLHSGNLLLVKHGPMDQQTLRSHLTAFISTDDGRTWGGGLLLDERTG 317
Query: 287 MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+ SYP QA+DG+V+ITY ++RT +H++
Sbjct: 318 V--SYPDGQQAADGTVYITYDFSRTGTRHILF 347
>gi|408388656|gb|EKJ68335.1| hypothetical protein FPSE_11343 [Fusarium pseudograminearum CS3096]
Length = 380
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 149/359 (41%), Gaps = 74/359 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++++ L +FGGS EG PD+ IWL + G W +P + N NPV
Sbjct: 18 HASNLLQLPDKTVLCTWFGGSQEGLPDICIWLSRLESGSSSWSTPQKVSSDENRSCQNPV 77
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC--MKR-SYNKGVTWSEREQLPPGILGPSKNKPILL 142
LF+ P NG L Q+ G +KR S ++G TWSE +L P G +PI++
Sbjct: 78 LFRAPHNGELWLLHTSQDAGNQDGAYILKRISSDEGKTWSEASRLLPDATGIFIRQPIVV 137
Query: 143 ENG----LLLCGSSVESWNSWGSWMEV-----TVDAGRSWR-KYGPIYIPNESLSVIQPV 192
+ L + E + W ++ + D G +W+ K P + ++++ PV
Sbjct: 138 NSEGAWILPVFYCRTEPGHRWIGSDDISGVFYSQDHGTTWKEKQAPDSVGAVHMNIVPPV 197
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ T + RS V S S + G+ W K T L NPNSGI +L G +
Sbjct: 198 ---SGQSTWVAVYRS-RWADNVYRSTSKN-GIDWEEPKATSLPNPNSGICAARLSSGHIA 252
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
L +N S R L + +S+D+G +
Sbjct: 253 LVFNRSSASADTLKRKGLYDDITPEDDKRPNQVTKTGKDAIWGTPRKTLTLGVSQDEGLT 312
Query: 275 WHDALTLEENLAM------------EFSYPAVIQASDGSV---HITYTYNRTQIKHVVL 318
W + + LE+ E SYP++ G + H+ YT++R IK+V +
Sbjct: 313 WSERV-LEDGDGFCGTNSSSGQENRELSYPSIYIEKKGEMDVAHVAYTWHRQHIKYVQI 370
>gi|83765156|dbj|BAE55299.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391864605|gb|EIT73900.1| putative neuraminidase [Aspergillus oryzae 3.042]
Length = 464
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 146/350 (41%), Gaps = 69/350 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HAS ++ + G L A+FGGS EG PD+ I+L + G W I + NPV
Sbjct: 120 HASNLLLLPNGDVLCAWFGGSMEGKPDISIYLSRLRSGEQSWSEAIRMTHDNTRSEQNPV 179
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK--GVTWSEREQLPPGILGPSKNKPILLE 143
LF+ P+ L L Y + S +KR +K G+TW + E L P + I+L+
Sbjct: 180 LFRTPTGDLWLLYTSQHAGNQDSAIVKRRVSKDDGITWGKEEVLFPDSGIFIRQPAIVLD 239
Query: 144 NG---LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
+G + + VE W S + V+ D G++W + IP ES +
Sbjct: 240 DGAWVIPVFKCRVEPGERWLGNNDISCIRVSRDEGQTWTES---VIP-ESTGCVHMEIQR 295
Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
+ + L RS + ++ S D GL+WS +PT L NPN+GI L GR+++ Y
Sbjct: 296 LKDCSYLGLFRS-RWADHIYLATSPD-GLSWSPPQPTILPNPNAGICFDVLPSGRVVVVY 353
Query: 256 NTVS---------------------------------------RGVLKVALSKDDGDSWH 276
N S R L VA S D G +W
Sbjct: 354 NHSSKLDATGRRQGLYDDIADGVDERRDQSSTQDGRESFWGAPRAPLCVAWSDDSGKTWE 413
Query: 277 DAL-----------TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
+ E+ L E SYP+++ +G +HI YT+ R +IK
Sbjct: 414 RRVLEDGDGYCMTNNSEKKLNRELSYPSMV-LDEGKIHIAYTFWRQRIKQ 462
>gi|391229608|ref|ZP_10265814.1| BNR/Asp-box repeat protein [Opitutaceae bacterium TAV1]
gi|391219269|gb|EIP97689.1| BNR/Asp-box repeat protein [Opitutaceae bacterium TAV1]
Length = 535
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 128/271 (47%), Gaps = 34/271 (12%)
Query: 67 WQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
W+ ++ D + P+ ++P ++ P L L+ W+ + TW+
Sbjct: 92 WKEVLVVDPDGPGPVRAFDPEVWMGPDGRLRLY---------WAQAAGHDGSISGTWT-L 141
Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SW--------MEVTVDAGRS 172
E P P+ + P + +G+++C V S W +W M V+ D GR+
Sbjct: 142 ETGEPESEAPAWSAPARISDGIMMCKPVVLSTGEWALPVSTWRKTDHSAKMLVSADGGRT 201
Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
W G ++P + S + + + +LR+L+R+ GIG S S D G+TW P+
Sbjct: 202 WSVRGACHVPEKDRSFDEHMLVERGDGSLRMLVRTRYGIGE---SVSTDRGVTWPELVPS 258
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNT-----VSRGVLKVALSKDDGDSWHDALTLEENLAM 287
+ +P++ +L+ G LLL ++ R L +S DDG +W L L+E +
Sbjct: 259 AIPHPSARFFITRLQSGNLLLVKHSPMDQQTLRSHLTAFISTDDGRTWGGGLLLDERTGV 318
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SYP QA+DG+V+ITY ++RT +H++
Sbjct: 319 --SYPDGQQAADGTVYITYDFSRTGTRHILF 347
>gi|94501941|ref|ZP_01308450.1| BNR/Asp-box repeat protein [Bermanella marisrubri]
gi|94425935|gb|EAT10934.1| BNR/Asp-box repeat protein [Oceanobacter sp. RED65]
Length = 388
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 139/319 (43%), Gaps = 34/319 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVP--- 80
S H++T +E+D G+ L ++GG+ EG DV ++ F + W + ++
Sbjct: 63 SVHSATAIELDNGNLLAMWYGGTREGHTDVSLYKAVFDQSKQAWIGHEKVLDRYDISQQL 122
Query: 81 ------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPP-- 129
+ NPVL K P+ ++LFY + + W+ M SY+ G +W ++L
Sbjct: 123 NRYIKKVGNPVLVKHPAGPVVLFY-VTVSLGGWATSQVNMAVSYDDGQSWHSSKRLVTSP 181
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
I KN ++ ++G + E + + + V + G KY I + +
Sbjct: 182 FINISTLVKNDAVIYQDGTIGITGYHELFGEFSEMIRVNLQ-GEVINKYR---ITDGDYT 237
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ V + + ++ S +V S++ DGG TWS + + NPNS + G K
Sbjct: 238 IQPTVIVQDPDHAVALIRDSSRHTEKVHYSKTSDGGKTWSDYEKLAVDNPNSAVYGFKDN 297
Query: 248 DGRLLLAYNTVSRGV------LKVALSKDDGDSWHDALTLEE-----NLAMEFSYPAVIQ 296
RL + +N +R V + +A+S D+G +W E NL + YP V
Sbjct: 298 KDRLWMLFNEATRQVEFPRNTMALAVSTDNGKTWTTKHYFENPGKDLNLNATYGYPWVTV 357
Query: 297 ASDGSVHITYTYNRTQIKH 315
+ G H+ YT +R I H
Sbjct: 358 TTSGDYHVFYTRDRESIVH 376
>gi|456062926|ref|YP_007501896.1| BNR/Asp-box repeat-containing protein [beta proteobacterium CB]
gi|455440223|gb|AGG33161.1| BNR/Asp-box repeat-containing protein [beta proteobacterium CB]
Length = 415
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 143/337 (42%), Gaps = 44/337 (13%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIAD 74
P AP S HA++++ + G +F GS EGAPDV I F + RW SP +
Sbjct: 76 LPDTGAP--SVHAASLIALKDGAVRAFWFAGSREGAPDVVINTAVFDPRASRWGSPTVVM 133
Query: 75 EEPNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN---KGVTWS 122
+ + + NPV ++ + LF+ + + W+G S +G+TW
Sbjct: 134 DRVSAEKGLSRYIAKLGNPVPARMADGRMQLFF-VTVSIGGWAGSSISSMISDDEGLTWD 192
Query: 123 EREQLPPGILGPSKNKPILLE-------NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
++L I P N L++ +G L + E +G ++ + R+
Sbjct: 193 RPQRL---ISSPLINLSTLVKSPAVSFTDGRLGLPAYHEWIGRFGEFLRIEASQVIDKRR 249
Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMR---SFNGIGRVCMSESCDGGLTWSYAKPT 232
+ IQPV F + R S + ++ +SE+ D G +W K
Sbjct: 250 M------SSGRGAIQPVLFADGPQEASAFFRQTRSSSQAKQIPVSETKDAGQSWVVTKDL 303
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDS--WH------DALTLEEN 284
++ NPNS + + L G L+ N + G ++ + + +S W D +L N
Sbjct: 304 EIPNPNSAVAALTLASGARLMVLNNIEAGRYRLVMVMRESNSSQWQVVQLIEDDESLVNN 363
Query: 285 LAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
EFSYP +I A+ H+ YT+NR +IKHV N
Sbjct: 364 QHREFSYPYLISANGEDAHLVYTWNRQKIKHVYFPAN 400
>gi|46109856|ref|XP_381986.1| hypothetical protein FG01810.1 [Gibberella zeae PH-1]
Length = 380
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 148/359 (41%), Gaps = 74/359 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++++ L +FGGS EG PD+ IWL + G W +P + N NPV
Sbjct: 18 HASNLLQLPDKTVLCTWFGGSQEGLPDICIWLSRLEPGSTSWSTPQKVSSDENRSCQNPV 77
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC--MKR-SYNKGVTWSEREQLPPGILGPSKNKPILL 142
LF+ P NG L Q+ G +KR S ++G TWSE +L P G +PI++
Sbjct: 78 LFRAPHNGDLWLLHTSQDAGNQDGAYILKRTSSDEGKTWSEASRLLPNATGIFIRQPIVV 137
Query: 143 ENG----LLLCGSSVESWNSWGSWMEV-----TVDAGRSWR-KYGPIYIPNESLSVIQPV 192
+ L + E + W ++ + D G +W+ K P + ++++ P
Sbjct: 138 NSEGAWILPVFYCRTEPGHRWIGSDDISGVFYSQDNGATWKEKQAPDSVGAVHMNIVPPA 197
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+++T V + V S S + G+ W K T L NPNSGI +L G +
Sbjct: 198 ----SDQTTWVAVYRSRWADNVYRSTSKN-GIDWEEPKATSLPNPNSGICAARLSSGHIA 252
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
L +N S R L + +S+D+G +
Sbjct: 253 LVFNRSSASADTLKRKGLYDDITPEDDKRPNQVTKTGKDAIWGTPRKTLTLGVSQDEGLT 312
Query: 275 WHDALTLEENLAM------------EFSYPAVIQASDGSV---HITYTYNRTQIKHVVL 318
W + + LE+ E SYP++ G H+ YT++R IK+V +
Sbjct: 313 WSERV-LEDGDGFCGTNSSSGQENRELSYPSIYIERKGETDVAHVAYTWHRQHIKYVQI 370
>gi|266622136|ref|ZP_06115071.1| BNR/Asp-box repeat protein [Clostridium hathewayi DSM 13479]
gi|288866158|gb|EFC98456.1| BNR/Asp-box repeat protein [Clostridium hathewayi DSM 13479]
Length = 389
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 148/335 (44%), Gaps = 47/335 (14%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMW 82
K+ HA ++E+ G L +F G+ EG+ DV I G W+ P ++P
Sbjct: 33 KTAHAPALLELSDGSMLCVWFTGTYEGSADVSIVCSKLPAGAKSWKRPEPVSKDPERSEQ 92
Query: 83 NPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILGP 134
NP LF P + Y G++ +++ ++ +S + G TW E E L G
Sbjct: 93 NPSLFTGPEGDVWAVYTAQLDRMPGKDNMQYTSVIRCQKSVDGGETWGEAETLF-SRQGS 151
Query: 135 SKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
+PI +LENG LC S+ + +++ D G++WR + +P+ + V
Sbjct: 152 FCRQPIQILENGRWIFANWLCSDSLTGLAGDPTVFQLSDDQGKTWRM---VRMPDSNGRV 208
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
V R L MRS + SES D G W+ T L N NS I ++LK
Sbjct: 209 HANVVELGGGR-LAAFMRS-RSADFIYRSESQDWGENWTVPMATSLPNNNSSISAIRLKS 266
Query: 249 GRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE--------ENLA 286
GR+ +AYN R + VALS+D+G +W +E EN
Sbjct: 267 GRIGIAYNPTHAANPIAGTAAWPGLRCPVAVALSEDEGLTWPVIRYMERGEGYAGPENRT 326
Query: 287 --MEFSYPAVIQASDGSVHITYT-YNRTQIKHVVL 318
++ YP ++Q +DG +H+ + +R+ IK++
Sbjct: 327 NNRQYEYPYLMQGTDGRLHLAFAGKDRSCIKYMCF 361
>gi|444351569|ref|YP_007387713.1| expressed protein [Enterobacter aerogenes EA1509E]
gi|443902399|emb|CCG30173.1| expressed protein [Enterobacter aerogenes EA1509E]
Length = 388
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 144/357 (40%), Gaps = 71/357 (19%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGGS EG D+ +W G W ++P NPV
Sbjct: 33 HAANLLPLPDGTLLCVWFGGSQEGKADISVWGARLAPGAASWSEATKLSDDPQRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF+ P L L + + + + ++S + G W L PG +PI
Sbjct: 93 LFQAPDGVLWLLWTAQHAGNQDTAIVRYRQSRDGGKHWGAIATLLDEPGTF---IRQPIS 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
++ NG + C + E W N S ++++ D G+SWR + +P +SL +
Sbjct: 150 VMPNGNWLLPVFYCRTRPGEKWVGNDDVSAVKISADGGKSWRD---VAVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
A+ +L RS + S S D G +WS PT L N NS I L +G L
Sbjct: 206 ITPLADGSLAAFFRS-RWADHIWYSRSTDFGESWSAPIPTTLPNNNSSIQVTTLSNGDLA 264
Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSWH 276
L +N +S R + VA+S D G SW
Sbjct: 265 LVFNNMSAAGASERRASLYDEIEDDDGRKEPERTGKTAFWGAPRAPMTVAISADGGKSWP 324
Query: 277 DALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
L E+ L EFSYP++ Q DG++ I YT+ R IK+V + P
Sbjct: 325 WMRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGEDGNLQIAYTWYRQAIKYVRVSPQ 381
>gi|320103153|ref|YP_004178744.1| neuraminidase (sialidase)-like protein [Isosphaera pallida ATCC
43644]
gi|319750435|gb|ADV62195.1| neuraminidase (sialidase)-like protein [Isosphaera pallida ATCC
43644]
Length = 401
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 150/364 (41%), Gaps = 74/364 (20%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR--WQSPIIADE 75
AN A + + HAS +VE LVA++ GS E A DV+I K G+ W +
Sbjct: 40 ANDAVYHN-HASCVVECPNSDLLVAWYRGSGERDADDVEILAARLKKGQTTWGPVFPLID 98
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS 135
P P NP L + +G ++ +W G + + ++ + P G P
Sbjct: 99 TPGYPDCNPALL-VGDDGSFWVFRPTILDHRWEGAL-------LKFARADACPEGDAAPV 150
Query: 136 KNKPILL-----------ENGLLLCGSSVESWNSWGSWMEVTVDAGRS---WRKYG---- 177
N+ +L + L + + G +++ VDA R +++ G
Sbjct: 151 WNREGVLHLTPVGLSEAIQASLDRLPEEIRAHPRLGGYLD-EVDARRDDELYQRLGWMPR 209
Query: 178 -------------PIYIPNESLSV----------------------IQPVPFHTANRTLR 202
P+Y S+S+ IQP ++ +L
Sbjct: 210 CRPVILSTGRWILPLYTDTYSVSLMAYSDDRGATWNTGRMIPGIFAIQPAVVERSDGSLV 269
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT--VSR 260
R+ R+ + S DGG+TWS A T L NP +G+D ++L G+ +L YN + R
Sbjct: 270 AYCRNAGAGRRILSATSNDGGVTWSEAIETTLPNPGAGVDAIRLASGKFVLVYNDTEIGR 329
Query: 261 GVLKVALSKDDGDSWHDALTLEENLAME--FSYPAVIQASDGSVHITYTYNRTQ----IK 314
L VALS D+G +W+ LE + F YP+++Q DG +H+ YT + I+
Sbjct: 330 HSLAVALSNDEGRTWNPPRFLERQPPDQGAFHYPSILQTRDGQIHVLYTAGKAHRGETIR 389
Query: 315 HVVL 318
H V
Sbjct: 390 HAVF 393
>gi|452985679|gb|EME85435.1| hypothetical protein MYCFIDRAFT_131178 [Pseudocercospora fijiensis
CIRAD86]
Length = 393
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 150/387 (38%), Gaps = 90/387 (23%)
Query: 13 EEFTFPANSAPFK-----------SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-- 59
+ FT S PF CHAS ++ +D G L A+FGG+ EG PD+ I+L
Sbjct: 12 KNFTLTTKSHPFPHSECFIPSTTPQCHASNLLILDNGDMLCAWFGGTQEGKPDISIYLSR 71
Query: 60 QTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNK 117
+T W + NPVLF+ PS L L Y + S +K+ S++
Sbjct: 72 KTPASTSWSQASRVTFDTTRSEQNPVLFQTPSGELWLLYTSQNLGDQDSALVKKQSSHDN 131
Query: 118 GVTWSEREQLPPGILGPS-----KNKPILLENGL------LLCGSSVES-W--NSWGSWM 163
G++WS P IL P + I+L++GL C S + W N S +
Sbjct: 132 GISWST-----PTILFPEPGTFIRQPVIILDDGLTWVIPTFKCRSEPGTRWVGNDDISAL 186
Query: 164 EVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGG 223
++ D G++W + +P+ S + +++ RS + G
Sbjct: 187 RISQDEGKTWMEK---EVPH-SFGAVHMAIRQLPDKSYLAFYRS--RWADFIHKSTSRNG 240
Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------------ 259
+ WS PT L NPN+GI L GR+L+ YN S
Sbjct: 241 IDWSEPIPTPLPNPNAGICFDVLPSGRILVVYNHSSRVNARGRREGLYDDIAAADDLRRN 300
Query: 260 ---------------RGVLKVALSKDDGDSWH-------DALTLEENLAM----EFSYPA 293
R L V S D G W D L N M E SYP+
Sbjct: 301 QGSKHAGKEAFWGAPRAPLCVGWSDDGGMEWKHRVLEEGDGFCLTNNSEMKLNRELSYPS 360
Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
++ DG V++ +TY R IK+V P
Sbjct: 361 IVVGKDGGVNVAFTYWRQGIKYVRFDP 387
>gi|241258892|ref|YP_002978776.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240863362|gb|ACS61025.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 395
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 145/367 (39%), Gaps = 78/367 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ GRW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSGRWSVPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQL--PPGILGP 134
NP++FK P + L Y Q G + + S + G T+ + L PG
Sbjct: 95 SEQNPLIFKAPDGNVWLLYT-SQTSGNQDGSVVKFRVSGDGGQTFGPVQILCDSPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + + W+ V + D G SW+ IP+
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLMSRDGGASWQMRD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + + F + S S DGG TWS +P +L N NS I
Sbjct: 207 IGAVHMNILPLGGDEMIAFYRNRF--AENILSSRSSDGGETWSAPEPAELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYN---------------------------------------TVSRGVLKV 265
L DG + + YN V R L +
Sbjct: 265 ILNDGAIAMVYNHSNAGMSDARRQSLYDEIEGGDAGETTVVADTSARKAVWGVPRAPLSL 324
Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
A+S+D G S+ + L+ E+L EFSYP+++Q DG++HI YTY R I
Sbjct: 325 AISRDGGKSFPHRIDLDTGDGFCLSNNSKESLNREFSYPSIVQGGDGTLHIAYTYYRRAI 384
Query: 314 KHVVLLP 320
K+V L P
Sbjct: 385 KYVRLAP 391
>gi|326799313|ref|YP_004317132.1| neuraminidase (sialidase)-like protein [Sphingobacterium sp. 21]
gi|326550077|gb|ADZ78462.1| neuraminidase (sialidase)-like protein [Sphingobacterium sp. 21]
Length = 391
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 161/375 (42%), Gaps = 74/375 (19%)
Query: 11 VAEE-FTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIW--LQTFKDGR 66
VAE FP + K H S++ + G L A+F GS E GA DV I + +
Sbjct: 24 VAEATLIFPLQA---KHVHGSSLAVLPNGDVLAAWFYGSGERGADDVCIMGSRKRAESKT 80
Query: 67 WQSPIIADEEPNVPMWNPVLFKLPSNGL---------------LLFYKIGQEVQK----- 106
W P + + +P NPVLF L +L Y++ E +
Sbjct: 81 WSKPFLMADTKGIPDCNPVLFLNHKKQLYLAWIAVQANRWEQSILRYRVADEYEGSAAPN 140
Query: 107 --WSGCMKRSYNKGVTWSEREQL--------------PP-------GILGPSKN------ 137
W + + +E+L PP P+K
Sbjct: 141 WGWQDNILLKPDSSFAQELKEKLSMLPDQQHGWAAYAPPYDEMIVTNAQDPAKRSFGWMT 200
Query: 138 --KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
KP+LL + +L + +N S + V+ D G+SWR PI IQP
Sbjct: 201 RIKPLLLADNKILLPLYSDGFNC--SLIAVSEDDGKSWRPSKPI----AGRGNIQPALIR 254
Query: 196 TANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLA 254
+ MR S + R+ S S DGG TW+ A+ T++ N S ++ ++L DGR LL
Sbjct: 255 RKQGDIIAYMRDSGDAPARIQQSISMDGGETWTVARKTEIPNTAS-VEVIRLLDGRWLLV 313
Query: 255 YNTVSRG--VLKVALSKDDGDSWHDALTLE--ENLAMEFSYPAVIQASDGSVHITYTY-- 308
N V G L + LS+D+G SW + LE + + SYP+++Q++DG +H++Y+Y
Sbjct: 314 GNDVDDGRYQLSLYLSEDEGTSWKLSTLLEADDKKKLSCSYPSIVQSTDGFIHLSYSYHV 373
Query: 309 --NRTQIKHVVLLPN 321
++ IK++ + P+
Sbjct: 374 TSDKKSIKYLKISPD 388
>gi|346972642|gb|EGY16094.1| glycosyl hydrolase [Verticillium dahliae VdLs.17]
Length = 381
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 147/358 (41%), Gaps = 72/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVPMWNPV 85
HAS ++++ L A+FGGS EG PD+ IWL QT W P + NPV
Sbjct: 19 HASNLLQLPDKTLLCAWFGGSQEGLPDIGIWLSRQTPGSNEWSPPQKISSDTGRSCQNPV 78
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKP-IL 141
LF+ P G + + Q+ G + + S + GVTWS+ + G +P ++
Sbjct: 79 LFRAPRTGHIWLFHTSQDAGNQDGALVMVRTSSDHGVTWSDPRYPFKDLKGVFVRQPLVV 138
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSW-RKYGPIYIPNESLSVIQPV 192
L++G + LC S+ + W N S + + D G +W K P I ++++ PV
Sbjct: 139 LKDGTWVLPVFLCRSAAGQRWIGNDDISAVLFSRDDGVTWEEKRVPGGIGAVHMNIVPPV 198
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
RS +V S S D G+ W+ +P L NPNSGI +L G +
Sbjct: 199 ---RGQDQYVAFFRS-RWADKVYRSTSTD-GIDWTSPEPNSLPNPNSGICAARLASGSIA 253
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
+ +N + R L V +S D G +
Sbjct: 254 IVFNDSAAVTNMARREGLYDDITPEDDKRPNQVSVNGKSAIWGTPRKALTVGISSDQGIT 313
Query: 275 WH-------DALTL----EENLAMEFSYPAVIQASDGS---VHITYTYNRTQIKHVVL 318
W D + E+ + E SYP++ D S VHI +T++R IK+V +
Sbjct: 314 WTTRVLEDGDGFCMTNNSEKKMNRELSYPSIWAEQDVSHAPVHIAFTFHRQHIKYVRI 371
>gi|343083102|ref|YP_004772397.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351636|gb|AEL24166.1| hypothetical protein Cycma_0387 [Cyclobacterium marinum DSM 745]
Length = 389
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 143/303 (47%), Gaps = 33/303 (10%)
Query: 29 ASTIVEVDKGHFLVAYFGGSCEGAPD------VKIWLQTFKDGRWQSPIIADEEPN--VP 80
+S ++ DKG F ++ G +PD V + T K W+ ++ D + V
Sbjct: 61 SSLAIQEDKGEFWAVWYAGI---SPDEDENNYVVLSSSTDKGETWKETLVIDPDGGGMVR 117
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQK-------WSGCMKRSYNKGVTWSEREQLPPGILG 133
++P ++ PS + +F+ Q V K W+ + S + G +WS +L G++
Sbjct: 118 AFDPEVWVDPSGKVWVFW--AQAVDKDAKIGGVWAIIGETSVDGGTSWSAPRRLTDGVM- 174
Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
KP++L NG + +S + M V+ D G++W G +++P E S +
Sbjct: 175 --MCKPLILNNGDWVLPASTWRETDHSAKMVVSKDQGKTWEVVGGVHVPKEFRSFDEHHF 232
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ L L+R+ GIG S S D G TWS + + + +P++ +L+ G LLL
Sbjct: 233 VEKEDGKLWALLRTKYGIGE---SYSSDSGKTWSALEESFIAHPSARFFIRRLQSGNLLL 289
Query: 254 AYN-----TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
+ R + +SKDDG +W L L++ + + SYP Q +DG+++ITY
Sbjct: 290 VKHGPISVKTGRSHMMAFISKDDGATWSKGLLLDQRINV--SYPDGQQTADGTIYITYDR 347
Query: 309 NRT 311
+RT
Sbjct: 348 SRT 350
>gi|134101090|ref|YP_001106751.1| BNR repeat-containing glycosyl hydrolase [Saccharopolyspora
erythraea NRRL 2338]
gi|291004090|ref|ZP_06562063.1| BNR repeat-containing glycosyl hydrolase [Saccharopolyspora
erythraea NRRL 2338]
gi|133913713|emb|CAM03826.1| glycosyl hydrolase, BNR repeat [Saccharopolyspora erythraea NRRL
2338]
Length = 394
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 140/365 (38%), Gaps = 70/365 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HA+ ++ + G +F G+ EG D+ +W G W P+ ++
Sbjct: 27 APAVQSHAANLMPLPDGDLGCVWFSGTQEGVADIGVWFSRLAAGGDTWSEPVRLSDDQTR 86
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQLPPGILGPS-- 135
NPVLF P L L + + + ++ S + G TW L
Sbjct: 87 SEQNPVLFPAPGGELWLLHTAQHAGDQDTAEVRVRTSGDSGRTWGPVRTLVAATEAGGVF 146
Query: 136 -KNKPILLENGLLLCGSSVESWNSWGSW--------MEVTVDAGRSWRKYGPIYIPNESL 186
+ ++L +G L + +G W + V+ D G +W +P+ +
Sbjct: 147 VRQPVVVLRSGRWLLPVFHCARPEFGKWVGDDDTSALLVSDDRGATWHHE---EVPDSTG 203
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
V V + +L L RS + S S DGG +WS PT L N NS I +L
Sbjct: 204 CVHMNV-VELDDGSLLALFRS-RWADAIHESRSHDGGRSWSAPVPTALPNNNSSIQCTRL 261
Query: 247 KDGRLLLAYNTVS--------------------------------------RGVLKVALS 268
+DGRL L +N S R L +ALS
Sbjct: 262 RDGRLALVFNASSAADATERRTSLYDEIDDNGITGSPSTQGGPSRAFWGAPRAPLTLALS 321
Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
D G +W +E + L E SYP+V Q +DG +HI +TY+R +KHV
Sbjct: 322 SDGGRTWPVRRDVETGDGYCMTNNSRDGLNRELSYPSVAQTADGVLHIAFTYHRRAVKHV 381
Query: 317 VLLPN 321
+ P+
Sbjct: 382 RIDPD 386
>gi|325003240|ref|ZP_08124352.1| BNR repeat-containing glycosyl hydrolase [Pseudonocardia sp. P1]
Length = 402
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 144/375 (38%), Gaps = 88/375 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HA+ ++E+ G +FGG+ EG PD+ + + G RW P+ ++P
Sbjct: 35 APDVQNHAANLMELPGGALGCVWFGGTQEGVPDIGVRYSRLEPGSDRWTGPVALSDDPER 94
Query: 80 PMWNPVLFKLPSNGLLLFY--KIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS-- 135
NP+LF PS + L + ++ ++ S + G TW R L P GP
Sbjct: 95 SEQNPLLFVAPSGTVWLLWTAQVAGNQDTAEVRVRTSGDGGRTWDARRTLFPA--GPGGG 152
Query: 136 ---KNKPILLENGLLL-----CGSSVESWNSWGSWME--------VTVDAGRSWRKYGPI 179
+ ++ +G L C + E G W+ V+ D G +WR+
Sbjct: 153 VFVRQPVVVTRSGRWLLPVWHCVTPPE-----GRWVGDLDTSAVMVSDDDGATWRE---T 204
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
+P+ + V V + +L L RS V S S D G TWS +P +L N NS
Sbjct: 205 AVPDSTGQVHMNV-VARPDGSLVALYRSRRADA-VHRSVSTDDGGTWSAPEPVELPNNNS 262
Query: 240 GIDGVKLKDGRLLLAYNT------------------------------------------ 257
+ V L DGRL L N
Sbjct: 263 SVQVVGLADGRLALVGNPSSRADATARRASLYDEISDSGDVGTAGGAAAADALREGAFWG 322
Query: 258 VSRGVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHIT 305
SR + +A+S D G +W LE EFSYP + A+DG + I
Sbjct: 323 ASRAPMTLAISDDGGRTWPVRRDLETGDGFCLVNDSRGKRNREFSYPTIRAAADGGLEIA 382
Query: 306 YTYNRTQIKHVVLLP 320
YT R I+HV L P
Sbjct: 383 YTVYRQAIRHVHLTP 397
>gi|302405557|ref|XP_003000615.1| glycosyl hydrolase [Verticillium albo-atrum VaMs.102]
gi|261360572|gb|EEY23000.1| glycosyl hydrolase [Verticillium albo-atrum VaMs.102]
Length = 381
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 148/358 (41%), Gaps = 72/358 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVPMWNPV 85
HAS ++++ L A+FGGS EG PD+ IWL QT W P + NPV
Sbjct: 19 HASNLLQLPNKTLLCAWFGGSQEGLPDISIWLSRQTPGSNEWSPPQKISSDTGRSCQNPV 78
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKP-IL 141
LF+ P G + + Q+ G + + S + GVTWSE + G +P ++
Sbjct: 79 LFRAPRTGHIWLFHTSQDAGNQDGALVMVRTSSDHGVTWSEPSYPFKDLKGVFVRQPLVV 138
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSW-RKYGPIYIPNESLSVIQPV 192
L++G + LC S+ + W N S + + D G +W K P I ++++
Sbjct: 139 LKDGTWVLPVFLCRSAAGQRWIGNDDISAVLFSRDDGVTWEEKRVPGGIGAVHMNIVP-- 196
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
PF ++ + + +V S S D G+ W+ +P L NPNSGI +L G +
Sbjct: 197 PFRGQDQYVAFFRSRW--ADKVYRSTSTD-GIDWASPEPISLPNPNSGICAARLPSGSIA 253
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
+ +N + R L V +S D G +
Sbjct: 254 IVFNKSAAVPNMTRREGLYDDITPEGDKRPNQVSVNGRSAIWGTPRKALTVGISSDQGIT 313
Query: 275 WH-------DALTL----EENLAMEFSYPAVIQASDGS---VHITYTYNRTQIKHVVL 318
W D + E+ E SYP++ D S VHI +T++R IK+V +
Sbjct: 314 WKTRVLEDGDGFCMTNNSEKKANRELSYPSIWTEQDVSHAPVHIAFTFHRQHIKYVRI 371
>gi|170735438|ref|YP_001774552.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
[Burkholderia cenocepacia MC0-3]
gi|169821476|gb|ACA96057.1| glycosyl hydrolase BNR repeat-containing protein [Burkholderia
cenocepacia MC0-3]
Length = 395
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 76/356 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ + +D G L +FGG+ EG PDV I+L G W +P ++P NPV
Sbjct: 39 HAANLHCLDNGDLLCVWFGGTQEGIPDVSIYLSRLAAGTDEWSAPRKLSDDPTRSEQNPV 98
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P L L Y + + ++R S + G+TW + L PG +PI+
Sbjct: 99 LFTAPDGRLWLIYTAQLSGHQNTAIVRRRVSTDGGLTWGPVDTLFDRPGTF---VRQPIV 155
Query: 142 -LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
L++G + LC + E W N S + V+ D GR+W + +P S+ +
Sbjct: 156 ALDDGTWICPIFLCRTEPGERWVGNDDVSAVMVSDDQGRTWTLH---EVP-ASVGCVHMN 211
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ +L L RS V S S D G TWS + T L N NS I L++G L
Sbjct: 212 VHVLHDGSLLALYRS-RWADFVHASRSTD-GRTWSAPERTDLPNNNSSIQFTVLQNGHLA 269
Query: 253 LAYNT----------------------------------------VSRGVLKVALSKDDG 272
L +N R + +A+S+D G
Sbjct: 270 LVFNESDASHGKERRASLYDDIDDSEDSGELRDQQASVRGTAFWGAPRAPMSLAISEDGG 329
Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+W LE + E+SYP+++Q+ DG++HI +T+ R +IK+V
Sbjct: 330 RTWPRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQSRDGALHIAFTWFRQKIKYV 385
>gi|338209583|ref|YP_004653630.1| neuraminidase (sialidase)-like protein [Runella slithyformis DSM
19594]
gi|336303396|gb|AEI46498.1| neuraminidase (sialidase)-like protein [Runella slithyformis DSM
19594]
Length = 393
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 156/375 (41%), Gaps = 75/375 (20%)
Query: 10 LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR-- 66
+V FP K H S++V + G FL +F GS E A DVKI K G
Sbjct: 24 IVRSTLIFPGQE---KHVHGSSLVNLPNGDFLSVWFYGSGERTADDVKIMGARLKKGETV 80
Query: 67 WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SYNKGV-- 119
W P + P++P NPVLF L G L I + +W + R Y K
Sbjct: 81 WSEPFEMADTPHLPDCNPVLF-LNQQGKLFLVWIAVQANQWEQSILRVKTSTDYTKSGPP 139
Query: 120 --TWSEREQLPPG-------------------------------ILGPSKN--------- 137
W + L P I+ +K+
Sbjct: 140 VWNWQDNILLKPDERFANEVAEKFKKLPENTAGWAGYARKYDDMIIEAAKDAPKRSIGWM 199
Query: 138 ---KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPF 194
KP+L+ENG ++ + +N S M ++ D G +W+ P+ IQP
Sbjct: 200 TRIKPMLMENGRIVLPLYSDGYNF--SLMAISDDFGTTWKPSLPLV----GRGPIQPALA 253
Query: 195 HTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
N L +R S + RV SES D G +W+ + T + N + ++ + L+DG
Sbjct: 254 KRKNGNLVAYLRDSGDEPTRVHYSESSDKGESWTASVKTDIPN-TASVELLTLQDGTWAF 312
Query: 254 AYNTVSRGVLKVAL--SKDDGDSWHDALTLEENLAME--FSYPAVIQASDGSVHITYTYN 309
N + G +V+L S D+G +W+ LE++ + +SYP +IQ +DG +HITY+ +
Sbjct: 313 LGNDIDDGRYRVSLYLSDDEGKTWNRKTRLEDHPRDKGGYSYPCLIQTADGLLHITYSAH 372
Query: 310 ----RTQIKHVVLLP 320
IK+VV+ P
Sbjct: 373 ADNKNKSIKYVVVDP 387
>gi|398793175|ref|ZP_10553649.1| putative neuraminidase (sialidase) [Pantoea sp. YR343]
gi|398211242|gb|EJM97863.1| putative neuraminidase (sialidase) [Pantoea sp. YR343]
Length = 395
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 146/358 (40%), Gaps = 76/358 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++ + G L +FGG+ EG D+ +W + G +W ++P NPV
Sbjct: 36 HASNLLPLPNGDLLCVWFGGTQEGIADISVWCSRLESGSTQWSEAQKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P L L + + + + ++ +S + G W L PG + ++
Sbjct: 96 LFLDPEQVLWLLWTAQKSGNQDTAIVRYRQSRDFGRNWGAINTLLDQPGTF--IRQPIVV 153
Query: 142 LENG-----LLLCGSSVE-SW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L NG + C + W N S ++++ D G+SWR + +P ESL +
Sbjct: 154 LPNGNWLLPVFYCRTQPGVKWVGNDDISAVKISSDKGQSWRD---VAVP-ESLGCVHMNI 209
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +L L RS + S S DGG +WS T+L N NS I L++G L L
Sbjct: 210 TLLQDGSLLALYRS-RWADFIYQSRSTDGGESWSAPLATELPNNNSSIQVTTLRNGHLAL 268
Query: 254 AYNTVS-------------------------------------------RGVLKVALSKD 270
+N +S R + +A+S D
Sbjct: 269 VFNAMSAKDASERRLSLYDEIEDEEDDGDVAVAAEPVVHSGRTAFWGAPRAPMTLAISAD 328
Query: 271 DGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
G SW L+E L EFSYP++ Q+ DG++H+ YTY R IK+V
Sbjct: 329 GGKSWPWQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQSDDGALHVAYTYFRQAIKYV 386
>gi|242238139|ref|YP_002986320.1| hypothetical protein Dd703_0687 [Dickeya dadantii Ech703]
gi|242130196|gb|ACS84498.1| conserved hypothetical protein [Dickeya dadantii Ech703]
Length = 393
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 149/359 (41%), Gaps = 76/359 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ I++ G +W E+P NPV
Sbjct: 36 HAANLLHLPNGDLLCVWFGGTQEGVADISIYMSRLPQGSQQWTPAEKLSEDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P L L Y + + + ++ +S + G TW + L PG +PI
Sbjct: 96 LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSTDLGHTWGDIGVLLEQPGTF---IRQPIT 152
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L NG + C + E W N S ++++ D G++WR+ +PN + V +
Sbjct: 153 VLPNGNWLLPVFYCRTRPGEKWVGNDDISAVKISSDQGKTWRES---VVPNSTGCVHMNI 209
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ TL L RS + +S S DGG +WS T L N NS I L++G L
Sbjct: 210 TL-LQDGTLLALYRS-RWADAIYLSRSTDGGESWSDPVATDLPNNNSSIQVTTLRNGHLA 267
Query: 253 LAYNTVS-----------------------------------------RGVLKVALSKDD 271
L +N ++ R + VA+S+D
Sbjct: 268 LVFNHMNADGATERRVSLYDEIEDEEDEGQNAVMPEITSERSAFWGAPRAPMTVAISEDG 327
Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G SW +E E L EFSYP++ Q D +H+ +TY R IK+V +
Sbjct: 328 GRSWPWQRNVEIGDGYCMTNNSTEKLNREFSYPSIKQGPDDKLHMAFTYFRQAIKYVCV 386
>gi|432494493|ref|ZP_19736311.1| hypothetical protein A173_01658 [Escherichia coli KTE214]
gi|431028009|gb|ELD41054.1| hypothetical protein A173_01658 [Escherichia coli KTE214]
Length = 344
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 141/344 (40%), Gaps = 72/344 (20%)
Query: 41 LVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY 98
+ +F G+ EG D+ +W G +W + + NPVLF P N L L +
Sbjct: 1 MCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPVLFLAPDNVLWLLW 60
Query: 99 KIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI-LLENG-----LLL 148
+ + + ++S + G TW E L PG +PI +L+NG +
Sbjct: 61 TAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPITVLDNGNWLLPVFY 117
Query: 149 CGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
C + E W N S ++++ D G SWR + +P +SL + N TL L
Sbjct: 118 CRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMNITMLHNGTLVALF 173
Query: 206 RSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------ 259
RS + +S S D G +WS + T+L N NS I L G L L YN +S
Sbjct: 174 RS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELALVYNAMSAAGAVE 232
Query: 260 -------------------------------RGVLKVALSKDDGDSWHDALTLE------ 282
R + VA+S D G SW L+
Sbjct: 233 RRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSWPWRRNLDEGDGYC 292
Query: 283 ------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
E L EFSYP++ Q+ DG++HI YT+ R IK+V + P
Sbjct: 293 MTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 336
>gi|150376129|ref|YP_001312725.1| hypothetical protein Smed_3984 [Sinorhizobium medicae WSM419]
gi|150030676|gb|ABR62792.1| conserved hypothetical protein [Sinorhizobium medicae WSM419]
Length = 398
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 87/373 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + ++ G + A+FGG+ EG D+ I+ G RW P + +
Sbjct: 28 SPTVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAERWGEPQRLSFDADH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + ER+ P L ++ +
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPAGNQDECRIR-----MAAIERDPDDPTKLDAAEGRY 141
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + ++ D G WR
Sbjct: 142 LDLPRGCFIRAPLVVRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGISTDGGERWR---- 197
Query: 179 IYIPNESLSVIQ--PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN 236
+ +S+ + PVP R + + RV ES DGG TWS PT + N
Sbjct: 198 LADLEDSIGCVHMSPVPVGGDGYAAFFRRRQADQVYRV---ESADGGRTWSLPAPTDVPN 254
Query: 237 PNSGIDGVKLKDGRLLLAYNTVS------------------------------------- 259
NS I ++L DGRL + N V+
Sbjct: 255 NNSSIAVIRLSDGRLAMICNPVNAALSPDRRASLYDELGEDDGRPDADPSGGCVPIWGVP 314
Query: 260 RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYT 307
R + V LS D G ++ + + +E + E SYP +++ DGS+HI YT
Sbjct: 315 RAPVSVCLSDDGGRTFPERIVIENGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHIAYT 374
Query: 308 YNRTQIKHVVLLP 320
++R IK+V L P
Sbjct: 375 FHRRAIKYVRLPP 387
>gi|116249553|ref|YP_765391.1| hypothetical protein pRL90099 [Rhizobium leguminosarum bv. viciae
3841]
gi|115254201|emb|CAK03814.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 395
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 145/367 (39%), Gaps = 78/367 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLTPGSDRWSVPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++FK P + L Y Q G + + S + G T+ + L PG
Sbjct: 95 SEQNPLIFKAPDGKVWLLY-TSQTSGNQDGSVVKCRISGDGGQTFGPVQILCDSPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + + W+ V V D G SW+ IP+
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLVSRDGGASWQMRD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + + F + S S DGG TWS +P +L N NS I
Sbjct: 207 IGAVHMNILPLGGDEMIAFYRNRF--AENILSSRSSDGGETWSAPEPAELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYNT---------------------------------------VSRGVLKV 265
L DG + + YN V R L +
Sbjct: 265 ILNDGAIAMVYNHSNAAMSDARRQSLYDEIEGGEAEETAVIADSSARKAVWGVPRAPLSL 324
Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
A+S+D G S+ + L+ ++L EFSYP+++Q DG++HI YTY R I
Sbjct: 325 AISRDGGKSFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSIVQGGDGTLHIAYTYYRRAI 384
Query: 314 KHVVLLP 320
K+V L P
Sbjct: 385 KYVRLAP 391
>gi|317046612|ref|YP_004114260.1| hypothetical protein Pat9b_0378 [Pantoea sp. At-9b]
gi|316948229|gb|ADU67704.1| conserved hypothetical protein [Pantoea sp. At-9b]
Length = 394
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 147/357 (41%), Gaps = 75/357 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ +W G +W ++P NPV
Sbjct: 36 HAANLLPLPNGDVLCVWFGGTQEGIADISVWSSRLAKGSNQWSEAEKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P L L + + + + ++ +S + G +WS + L PG + ++
Sbjct: 96 LFLDPQQVLWLLWTAQKSGNQDTAIVRYRQSRDFGRSWSAIDTLLDQPGTF--IRQPIVV 153
Query: 142 LENG-----LLLCGSSVE-SW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L NG + C + W N S ++++ D G+SWR + +P +SL +
Sbjct: 154 LPNGNWLLPVFYCRTQPGVKWVGNDDVSAVKISSDQGKSWRD---VAVP-DSLGCVHMNI 209
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +L L RS + S S DGG +WS + T L N NS I L +G L L
Sbjct: 210 TLLQDGSLLALYRS-RWADFIYQSRSRDGGESWSAPQATALPNNNSSIQVTTLHNGHLAL 268
Query: 254 AYNTVS------------------------------------------RGVLKVALSKDD 271
+N +S R + +A+S D
Sbjct: 269 VFNAMSAKDASERRLSLYDEIEDEEEGDVAVAAEPVVHSGRTAFWGAPRAPMTLAISADG 328
Query: 272 GDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
G SW L+E L EFSYP++ Q+ DG++HI YTY R IK+V
Sbjct: 329 GQSWPWQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQSEDGALHIAYTYFRQAIKYV 385
>gi|418298483|ref|ZP_12910321.1| hypothetical protein ATCR1_13188 [Agrobacterium tumefaciens
CCNWGS0286]
gi|355536396|gb|EHH05669.1| hypothetical protein ATCR1_13188 [Agrobacterium tumefaciens
CCNWGS0286]
Length = 398
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/371 (23%), Positives = 139/371 (37%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HAS + +D G L A+FGG+ EG D+ I+ G W P +P
Sbjct: 28 SPMIQNHASFLHLLDDGTLLCAWFGGTLEGKSDISIFASVLTPGSTHWGPPQRLSHDPAH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P +G+L + Q C R + R+ P L + +
Sbjct: 88 SEQNPVLFTAP-DGVLWLFHTAQPSGNQDECRIR-----MARVTRDPTAPEKLASEEGRF 141
Query: 140 ILLENGLLL-------------------CGSSVESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + + WN + + ++ D G +W+
Sbjct: 142 LDLPQGCFIRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISTDNGATWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ +S + P N R V +ES DGG +WS +PT + N N
Sbjct: 198 LQELQKSTGCVHMSPVAVTNGHYAAFFRRRQA-DFVYRTESADGGRSWSKPQPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I ++L DGRL + N ++ R
Sbjct: 257 SSIAVIRLGDGRLAMICNPINAAQSSDRRASLYDELGEEDDRPDADPSGGCVPVWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYN 309
+ + LS DDG S+ + +E+ E SYP +++A DG++H +YTY+
Sbjct: 317 PVSICLSNDDGRSFATRILIEDGPGTCLSNDSTDGRNQEMSYPWLLEAPDGTLHASYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLAP 387
>gi|424877428|ref|ZP_18301074.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392521575|gb|EIW46302.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 395
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/367 (24%), Positives = 146/367 (39%), Gaps = 78/367 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGAERWSVPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P + L Y Q G + + S + G T+ + L PG
Sbjct: 95 SEQNPLIFNAPDGKIWLLYT-SQTSGNQDGSVVKCRISDDGGQTFGPVQILCDSPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + + W+ V + D G +W+ + IP+
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLMSRDGGANWQMHD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + + F + S S DGG TWS +PT+L N NS I
Sbjct: 207 IGAVHMNILPLGGDEMIAFYRNRF--AENILSSRSSDGGATWSAPEPTKLPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYNT---------------------------------------VSRGVLKV 265
L DG + + YN V R L +
Sbjct: 265 ILNDGAIAMVYNHSNAGTSDARRQSLYDEIEAGEAGETAVVADNSTRKAVWGVPRAPLSL 324
Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
A+S+D G S+ + L+ ++L EFSYP+++Q DG++HI YTY R I
Sbjct: 325 AISRDGGKSFPHRVDLDTGDGFCLSNNSKDSLNREFSYPSIVQGGDGALHIAYTYYRRAI 384
Query: 314 KHVVLLP 320
K++ L P
Sbjct: 385 KYISLAP 391
>gi|407918096|gb|EKG11384.1| hypothetical protein MPH_11545 [Macrophomina phaseolina MS6]
Length = 388
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 151/353 (42%), Gaps = 72/353 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++ +D G L ++FGGS EG D+ I+L K G +W++P+ ++P+ NPV
Sbjct: 35 HASNLLLLDDGELLCSWFGGSQEGCSDISIYLSRLKPGSQKWEAPLQVSDDPSRSEQNPV 94
Query: 86 LFKLP-SNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNKPILL 142
LF+ P SN L + Y + + ++R S ++G TWS QL P I + ++L
Sbjct: 95 LFQAPDSNELWVIYTAQEAGHQNKAIIRRRVSTDRGATWSPATQLFPEIGAFVRQPIVVL 154
Query: 143 ENG-----LLLCGSSVE-SW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPF 194
+NG + C + W N S + + D G +W++ +PN + V +
Sbjct: 155 DNGTWVLPIWYCRAPDGFQWTGNDDISAVRYSNDRGATWKEQA---VPNSTGCVHMNI-R 210
Query: 195 HTANRTLRVLMRS--FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ T RS + + R S+ D WS +P L NPNSGI L G+L
Sbjct: 211 RGPDGTYLAFFRSRWADNVYRSTSSDCID----WSPPQPIGLPNPNSGICFDVLPSGKLA 266
Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
L +N S R L +S+D G +
Sbjct: 267 LVFNNSSATPDMRRREGLYDDITPEAEKLKNQPAVDGKEAIWGTPRSALSFGISEDFGLT 326
Query: 275 WH-------DALTLEEN----LAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
W D + N + E SYP++ DG +HI YTY R +IK+V
Sbjct: 327 WRFKTIEDTDGYCMTNNSRDKINREASYPSIATGEDGWIHIAYTYFRQRIKYV 379
>gi|218461765|ref|ZP_03501856.1| putative glycosyl hydrolase protein [Rhizobium etli Kim 5]
Length = 395
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 139/368 (37%), Gaps = 80/368 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSDRWSDPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK-RSYNKGVTWSER---EQLPPGILGPS 135
NP++F P + L Y + + R+ N R + P + P+
Sbjct: 95 SEQNPLIFNAPDGKIWLLYTSQTSATRMARLSNFRTSNDAANIPARFNPLRQPRYLRPPA 154
Query: 136 KNKPILLENGLL----LCGSSVESWNSWGSWMEVTVDAGRS--------WRKYGPIYIPN 183
+ + LL G W+ V + R W G +++
Sbjct: 155 DCRQRPAADWLLPIFRCVGLEGRRWSGDADTAAVLISRDRGANWQMVDIWDSLGAVHMNI 214
Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
L + V F+ NR ++ S S DGG +WS +PT+L N NS I
Sbjct: 215 LPLGGSEMVAFYR-NRFAETILSS----------RSADGGESWSPPEPTELPNNNSSIQA 263
Query: 244 VKLKDGRLLLAYNT---------------------------------------VSRGVLK 264
L DG + + YN V R L
Sbjct: 264 TVLDDGGIAMVYNHSNATMSDARRQSLYDEIEGDVAGESAAIVADAGRKAVWGVPRAPLS 323
Query: 265 VALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQ 312
+A+S+D G ++ + L+ ++L EFSYP+VIQ SDG++H+ YTY R
Sbjct: 324 LAISRDGGRTFPQRMDLDTGDGFCLSNNSKDSLNREFSYPSVIQGSDGTLHVAYTYYRRA 383
Query: 313 IKHVVLLP 320
IK+V L P
Sbjct: 384 IKYVRLAP 391
>gi|398798882|ref|ZP_10558179.1| putative neuraminidase (sialidase) [Pantoea sp. GM01]
gi|398099705|gb|EJL89957.1| putative neuraminidase (sialidase) [Pantoea sp. GM01]
Length = 395
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 149/362 (41%), Gaps = 84/362 (23%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ +W + G +W ++P NP+
Sbjct: 36 HAANLLPLPNGDVLCVWFGGTQEGIADISVWCSRLESGSTQWSEAQKLSDDPTRSEQNPL 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL---------PPGILGP 134
LF P L L + + + + ++ +S++ G +W + L P ++ P
Sbjct: 96 LFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDLGHSWGAIDTLLDQPGTFIRQPMVVLP 155
Query: 135 SKN--KPILL---ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
+ N P+ + G+ G N S ++++ D G+SWR + +P +SL +
Sbjct: 156 NDNWLLPVFYCRTQPGVKWVG------NDDISAVKISSDRGKSWRD---VVVP-DSLGCV 205
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
+ +L L RS + S S DGG +WS + T+L N NS I L +G
Sbjct: 206 HMNITLLQDGSLLALYRS-RWADFIYQSRSTDGGESWSAPRATELPNNNSSIQVTTLHNG 264
Query: 250 RLLLAYNTVS-------------------------------------------RGVLKVA 266
L L +N +S R + +A
Sbjct: 265 HLALVFNAMSAKDARERRVSLYDEIEDEEDDGDAAVAAEPQVHSGRSAFWGAPRAPMTLA 324
Query: 267 LSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
+S D G SW L+E L EFSYP++ Q+ DG++H+ YTY R IK
Sbjct: 325 ISADGGKSWPWQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQSEDGALHVAYTYFRQAIK 384
Query: 315 HV 316
+V
Sbjct: 385 YV 386
>gi|408377280|ref|ZP_11174883.1| hypothetical protein QWE_06783 [Agrobacterium albertimagni AOL15]
gi|407749239|gb|EKF60752.1| hypothetical protein QWE_06783 [Agrobacterium albertimagni AOL15]
Length = 396
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 141/371 (38%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
+P HAS + G + A+FGGS EG D+ I+ G+ W +P +P+
Sbjct: 28 SPMVQNHASFLHRQADGSLICAWFGGSLEGKSDISIYASVLVAGQNAWGAPQRLSSDPDH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF+ P LLLF+ Q C R V + P L ++ +
Sbjct: 88 SEQNPVLFQAPDGRLLLFH-TSQPSGNQDECRIRMSEISVDAED-----PTRLRATEGRY 141
Query: 140 ILLENG-LLLCGSSVESWNSW--------------------GSWMEVTVDAGRSWRKYGP 178
+ L G + ++ + +W + + ++ D G +WR
Sbjct: 142 LDLPRGSFVRAPVAIRADGTWLLPIFRCIPRPGQKWNGSHDTAAIGISEDGGATWR---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ ++S+ + P + T+ R V +ES D G TWS PT L N N
Sbjct: 198 LEELSQSIGCVHMSPVALGDTTMAAFFRRRQA-DFVYRTESHDAGRTWSAPAPTDLPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I + L DGR+ + N V+ R
Sbjct: 257 SSIAAIALDDGRVAIICNPVNAAASLDRRTSLYDELGEEDDRPEADPTGGCAPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
+ V +S D G ++ + +E+ E SYP ++ DGS+HI YTY+
Sbjct: 317 PVTVCISSDGGLTFPQRIQIEDGPGTCLSNDSTDGRNKEMSYPWLLADQDGSLHIAYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLAP 387
>gi|86359911|ref|YP_471801.1| glycosyl hydrolase [Rhizobium etli CFN 42]
gi|86284013|gb|ABC93074.1| putative glycosyl hydrolase protein [Rhizobium etli CFN 42]
Length = 395
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 148/371 (39%), Gaps = 86/371 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ GRW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMRDISIYMSRLPPGSGRWSEPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P L Y Q G + + S + G T+ + L PG
Sbjct: 95 SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSIVKCRISDDGGKTFGPVQILCDSPGTF-- 151
Query: 135 SKNKPILLENG-----LLLC-GSSVESWNSWGSWMEVTV--DAGRSWR------KYGPIY 180
+ + ++ + G + C G + W+ V + D G SW+ G ++
Sbjct: 152 VRQQIVVNDRGDWLLPIFRCVGLEGQRWSGDADTAAVLISRDGGASWQMRDIPDSIGAVH 211
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ L + + F+ NR ++ S S DGG TWS +PT+L N NS
Sbjct: 212 MNILPLGGCEMIAFYR-NRFAETILSS----------RSADGGETWSPPEPTELPNNNSS 260
Query: 241 IDGVKLKDGRLLLAYNT---------------------------------------VSRG 261
I L +G + + YN V R
Sbjct: 261 IQATVLDNGGIAMVYNHSNATMSDARRQSLYDEIEGDECGENAAVVADAGRKAVWGVPRA 320
Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
L +A+S+D G ++ + L+ ++L EFSYP+VI+ SDG++H+ YTY
Sbjct: 321 PLSLAISRDGGRTFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSVIEGSDGTLHVAYTYY 380
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 381 RRAIKYVRLAP 391
>gi|171463784|ref|YP_001797897.1| BNR/Asp-box repeat-containing protein [Polynucleobacter necessarius
subsp. necessarius STIR1]
gi|171193322|gb|ACB44283.1| BNR/Asp-box repeat protein [Polynucleobacter necessarius subsp.
necessarius STIR1]
Length = 413
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 143/320 (44%), Gaps = 38/320 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVP--- 80
S HA++++ + G V +F GS EGA DV I+ + W +P + + +
Sbjct: 73 SVHAASMIALKDGAVRVFWFAGSREGAADVAIYNSVYDPHSTNWSAPTVVMDRVSAEKGL 132
Query: 81 ------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGI 131
+ NPV +L L LF+ + + W+G S ++G+TWS ++L I
Sbjct: 133 LRYIAKLGNPVPTRLVDGRLQLFF-VTVSIGGWAGSSISAITSDDEGLTWSSPQRL---I 188
Query: 132 LGPSKNKPILLEN-GLLLCGS--SVESWNSW-GSWME-VTVDAGRSWRKYGPIYIPNESL 186
P N L+++ G++ + +++ W G + E + VDAGR K +
Sbjct: 189 SSPLLNLSTLVKSPGVMFVDGLMGMPAYHEWVGRFGEFLRVDAGRVIDKRRM----SSGR 244
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
IQP+ F + R G ++ +S + + G W ++ + NPNS + G
Sbjct: 245 GAIQPLVFVNDAQDASAFFRQTRSAGLPKQIPVSYTQNAGQNWHQSEDLAIANPNSAVAG 304
Query: 244 VKLKDGRLLLAYNTVSRGVLKVAL--SKDDGDSWHDALTLEENLAM------EFSYPAVI 295
V LK G +L N + G ++ L S W LE++ A+ EFSYP +I
Sbjct: 305 VILKSGTRILVLNDIEYGRHRLVLMMSSPKNGQWQTVEVLEDDEALPDIQRKEFSYPYLI 364
Query: 296 QASDGSVHITYTYNRTQIKH 315
H+ YT++R +I+H
Sbjct: 365 TVDGEDAHLVYTWDRKKIRH 384
>gi|121605966|ref|YP_983295.1| BNR/Asp-box repeat-containing protein [Polaromonas
naphthalenivorans CJ2]
gi|120594935|gb|ABM38374.1| BNR/Asp-box repeat protein [Polaromonas naphthalenivorans CJ2]
Length = 461
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 155/370 (41%), Gaps = 74/370 (20%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RW--QSPIIAD 74
A+S+P ++ HA++IVE+ G +F GS EGA DV I + RW ++ ++
Sbjct: 83 ASSSPGQAVHAASIVELRDGRLRAVWFSGSREGAGDVVIKTSVMEPESLRWSEETTLLDR 142
Query: 75 EEPNVPMW-------NPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSER 124
+ +W NPV+ ++P + L+L + + V W+G RS + G++WS
Sbjct: 143 QHLQQGLWRYVKKIGNPVIARMPDDSLVL-WMVNVSVGGWAGSSITWLRSTDDGMSWSLP 201
Query: 125 EQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
+L I +K PI ++G + E + + + ++ + +
Sbjct: 202 RRLVTSPFLNISTLAKGAPIFYQDGQIGLPVYHEFVSKFAEILRISPEG----QVLDKTR 257
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
IP S +QPV ++ R ++ MRS + V SE+ D G TWS T NP+S
Sbjct: 258 IPRSQTS-LQPVVLVSSARQAQIYMRSGHSTA-VMASETVDAGKTWSATHATAWPNPDSA 315
Query: 241 IDGV-------------KLKDGRLLLAYNTVSRGVLKVALS------------------- 268
+ GV K K+ +L T + G + A+
Sbjct: 316 LAGVVTATSQQWLALNPKPKNREILALLQTSAAGSFEGAIPWVVESSPTPEIRLPVSDYE 375
Query: 269 -----------------KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRT 311
+ D S L EFSYP ++Q+ DG +H+ YT++R+
Sbjct: 376 RLLGDELQKQGASQAQIQADVASAKRQLCGSATCLQEFSYPYLLQSRDGYLHLVYTWHRS 435
Query: 312 QIKHVVLLPN 321
+IKHV L P
Sbjct: 436 RIKHVRLDPR 445
>gi|222082211|ref|YP_002541576.1| glycosyl hydrolase [Agrobacterium radiobacter K84]
gi|221726890|gb|ACM29979.1| glycosyl hydrolase protein [Agrobacterium radiobacter K84]
Length = 395
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 146/371 (39%), Gaps = 86/371 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW +P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLFPGATRWSAPEKMSDDPVK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
NP++F P + L Y Q G + + S + G T+ E L + G
Sbjct: 95 SEQNPLIFNAPGGDVWLLYT-SQTSGNQDGSVVKCRISADGGKTFGEVSVL-CDLPGTFV 152
Query: 137 NKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSW------RKYGPIY 180
+PI++ NG + C G E W V + D G+SW G ++
Sbjct: 153 RQPIIV-NGRGAWLLPVFRCIGLLGERWTGDADTAGVLISRDQGKSWVMTAVPDSLGAVH 211
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ +L V F+ NR + V S S DGG +WS +P L N NS
Sbjct: 212 MNIVALEGDDLVAFYR-NRFAQ----------SVLASRSTDGGESWSAPQPMDLPNNNSS 260
Query: 241 IDGVKLKDGRLLLAYN---------------------------------------TVSRG 261
I VKLK + + YN V R
Sbjct: 261 IQAVKLKSCVIAIVYNHSNALMSDARRQSLYDEIEGGEAIDAAPLQTLERRKAVWGVPRA 320
Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
L +A S D G S+ + L+ ++L EFSYP+++Q +DG VHI YTY
Sbjct: 321 PLSLAFSSDGGTSFSRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVQGADGVVHIAYTYY 380
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 381 RRAIKYVRLSP 391
>gi|149177682|ref|ZP_01856283.1| hypothetical protein PM8797T_27502 [Planctomyces maris DSM 8797]
gi|148843500|gb|EDL57862.1| hypothetical protein PM8797T_27502 [Planctomyces maris DSM 8797]
Length = 393
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 140/301 (46%), Gaps = 38/301 (12%)
Query: 44 YFGGSCEGAPDVKIWLQTFKDGR-WQSPIIADEEPN-VPMWNPVLFKLPSNGLLLFYKIG 101
Y GG+ EG + + + + DG+ W P + + P V ++P ++ PS + +F+
Sbjct: 73 YAGGTGEGELNYVVLVTSGDDGKTWSGPKLVIDPPGPVRAYDPAIWHDPSGRMWVFW--A 130
Query: 102 QEVQKWSG-------CMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLL------ 148
Q + W G S + W+E +L GI+ NKP +L NG L
Sbjct: 131 QSYRWWDGRSGVWAVTTDDSDQENPKWTEPRRLCNGIM---MNKPTVLSNGDWLLPVAIW 187
Query: 149 ---CGSSVESW----NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTL 201
S+E G + ++ D G++++ G +P + + + + +L
Sbjct: 188 KQPAKESIEHRFDLPEERGGNIFISKDQGKTFQLLGQTDVPERTFD--EHMIVERKDNSL 245
Query: 202 RVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY----NT 257
L+R+ GIG S S DGG TWS K + + + N+ +LK G LL+ +
Sbjct: 246 WTLVRTRYGIGE---SISADGGKTWSKGKASDIPHVNARFFIRRLKSGNLLMVRHNPADK 302
Query: 258 VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
SR L LS+DDG +W L L+E + SYP +Q+ DG+++I Y + RT+ K ++
Sbjct: 303 KSRRDLTAFLSQDDGKTWQGGLLLDERSGV--SYPDGVQSDDGTIYIIYDFARTRDKKIL 360
Query: 318 L 318
+
Sbjct: 361 M 361
>gi|213586273|ref|ZP_03368099.1| hypothetical protein SentesTyph_35378 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-0664]
Length = 210
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 105/211 (49%), Gaps = 37/211 (17%)
Query: 141 LLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS------------- 187
+ NG + S+ES W ++++ + D G+ W P+ ++S
Sbjct: 1 MASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKG 60
Query: 188 ---------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
VIQP + ++ + +L+RS G + S+S D G TWS A+ T
Sbjct: 61 MLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARAT 118
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAM 287
L N NSGID V ++DG L+LA N V+ R L + S+D+G+SW L LE +
Sbjct: 119 FLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG- 177
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
E+SYPA+I + G VHITYT+NR I + L
Sbjct: 178 EYSYPAII-SKGGVVHITYTWNRKNIVYCRL 207
>gi|389626781|ref|XP_003711044.1| glycosyl hydrolase [Magnaporthe oryzae 70-15]
gi|351650573|gb|EHA58432.1| glycosyl hydrolase [Magnaporthe oryzae 70-15]
gi|440466568|gb|ELQ35828.1| glycosyl hydrolase [Magnaporthe oryzae Y34]
gi|440487602|gb|ELQ67382.1| glycosyl hydrolase [Magnaporthe oryzae P131]
Length = 396
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 152/368 (41%), Gaps = 92/368 (25%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HAS ++ + G L A+FGGS EG D+ IWL G W SP+ ++P NPV
Sbjct: 33 HASNLLLLPSGELLCAWFGGSMEGISDIYIWLSRLAPGAKEWTSPLQVSKDPGRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL----------PPGIL 132
LF +G L QE + C+ R S + G TW E E+L P +L
Sbjct: 93 LFLDERSGNLWCIYTAQEHGAQNKCLVRRRISSDLGKTWGEAEELFSDRGIFVRQPIVVL 152
Query: 133 ----GPSKNKPIL------LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
GP K+ +L E G+ G N S + + D+G++W++ +P
Sbjct: 153 KDKNGPGKDVWVLPVFHCRAEEGVEWVG------NDDVSAVRYSEDSGKTWKES---MVP 203
Query: 183 NESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
+V I+P P + R + I + S+ C + W+ +PT+L NPNS
Sbjct: 204 ESRGAVHMNIRPDPSNPGKFVGVYRSRWADNI-YLSRSDDC---INWTKPEPTELPNPNS 259
Query: 240 GID-GVKLKDGRLLLAYNT--------------------------------------VSR 260
GI GV K G+L + YN R
Sbjct: 260 GICLGVLPKSGKLAMVYNDSRATPDMVQRTGLYDDIETDGPKLKKQAVIGDKAAIWGTPR 319
Query: 261 GVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTY 308
+ +ALS D G +W + LE E L E SYP++ A D ++HI +T+
Sbjct: 320 SRMTLALSSDGGRTWPEKRLLEEGDGFCLTNNSKEKLNRELSYPSIFVADDETMHIAFTF 379
Query: 309 NRTQIKHV 316
R +IK+V
Sbjct: 380 WRQRIKYV 387
>gi|424920580|ref|ZP_18343943.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM597]
gi|392849595|gb|EJB02117.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM597]
Length = 394
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 143/366 (39%), Gaps = 77/366 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLRDGTLTCVWFGGTMEGMGDISIYMSRLAPGSERWSEPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P L Y Q G + + S + G ++ L PG
Sbjct: 95 SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSVVKCRISDDGGKSFGPVRILCDSPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + + W+ V + D G SW+ IP+
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLISRDGGASWQMRD---IPDT 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + + F + S S DGG TWS +PT+L N NS I
Sbjct: 207 IGAVHMNILPLDGDDMIAFYRNRF--AESILSSRSSDGGETWSAPEPTELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYNT--------------------------------------VSRGVLKVA 266
L D + + YN V R L +A
Sbjct: 265 TLNDRAIAMVYNHSNASTSDARRQSLYDEIEGGETGEAPVVADIGRKAVWGVPRAPLSLA 324
Query: 267 LSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
+S+D G S+ + L+ ++L EFSYP++IQ SDG++HI YTY R IK
Sbjct: 325 ISRDSGKSFPHRVDLDTGDGFCLSNNSKDSLNREFSYPSIIQGSDGALHIAYTYYRRAIK 384
Query: 315 HVVLLP 320
+V L P
Sbjct: 385 YVRLAP 390
>gi|445227594|ref|ZP_21404370.1| hypothetical protein SEE10_003303 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
gi|444866898|gb|ELX91610.1| hypothetical protein SEE10_003303 [Salmonella enterica subsp.
enterica serovar Enteritidis str. SE10]
Length = 158
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
VIQP + ++ + +L+RS G + S+S D G TWS A+ T L N NSGID V ++
Sbjct: 24 VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQ 81
Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
DG L+LA N V+ R L + S+D+G+SW L LE + E+SYPA+I + G V
Sbjct: 82 DGTLILALNLVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 139
Query: 303 HITYTYNRTQIKHVVL 318
HITYT+NR I + L
Sbjct: 140 HITYTWNRKNIVYCRL 155
>gi|338811147|ref|ZP_08623378.1| expressed protein [Acetonema longum DSM 6540]
gi|337276830|gb|EGO65236.1| expressed protein [Acetonema longum DSM 6540]
Length = 386
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 144/334 (43%), Gaps = 47/334 (14%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
+ + HA ++E+ G L +F G+ EG+ D+ I W + +
Sbjct: 33 YPTAHAPAMLELPNGDLLCCWFAGTYEGSADIHIICSILPKNSPAWLPTVDISGDSTRSE 92
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
NP LF P + + Y + Q+ M+ +S + G TW E + P G
Sbjct: 93 QNPSLFYGPDHAVWAMYTAQLDRQEGKDNMQYTAMVRCQKSTDGGKTWGPYETVFPE-EG 151
Query: 134 PSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+PI +L NG LC S + + S ++ D G++WR + +P +S
Sbjct: 152 TFCRQPIQILSNGRWIFGNWLCTDSKDGLSDDPSVFRISDDEGKTWRM---VMMP-KSNG 207
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P N L MR+ R+ SES D G TWS PT L N NS I ++L+
Sbjct: 208 HVHPNVVELENGHLVAFMRNREA-HRIHRSESFDWGETWSKPAPTPLPNNNSSISALRLQ 266
Query: 248 DGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEENL 285
GR+ +AYN R + VALS+D G ++ + +EN
Sbjct: 267 SGRIAIAYNPTCTPNPQSGKAAWPGLRCPVAVALSEDGGLTFPIVRWMERGEGFIGDENK 326
Query: 286 A--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
++ YP ++Q+ DG++H+ Y ++ R +K+V
Sbjct: 327 TNNRQYEYPYLMQSVDGALHLAYASHTRAGVKYV 360
>gi|438144458|ref|ZP_20875457.1| hypothetical protein SEEP9120_19452, partial [Salmonella enterica
subsp. enterica serovar Pullorum str. ATCC 9120]
gi|434939238|gb|ELL46096.1| hypothetical protein SEEP9120_19452, partial [Salmonella enterica
subsp. enterica serovar Pullorum str. ATCC 9120]
Length = 151
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
VIQP + ++ + +L+RS G + S+S D G TWS A+ T L N NSGID V ++
Sbjct: 17 VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQ 74
Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
DG L+LA N V+ R L + S+D+G+SW L LE + E+SYPA+I + G V
Sbjct: 75 DGTLILALNLVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 132
Query: 303 HITYTYNRTQIKHVVL 318
HITYT+NR I + L
Sbjct: 133 HITYTWNRKNIVYCRL 148
>gi|417383909|ref|ZP_12149461.1| hypothetical protein LTSEJOH_3060 [Salmonella enterica subsp.
enterica serovar Johannesburg str. S5-703]
gi|417531553|ref|ZP_12186235.1| hypothetical protein LTSEURB_2815 [Salmonella enterica subsp.
enterica serovar Urbana str. R8-2977]
gi|353610544|gb|EHC63469.1| hypothetical protein LTSEJOH_3060 [Salmonella enterica subsp.
enterica serovar Johannesburg str. S5-703]
gi|353664663|gb|EHD03025.1| hypothetical protein LTSEURB_2815 [Salmonella enterica subsp.
enterica serovar Urbana str. R8-2977]
Length = 188
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
VIQP + ++ + +L+RS G + S+S D G TWS A+ T L N NSGID V ++
Sbjct: 54 VIQPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQ 111
Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
DG L+LA N V+ R L + S+D+G+SW L LE + E+SYPA+I + G V
Sbjct: 112 DGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 169
Query: 303 HITYTYNRTQIKHVVL 318
HITYT+NR I + L
Sbjct: 170 HITYTWNRKNIVYCRL 185
>gi|209552288|ref|YP_002284203.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209539400|gb|ACI59332.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 394
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 142/366 (38%), Gaps = 77/366 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSERWSEPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P L Y Q G + + S + G ++ L PG
Sbjct: 95 SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSVVKCRISDDGGKSFGPVRILCESPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + + W+ V + D G SW+ IP+
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLISRDGGASWQMRD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V V + + F + S S DGG TWS +P +L N NS I
Sbjct: 207 IGAVHMNVLPLGGDDMIAFYRNRF--AESILSSRSSDGGETWSAPEPAELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYNT--------------------------------------VSRGVLKVA 266
L DG + + YN V R L +A
Sbjct: 265 VLNDGAIAMVYNHSNASTSDARRQSLYDEIESGDAGEAPVVADIGRKAVWGVPRAPLSLA 324
Query: 267 LSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
+S+D G S+ + L+ ++L EFSYP++IQ DG++HI YTY R IK
Sbjct: 325 ISRDSGKSFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSIIQGGDGALHIAYTYYRRAIK 384
Query: 315 HVVLLP 320
+V L P
Sbjct: 385 YVRLAP 390
>gi|424889567|ref|ZP_18313166.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM2012]
gi|393171785|gb|EJC71830.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM2012]
Length = 394
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 144/366 (39%), Gaps = 77/366 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPDAERWSEPEKMSDDPQK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P L Y Q G + + S + G ++ L PG
Sbjct: 95 SEQNPLIFNAPDGKTWLLYT-SQTSGNQDGSLVKCRISDDGGKSFGPVSVLCDSPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+PI++ NG + C G + + W+ V + + G SW+ + IP+
Sbjct: 152 -VRQPIVV-NGRGDWLLPVFRCVGLNGQRWSGDADTAAVLISRNGGESWQMHD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + + F + S S DGG TWS PT+L N NS I
Sbjct: 207 IGAVHMNILPLGGDEMVAFYRNRF--AESILSSRSSDGGETWSAPAPTELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYNT--------------------------------------VSRGVLKVA 266
L DG + + YN V R L +A
Sbjct: 265 VLNDGAIAMVYNHSNATMSDARRQSLYDEIEGDEAGEATVVAHIGRNAVWGVPRAPLSLA 324
Query: 267 LSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
+S+D G S+ + L+ ++L EFSYP++I+ DG++HI YTY R IK
Sbjct: 325 ISRDGGKSFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSIIEGGDGTLHIAYTYYRRAIK 384
Query: 315 HVVLLP 320
+V L P
Sbjct: 385 YVRLAP 390
>gi|449138247|ref|ZP_21773538.1| hypothetical protein RE6C_05593 [Rhodopirellula europaea 6C]
gi|448883183|gb|EMB13725.1| hypothetical protein RE6C_05593 [Rhodopirellula europaea 6C]
Length = 152
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 76/146 (52%), Gaps = 28/146 (19%)
Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS- 259
L++L RS R+ S S DGG TWS T L NPNSG D V L+DGR +L YN
Sbjct: 8 LQILCRSQEK--RIVESWSKDGGKTWSTLTATDLPNPNSGTDAVTLQDGRQVLIYNHSEG 65
Query: 260 ------------RGVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVI 295
R +L +A+S DG +W LTLE E+SYPA+I
Sbjct: 66 MVRRKDAPDLKPRRILNLAISS-DGKTWKPVLTLENETGPHPKDADRRRHFGEYSYPAII 124
Query: 296 QASDGSVHITYTYNRTQIKHVVLLPN 321
Q SDG +++ YTYNR +KH ++ P+
Sbjct: 125 QTSDGMLNMVYTYNREGVKHAIVDPS 150
>gi|398351148|ref|YP_006396612.1| hypothetical protein USDA257_c12640 [Sinorhizobium fredii USDA 257]
gi|390126474|gb|AFL49855.1| hypothetical protein USDA257_c12640 [Sinorhizobium fredii USDA 257]
Length = 397
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 148/364 (40%), Gaps = 71/364 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ +++ + G RW +P ++P
Sbjct: 35 SPCVQNHAANLSFLPDGTLTCVWFGGTMEGMGDISVYMSRLEPGARRWTNPEKMSDDPQK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NP++F P + L + + +KR S + G ++ E L I G
Sbjct: 95 SEQNPLIFTAPDGRVWLLFTSQTSGHQDGAVVKRRISEDGGKSFGPTEVLC-DIPGTFVR 153
Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
+PI++ + C G + W V + D G++W +P+ + +V
Sbjct: 154 QPIIVNGAGDWLLPIFRCMGEAGRRWTGAIDRAAVLISRDQGKAWTM---AEVPDSTGAV 210
Query: 189 -IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ VP N L R+ V S+S D G W+ +P L N NS I +++K
Sbjct: 211 HMSIVPVE--NGALVAFYRNRFAT-HVLRSQSSDAGTHWTAPEPVDLPNNNSSIQAIRMK 267
Query: 248 DGRLLLAYNT--------------------------------------VSRGVLKVALSK 269
+G + + YN V R L +A S
Sbjct: 268 NGAIAMVYNHSNASMSDARRHSLYDEIEGEGDSGSAEAVASARPAVWGVPRAPLSLAFST 327
Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
D G ++ + L+ ++L EFSYP++++ ++G +H+ YTY R IK+V
Sbjct: 328 DGGRTFPRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYVR 387
Query: 318 LLPN 321
L P+
Sbjct: 388 LDPD 391
>gi|417100343|ref|ZP_11960093.1| putative glycosyl hydrolase protein [Rhizobium etli CNPAF512]
gi|327192232|gb|EGE59197.1| putative glycosyl hydrolase protein [Rhizobium etli CNPAF512]
Length = 411
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 146/371 (39%), Gaps = 86/371 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 51 SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLPPGSERWSEPEKMSDDPEK 110
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P L Y Q G + + S + G T+ + L PG
Sbjct: 111 SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSIVKCRISDDGGKTFGPVQILCDSPGTF-- 167
Query: 135 SKNKPILLENG-----LLLC-GSSVESWNSWGSWMEVTV--DAGRSWR------KYGPIY 180
+ + ++ + G + C G + W+ V + D G SW+ G ++
Sbjct: 168 VRQQIVVNDRGDWLLPVFRCVGLEGQRWSGDADTAAVLISRDGGASWQMRDIPDSIGAVH 227
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ L V F+ NR ++ S S DGG WS +PT+L N NS
Sbjct: 228 MNILPLGGGDMVAFYR-NRFAETILSS----------RSADGGENWSPPEPTELPNNNSS 276
Query: 241 IDGVKLKDGRLLLAYNT---------------------------------------VSRG 261
I L +G + + YN V R
Sbjct: 277 IQATVLDNGGIAMVYNHSNADMSDARRQSLYDEIEGDEAGEHAAVVADAGRKAVWGVPRA 336
Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
L +A+S+D G ++ L L+ ++L EFSYP+VIQ SDG++H+ YTY
Sbjct: 337 PLSLAISRDGGRTFPHHLDLDTGDGFCLSNNSKDSLNREFSYPSVIQGSDGTLHVAYTYY 396
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 397 RRAIKYVRLAP 407
>gi|417349594|ref|ZP_12128222.1| Putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Gaminara str. A4-567]
gi|353572464|gb|EHC36104.1| Putative cytoplasmic protein [Salmonella enterica subsp. enterica
serovar Gaminara str. A4-567]
Length = 188
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 81/136 (59%), Gaps = 9/136 (6%)
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
VIQP + ++ + +L+RS G + S+S D G TWS A+ T L N NSGID V +
Sbjct: 54 VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATSLPNNNSGIDLVSMP 111
Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
DG L+LA N V+ R L + S+D+G+SW L LE + E+SYPA+I + G V
Sbjct: 112 DGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 169
Query: 303 HITYTYNRTQIKHVVL 318
HITYT+NR I + L
Sbjct: 170 HITYTWNRKNIVYCRL 185
>gi|36958681|gb|AAQ87149.1| Hypothetical protein RNGR00124 [Sinorhizobium fredii NGR234]
Length = 399
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/365 (23%), Positives = 143/365 (39%), Gaps = 71/365 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ +++ G RW P ++P
Sbjct: 35 SPCVQNHAANLAFLPDGTLTCVWFGGTMEGMGDISVYMSRLAPGGRRWTDPEKMSDDPAK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NP++F P + L + + +KR S + G T+ E L I G
Sbjct: 95 SEQNPLIFAAPDGRIWLLFTSQTSGNQDGAVVKRRISEDGGKTFGATEVLC-DIPGTFVR 153
Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
+PI++ + C G + W V + D GR+W +P+ +V
Sbjct: 154 QPIIVNAAGDWLLPVFRCIGEAGRRWTGDVDRAAVLISRDEGRTWTMQ---EVPDSIGAV 210
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ + + F V S+S D G TW+ + L N NS I +++K+
Sbjct: 211 HMNLVAADGDAMIAFYRNRF--ATHVLRSQSSDAGHTWTAPEAVDLPNNNSSIQAIRMKN 268
Query: 249 GRLLLAYNT----------------------------------------VSRGVLKVALS 268
G + + YN V R L +A S
Sbjct: 269 GAIAMVYNHSNASMSEARRHSLYDEIEGDGEELGGAAEAVASGRPAVWGVPRAPLSLAFS 328
Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
DDG ++ + L+ ++L EFSYP++++ ++G +H+ YTY R IK+V
Sbjct: 329 TDDGRTFPRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYV 388
Query: 317 VLLPN 321
L P+
Sbjct: 389 RLDPS 393
>gi|213024134|ref|ZP_03338581.1| hypothetical protein Salmonelentericaenterica_17126 [Salmonella
enterica subsp. enterica serovar Typhi str. 404ty]
Length = 139
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
VIQP + ++ + +L+RS G + S+S D G TWS A+ T L N NSGID V ++
Sbjct: 5 VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATFLPNNNSGIDLVSMQ 62
Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
DG L+LA N V+ R L + S+D+G+SW L LE + E+SYPA+I + G V
Sbjct: 63 DGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SKGGVV 120
Query: 303 HITYTYNRTQIKHVVL 318
HITYT+NR I + L
Sbjct: 121 HITYTWNRKNIVYCRL 136
>gi|119897038|ref|YP_932251.1| hypothetical protein azo0747 [Azoarcus sp. BH72]
gi|119669451|emb|CAL93364.1| conserved hypothetical secreted protein [Azoarcus sp. BH72]
Length = 426
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 147/359 (40%), Gaps = 75/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--W--QSPIIADEEPNV--- 79
HA+++ + G +FGG EG +VK++ T G W Q I E+ +
Sbjct: 63 VHAASVAVLPDGRLFATWFGGEREGGTEVKVYAATRNPGEAGWGRQHAIATPEQTSADVG 122
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPP--- 129
M NPV F P L + Y + + W+ + RS + G TW +L
Sbjct: 123 RLVRKMGNPVAFVTPRGELWVVY-VSVTLGGWATSHLNLLRSPDLGTTWLPARRLAATPF 181
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
+ K P+ +NG + E + + V D G K + + + S
Sbjct: 182 FNLSTLVKGFPVFFDNGDVGLPVYHEMAGKFAELL-VLSDEGEVRHK---VRMDHGRRS- 236
Query: 189 IQPVPFHTANRTLRVLMR--SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
+QPV + LMR +G R SE+ DGG +WS +PT L NPNS + ++L
Sbjct: 237 LQPVVLVEDEQRAVALMRYGGEHGPFRAWRSETADGGRSWSAVEPTTLANPNSALAALRL 296
Query: 247 KDGRLLLAYNTVSRGVLKVAL--SKDDGDSW--------------HDA------LTLEEN 284
DGRLL N L+++L S+D G +W H+A LE +
Sbjct: 297 DDGRLLAVANDTEDERLRLSLLVSEDGGRNWRSIHRFEDKQDFAGHEAGGEVFRARLEAD 356
Query: 285 LA---------------------------MEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+A ++ YP +++A+DG H+ YT+NR+ ++H+
Sbjct: 357 VADLGPGPAAADLVRVAERNLCRNQGACGWQYDYPYLVRAADGDFHLVYTWNRSFVRHI 415
>gi|227819951|ref|YP_002823922.1| glycosyl hydrolase [Sinorhizobium fredii NGR234]
gi|227338950|gb|ACP23169.1| putative glycosyl hydrolase protein [Sinorhizobium fredii NGR234]
Length = 385
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/365 (23%), Positives = 143/365 (39%), Gaps = 71/365 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ +++ G RW P ++P
Sbjct: 21 SPCVQNHAANLAFLPDGTLTCVWFGGTMEGMGDISVYMSRLAPGGRRWTDPEKMSDDPAK 80
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NP++F P + L + + +KR S + G T+ E L I G
Sbjct: 81 SEQNPLIFAAPDGRIWLLFTSQTSGNQDGAVVKRRISEDGGKTFGATEVLCD-IPGTFVR 139
Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
+PI++ + C G + W V + D GR+W +P+ +V
Sbjct: 140 QPIIVNAAGDWLLPVFRCIGEAGRRWTGDVDRAAVLISRDEGRTWTMQ---EVPDSIGAV 196
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ + + F V S+S D G TW+ + L N NS I +++K+
Sbjct: 197 HMNLVAADGDAMIAFYRNRF--ATHVLRSQSSDAGHTWTAPEAVDLPNNNSSIQAIRMKN 254
Query: 249 GRLLLAYNT----------------------------------------VSRGVLKVALS 268
G + + YN V R L +A S
Sbjct: 255 GAIAMVYNHSNASMSEARRHSLYDEIEGDGEELGGAAEAVASGRPAVWGVPRAPLSLAFS 314
Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
DDG ++ + L+ ++L EFSYP++++ ++G +H+ YTY R IK+V
Sbjct: 315 TDDGRTFPRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYV 374
Query: 317 VLLPN 321
L P+
Sbjct: 375 RLDPS 379
>gi|440714059|ref|ZP_20894645.1| hypothetical protein, secreted [Rhodopirellula baltica SWK14]
gi|436441048|gb|ELP34323.1| hypothetical protein, secreted [Rhodopirellula baltica SWK14]
Length = 376
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 152/343 (44%), Gaps = 49/343 (14%)
Query: 10 LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
L A++ TF P P ++ HA T + V +G L A Y G + +
Sbjct: 19 LQAQDSTFLAPPKYIGPPQTIHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPREDANNY 78
Query: 57 IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK 112
+ L T DG W+ ++ D + + P+ ++P L+ P L L + Q V
Sbjct: 79 VVLSTSGDGGETWEEVLVVDPDEDGPVRAYDPELWIAPDGKLRLIW--AQAVGHLGTI-- 134
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SWME---- 164
GV + E E P + P + + +G+++C V S W +W E
Sbjct: 135 ----AGVWFLEIED--PDVEQPVHGEAKRVTDGIMMCKPLVLSTGEWALPASTWRETDES 188
Query: 165 ----VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
V+VD G++W + G +P + + + + + ++ +L R+ GIG S S
Sbjct: 189 ARMIVSVDQGKTWAQRGGCNVPKDVRAFDEHMFVERNDGSIWLLARTKYGIGE---SVST 245
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSW 275
D G TW P+++ +P++ +L G LLL + R L +S DDG +W
Sbjct: 246 DRGKTWPELTPSKIAHPSARFFIRRLDSGNLLLVKHGPIEERTGRSHLMAFVSIDDGKNW 305
Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L L+E + SYP Q +DG +HI Y YNRT+ +H++L
Sbjct: 306 KGGLMLDERAGV--SYPDGQQDADGVIHIIYDYNRTKDRHILL 346
>gi|385792411|ref|YP_005825387.1| hypothetical protein [Francisella cf. novicida Fx1]
gi|328676557|gb|AEB27427.1| BNR/Asp-box repeat protein [Francisella cf. novicida Fx1]
Length = 372
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 133/304 (43%), Gaps = 27/304 (8%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F V ++ E AP+ KI + + G+WQ P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-VTWYSSDQETAPNTKIVVAVAEKVAGKWQFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
+E + NP+++ L+F S + S + G TWS+ E
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPE 171
Query: 126 Q-LPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
N+ ++ +QP + L R + I ++ +E+ D GL+WS KPTQL NP+SG
Sbjct: 228 TNDGVN-LQPTVVPLSKTHALALYRQMHSPIKKIYTNETSDSGLSWSKVKPTQLDNPDSG 286
Query: 241 IDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
I +K+++G +LLAYN T SR L +AL D+ W + T + E SYPA
Sbjct: 287 IAAIKIQNG-ILLAYNNATDSRADLSLALKADNSQQWRNIYTFPNKIKGELSYPAFTLYQ 345
Query: 299 DGSV 302
D +
Sbjct: 346 DNII 349
>gi|424875893|ref|ZP_18299552.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393163496|gb|EJC63549.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 395
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 143/367 (38%), Gaps = 78/367 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSERWSVPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++FK P + L Y Q G + + S + G T+ + L PG
Sbjct: 95 SEQNPLIFKAPDGKVWLLY-TSQTSGNQDGSVVKCRISGDGGQTFGPVQILCNNPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + + W+ V + D G SW+ IP+
Sbjct: 152 --VRQQIVVNGKGDWLLPVFRCVGLNGQRWSGDADTAAVLMSRDGGVSWQMRD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + F + S S D G TWS +P +L N NS I
Sbjct: 207 IGAVHMNILPLGGGEMIAFYRNRF--AENILSSRSSDSGETWSAPEPAELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYN---------------------------------------TVSRGVLKV 265
L DG + + YN V R L +
Sbjct: 265 ILNDGAIAMVYNHSNAAMSDARRQSLYDEIEGGEAGETAVFANSSVRKAVWGVPRAPLSL 324
Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
A+S+D G S+ + L+ ++L EFSYP+++Q DG++HI YTY R I
Sbjct: 325 AISRDGGKSFPHRIDLDSGDGFCLSNNSKDSLNREFSYPSIVQGGDGTLHIAYTYYRRTI 384
Query: 314 KHVVLLP 320
K+V L P
Sbjct: 385 KYVRLAP 391
>gi|145250895|ref|XP_001396961.1| glycosyl hydrolase [Aspergillus niger CBS 513.88]
gi|134082486|emb|CAK97293.1| unnamed protein product [Aspergillus niger]
gi|350636350|gb|EHA24710.1| hypothetical protein ASPNIDRAFT_40608 [Aspergillus niger ATCC 1015]
Length = 384
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 148/358 (41%), Gaps = 71/358 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
AP HAS ++ + G A+FGGS EG PD+ I+L + G W +
Sbjct: 27 APTVQSHASNLLRMPNGDLFCAWFGGSLEGKPDISIYLSRLRCGESEWTEASKMTHDDTR 86
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NPVLF P+ L L Y + + +K S + G TW + E++ G
Sbjct: 87 SEQNPVLFHSPTGELWLLYTSQHSGDQDTAIVKYRVSNDSGATWGD-EKVLFHDTGTFIR 145
Query: 138 KPI-LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
+P+ +LE+G + C E W N S + V+ D GR+W + IP ES
Sbjct: 146 QPVAVLEDGAWVVPVFKCRVQPGERWLGNDDISCIRVSRDQGRTWTES---EIP-ESTGC 201
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ + + + RS V ++ S D G+ WS +PT L NPN+GI L
Sbjct: 202 VHMEIQRLKDGSYLGMFRS-RWADNVYLATSPD-GVVWSAPQPTSLPNPNAGICFDVLPS 259
Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
GR++L YN S R L VA S
Sbjct: 260 GRVVLVYNHSSKLDALGRRQGLYDDIADGVDERKNQASAKDGRESFWGSPRAPLCVAWSD 319
Query: 270 DDGDSWH-------DALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
D G +W D L E+ L E SYP+++ A +G +H+ YT+ R +IK+V
Sbjct: 320 DKGKTWKHRILEDGDGYCLTNNSEKKLNRELSYPSMVCA-EGIIHVAYTFWRQRIKYV 376
>gi|118497095|ref|YP_898145.1| BNR/Asp-box repeat-containing protein [Francisella novicida U112]
gi|194323392|ref|ZP_03057169.1| BNR/Asp-box repeat protein [Francisella novicida FTE]
gi|208778888|ref|ZP_03246234.1| BNR/Asp-box repeat protein [Francisella novicida FTG]
gi|254372460|ref|ZP_04987949.1| hypothetical protein FTCG_00020 [Francisella tularensis subsp.
novicida GA99-3549]
gi|118423001|gb|ABK89391.1| BNR/Asp-box repeat protein [Francisella novicida U112]
gi|151570187|gb|EDN35841.1| hypothetical protein FTCG_00020 [Francisella novicida GA99-3549]
gi|194322247|gb|EDX19728.1| BNR/Asp-box repeat protein [Francisella tularensis subsp. novicida
FTE]
gi|208744688|gb|EDZ90986.1| BNR/Asp-box repeat protein [Francisella novicida FTG]
gi|313661724|gb|ADR71924.1| KdoH1 [Francisella novicida U112]
Length = 372
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 135/305 (44%), Gaps = 29/305 (9%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E AP+ KI + + G+W P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAIAEKVAGKWHFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
N+ ++ +QP VP + L + + + I R+ +E+ D GL+WS KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285
Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
GI +K+++G +LLAYN T SR L +A D+ W + T + EFSYPA
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGEFSYPAFTLY 344
Query: 298 SDGSV 302
D +
Sbjct: 345 QDNII 349
>gi|427404018|ref|ZP_18894900.1| hypothetical protein HMPREF9710_04496 [Massilia timonae CCUG 45783]
gi|425717257|gb|EKU80222.1| hypothetical protein HMPREF9710_04496 [Massilia timonae CCUG 45783]
Length = 405
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 78/317 (24%), Positives = 137/317 (43%), Gaps = 40/317 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP-IIADEEP------- 77
S HAS++ + L ++ GS E PDVK++ T+ +G W +P ++A E
Sbjct: 91 SAHASSMTPLPGNRMLAFWWAGSRESGPDVKVYASTWANGAWSAPWMVASRESLGAALGF 150
Query: 78 -NVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILG 133
+ NPV + +G + Y + + W+ S ++G ++ R LP
Sbjct: 151 GVRRIGNPVAWT-DRDGKIHLYVVATGLGGWAASRIAHIVSNDQGASFRVRRVLP----- 204
Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW-------RKYGPIYIPN--E 184
P+ + L+ S W W+ V + G + + P + +
Sbjct: 205 ---TSPLFNTSALVRTSPLALSDGGW--WLPVYFEIGNKYPMLMAFDAQGNPTSLARIGQ 259
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
S S +QP + +R L+R + R+ + S DGG +W L N N+ + +
Sbjct: 260 STSTLQPAIVAVSKTEVRALLRDDSDARRMQHAASLDGGASWQDLPALDLPNHNTSMAAI 319
Query: 245 KLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLAM-EFSYPAVIQAS 298
+L G+ L+ +N +R L++++S DD +W + + A EFSYP + Q
Sbjct: 320 RLSSGQFLMLHNHAADDASARSRLRLSIS-DDMRTWRTVADIAKGKAGDEFSYPTLQQVG 378
Query: 299 DGSVHITYTYNRTQIKH 315
D +H+TYT+ R I H
Sbjct: 379 D-ELHVTYTWQRQAIAH 394
>gi|378763488|ref|YP_005192104.1| conserved hypothetical protein [Sinorhizobium fredii HH103]
gi|365183116|emb|CCE99965.1| conserved hypothetical protein [Sinorhizobium fredii HH103]
Length = 399
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 144/364 (39%), Gaps = 71/364 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ +++ + RW P ++P
Sbjct: 35 SPCVQNHAANLAFLPDGTLTCVWFGGTMEGMGDISVYMSRLEPRARRWTEPEKMSDDPAK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
NP++F P + L + + +KR S + G ++ E L I G
Sbjct: 95 SEQNPLIFTAPDGRIWLLFTSQTSGNQDGAVVKRRISDDGGKSFGATEVLC-DIPGTFVR 153
Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
+PI++ + + C G + W V + D GR+W +P +S+
Sbjct: 154 QPIIVNSAGDWLLPVFRCIGEADRRWTGAVDRAAVLISRDQGRTWTMQ---EVP-DSIGA 209
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ T+ R+ V S+S D GL W+ + +L N NS I +++K+
Sbjct: 210 VHMNVVRADGATMVAFYRNRFATN-VLRSQSSDAGLNWTAPEAVELPNNNSSIQAIRMKN 268
Query: 249 GRLLLAYNT----------------------------------------VSRGVLKVALS 268
G + + YN V R L +A S
Sbjct: 269 GAIAMVYNHSNASMSDARRHSLYDEIESDGGEDNGSTEMVASARPAVWGVPRAPLSLAFS 328
Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
D G ++ + L+ ++L EFSYP++++ ++G +H+ YTY R IK+V
Sbjct: 329 TDGGRTFSRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYV 388
Query: 317 VLLP 320
L P
Sbjct: 389 RLDP 392
>gi|409437210|ref|ZP_11264340.1| Glycosyl hydrolase protein [Rhizobium mesoamericanum STM3625]
gi|408751113|emb|CCM75496.1| Glycosyl hydrolase protein [Rhizobium mesoamericanum STM3625]
Length = 395
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 141/363 (38%), Gaps = 74/363 (20%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G W P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLATGAACWSQPEKMSDDPTK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
NP++F P + L Y Q G + + S + G T+ + L I G
Sbjct: 95 SEQNPLIFNAPDGKVWLLYT-SQTSGNQDGSIVKCRISEDGGKTFGDVRVLC-DIPGTFV 152
Query: 137 NKPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLS 187
+PI++ + C G E W V + DAG+SW +P+ +
Sbjct: 153 RQPIIVNKAGIWLLPVFRCVGLPGERWTGDVDTAGVLISHDAGKSWVM---TEVPDSIGA 209
Query: 188 V-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
V + VP N R I S S DGG WS +P L N NS I +L
Sbjct: 210 VHMNIVPIDGDNLVAFYRNRFAESI---LSSRSSDGGAHWSAPRPIDLPNNNSSIQATRL 266
Query: 247 KDGRLLLAYNT---------------------------------------VSRGVLKVAL 267
++G + + YN V R L +AL
Sbjct: 267 ENGMIAMVYNHSNASTSDARRQSLYDEIEGDEETKVAPVQTSERRKAVWGVPRAPLSLAL 326
Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
S D G S+ + L+ +++ EFSYP++IQ +DG VH+ YTY R IK+
Sbjct: 327 SGDGGVSFPRRVDLDTGDGYCLSNNSKDSVNREFSYPSIIQGADGVVHVAYTYYRRAIKY 386
Query: 316 VVL 318
V L
Sbjct: 387 VRL 389
>gi|213024184|ref|ZP_03338631.1| hypothetical protein Salmonelentericaenterica_17386 [Salmonella
enterica subsp. enterica serovar Typhi str. 404ty]
Length = 130
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 64/123 (52%), Gaps = 4/123 (3%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPV 85
+CHAST+V + G + A+F G CEG+ D IWL ++ W +P WNPV
Sbjct: 3 TCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPV 62
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILL 142
LF PS+ L LFYK+G V W S ++G TWS L + GP KNK +L
Sbjct: 63 LF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLVNDDILPRGPVKNKLLLA 121
Query: 143 ENG 145
NG
Sbjct: 122 SNG 124
>gi|254373922|ref|ZP_04989404.1| BNR/Asp-box repeat protein [Francisella novicida GA99-3548]
gi|151571642|gb|EDN37296.1| BNR/Asp-box repeat protein [Francisella novicida GA99-3548]
Length = 372
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 135/305 (44%), Gaps = 29/305 (9%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E AP+ KI + + G+W P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
N+ ++ +QP VP + L + + + I R+ +E+ D GL+WS KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285
Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
GI +K+++G +LLAYN T SR L +AL D+ W + T + E SYPA
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLALKADNSQQWRNIYTFPNKIKGELSYPAFTLY 344
Query: 298 SDGSV 302
D +
Sbjct: 345 QDNII 349
>gi|373853235|ref|ZP_09596034.1| hypothetical protein Opit5DRAFT_4089 [Opitutaceae bacterium TAV5]
gi|372472762|gb|EHP32773.1| hypothetical protein Opit5DRAFT_4089 [Opitutaceae bacterium TAV5]
Length = 374
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 44 YFGGSCEGAPDVKIWLQTFKDGR-WQSPIIADEEPN--VPMWNPVLFKLPSNGLLLFY-- 98
Y GG E + + +++ GR W P + + P V ++PVL+ PS L +F+
Sbjct: 51 YTGGHTECEANHVVVVRSRDGGRTWSEPYLVIDPPARYVRAFDPVLWLDPSGRLWIFWAQ 110
Query: 99 ---KIGQEVQKWSGCMKRSYNKGVTWSEREQLP----PGILGPSK-------NKPILLEN 144
++G+ G + + GV W R P P P + NKP + N
Sbjct: 111 TFVRVGEGAPH--GGVVHNGRSGV-WFIRTDTPDAGQPAWTSPRRIANGVMMNKPTVRSN 167
Query: 145 GLLLCGSSVESWNSWGSWM-----------EVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
G L +S+ W G + V+ D G ++ G +P + + +
Sbjct: 168 GEWLLPTSI--WTEHGPLLPELAHERYPNITVSTDGGETFSLRGGADVPCRCID--EHML 223
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +L +L+R+ GIG+ S S DGG TW PT + +P + +L GRLLL
Sbjct: 224 VERRDGSLWMLVRTLYGIGQ---SVSVDGGATWRPGWPTDIASPCTRFHIRRLASGRLLL 280
Query: 254 --AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRT 311
YN R L +LS DDG +WH L L+E E SYP DG +H+ Y R
Sbjct: 281 LHHYNFTGRSHLTASLSDDDGRTWHGRLLLDER--AEVSYPDAGITPDGRIHVIYDRERF 338
Query: 312 QIKHVVL 318
+ + ++L
Sbjct: 339 KAREILL 345
>gi|412991244|emb|CCO16089.1| BNR repeat-containing glycosyl hydrolase [Bathycoccus prasinos]
Length = 574
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 134/322 (41%), Gaps = 43/322 (13%)
Query: 38 GHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLF-------K 88
G L+A+F G EG +V I + K G +W + + NPVL K
Sbjct: 185 GDVLLAWFSGKSEGGDNVGIAVSKLKKGETKWSQAKFVSKAQHRSAQNPVLMRNNNPNDK 244
Query: 89 LPSNGLLLFYKIGQEVQKWSGCMKRSY---NKGVTWSEREQLPPGILGPSKNKPILLENG 145
S + + + + RS + G TWS+ + KN I NG
Sbjct: 245 EDSKHVRIVHSSQAAYEGQGTSDMRSLFSTDGGETWSKPVVVDHKNGAFPKNSAIRALNG 304
Query: 146 LLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRT 200
L+ S ++ S ++ + D G SW + +P +++QP N
Sbjct: 305 DLILPMYYTPSGFFDASTQYSELQTSKDGGESWHPL--VEMPGTRGNLVQPTIVRLNNND 362
Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR---LLLAYNT 257
L RS + S+S + G TW+ AKPT+ N N+GI +KL+D L+L ++
Sbjct: 363 LLSFFRSRTA-DFIYASKSTNDGKTWTEAKPTKFPNNNAGIQMIKLRDSSKDVLVLVFDN 421
Query: 258 VSRGV--LKVALSKDDGDSWHDALTLEENLAM----------------EFSYPAVIQAS- 298
+ G L +ALS+D G SW LE ++ E+SYP++ +
Sbjct: 422 CNEGRFPLSIALSEDGGKSWDHIRDLEPDVDFDHEQPNRDKHTNDGDGEYSYPSIQEDGI 481
Query: 299 -DGSVHITYTYNRTQIKHVVLL 319
DG +H+ YT+ R IK+ +
Sbjct: 482 HDGLIHVAYTFRRETIKYSAFM 503
>gi|421587605|ref|ZP_16032986.1| putative glycosyl hydrolase protein [Rhizobium sp. Pop5]
gi|403707873|gb|EJZ22747.1| putative glycosyl hydrolase protein [Rhizobium sp. Pop5]
Length = 395
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 146/372 (39%), Gaps = 88/372 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSKRWSEPEKMTDDPQK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK-RSYNKGVTWSEREQLPPGILGPSKN- 137
NP++F P L Y + +K R + G R P IL S
Sbjct: 95 SEQNPLIFNAPDGKTWLLYTSQTSGNQDGSVVKCRISDDG----GRSFGPVRILCDSPGT 150
Query: 138 --KPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKY------GPI 179
+ ++ NG + C G + + W+ V + D G SW+ + G +
Sbjct: 151 FVRQQIVVNGRGEWLLPVFRCVGLNGQRWSGDADTAAVLISRDGGASWQMHDIADSIGAV 210
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
++ L + + F+ NR ++ S S DGG TWS +PT+L N NS
Sbjct: 211 HMNILPLGGDEMIAFYR-NRFAESILSS----------RSSDGGRTWSAPEPTELPNNNS 259
Query: 240 GIDGVKLKDGRLLLAYNT---------------------------------------VSR 260
I L DG + + YN V R
Sbjct: 260 SIQATVLNDGGIAMVYNHSNAAMSDARRLSLYDEIEGDDAGETAAAVADIRRKAIWGVPR 319
Query: 261 GVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTY 308
L +A+S+D G S+ + L+ ++L EFSYP+VIQ+ D ++H+ YTY
Sbjct: 320 APLSLAISRDGGRSFPYRIDLDTGDGHCLSNNSKDSLNREFSYPSVIQSGDRTLHVAYTY 379
Query: 309 NRTQIKHVVLLP 320
R IK++ L P
Sbjct: 380 YRRAIKYMRLDP 391
>gi|424892701|ref|ZP_18316281.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|424893072|ref|ZP_18316652.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393183982|gb|EJC84019.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM2297]
gi|393184353|gb|EJC84390.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
trifolii WSM2297]
Length = 395
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 145/367 (39%), Gaps = 78/367 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLAPGAERWSEPDKMSDDPQK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE--REQLPPGILGPSKN 137
NP++F P + L Y + SG S K T + + P +L S
Sbjct: 95 SEQNPLIFNAPDGKIWLLY-----TSQTSGNQDGSVVKCRTSDDGGKSFGPVSVLCDSPG 149
Query: 138 KPI---LLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
+ L+ NG + C G + + W+ V + D G SWR + IP+
Sbjct: 150 TFVRQPLVVNGKGDWLLPVFRCVGRNGQRWSGDADTAAVLISRDGGASWRMHD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + ++ + F + S S D G TWS +PT+L N NS I
Sbjct: 207 IGAVHMNILPLGGDKMVAFYRNRF--AESILSSRSSDSGETWSAPEPTELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYNT---------------------------------------VSRGVLKV 265
L DG + + YN V R L +
Sbjct: 265 ILNDGAIAMVYNHSNASTSDARRQSLYDEIEGGETKEFTVVAADIGRKAVWGVPRAPLSL 324
Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
A+S+D G S+ + L+ ++L EFSYP++I+ DG++HI YTY R I
Sbjct: 325 AISRDGGRSFPHRVDLDTGDGFCLSNNSKDSLNREFSYPSIIEGDDGTLHIAYTYYRRAI 384
Query: 314 KHVVLLP 320
K+V L P
Sbjct: 385 KYVRLAP 391
>gi|213580823|ref|ZP_03362649.1| hypothetical protein SentesTyph_06247 [Salmonella enterica subsp.
enterica serovar Typhi str. E98-0664]
Length = 138
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 66/136 (48%), Gaps = 4/136 (2%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G CEG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 129 PGIL---GPSKNKPIL 141
+ GP KNK +L
Sbjct: 123 NDDILPRGPVKNKLLL 138
>gi|418406167|ref|ZP_12979487.1| hypothetical protein AT5A_03055 [Agrobacterium tumefaciens 5A]
gi|358008080|gb|EHK00403.1| hypothetical protein AT5A_03055 [Agrobacterium tumefaciens 5A]
Length = 398
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/371 (23%), Positives = 138/371 (37%), Gaps = 83/371 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HAS + + G L A+FGG+ EG D+ I+ G RW +P +P
Sbjct: 28 SPTIQNHASFLHLLQDGTLLCAWFGGTLEGKSDISIFASALTPGASRWGAPQRLSNDPAH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPV+F P L LF+ Q C R + R+ P L +
Sbjct: 88 SEQNPVIFTAPDGTLWLFH-TAQPSGNQDECRIR-----MARVVRDPATPEKLSAEDGRF 141
Query: 140 ILLENGLLL-------------------CGSSVESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + + WN + + ++ D G +W+
Sbjct: 142 LDLPKGCFVRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISKDNGLTWQ---- 197
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ S + P AN R V +ES DGG +W+ PT + N N
Sbjct: 198 LQELAGSTGCVHMSPVAGANGRYSAFFRRRQA-DFVYRTESTDGGRSWTEPLPTDVPNNN 256
Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
S I ++L GRL + N V+ R
Sbjct: 257 SSIAAIRLAHGRLAMICNPVNAAQSSDRRASLYDELGEEDDRPDADPSGGCVPIWGVPRA 316
Query: 262 VLKVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYN 309
+ + LS DDG S+ + +E+ +E SYP +++A+DG++H +YTY+
Sbjct: 317 PVSICLSDDDGRSFPTRILIEDGPGTCLSNDSTDGRNLEMSYPWLLEAADGTLHASYTYH 376
Query: 310 RTQIKHVVLLP 320
R IK+V L P
Sbjct: 377 RRAIKYVRLAP 387
>gi|291612790|ref|YP_003522947.1| BNR/Asp-box repeat protein [Sideroxydans lithotrophicus ES-1]
gi|291582902|gb|ADE10560.1| BNR/Asp-box repeat protein [Sideroxydans lithotrophicus ES-1]
Length = 466
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 139/378 (36%), Gaps = 87/378 (23%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEE 76
A+S HAS+++E+ G +F GS EGA DV I F R W + I
Sbjct: 80 ASSRQHVQVHASSLIELKDGRIRAFWFSGSREGASDVTINTAVFDPARDEWGAEQIVASR 139
Query: 77 PNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSER 124
+ + NPV + L LFY + + W+G S + G TWS
Sbjct: 140 SSTQRALHRYVSKLGNPVAGRAADGTLRLFY-VTVSLGGWAGSSITEMTSADDGETWSAP 198
Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
+L I P K P L +G + ES++ + + + D ++
Sbjct: 199 RRL---ITSPFINISTLVKGTPFLYADGSMGLPVYHESFSKFAEILHLDADGDVMDKQ-- 253
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES---CDGGLTWSYAKPTQL 234
+ +QPV ++ VL R + G ++ S DGG W + +
Sbjct: 254 --RLARAGQGTLQPVVLVKSSNDALVLTR-YAGRDNPHLARSLTTADGGRDWGAVEKSPF 310
Query: 235 LNPNSGIDGVKLKDGRLLLAYNTVSRG--VLKVALSKDDGDSWHDALTLEENLAM----- 287
NP++ + L DGRLL N +G L + LS D G SW + LEE +
Sbjct: 311 PNPDAALSATALPDGRLLAVLNHQEQGRDSLSLMLSADGGHSWKELHRLEEMRVLRDKKL 370
Query: 288 -----------------------------------------------EFSYPAVIQASDG 300
EFSYP +IQA +G
Sbjct: 371 DETQCLHIVRGLLVNSETRLAKAPAATLDEYVDSAKARVRADGGCNFEFSYPYLIQARNG 430
Query: 301 SVHITYTYNRTQIKHVVL 318
H++YT+NR IKHV
Sbjct: 431 DFHLSYTWNRVFIKHVTF 448
>gi|332716574|ref|YP_004444040.1| hypothetical protein AGROH133_12331 [Agrobacterium sp. H13-3]
gi|325063259|gb|ADY66949.1| hypothetical protein AGROH133_12331 [Agrobacterium sp. H13-3]
Length = 398
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 142/369 (38%), Gaps = 79/369 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HAS + + G L A+FGG+ EG D+ I+ G RW +P +P
Sbjct: 28 SPMIQNHASFLHLLQDGTLLCAWFGGTLEGKSDISIFASALTPGASRWGAPQRLSNDPAH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL----------PP 129
NPV+F P L LF+ Q C R + + E+L P
Sbjct: 88 SEQNPVIFTAPDGTLWLFH-TAQPSGNQDECRIRMARVVLDPAIPEKLSTEDGRFLDLPK 146
Query: 130 GILGPSKNKPILLENG------LLLC-GSSVESWNSW--GSWMEVTVDAGRSWRKYGPIY 180
G P+ + + + C + WN + + ++ D G +W+ +
Sbjct: 147 GCF---IRAPLRIRDDGAWLLPIFRCLQRPGQKWNGSHDTAALGISKDNGLTWQ----LQ 199
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
S + P AN R V +ES DGG +W+ + T + N NS
Sbjct: 200 ELAGSTGCVHMSPVAGANGRYSAFFRRRQA-DFVYRTESTDGGRSWAEPQLTDVPNNNSS 258
Query: 241 IDGVKLKDGRLLLAYNTVS-------------------------------------RGVL 263
I ++L DGRL + N ++ R +
Sbjct: 259 IAAIRLADGRLAMICNPINAAQSSDRRASLYDELGEEDDRPDADPSGGCVPIWGVPRAPV 318
Query: 264 KVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYNRT 311
+ LS DDG S+ + +E+ +E SYP +++A+DG++H +YTY+R
Sbjct: 319 SICLSDDDGRSFPTRILIEDGPGTCLSNDSTDGRNLEMSYPWLLEAADGTLHASYTYHRR 378
Query: 312 QIKHVVLLP 320
IK+V L P
Sbjct: 379 AIKYVRLAP 387
>gi|365133288|ref|ZP_09342672.1| hypothetical protein HMPREF1032_00468 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363616098|gb|EHL67552.1| hypothetical protein HMPREF1032_00468 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 347
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/326 (26%), Positives = 144/326 (44%), Gaps = 39/326 (11%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCE--GAP---DVKIWLQTFKDG--RWQSPIIADEEPNVP 80
H ++ D G+ L + GG + G P D K W+ + G W +P +
Sbjct: 22 HMPSLTLADNGNLLAVWNGGFLQWNGDPMGRDAKDWVSVLEPGAKEWSNPDAVGCDIRYC 81
Query: 81 MWNPVLFKLPSNGLLLFYK--IGQEVQKWSGC-------MKRSYNKGVTWSEREQLPPGI 131
+PV K ++L + + EV + C +++ + G TW + P GI
Sbjct: 82 CHDPVFLKNKKGEIILLFAKFLDTEVNFTTWCNGRDELWTRKTKDGGRTWEPAQ--PAGI 139
Query: 132 L-GPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLS 187
G + N +LL++G ++ +S E + + + + + D G +W K GPI ++
Sbjct: 140 QSGHASNDSVLLDDGTIVFASTSSELPDKYFGAVRIYLSHDDGETWEK-GPILSADDGNL 198
Query: 188 VIQPVPFHTANRTLRVLMRSF---NGIGRVCMS------ESCDGGLTWSYAKPTQLLNPN 238
+ +P + T+R+ R+ G G S ES DGG TW+ PT +LN
Sbjct: 199 IREPALCLRPDGTIRMFTRTCPGSTGWGAGVKSLVSYTAESRDGGKTWTQPVPTTILNNE 258
Query: 239 SGIDGVKLKDGRLLLAYNTV------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
S ID + +L+AYN R L +A SKD+G +W + + L + P
Sbjct: 259 SKIDVISWDKDTILMAYNDTPVADWHERSPLTLAYSKDEGKTWKNLIELAAAPGNKCQ-P 317
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
A+ + DG +++ Y T I+H+VL
Sbjct: 318 AMCRDRDGRLNVVYMNRHTAIEHLVL 343
>gi|424908423|ref|ZP_18331800.1| 4-hydroxythreonine-4-phosphate dehydrogenase [Rhizobium
leguminosarum bv. viciae USDA 2370]
gi|392844454|gb|EJA96976.1| 4-hydroxythreonine-4-phosphate dehydrogenase [Rhizobium
leguminosarum bv. viciae USDA 2370]
Length = 716
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 140/369 (37%), Gaps = 79/369 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
+P HAS + +D G L A+FGG+ EG D+ I+ GRW +P +
Sbjct: 28 SPMIQNHASFLHLLDDGVLLCAWFGGTLEGKSDISIFASVLTPGAGRWGAPQRLSHDTAR 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ----------LPP 129
NPVLF P L LF+ Q C R SE E+ LP
Sbjct: 88 SEQNPVLFTAPDGALWLFH-TAQPSGNQDECRIRMAKIHRNRSEPEKLISEDGRFLDLPK 146
Query: 130 GILGPSKNKPILLENG---LLLCGSSV----ESWNSW--GSWMEVTVDAGRSWRKYGPIY 180
G P+ + + LL V + WN + + ++ D G +W
Sbjct: 147 GCF---IRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISTDDGETWTLQ---E 200
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+P +S+ + P + R V +ES DGG +WS PT + N NS
Sbjct: 201 LP-DSIGCVHMSPVAGTDGQHSAFFRRRQA-DFVYRTESKDGGRSWSRPHPTDVPNNNSS 258
Query: 241 IDGVKLKDGRLLLAYNTVS-------------------------------------RGVL 263
I +L GRL + N V+ R +
Sbjct: 259 IAVTRLGSGRLAMICNPVNAAQSSDRRASLYDELGEDDGRPDADPSGGCVPVWGVPRAPV 318
Query: 264 KVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRT 311
+ LS DDG S+ + +E + +E SYP +++ DG++H +YTY+R
Sbjct: 319 TICLSDDDGRSFPTRILIENGPGTCLSNDSTDGRNLEMSYPWLLETPDGALHASYTYHRR 378
Query: 312 QIKHVVLLP 320
IK+V L P
Sbjct: 379 AIKYVRLEP 387
>gi|308185695|ref|YP_003929826.1| sialidase [Pantoea vagans C9-1]
gi|308056205|gb|ADO08377.1| Sialidase precursor (Neuraminidase) [Pantoea vagans C9-1]
Length = 398
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 82/376 (21%)
Query: 12 AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQS 69
A++ +S P HA+ ++ + G L +FGG+ EG D+ +W G RW
Sbjct: 22 AQQIAMLPSSCPQN--HAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLTKGSDRWSD 79
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL 127
++P NPVLF P L L + + + + + ++S++ G +W E + L
Sbjct: 80 AQKLSDDPGRSEQNPVLFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDMGKSWGEIKTL 139
Query: 128 --PPGILGPSKNKPILLENG-----LLLCGSSV-ESWNSWGSW----MEVTVDAGRSWRK 175
PG + ++L NG + C + E W GS+ ++++ D G+SWR
Sbjct: 140 LDKPGTF--IRQPIVVLPNGNWLLPVFYCLTQPGEKWV--GSYDVSAVKISEDKGQSWRD 195
Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
+ +P+ + V V + +L L RS + S S D G +WS +P L
Sbjct: 196 ---VSVPDSTGCVHMNVTL-LDDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEPLSLP 250
Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS------------------------------------ 259
N NS I L++G L L +N +S
Sbjct: 251 NNNSSIQVTTLRNGHLALVFNAMSAEGASERRLSLYDEIEDDEEDGDVAVAAEPVVHSGH 310
Query: 260 -------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDG 300
R + +A+S D G SW L+E L EFSYP++ Q +G
Sbjct: 311 TAFWGAPRAPMTLAISTDGGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQGVNG 370
Query: 301 SVHITYTYNRTQIKHV 316
+HI YT+ R IK+V
Sbjct: 371 ELHIAYTWYRQAIKYV 386
>gi|304396598|ref|ZP_07378479.1| conserved hypothetical protein [Pantoea sp. aB]
gi|304356107|gb|EFM20473.1| conserved hypothetical protein [Pantoea sp. aB]
Length = 398
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 82/376 (21%)
Query: 12 AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQS 69
A++ +S P HA+ ++ + G L +FGG+ EG D+ +W G RW
Sbjct: 22 AQQIAMLPSSCPQN--HAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLTKGSDRWSD 79
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL 127
++P NPVLF P L L + + + + + ++S++ G +W E + L
Sbjct: 80 AQKLSDDPGRSEQNPVLFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDMGKSWGEIKTL 139
Query: 128 --PPGILGPSKNKPILLENG-----LLLCGSSV-ESWNSWGSW----MEVTVDAGRSWRK 175
PG + ++L NG + C + E W GS+ ++++ D G+SWR
Sbjct: 140 LDKPGTF--IRQPIVVLPNGNWLLPVFYCLTQPGEKWV--GSYDVSAVKISEDKGQSWRD 195
Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
+ +P+ + V V + +L L RS + S S D G +WS +P L
Sbjct: 196 ---VSVPDSTGCVHMNVTL-LDDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEPLSLP 250
Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS------------------------------------ 259
N NS I L++G L L +N +S
Sbjct: 251 NNNSSIQVTTLRNGHLALVFNAMSAEGASERRLSLYDEIEDDEEDGDVAVAAEPVVHSGR 310
Query: 260 -------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDG 300
R + +A+S D G SW L+E L EFSYP++ Q +G
Sbjct: 311 TAFWGAPRAPMTLAISTDGGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQGVNG 370
Query: 301 SVHITYTYNRTQIKHV 316
+HI YT+ R IK+V
Sbjct: 371 ELHIAYTWYRQAIKYV 386
>gi|408788538|ref|ZP_11200256.1| hypothetical protein C241_21817 [Rhizobium lupini HPC(L)]
gi|408485633|gb|EKJ93969.1| hypothetical protein C241_21817 [Rhizobium lupini HPC(L)]
Length = 398
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 138/369 (37%), Gaps = 79/369 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
+P HAS + +D G L A+FGG+ EG D+ I+ GRW +P +
Sbjct: 28 SPMIQNHASFLHLLDDGALLCAWFGGTLEGKSDISIFASVLTPGAGRWGAPQRLSHDTAH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ----------LPP 129
NPV F P L LF+ Q C R SE E+ LP
Sbjct: 88 SEQNPVFFTAPDGALWLFH-TAQPSGNQDECRIRMAKIHRNRSEPEKLISEDGRFLDLPK 146
Query: 130 GILGPSKNKPILLENG------LLLC-GSSVESWNSW--GSWMEVTVDAGRSWRKYGPIY 180
G P+ + + + C + WN + + ++ D G +W
Sbjct: 147 GCF---IRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISTDDGETWTLQ---E 200
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+P +S+ + P + R V +ES DGG +WS PT + N NS
Sbjct: 201 LP-DSIGCVHMSPLAGTDGQHSAFFRRRQA-DFVYRAESKDGGRSWSRPHPTDVPNNNSS 258
Query: 241 IDGVKLKDGRLLLAYNTVS-------------------------------------RGVL 263
I +L GRL + N V+ R +
Sbjct: 259 IAVTRLGSGRLAMICNPVNAAQSSDRRASLYDELGEDDGRPDADPSGGCVPVWGVPRAPV 318
Query: 264 KVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYNRT 311
+ LS DDG S+ + +E+ +E SYP +++ DG++H +YTY+R
Sbjct: 319 TICLSDDDGRSFPTRILIEDGPGTCLSNDSTDGRNLEMSYPWLLETPDGALHASYTYHRR 378
Query: 312 QIKHVVLLP 320
IK+V L P
Sbjct: 379 AIKYVRLEP 387
>gi|152979613|ref|YP_001345242.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
[Actinobacillus succinogenes 130Z]
gi|150841336|gb|ABR75307.1| glycosyl hydrolase BNR repeat-containing protein [Actinobacillus
succinogenes 130Z]
Length = 395
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 146/362 (40%), Gaps = 81/362 (22%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
HA+ + + G L +FGG+ EG D+ + K G W + ++P NPV
Sbjct: 36 HAANLFPLPNGDLLCTWFGGTQEGISDISAYFSRLKKGSDTWTPAVKLSDDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILGPSK----NKP 139
F P N L + Y + + ++ +S + G TW E L + P+K +P
Sbjct: 96 FFLDPDNVLWILYTAQISGNQDTAIVRYRKSTDFGETWGPIEVL---LEDPNKGVFIRQP 152
Query: 140 IL-LENG-----LLLCGSS-----VESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I+ L+NG + C + V S+++ S + ++ D G++WR + +PN + V
Sbjct: 153 IVVLDNGNWLLPVFYCIARPGEKWVGSYDT--SAVMISSDKGKTWRS---VDVPNSTGCV 207
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ + +L L RS + S S D G TWS K L N N+ I L +
Sbjct: 208 HMNI-LKLKDGSLYALYRS-RWADYIYESRSTDNGETWSEPKTLPLPNNNASIQADVLDN 265
Query: 249 GRLLLAYNTVS----------------------------------------RGVLKVALS 268
G + L +N S R + +A+S
Sbjct: 266 GDIALVFNNSSAKDAKERRLSLYDEIEDESKEQKKEAELVEGQRNAFWGAPRAPMSLAIS 325
Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
D+G +W L+ E L E SYP++ Q DG +HI YT+ R IK+V
Sbjct: 326 TDNGATWPYIRNLDEGDGYCMSNNSREQLNRELSYPSIKQGLDGKLHIAYTFYRMAIKYV 385
Query: 317 VL 318
+
Sbjct: 386 CV 387
>gi|187931351|ref|YP_001891335.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. mediasiatica FSC147]
gi|187712260|gb|ACD30557.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
mediasiatica FSC147]
Length = 372
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 133/305 (43%), Gaps = 29/305 (9%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E AP+ KI + + G+W P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
N+ ++ +QP VP + L + + + I R+ +E+ D GL+WS KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMLSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285
Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
GI +K+++G +LLAYN T SR L +A D+ W + T + E SYP
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPY 344
Query: 298 SDGSV 302
D +
Sbjct: 345 QDNII 349
>gi|440760708|ref|ZP_20939811.1| hypothetical protein F385_3741 [Pantoea agglomerans 299R]
gi|436425461|gb|ELP23195.1| hypothetical protein F385_3741 [Pantoea agglomerans 299R]
Length = 398
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 82/376 (21%)
Query: 12 AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQS 69
A++ +S P HA+ ++ + G L +FGG+ EG D+ +W G RW
Sbjct: 22 AQQIAMLPSSCP--QNHAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLTKGSDRWSD 79
Query: 70 PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL 127
++P NPVLF P L L + + + + + ++S++ G +W E + L
Sbjct: 80 AQKLSDDPGRSEQNPVLFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDMGKSWGEIKTL 139
Query: 128 --PPGILGPSKNKPILLENG-----LLLCGSSV-ESWNSWGSW----MEVTVDAGRSWRK 175
PG + ++L NG + C + E W GS+ ++++ D G+SWR
Sbjct: 140 LDKPGTF--IRQPIVVLPNGNWLLPVFYCLTQPGEKWV--GSYDVSAVKISEDKGQSWRD 195
Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
+ +P+ + V V + +L L RS + S S D G +WS +P L
Sbjct: 196 ---VSVPDSTGCVHMNVTL-LDDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEPLSLP 250
Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS------------------------------------ 259
N NS I L++G L L +N +S
Sbjct: 251 NNNSSIQVTTLRNGHLALVFNAMSAEGASERRLSLYDEIEDEEENGDVAIAAEPMVHSGR 310
Query: 260 -------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDG 300
R + +A+S D G SW L+E L EFSYP++ Q +G
Sbjct: 311 TAFWGAPRAPMTLAISTDGGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQGVNG 370
Query: 301 SVHITYTYNRTQIKHV 316
+HI YT+ R IK+V
Sbjct: 371 ELHIAYTWYRQAIKYV 386
>gi|420243404|ref|ZP_14747334.1| putative neuraminidase (sialidase) [Rhizobium sp. CF080]
gi|398061196|gb|EJL52995.1| putative neuraminidase (sialidase) [Rhizobium sp. CF080]
Length = 397
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 139/372 (37%), Gaps = 85/372 (22%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
+P HAS + + G + A+FGGS EG D+ I+ G W P +
Sbjct: 28 SPMVQNHASFLHLLGDGALICAWFGGSLEGKSDISIFASVLVPGSDAWGPPQRLSFDDAH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
NPVLF P L LF+ Q C R + R++ P L
Sbjct: 88 SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAEIRRDRSDPARLTSEAGAY 141
Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
+ L G + V + WN + + ++ DAG++WR
Sbjct: 142 LDLPQGCFVRAPLVVRDDGAWLLPIFRCIQRPGQKWNGSHDTAAIGISTDAGKTWRLED- 200
Query: 179 IYIPNESLSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+P + V + P R + + R +ES DGG +WS + T + N
Sbjct: 201 --LPQSTGCVHMSPAALQAGGHAALFRRRQADFVFR---TESTDGGRSWSVPEATDVPNN 255
Query: 238 NSGIDGVKLKDGRLLLAYNTVS-------------------------------------R 260
NS I ++LKDGRL + N VS R
Sbjct: 256 NSSIAVIRLKDGRLAMICNPVSAAQSGDRRASLYDELGENDDRPDADPTGGCVPVWGVPR 315
Query: 261 GVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTY 308
+ V +S D G S+ + +E+ E SYP +++A DG++HI YTY
Sbjct: 316 APVCVCISDDGGLSFPVRILIEDGPGTCLSNDSTDGRNKEMSYPWLLEAPDGTLHIAYTY 375
Query: 309 NRTQIKHVVLLP 320
+R IK+V L P
Sbjct: 376 HRRAIKYVSLAP 387
>gi|62260730|gb|AAX77930.1| unknown protein [synthetic construct]
Length = 407
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 133/305 (43%), Gaps = 29/305 (9%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E AP+ KI + + G+W P++
Sbjct: 79 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 137
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 138 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 197
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 198 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 253
Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
N+ ++ +QP VP + L + + + I R+ +E+ D GL+WS KPTQL NP+S
Sbjct: 254 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 311
Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
GI +K+++G +LLAYN T SR L +A D+ W + T + E SYP
Sbjct: 312 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPY 370
Query: 298 SDGSV 302
D +
Sbjct: 371 QDNII 375
>gi|56707548|ref|YP_169444.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. tularensis SCHU S4]
gi|110670019|ref|YP_666576.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. tularensis FSC198]
gi|134302518|ref|YP_001122488.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. tularensis WY96-3418]
gi|254370071|ref|ZP_04986077.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
tularensis FSC033]
gi|254874366|ref|ZP_05247076.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
tularensis MA00-2987]
gi|379716810|ref|YP_005305146.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
tularensis TIGB03]
gi|379725414|ref|YP_005317600.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
tularensis TI0902]
gi|385794167|ref|YP_005830573.1| hypothetical protein NE061598_02225 [Francisella tularensis subsp.
tularensis NE061598]
gi|421752393|ref|ZP_16189421.1| hypothetical protein B345_08411 [Francisella tularensis subsp.
tularensis AS_713]
gi|421754259|ref|ZP_16191237.1| hypothetical protein B344_08353 [Francisella tularensis subsp.
tularensis 831]
gi|421755032|ref|ZP_16191986.1| hypothetical protein B343_02241 [Francisella tularensis subsp.
tularensis 80700075]
gi|421757985|ref|ZP_16194850.1| hypothetical protein B342_08454 [Francisella tularensis subsp.
tularensis 80700103]
gi|421759820|ref|ZP_16196647.1| hypothetical protein B341_08391 [Francisella tularensis subsp.
tularensis 70102010]
gi|424675141|ref|ZP_18112053.1| hypothetical protein B229_08357 [Francisella tularensis subsp.
tularensis 70001275]
gi|56604040|emb|CAG45032.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
tularensis SCHU S4]
gi|110320352|emb|CAL08415.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
tularensis FSC198]
gi|134050295|gb|ABO47366.1| hypothetical protein FTW_1675 [Francisella tularensis subsp.
tularensis WY96-3418]
gi|151568315|gb|EDN33969.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
tularensis FSC033]
gi|254840365|gb|EET18801.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
tularensis MA00-2987]
gi|282158702|gb|ADA78093.1| hypothetical protein NE061598_02225 [Francisella tularensis subsp.
tularensis NE061598]
gi|377826863|gb|AFB80111.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
tularensis TI0902]
gi|377828487|gb|AFB78566.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
tularensis TIGB03]
gi|409085109|gb|EKM85261.1| hypothetical protein B344_08353 [Francisella tularensis subsp.
tularensis 831]
gi|409085376|gb|EKM85520.1| hypothetical protein B345_08411 [Francisella tularensis subsp.
tularensis AS_713]
gi|409089120|gb|EKM89173.1| hypothetical protein B343_02241 [Francisella tularensis subsp.
tularensis 80700075]
gi|409090199|gb|EKM90222.1| hypothetical protein B341_08391 [Francisella tularensis subsp.
tularensis 70102010]
gi|409091370|gb|EKM91370.1| hypothetical protein B342_08454 [Francisella tularensis subsp.
tularensis 80700103]
gi|417434396|gb|EKT89355.1| hypothetical protein B229_08357 [Francisella tularensis subsp.
tularensis 70001275]
Length = 372
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 133/305 (43%), Gaps = 29/305 (9%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E AP+ KI + + G+W P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
N+ ++ +QP VP + L + + + I R+ +E+ D GL+WS KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285
Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
GI +K+++G +LLAYN T SR L +A D+ W + T + E SYP
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPY 344
Query: 298 SDGSV 302
D +
Sbjct: 345 QDNII 349
>gi|32473537|ref|NP_866531.1| hypothetical protein RB5143 [Rhodopirellula baltica SH 1]
gi|32398217|emb|CAD78312.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 479
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 151/343 (44%), Gaps = 49/343 (14%)
Query: 10 LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
L A++ TF P P ++ HA T + V +G L A Y G + +
Sbjct: 122 LQAQDSTFLAPPKYIGPPQTIHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPREDANNY 181
Query: 57 IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK 112
+ L T DG W+ ++ D + + P+ ++P L+ P L L + Q V
Sbjct: 182 VVLSTSGDGGETWEEVLVVDPDEDGPVRAYDPELWIAPDGKLRLIW--AQAVGHLGTI-- 237
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SWME---- 164
GV + E E P + P + + +G+++C V S W +W E
Sbjct: 238 ----AGVWFLEIED--PDMGQPVHGEAKRVTDGIMMCKPLVLSTGEWALPASTWRETDES 291
Query: 165 ----VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
V+ D G++W + G +P + + + + ++++ +L R+ GIG S S
Sbjct: 292 ARMIVSDDQGKTWAQRGGCNVPKDVRAFDEHMFVERNDQSIWLLARTKYGIGE---SVST 348
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSW 275
D G TWS P+ + +P++ +L G LLL + R L +S D G +W
Sbjct: 349 DRGKTWSELAPSTIAHPSARFFIRRLDSGNLLLVKHGPIEERTGRSHLMAFVSIDHGRNW 408
Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L L+E + SYP Q +DG +HI Y YNRT+ +H++L
Sbjct: 409 KGGLMLDERAGV--SYPDGQQDADGVIHIIYDYNRTKDRHILL 449
>gi|417349595|ref|ZP_12128223.1| hypothetical protein SeGA_2676 [Salmonella enterica subsp. enterica
serovar Gaminara str. A4-567]
gi|353572465|gb|EHC36105.1| hypothetical protein SeGA_2676 [Salmonella enterica subsp. enterica
serovar Gaminara str. A4-567]
Length = 144
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 59/115 (51%), Gaps = 1/115 (0%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + F CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS+
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSK 117
>gi|254368714|ref|ZP_04984727.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
holarctica FSC022]
gi|157121635|gb|EDO65805.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
holarctica FSC022]
Length = 372
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 130/304 (42%), Gaps = 27/304 (8%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E AP+ KI + + G+W P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKYIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
N+ ++ +QP + L R + I R+ +E+ D GL+WS KPTQL NP+SG
Sbjct: 228 TNDGVN-LQPTVVPLSKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDSG 286
Query: 241 IDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
I +K+++G +LLAYN T SR L +A D+ W + T + E SYP
Sbjct: 287 IAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPYQ 345
Query: 299 DGSV 302
D +
Sbjct: 346 DNII 349
>gi|417302934|ref|ZP_12090011.1| hypothetical protein RBWH47_00392 [Rhodopirellula baltica WH47]
gi|327540766|gb|EGF27333.1| hypothetical protein RBWH47_00392 [Rhodopirellula baltica WH47]
Length = 511
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 90/343 (26%), Positives = 151/343 (44%), Gaps = 49/343 (14%)
Query: 10 LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
L A++++F P P ++ HA T + V +G L A Y G + +
Sbjct: 154 LQAQDYSFLASPKYIGPPQTVHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPSEDANNY 213
Query: 57 IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK 112
+ L T DG W ++ D + + P+ ++P L+ P L L + Q V
Sbjct: 214 VVLSTSGDGGETWNEVLVVDPDEDGPVRAYDPELWIAPDGKLRLIW--AQAVGHLGTI-- 269
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SWME---- 164
GV + E E P + P + + +G+++C V S W +W E
Sbjct: 270 ----AGVWFLEIED--PDVEQPVHGEAKRVTDGIMMCKPLVLSTGEWALPASTWRETDES 323
Query: 165 ----VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
V+ D G++W + G +P + + + + ++++ +L R+ GIG S S
Sbjct: 324 ARMIVSDDQGKTWAQRGGCNVPKDVRAFDEHMFVERNDQSIWLLARTKYGIGE---SVST 380
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSW 275
D G TWS P+ + +P++ +L G LLL + R L +S D G +W
Sbjct: 381 DRGKTWSELAPSTIAHPSARFFIRRLDSGNLLLVKHGPIEERTGRSHLMAFVSIDHGKNW 440
Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L L+E + SYP Q +DG +HI Y YNRT+ +H++L
Sbjct: 441 KGGLMLDERAGV--SYPDGQQDTDGVIHIIYDYNRTKDRHILL 481
>gi|67458777|ref|YP_246401.1| hypothetical protein RF_0385 [Rickettsia felis URRWXCal2]
gi|67004310|gb|AAY61236.1| unknown [Rickettsia felis URRWXCal2]
Length = 142
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 71/127 (55%), Gaps = 21/127 (16%)
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALS 268
+ S S + G+ WS +P LLNP+S ID + L D LLL YN V SR +L VA S
Sbjct: 16 IHRSISINNGIYWSNLEPVNLLNPDSAIDAINLGDDTLLLTYNRVINNRKSRNILSVATS 75
Query: 269 KDDGDSWHDALTLEEN------------LAMEFSYPAVIQASDGS-VHITYTYNRTQIKH 315
D+G +WH +T+ + L+ E+SYP +I + D + +H+ YT+NR +KH
Sbjct: 76 YDEGLNWH-PITIRNSIYPEGDIEYSNILSEEYSYPTIIMSPDNNEIHVIYTFNRINLKH 134
Query: 316 VV--LLP 320
V LLP
Sbjct: 135 KVFQLLP 141
>gi|421613255|ref|ZP_16054341.1| hypothetical protein RBSH_04149 [Rhodopirellula baltica SH28]
gi|408495849|gb|EKK00422.1| hypothetical protein RBSH_04149 [Rhodopirellula baltica SH28]
Length = 376
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 151/336 (44%), Gaps = 35/336 (10%)
Query: 10 LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
L A++ +F P P ++ HA T + V +G L A Y G + +
Sbjct: 19 LQAQDSSFLAPPKYVGPPQTVHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPREDANNY 78
Query: 57 IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFY-----KIGQEVQKW 107
+ L T +G W+ ++ D + N P+ ++P L+ P L L + +G W
Sbjct: 79 VVLSTSGNGGETWEEVLVVDPDENGPVRAYDPELWIAPDGKLRLIWAQAVGHLGTIAGVW 138
Query: 108 SGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTV 167
++ + + E +++ GI+ KP++L G +S + M V+V
Sbjct: 139 FLEIEDPDVEQPVYGEAKRVTDGIM---MCKPLVLSTGEWALPASTWRETDESARMIVSV 195
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W + G +P + + + + + ++ +L R+ GIG S S D TW
Sbjct: 196 DQGKTWAQRGGCNVPKDVRAFDEHIFVERNDGSIWLLARTKYGIGE---SVSTDRRKTWP 252
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSWHDALTLE 282
P+++ +P++ +L G LLL + R L +S DDG++W L L+
Sbjct: 253 ELTPSKIAHPSARFFIRRLASGNLLLVKHGPIEKKTGRSHLMAFVSNDDGETWKGGLMLD 312
Query: 283 ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
E + SYP Q ++G +HI Y YNRT+ +H++
Sbjct: 313 ERAGV--SYPDGQQDTNGVIHIVYDYNRTKDRHILF 346
>gi|340029880|ref|ZP_08665943.1| hypothetical protein PaTRP_14293 [Paracoccus sp. TRP]
Length = 395
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 90/367 (24%), Positives = 140/367 (38%), Gaps = 78/367 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
+P HA+ + D G + A+FGG+ EG D+ I G W +P+
Sbjct: 28 SPMVQNHAAFLSLADDGALICAWFGGTLEGKSDISIHASVLLPGADAWGPAQRLSHDPDH 87
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSERE----QLPPGIL 132
NPV+F P+ GL LF+ Q C M + GV + E LP G
Sbjct: 88 SEQNPVIFAAPNGGLWLFH-TSQPSGNQDQCRIRMAALHRNGVAITAEEGRFLDLPRGCF 146
Query: 133 GPSKNKPILLENG------LLLC-GSSVESWNSW--GSWMEVTVDAGRSWRKYGPIYIPN 183
P+ L + C + WN + M V+ D G+SWR +P+
Sbjct: 147 ---VRAPVRLRGDGAWLLPIFRCVQRPGQKWNGSHDTAAMGVSTDGGQSWRLED---VPD 200
Query: 184 ESLSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
SV + PV R + + R S+S +GG +W+ T + N NS I
Sbjct: 201 SVGSVHMSPVDLGEGRMAAFYRRRQSDFVHR---SDSANGGHSWAAPVATDVPNNNSSIA 257
Query: 243 GVKLKDGRLLLAYNTVS-------------------------------------RGVLKV 265
V L+DGR+ + N V+ R + +
Sbjct: 258 AVTLRDGRIAMICNPVNAAMSPDRRTSLYDELGEDHDRPDADPTGGCVPVWGVPRAPVAI 317
Query: 266 ALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQI 313
+S D G ++ +E+ E SYP +++ DG++H+ YTY+R I
Sbjct: 318 CVSSDGGQTFPRRYLVEDGPGTCTSNDSTDGRNKEMSYPWLLEGPDGTLHLAYTYHRRAI 377
Query: 314 KHVVLLP 320
K++ L P
Sbjct: 378 KYLQLQP 384
>gi|398806469|ref|ZP_10565376.1| hypothetical protein PMI15_04242 [Polaromonas sp. CF318]
gi|398088078|gb|EJL78649.1| hypothetical protein PMI15_04242 [Polaromonas sp. CF318]
Length = 372
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 143/361 (39%), Gaps = 76/361 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG---RW--QSPIIADEEPNVPMW 82
HA+++VE+ G +F GS EGA DV I D RW +S + + +W
Sbjct: 2 HAASLVELRDGRLRAVWFSGSREGAGDVTI-QSAVTDATGLRWGAESTLFDRQALQQGLW 60
Query: 83 -------NPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPG-- 130
NPV+ + P +G L + + + W+G RS ++G +WS +L
Sbjct: 61 RYVKKIGNPVIARAP-DGSLNLWMVNVSLGGWAGSSITWARSTDEGASWSAPRRLVTSPF 119
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I K P++ ENG E +G + ++ + + IP S
Sbjct: 120 LNISTLVKGTPVIFENGETGLPVYHEFVAKFGEVLRISPQG----QVVDKVRIPGSRTS- 174
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+QPV + + +V MRS N + S + D G TW A + NP+S + G
Sbjct: 175 LQPVVLVSGPQKAQVYMRSGNATA-LMASSTTDAGKTWLPAHASAWPNPDSALAGTMTNS 233
Query: 249 GRLLLAYNTVSRGVLKVAL------------------------SKDDGDSWHDALTLE-- 282
G LA N R +AL ++ + LT E
Sbjct: 234 GTQWLALNPAPRNRETLALLQTATGGSFDGVTPWIVESSANPQARTSMGDYERLLTQELK 293
Query: 283 -----------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
++ + EFSYP ++Q+ DG +H+ YT++RT+IKHV L
Sbjct: 294 ARGASDGEAQAYVASARRQLCGGQSCSQEFSYPYLLQSRDGYLHLLYTWHRTRIKHVRLD 353
Query: 320 P 320
P
Sbjct: 354 P 354
>gi|67901886|ref|XP_681199.1| hypothetical protein AN7930.2 [Aspergillus nidulans FGSC A4]
gi|40740394|gb|EAA59584.1| hypothetical protein AN7930.2 [Aspergillus nidulans FGSC A4]
gi|259480678|tpe|CBF73543.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 333
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 132/336 (39%), Gaps = 72/336 (21%)
Query: 50 EGAPDVKIWLQT--FKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKW 107
EG PD+ I+ F + W I + NPV+F+ PS L L Y Q +
Sbjct: 2 EGKPDISIYFSRLPFGNRTWSEAIKVTHDETRSEQNPVMFQHPSGELWLLYTSQQGGNQD 61
Query: 108 SGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPILLENGLLLCGS---SVESWNSW- 159
S +KR S++ G WS+ + PG + I+L+NG+ + + VE W
Sbjct: 62 SAVVKRTISFDDGNAWSDPTVIFDEPGTF--IRQPVIILKNGIFVIPTFKCRVELGAKWI 119
Query: 160 ----GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
S + + D G++W + PI ES + L RS
Sbjct: 120 GNDDISAIRTSDDQGQTWSEI-PI---AESTGCVHMEIQRLKTGEYLALYRS--RWADFI 173
Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---------------- 259
S + GL W+ KPT L NPN+GI L GR+++ YN S
Sbjct: 174 YSSTSPDGLAWTAPKPTSLPNPNAGICFDVLPSGRVVVVYNHSSRKNALGRREGLYDEII 233
Query: 260 -----------------------RGVLKVALSKDDGDSWH-------DALTL----EENL 285
R L VA S D+G +W D L E+ L
Sbjct: 234 EGKDSRPNQKPRPDGKEAFWGAPRAPLSVAWSDDEGRTWQHRTLEEGDGYCLTNNSEQKL 293
Query: 286 AMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
E SYP+++ DGS+HI +T+ R IK+V + N
Sbjct: 294 NRELSYPSMVVGEDGSIHIAFTFWRQTIKYVQIGEN 329
>gi|326430587|gb|EGD76157.1| hypothetical protein PTSG_00864 [Salpingoeca sp. ATCC 50818]
Length = 376
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 147/342 (42%), Gaps = 39/342 (11%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGS-CEGAPDVKIWLQTFKDG--RW 67
V+ + F N++ S H S I + G + A+ G S EG D I+ + KDG W
Sbjct: 39 VSHHWVFQTNASIVYS-HMSMIEALPDGTLVAAFQGSSLSEGCNDQAIFFTSSKDGGVTW 97
Query: 68 QSPIIADEEPNVPMWNPVL-FKLPSNGLLLFYKIGQEVQKWS----GCMKRSY------- 115
+P + + + +W PVL F +N L LF+ + Q G KRSY
Sbjct: 98 -APAVVAQNDTLALWGPVLRFDKATNKLWLFFSASTKYQNRDKPGCGSFKRSYPGGEVRY 156
Query: 116 ----NKGVTWSEREQLPP-----GILGPSKNKPILLENGLLLCGSSVESWNSW-----GS 161
+ G TW+ + P + + N I+ G + E + S
Sbjct: 157 VTSTDGGATWTNPVVILPFEARGQVSKVTANHMIITATGRWMLPFWQEPRTVYDKGLSAS 216
Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
+ V+ D G +W G I+ N S +I+ + +L L R+ G + S S D
Sbjct: 217 GVLVSDDNGTTWTTRGSIH--NNSTWLIENTLHQFQDSSLIQLFRT--QAGYIYESWSTD 272
Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLK-DGRLLLAYN--TVSRGVLKVALSKDDGDSWHDA 278
G TW+ A P+ LLNPNS + + +G +LAYN +R L +A S G +W
Sbjct: 273 NGNTWTTAMPSSLLNPNSKVCLTRGSVNGSAILAYNPSKTARTPLVLAQSSLTGHTWTTT 332
Query: 279 LTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
+L+ + M F YP + + G V TY+ N ++ + + P
Sbjct: 333 ASLDNDPNMSFEYPTPLLHA-GRVFTTYSANSSKGIKLAISP 373
>gi|381405652|ref|ZP_09930336.1| sialidase [Pantoea sp. Sc1]
gi|380738851|gb|EIB99914.1| sialidase [Pantoea sp. Sc1]
Length = 398
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 146/360 (40%), Gaps = 76/360 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ ++ + G L +FGG+ EG D+ +W + G RW E+P NPV
Sbjct: 36 HAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLQKGSDRWSEAQKLSEDPTRSEQNPV 95
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
LF P L L + + + + + ++S++ G +W + L PG + ++
Sbjct: 96 LFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDLGRSWGAIDTLLDKPGTF--IRQPIVV 153
Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
L NG + C + E W N S ++++ D GRSWR + +P+ + V V
Sbjct: 154 LPNGNWLLPVFYCLTPPGEKWVGNYDVSAVKISEDKGRSWRD---VSVPDSTGCVHMNVT 210
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ +L L RS + S S D G +WS + L N NS I L++G L L
Sbjct: 211 L-LEDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEALSLPNNNSSIQVTTLRNGHLAL 268
Query: 254 AYNTVS-------------------------------------------RGVLKVALSKD 270
+N +S R + +A+S D
Sbjct: 269 VFNAMSAEGASERRLSLYDEIEDDEEDGEVAVAAEAVVHSGRTAFWGAPRAPMTLAISTD 328
Query: 271 DGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
G SW L+E L EFSYP++ + G +HI YT+ R IK+V +
Sbjct: 329 GGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKEGLHGELHIAYTWYRQAIKYVRI 388
>gi|417511439|ref|ZP_12176059.1| hypothetical protein LTSESEN_3058 [Salmonella enterica subsp.
enterica serovar Senftenberg str. A4-543]
gi|353643034|gb|EHC87321.1| hypothetical protein LTSESEN_3058 [Salmonella enterica subsp.
enterica serovar Senftenberg str. A4-543]
Length = 138
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 65/136 (47%), Gaps = 7/136 (5%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS------ 122
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122
Query: 123 EREQLPPGILGPSKNK 138
+ LP G + S+ K
Sbjct: 123 NDDILPRGPVARSRKK 138
>gi|399039745|ref|ZP_10735254.1| putative neuraminidase (sialidase) [Rhizobium sp. CF122]
gi|398062158|gb|EJL53939.1| putative neuraminidase (sialidase) [Rhizobium sp. CF122]
Length = 395
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 142/363 (39%), Gaps = 74/363 (20%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G W P ++
Sbjct: 35 SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGATHWSQPEKMSDDSAK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
NP++F P + L Y Q G + + S + G T+ L I G
Sbjct: 95 SEQNPLIFNAPDGNVWLLY-TSQTSGNQDGSVVKCRISEDGGKTFGVVRVLC-DIPGTFV 152
Query: 137 NKPILLENG------LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+PI++ + C G E W + + + ++ D G+SW +P+ +
Sbjct: 153 RQPIIVNKTGSWLLPVFRCIGLPGERWTGDVDTAGVLISRDQGKSWAM---AEVPDSIGA 209
Query: 188 V-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
V + VP N R V S S DGG WS +P L N NS I KL
Sbjct: 210 VHMNIVPIDDDNLVAFYRNRFAES---VLYSRSSDGGEHWSAPRPIGLPNNNSSIQATKL 266
Query: 247 KDGRLLLAYNT---------------------------------------VSRGVLKVAL 267
++G + + YN V R L +A
Sbjct: 267 ENGMIAMVYNHSNALTSDARRQSLYDEIEGDEETKVAPAPTVERRKAVWGVPRAPLSLAF 326
Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
S+D G S+ + L+ +++ EFSYP++++ +DG+VH+ YTY R IK+
Sbjct: 327 SRDGGASFPRRIDLDTGDGYCLSNNSKDSVNREFSYPSIVEGADGAVHVAYTYYRRAIKY 386
Query: 316 VVL 318
V L
Sbjct: 387 VRL 389
>gi|89255875|ref|YP_513237.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. holarctica LVS]
gi|115314364|ref|YP_763087.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. holarctica OSU18]
gi|156501859|ref|YP_001427924.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
subsp. holarctica FTNF002-00]
gi|254367239|ref|ZP_04983267.1| bNR/Asp-box repeat protein [Francisella tularensis subsp.
holarctica 257]
gi|290953330|ref|ZP_06557951.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
subsp. holarctica URFT1]
gi|422938340|ref|YP_007011487.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. holarctica FSC200]
gi|423050201|ref|YP_007008635.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
subsp. holarctica F92]
gi|89143706|emb|CAJ78905.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
holarctica LVS]
gi|115129263|gb|ABI82450.1| possible neuraminidase [Francisella tularensis subsp. holarctica
OSU18]
gi|134253057|gb|EBA52151.1| bNR/Asp-box repeat protein [Francisella tularensis subsp.
holarctica 257]
gi|156252462|gb|ABU60968.1| putative neuraminidase with BNR/Asp-box repeat [Francisella
tularensis subsp. holarctica FTNF002-00]
gi|407293491|gb|AFT92397.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
subsp. holarctica FSC200]
gi|421950923|gb|AFX70172.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
subsp. holarctica F92]
Length = 372
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 129/304 (42%), Gaps = 27/304 (8%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
AN FK HAS++ +D F + ++ E A + KI + + G+W P++
Sbjct: 53 ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETATNTKIVVAVAEKVAGKWHFNEIKPVM 111
Query: 73 ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
+E + NP+++ L+F S + S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKYIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L IL S + I L+N E N G W D + +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
N+ ++ +QP + L R + I R+ +E+ D GL+WS KPTQL NP+SG
Sbjct: 228 TNDGVN-LQPTVVPLSKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDSG 286
Query: 241 IDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
I +K+++G +LLAYN T SR L +A D+ W + T + E SYP
Sbjct: 287 IAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPYQ 345
Query: 299 DGSV 302
D +
Sbjct: 346 DNII 349
>gi|384220758|ref|YP_005611924.1| hypothetical protein BJ6T_70870 [Bradyrhizobium japonicum USDA 6]
gi|354959657|dbj|BAL12336.1| hypothetical protein BJ6T_70870 [Bradyrhizobium japonicum USDA 6]
Length = 401
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 71/361 (19%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ ++ + G +FGG+ EG D+ ++ G +W ++
Sbjct: 36 SPCVQNHAANLMPLANGDLACVWFGGTQEGMSDISVYFSRLARGATQWSPAEKLSDDSGR 95
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSK- 136
NPVLF P L L + + + ++R S + G +W E L P G
Sbjct: 96 SEQNPVLFPAPDGTLWLMWTAQLAGNQDTALIRRRLSRDNGKSWGPIETLFPEQRGRGTF 155
Query: 137 -NKPIL-LENG-----LLLCGSSV-ESWN--SWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
+PI+ L+NG + C S+ WN S ++++ D+G +WR + +P S
Sbjct: 156 IRQPIVALDNGDWLLPVFYCHSTPGRKWNGDEDTSAVKISSDSGATWRD---VDVPG-SR 211
Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
+ + +L L RS + S S D G TWS T L N NS I +L
Sbjct: 212 GCVHMCIARLDDSSLIALYRS-RWADTIYQSRSTDHGRTWSAPVQTVLPNNNSSIQVTRL 270
Query: 247 KDGRLLLAYNTVS----------------------------------------RGVLKVA 266
+G L L +N S R + +A
Sbjct: 271 GNGHLALVFNDSSARDATGRRLSLYDEIEDDLPPVLAGDGKRTDERTAFWGAPRAPMTLA 330
Query: 267 LSKDDGDSWH-------DALTLEEN----LAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
+S D G +W D + N E SYP+V + +DG++H+ +T++R IK+
Sbjct: 331 VSADGGRTWSKRNVETGDGYCMTNNSRDQTNRELSYPSVKETADGAIHMAFTFHRRAIKY 390
Query: 316 V 316
V
Sbjct: 391 V 391
>gi|452747977|ref|ZP_21947766.1| hypothetical protein B381_09501 [Pseudomonas stutzeri NF13]
gi|452008126|gb|EME00370.1| hypothetical protein B381_09501 [Pseudomonas stutzeri NF13]
Length = 387
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 140/363 (38%), Gaps = 82/363 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKI-----------WLQTFKDGRWQSPIIADE 75
HAS++ + G + +F GS EGA DV++ W + +S A +
Sbjct: 23 VHASSVTSLPDGGLMAVWFAGSREGAADVQVRGARFDAIAAEWGEELVLATRESTQKATQ 82
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPP--- 129
+ + NPV+ P N L LFY + V W+G S + G TWS QL
Sbjct: 83 KHIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSAINAMYSDDLGQTWSPPRQLITTPF 141
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDA---GRSWRKYGPIYIPNES 185
I +N P+ ++G + E + ++ ++ + G+S G N+S
Sbjct: 142 LNISTLVRNAPVFHQDGTIGLPVYHEFLGKFAEYLYISPNGEVIGKSRISKG-----NDS 196
Query: 186 LS-VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
L + P+ A LR S + RV S + D G TWS P NPNS + V
Sbjct: 197 LQPTVVPMDERHAVAMLRYAGASHH---RVLASRTADAGRTWSEPYPIDPANPNSALAAV 253
Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE------------------- 283
+ LL+A N + G ++ L + D D W + L+E
Sbjct: 254 ATPNHGLLVALNDLEDGRFRLRLMETDSSLDLWKPVIDLDESPDPEGNPFTPEVYREIIG 313
Query: 284 --------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQI 313
NL E+ YP I++SDG H+ Y++N T I
Sbjct: 314 DKFRLSSGPRRLSLVDEFLTNLDQRVCKTHGCDFEYEYPYFIRSSDGLYHLVYSWNNTFI 373
Query: 314 KHV 316
KHV
Sbjct: 374 KHV 376
>gi|417383912|ref|ZP_12149463.1| hypothetical protein LTSEJOH_3062 [Salmonella enterica subsp.
enterica serovar Johannesburg str. S5-703]
gi|353610535|gb|EHC63463.1| hypothetical protein LTSEJOH_3062 [Salmonella enterica subsp.
enterica serovar Johannesburg str. S5-703]
Length = 133
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 62/131 (47%), Gaps = 1/131 (0%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSKDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS L
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122
Query: 129 PGILGPSKNKP 139
+ P + P
Sbjct: 123 NDDILPRRAVP 133
>gi|218508465|ref|ZP_03506343.1| putative glycosyl hydrolase protein [Rhizobium etli Brasil 5]
Length = 340
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 138/349 (39%), Gaps = 86/349 (24%)
Query: 44 YFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIG 101
+FGG+ EG D+ I++ G RW P ++P NP++F P L Y
Sbjct: 2 WFGGTMEGMGDISIYMSRLPPGSERWSEPEKMSDDPEKSEQNPLIFNAPDGKTWLLY-TS 60
Query: 102 QEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGPSKNKPILLENG-----LLLC-G 150
Q G + + S + G T+ + L PG + + ++ + G + C G
Sbjct: 61 QTSGNQDGSIVKCRISDDGGKTFGPVQILCDSPGTF--VRQQIVVNDRGDWLLPIFRCVG 118
Query: 151 SSVESWNSWGSWMEVTV--DAGRSWR------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
+ W+ V + D G SW+ G +++ L V F+ NR
Sbjct: 119 LEGQRWSGDADTAAVLISRDGGGSWQMRDIPDSIGAVHMNILPLGGGDMVAFYR-NRFAE 177
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------ 256
++ S S DGG TWS +PT+L N NS I L +G + + YN
Sbjct: 178 TILSS----------RSADGGETWSPPEPTELPNNNSSIQATVLDNGGIAMVYNHSNATT 227
Query: 257 ---------------------------------TVSRGVLKVALSKDDGDSWHDALTLE- 282
V R L +A+S+D G ++ + L+
Sbjct: 228 SDARRQSLYDEIEGDEAGENAAVVADAGRKAVWGVPRAPLSLAISRDGGTTFPHRIDLDT 287
Query: 283 -----------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
++L EFSYP+VI+ SDG++H+ TY R IK+V L P
Sbjct: 288 GDGFCLSNNSKDSLNREFSYPSVIEGSDGTLHVACTYYRRAIKYVRLAP 336
>gi|238482243|ref|XP_002372360.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220700410|gb|EED56748.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 332
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 90/331 (27%), Positives = 135/331 (40%), Gaps = 69/331 (20%)
Query: 50 EGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKW 107
EG PD+ I+L + G W I + NPVLF+ P+ L L Y +
Sbjct: 2 EGKPDISIYLSRLRSGEQSWSEAIRMTHDNTRSEQNPVLFRTPTGDLWLLYTSQHAGNQD 61
Query: 108 SGCMKRSYNK--GVTWSEREQLPPGILGPSKNKPILLENG---LLLCGSSVESWNSW--- 159
S +KR +K G+TW + E L P + I+L++G + + VE W
Sbjct: 62 SAIVKRRVSKDDGITWGKEEVLFPDSGIFIRQPAIVLDDGAWVIPVFKCRVEPGERWLGN 121
Query: 160 --GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
S + V+ D G++W + IP ES + + + L RS + ++
Sbjct: 122 NDISCIRVSRDEGQTWTES---VIP-ESTGCVHMEIQRLKDCSYLGLFRS-RWADHIYLA 176
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
S DG L+WS +PT L NPN+GI L GR+++ YN S
Sbjct: 177 TSPDG-LSWSPPQPTILPNPNAGICFDVLPSGRVVVVYNHSSKLDATGRRQGLYDDIADG 235
Query: 260 ---------------------RGVLKVALSKDDGDSWHDAL-----------TLEENLAM 287
R L VA S D G +W + E+ L
Sbjct: 236 VDERRDQSSTQDGRESFWGAPRAPLCVAWSDDSGKTWERRVLEDGDGYCMTNNSEKKLNR 295
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
E SYP+++ +G +HI YT+ R +IK+V L
Sbjct: 296 ELSYPSMV-LDEGKIHIAYTFWRQRIKYVQL 325
>gi|417531554|ref|ZP_12186236.1| hypothetical protein LTSEURB_2816 [Salmonella enterica subsp.
enterica serovar Urbana str. R8-2977]
gi|353664664|gb|EHD03026.1| hypothetical protein LTSEURB_2816 [Salmonella enterica subsp.
enterica serovar Urbana str. R8-2977]
Length = 137
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 57/114 (50%), Gaps = 1/114 (0%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
LV + P + CHAST+V + G + A+F G EG+ D IWL ++ W
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSKDTAIWLSRYEHNIWT 63
Query: 69 SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
+P WNPVLF PS+ L LFYK+G +V W S ++G TWS
Sbjct: 64 TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWS 116
>gi|392421090|ref|YP_006457694.1| hypothetical protein A458_10170 [Pseudomonas stutzeri CCUG 29243]
gi|390983278|gb|AFM33271.1| hypothetical protein A458_10170 [Pseudomonas stutzeri CCUG 29243]
Length = 387
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 140/363 (38%), Gaps = 82/363 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKI-----------WLQTFKDGRWQSPIIADE 75
HAS++ + G + +F GS EGA DV++ W + +S A +
Sbjct: 23 VHASSVTSLPDGGLMAVWFAGSREGAADVQVRGARFDAIAAEWGEELVLATRESTQKATQ 82
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPP--- 129
+ + NPV+ P N L LFY + + W+G S + G TWS QL
Sbjct: 83 KHIRKLGNPVIALAPDNRLWLFY-VSVSIGGWAGSAINAMYSDDLGETWSTPRQLITTPF 141
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVD---AGRSWRKYGPIYIPNES 185
I +N P+ ++G + E + ++ ++ + G+S G N+S
Sbjct: 142 LNISTLVRNAPVFHQDGTIGLPVYHEFLGKFAEYLYISPNGEVTGKSRISKG-----NDS 196
Query: 186 LS-VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
L + P+ A LR S + RV S + D G TWS P NPNS + V
Sbjct: 197 LQPTVVPMDEQHAVAMLRYAGASHH---RVLASRTADAGRTWSKPYPIDPANPNSALAAV 253
Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE------------------- 283
+ LL+A N + G ++ L + D D W + L+E
Sbjct: 254 ATPNHGLLVALNDLEDGRFRLRLMETDSSLDLWKPVIDLDESPDPEGNPFTPEVYREIIG 313
Query: 284 --------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQI 313
NL E+ YP I+++DG H+ Y++N T I
Sbjct: 314 DKFRLSSGPRRLSLVDEFLTNLDKRVCKTHGCDFEYEYPYFIRSADGLYHLVYSWNNTFI 373
Query: 314 KHV 316
KHV
Sbjct: 374 KHV 376
>gi|357638395|ref|ZP_09136268.1| BNR/Asp-box repeat protein [Streptococcus urinalis 2285-97]
gi|418416853|ref|ZP_12990052.1| hypothetical protein HMPREF9318_00800 [Streptococcus urinalis
FB127-CNA-2]
gi|357586849|gb|EHJ56257.1| BNR/Asp-box repeat protein [Streptococcus urinalis 2285-97]
gi|410874671|gb|EKS22602.1| hypothetical protein HMPREF9318_00800 [Streptococcus urinalis
FB127-CNA-2]
Length = 310
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 130/292 (44%), Gaps = 51/292 (17%)
Query: 67 WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY------KIGQEVQKWSGCM--KRSYNKG 118
W I + N NP F+ P + L Y + G++ +++ + ++S ++G
Sbjct: 2 WSEAEIVSKGENRSEQNPAFFRSPDGEIWLIYTSQLSRQEGKDNMQFTSIIMVQKSSDEG 61
Query: 119 VTWSEREQLPPGILGPSKNKPILLENG-----LLLCGSSVESWNSWGSWMEVTVDAGRSW 173
VTW + E L P S+ +L NG +C S E + + V+ D G++W
Sbjct: 62 VTWGDPEVLFPEEGTFSRQAIQILSNGRWIFATWICEDSPEGLTNDPTEFRVSDDTGQTW 121
Query: 174 RKYGPIYIPNES----LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
+ + +P+ + ++I+ P H L MRS V +SES D G +WS
Sbjct: 122 KI---VRMPDSAGRVHANLIEVEPGH-----LVAFMRS-RFADNVYISESFDYGDSWSVP 172
Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW 275
+ T L N N+ I +KL+ G + LAYN + R + V++++D G +W
Sbjct: 173 EKTVLSNNNASISAIKLQTGEIALAYNDNAIENPEFGKVAWPGLRNPVAVSITEDFGKTW 232
Query: 276 HDALTLE--ENLA--------MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
E EN ++ YP + Q+ +G +H+ Y Y NR IK+V
Sbjct: 233 PIGRVFEQAENFIGAENRTNNKQYEYPTLYQSQNGDLHLVYAYRNRLCIKYV 284
>gi|320107998|ref|YP_004183588.1| hypothetical protein AciPR4_2821 [Terriglobus saanensis SP1PR4]
gi|319926519|gb|ADV83594.1| hypothetical protein AciPR4_2821 [Terriglobus saanensis SP1PR4]
Length = 412
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/348 (25%), Positives = 139/348 (39%), Gaps = 63/348 (18%)
Query: 24 FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
F S HA+ ++ + G L F G+ EG +V I + G W+ ++ D E
Sbjct: 57 FSSSHAANLLLLRNGDILCFSFSGTREGDSNVAIVVSRLPKGSQTWEPAVLVDREEGKSY 116
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKW---SGCMKRSYNKGV-TWSEREQLPPGILGPSKN 137
NPV + ++G + Y Q K S +K + G TWS+ E + +
Sbjct: 117 QNPVPIE-DAHGRIWLYNTSQSANKGQADSQVLKVYSDDGAKTWSKPEVVFDKAGSYLRQ 175
Query: 138 KPILLENGLLL-------CGSSVESWNSWGSWMEVTVDAGRSWRK---YGPIYIPNESLS 187
++ ENG LL ++ S ++V+ DAG++WR+ G + S+
Sbjct: 176 PTVMGENGELLLPIYYSTSSGITNGADTNYSVVDVSKDAGKTWRECRVTGSNGLVQMSIV 235
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + R+ R + I R + C+ W+ T L N N+ I LK
Sbjct: 236 KLRPKSYVAFFRS-----RYADFIYRSTSTNGCE----WTAPTKTVLPNNNASIQATLLK 286
Query: 248 DGRLLLAYN----------------TVSRGVLKVALSKDDGDSWHDALTLEENLAM---- 287
DG + +A++ T R L VALS D G +W LE A
Sbjct: 287 DGHIAIAFDNTQGKSPEKLAAHEPQTGPRAPLSVALSSDAGLTWTSVRDLEIPDAAVRAS 346
Query: 288 -----------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
E+SYP+++Q G + + YTY R IK V+
Sbjct: 347 VVNSPGVGDTIQFPGEEEYSYPSILQNPSGQILVAYTYRRVAIKAVLF 394
>gi|339058255|ref|ZP_08648757.1| BNR/Asp-box repeat protein [gamma proteobacterium IMCC2047]
gi|330720539|gb|EGG98821.1| BNR/Asp-box repeat protein [gamma proteobacterium IMCC2047]
Length = 387
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 147/325 (45%), Gaps = 33/325 (10%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
A+S + H++ +V++ G+ ++GG+ EGA DV I+ + W + E
Sbjct: 54 ASSGVTREVHSAVMVKLGNGNLRAFWYGGAREGAKDVSIYTAELDLQSNEWLDERVLVER 113
Query: 77 PNVP---------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSER 124
+ NPV F+ L LF+ + V W+G S + G +W+
Sbjct: 114 HATAQQLGIYLRKIGNPVTFRDGDERLWLFF-VSVSVGGWAGSTINFMISDDDGQSWTTP 172
Query: 125 EQLPPGIL----GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
++L L K +P L +G + E +G + ++ +G K +
Sbjct: 173 KRLIASPLFNLSTLVKGEPFLYTDGSIGLPVYHEMIGKFGELIHLS-SSGEVLDK---VR 228
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES--CDGGLTWSYAKPTQLLNPN 238
+ + +S +QP+ ++ + R+ MR+ + E+ D G +W+ L NPN
Sbjct: 229 LMSGRVS-LQPIVLPSSGQHARIYMRNATDAPDSTLFETWTTDAGESWAPVVRNDLPNPN 287
Query: 239 SGIDGVKLKDGRLLLAYNTV--SRGVLKVALSKDDGDSWHDALTLEEN-----LAMEFSY 291
+ I G++L DG LLL N R L + S DDGDSW E+ +F+Y
Sbjct: 288 AAITGLRLDDGGLLLVINNQHDKRNNLSLMHSVDDGDSWQFLYQFEQEENTPEFEHQFAY 347
Query: 292 PAVIQASDGSVHITYTYNRTQIKHV 316
P++ Q+SDG H+ YT+N+ +IKH+
Sbjct: 348 PSLRQSSDGDFHVVYTWNKERIKHI 372
>gi|87310619|ref|ZP_01092747.1| hypothetical protein DSM3645_26524 [Blastopirellula marina DSM
3645]
gi|87286600|gb|EAQ78506.1| hypothetical protein DSM3645_26524 [Blastopirellula marina DSM
3645]
Length = 515
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 126/296 (42%), Gaps = 36/296 (12%)
Query: 44 YFGGSCEGAPDVKIWLQTFKDGRWQSP----IIADEEPNVPMWNPVLFKLPSNGLLLFYK 99
Y GG E + + + + +G+ S I D+E V ++P + PS L LF+
Sbjct: 52 YGGGVTEDRHNYILLVSSDDEGKTWSENKVVIDPDKEGPVRAFDPCPWVDPSGKLWLFW- 110
Query: 100 IGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSW 159
Q G + +T P P + P L+ NG+++C + + W
Sbjct: 111 ----CQATRGGGGDPHTFAMTTEN-----PDDAQPKWSAPQLVHNGVMMCKPTALANGDW 161
Query: 160 ----GSW-------MEVTVDAGRSWRKYGPIYIPN-ESLSVIQPVPFHTANRTLRVLMRS 207
W + + D G++W G +P + + +P+ N L L+R+
Sbjct: 162 LLPTAIWGRDHSCRVVASTDQGKTWSLRGTANVPAAKDRNCDEPMIVQRKNGELWQLVRT 221
Query: 208 FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLA-----YNTVSRGV 262
GIG S S DGG TW+ P + + S +L+ G LLL + R
Sbjct: 222 SYGIGE---SHSTDGGKTWTAVTPWNIRHTASRFFIRRLQSGNLLLVKHGPLHEKTGRSH 278
Query: 263 LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L LSKDDG SW L L+E + SYP ++ DG V++TY Y+R K++ +
Sbjct: 279 LTAYLSKDDGQSWEGGLLLDERSGV--SYPDGAESQDGVVYLTYDYDRQGQKYIYM 332
>gi|149177101|ref|ZP_01855708.1| hypothetical protein PM8797T_26875 [Planctomyces maris DSM 8797]
gi|148843993|gb|EDL58349.1| hypothetical protein PM8797T_26875 [Planctomyces maris DSM 8797]
Length = 542
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/205 (31%), Positives = 101/205 (49%), Gaps = 17/205 (8%)
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
W+E L PG++ NKPI+ +NG L +++ + M V+ D GR+W G
Sbjct: 159 WTEPRALFPGVM---INKPIVTKNGEWLMPAAIWRQDDSCRVM-VSKDQGRTWALRGAAN 214
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
+P + + +P+ + ++ +L+R+ + GIG S S D G +W+ K L S
Sbjct: 215 VPKDRRNCDEPMLVERQDGSVWLLVRTSSYGIGD---SVSTDAGRSWTEVK-DHLQQTTS 270
Query: 240 GIDGVKLKDGRLLL-AYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
+L G LLL +N + R L LS DDG SW L L+ + SYP
Sbjct: 271 RFHIQRLASGNLLLIKHNGIDERSRGRNHLTAYLSADDGKSWKGGLLLDTRDMV--SYPD 328
Query: 294 VIQASDGSVHITYTYNRTQIKHVVL 318
QA DG++++ Y +NR KH++L
Sbjct: 329 ATQAPDGTIYVIYDWNRADEKHILL 353
>gi|266620455|ref|ZP_06113390.1| putative BNR/Asp-box repeat-containing domain protein [Clostridium
hathewayi DSM 13479]
gi|288867941|gb|EFD00240.1| putative BNR/Asp-box repeat-containing domain protein [Clostridium
hathewayi DSM 13479]
Length = 389
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 73/286 (25%), Positives = 121/286 (42%), Gaps = 48/286 (16%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQK-----WSGCMKRSYNKGVTWSERE 125
++ +P +++P L+ P L +FY + W + +TWS
Sbjct: 80 VVEHPDPECRIYDPNLWIAPDKKLWMFYTQARGFNDGRSGVWVTVCDQPDTDPLTWSAPR 139
Query: 126 QLPPGILGPSKNKPILLENG--LLLCGSSVESWNS-----------WGSWMEVTVDAGRS 172
++ GI+ NKPI+ G L C ++ S S + + D G++
Sbjct: 140 RIANGIM---MNKPIITTKGEWLFPCAIWCDTSGSVPAERHGLEQEQFSNVYASSDKGKT 196
Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
G IPN S + + + +L +L+R+F+GIG S S DGG TW+ + +
Sbjct: 197 ISLRGHADIPNRSFD--ENMIVEKKDGSLWMLVRTFDGIGE---SFSTDGGYTWTPGQKS 251
Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV--------------------SRGVLKVALSKDDG 272
+ P S +LK GRLLL + R L LS+DDG
Sbjct: 252 HIDGPCSRFHISRLKSGRLLLINHYQFDQRIDLEDIMHQGNVKKWKGRSHLSALLSEDDG 311
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W +L L+E E SYP +A DG +++TY + R + +++
Sbjct: 312 QTWPYSLLLDER--NEVSYPDAKEADDGFIYVTYDHERVTEREILM 355
>gi|398879007|ref|ZP_10634110.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM67]
gi|398197553|gb|EJM84530.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM67]
Length = 432
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/386 (23%), Positives = 144/386 (37%), Gaps = 86/386 (22%)
Query: 11 VAEEFTFPANSAPFKS------CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-- 62
VAE+ P ++ F S H+S + + G + +F G+ EGA DV++ F
Sbjct: 45 VAEQVAKPIYTSRFASSGLVDFVHSSAVTALPDGSLMTVWFAGTREGAADVQVRSARFDA 104
Query: 63 KDGRW---------QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC--- 110
+ G W QS + + + NPV+ P N L LFY + + W+G
Sbjct: 105 QTGEWGAEQVLATRQSTQLGTGKYIRKLGNPVVALAPDNRLWLFY-VSVSMGGWAGSAVN 163
Query: 111 MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWM 163
+ S + G WS QL I P ++ P+ +G + E + ++
Sbjct: 164 VMVSEDYGRQWSAPRQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFSEYL 220
Query: 164 EVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGG 223
++ D G K+ + N + P+ R + +L + N RV + + D G
Sbjct: 221 YLSAD-GEVIDKFRISHGNNSLQPTVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAG 276
Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTL 281
TWS P NPNS + + + LL+A N + G K++L D D W L
Sbjct: 277 QTWSEPYPLDPSNPNSSLAAIATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDL 336
Query: 282 -------------------------------------------------EENLAMEFSYP 292
E+ + E+ YP
Sbjct: 337 DKSPDPEGDTFSPEAYKEIIGREFRGSSGALRQPLEAQFLNNLDSRVCKEQACSFEYEYP 396
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
I++ DG H+ Y++N T IKHV
Sbjct: 397 YFIRSPDGMYHLVYSWNNTFIKHVTF 422
>gi|424825293|ref|ZP_18250280.1| hypothetical protein CAB1_0654 [Chlamydophila abortus LLG]
gi|333410392|gb|EGK69379.1| hypothetical protein CAB1_0654 [Chlamydophila abortus LLG]
Length = 357
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/304 (25%), Positives = 138/304 (45%), Gaps = 23/304 (7%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVP 80
S+ ++ STIVE G LV + + K+ + GRW P + +
Sbjct: 31 SSDNRTSGMSTIVETG-GRLLVVW-----QDCDSNKLIGRRCDQGRWSEPYEIFPQASGL 84
Query: 81 MWNPVLFKLPSNGLLLFYK---IGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKN 137
+PVL ++ S + LFY+ +G+ + K + S+N GV W E LPPGI G ++N
Sbjct: 85 FSHPVLVRISSQEIWLFYRKQQVGESISK--PYVAFSFNSGVHWLEHRLLPPGIEGTTRN 142
Query: 138 KPIL--LENGLLLCGSSV-ESWNS---WGSWMEVTVDAGRSWR-KYGPIYIPNESLSVIQ 190
P + + N + + SV ES + SW+E+ RSW + GPI + + I+
Sbjct: 143 PPYVDRIRNRVYIPSFSVYESPDQELIGDSWVEIYNFFSRSWEGRMGPILGGSRNQVSIE 202
Query: 191 PVPFHTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
P + L + R +F +C+S S + G++WS K + ++ I ++G
Sbjct: 203 PAIVKLTSNKLALFFRNAFVEESYICLSVSTNQGISWSKLKNLPWRSYDNSISVQSGRNG 262
Query: 250 RLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
+L L N+ G L S ++G +W+ ++ +++ P+ G +H+ YT
Sbjct: 263 KLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSID---PSTYLDRQGCMHLVYTS 319
Query: 309 NRTQ 312
Q
Sbjct: 320 KDAQ 323
>gi|398882671|ref|ZP_10637637.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM60]
gi|398198338|gb|EJM85296.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM60]
Length = 432
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/386 (23%), Positives = 144/386 (37%), Gaps = 86/386 (22%)
Query: 11 VAEEFTFPANSAPFKS------CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-- 62
VAE+ P ++ F S H+S + + G + +F G+ EGA DV++ F
Sbjct: 45 VAEQVAKPIYTSRFASSGLVDFVHSSAVTALPDGSLMTVWFAGTREGAADVQVRSARFDA 104
Query: 63 KDGRW---------QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC--- 110
+ G W QS + + + NPV+ P N L LFY + + W+G
Sbjct: 105 QTGEWGAEQVLATRQSTQLGTGKYIRKLGNPVVALAPDNRLWLFY-VSVSMGGWAGSAVN 163
Query: 111 MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWM 163
+ S + G WS QL I P ++ P+ +G + E + ++
Sbjct: 164 VMVSEDFGRQWSAPRQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFSEYL 220
Query: 164 EVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGG 223
++ D G K+ + N + P+ R + +L + N RV + + D G
Sbjct: 221 YLSAD-GEVIDKFRISHGNNSLQPTVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAG 276
Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTL 281
TWS P NPNS + + + LL+A N + G K++L D D W L
Sbjct: 277 QTWSEPYPLDPSNPNSSLAAIATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDL 336
Query: 282 -------------------------------------------------EENLAMEFSYP 292
E+ + E+ YP
Sbjct: 337 DKSPDPEGDTFSPEAYKEIIGREFRGSSGALRQPLEAQFLDNLDSRVCKEQACSFEYEYP 396
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
I++ DG H+ Y++N T IKHV
Sbjct: 397 YFIRSPDGMYHLVYSWNNTFIKHVTF 422
>gi|405379113|ref|ZP_11033017.1| putative neuraminidase (sialidase) [Rhizobium sp. CF142]
gi|397324370|gb|EJJ28731.1| putative neuraminidase (sialidase) [Rhizobium sp. CF142]
Length = 394
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 142/368 (38%), Gaps = 85/368 (23%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G W P ++P
Sbjct: 35 SPCIQNHAANLAFLADGTLTCVWFGGTMEGMGDISIYMSRLAPGTDCWSEPEKMSDDPAK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL---- 132
NP++F P + L Y Q G + + S + G +++ PP IL
Sbjct: 95 SEQNPLIFNAPDGRVWLLY-TSQTSGDQDGAVVKCRISEDGGKSFA-----PPTILCDLP 148
Query: 133 GPSKNKPILLENG-------LLLCGS-SVESWNSWGSWMEVTV--DAGRSWRKYGPIYIP 182
G + I++ NG + C S + W+ V + D G+SW+ +
Sbjct: 149 GTFVRQQIVV-NGKGDWLLPIFRCISLPGQRWSGDADTAGVLISHDGGKSWQ----MKEI 203
Query: 183 NESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
SL + VP +R + ++ R +S D G TWS +PT L N NS
Sbjct: 204 EGSLGAVHMNIVPLE-GDRMIAFFRDRYSENIRASLS--LDQGETWSAPEPTNLPNNNSS 260
Query: 241 IDGVKLKDGRLLLAYNT--------------------------------------VSRGV 262
I V L D R+ + YN V R
Sbjct: 261 IQAVGLTDRRIAMVYNHSNAASSDARRHSLYDEIESDKAPVAEISSMPARKAIWGVPRAP 320
Query: 263 LKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
L +A+S D G + L+ + L EFSYP++ Q DG++H+ YTY R
Sbjct: 321 LSLAVSTDTGGYFERKFDLDTGDGYCLSNNSKDMLNREFSYPSITQGPDGALHVAYTYYR 380
Query: 311 TQIKHVVL 318
IK+V L
Sbjct: 381 RAIKYVRL 388
>gi|398994208|ref|ZP_10697120.1| hypothetical protein PMI22_01711 [Pseudomonas sp. GM21]
gi|398132879|gb|EJM22131.1| hypothetical protein PMI22_01711 [Pseudomonas sp. GM21]
Length = 432
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 100/390 (25%), Positives = 149/390 (38%), Gaps = 87/390 (22%)
Query: 3 KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
KD K + F A+S H+S I + G + +F GS EGA DV++ F
Sbjct: 46 KDEAAKPIYTSRF---ASSGLVDFVHSSAITALPDGSLMAVWFAGSREGAADVQVRSARF 102
Query: 63 K--DGRW-QSPIIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC- 110
G W + ++A + + NPV+ P N L LFY + + W+G
Sbjct: 103 NVSTGEWGEEQVMATRDSTQQSTRKYIRKLGNPVIALAPDNRLWLFY-VSVSMGGWAGSA 161
Query: 111 --MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGS 161
+ S + GV WS QL I P ++ P+ +G + E +
Sbjct: 162 VNVMVSDDFGVNWSAPRQL---ITSPFLNISTLVRSAPVFHVDGSIGLPVYHEFLGKFAE 218
Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSES 219
++ ++ D G KY I N SL QP VP R + +L + RV S +
Sbjct: 219 YLYLSAD-GEVIDKYR-ISHGNHSL---QPAVVPL-DGQRGIALLRYAGETHHRVLASRT 272
Query: 220 CDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSW-- 275
D G TWS P NPNS + V + +L+A N + G K++L D + W
Sbjct: 273 EDAGQTWSEPYPLDPSNPNSSLAAVATPEHGMLVALNDLQEGRFKLSLYLTDAKMEDWRL 332
Query: 276 -HD-------------ALTLEENLAMEF-------------------------------- 289
HD L +E + +EF
Sbjct: 333 VHDLDKSPDPEGDPFSPLAYKEIIGIEFRGSSGKLRQPLEAQFLANLDNRVCKNDRCEFE 392
Query: 290 -SYPAVIQASDGSVHITYTYNRTQIKHVVL 318
YP I++ DG H+ Y++N T IKHV
Sbjct: 393 YEYPYFIRSPDGMYHLVYSWNNTFIKHVTF 422
>gi|424854452|ref|ZP_18278810.1| glycosyl hydrolase [Rhodococcus opacus PD630]
gi|356664499|gb|EHI44592.1| glycosyl hydrolase [Rhodococcus opacus PD630]
Length = 412
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 140/373 (37%), Gaps = 82/373 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
AP HA+ + + G +F G+ EG PD+ +W W P+ +P
Sbjct: 35 APQVQNHAANLTVLPDGSLGCVWFAGTQEGVPDISVWFSGLAPDADAWSEPVQLSGDPTR 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQE-VQKWSGCMKR-SYNKGVTWSEREQLPP--GILGPS 135
NPVL + + L + Q + M+R S + G TW E L P G
Sbjct: 95 SEQNPVLHVTNTETVWLMWTSQHAGNQDTARVMRRISADGGRTWGEAHTLLPETDAGGVF 154
Query: 136 KNKPIL-LENGLLLC---------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+P++ L G LL G + S M ++ D G SWR+ + +P S
Sbjct: 155 VRQPVVALPTGRLLLPVFHCVRIEGRKWVGDRDYSSVM-ISDDQGNSWRE---VTVPG-S 209
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ + A+ TL L RS + S S D G TW+ + T+L N NS I V
Sbjct: 210 VGCVHMNIGRLADGTLVALYRS-RWADSIYRSTSTDDGDTWTEPEATELPNNNSSIQFVV 268
Query: 246 LKDGRLLLAYNTVS---------------------------------------------- 259
L D RL L +N S
Sbjct: 269 LPDDRLALVFNESSAADATARRTSLYDEIDDDGLADGVPEQADTELEPLDDGDSRSAFWG 328
Query: 260 --RGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDGSVHIT 305
R + +A+S+D G +W D L N L E+SYP++ +DG +H+
Sbjct: 329 APRAPMTLAMSEDGGRTWPVRRDIEIGDGYCLSNNSRDGLNREYSYPSITAGNDGRLHVA 388
Query: 306 YTYNRTQIKHVVL 318
+T R I++V L
Sbjct: 389 FTRFRQAIEYVEL 401
>gi|331656696|ref|ZP_08357658.1| hypothetical protein ECKG_00503, partial [Escherichia coli TA206]
gi|331054944|gb|EGI26953.1| hypothetical protein ECKG_00503 [Escherichia coli TA206]
Length = 307
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 109/247 (44%), Gaps = 23/247 (9%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
LF P N L L + + + ++ +S + G TW E L PG +PI
Sbjct: 93 LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149
Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+L+NG + C + E W N S ++++ D G SWR + +P +SL +
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
N TL L RS + +S S D G +WS + T+L N NS I L G L
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264
Query: 253 LAYNTVS 259
L YN +S
Sbjct: 265 LVYNAMS 271
>gi|387824033|ref|YP_005823504.1| BNR/Asp-box repeat-containing protein [Francisella cf. novicida
3523]
gi|328675632|gb|AEB28307.1| BNR/Asp-box repeat protein [Francisella cf. novicida 3523]
Length = 307
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 122/278 (43%), Gaps = 34/278 (12%)
Query: 40 FLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPIIADEEPNV-------PMWNPVL 86
+ ++ G E AP+ KI L T + G+W II ++ + NP++
Sbjct: 8 LFITWYSGDQETAPNTKIMLATAEKVAGKWHFDEIRSIIDRQKFQSILKKHIHHLGNPII 67
Query: 87 FKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGL 146
+ L+F S + S + G TWS+ P IL + +L
Sbjct: 68 YSQGKKLWLVFTSSSGGWTTSSLNIMYSNDLGKTWSQ----PKTILSSN-----ILNFST 118
Query: 147 LLCGSSVESWNSWGS---WMEVTVDAGR--SWRKYGPIYIPNESLS---VIQPVPFHTAN 198
L GS++E N+ + + E GR + + G + +E + +QP +
Sbjct: 119 LTRGSAIELDNNRFAIPVYKEFNNLNGRWFIFNQNGELIFVSEMTNDGVNLQPTVVPLSK 178
Query: 199 RTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN- 256
L R + I ++ +E+ D G++WS KPTQL NP++GI +K+++G +LLAYN
Sbjct: 179 THALALYRQMHSPIKKIYTNETADSGISWSKVKPTQLDNPDAGIAAIKIQNG-ILLAYNN 237
Query: 257 -TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
T SR L +A D+ W + T E SYPA
Sbjct: 238 ATNSRADLSLAFKADNSQQWQNIYTFPNKTKAELSYPA 275
>gi|226364119|ref|YP_002781901.1| hypothetical protein ROP_47090 [Rhodococcus opacus B4]
gi|226242608|dbj|BAH52956.1| hypothetical protein [Rhodococcus opacus B4]
Length = 409
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 140/373 (37%), Gaps = 82/373 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ--TFKDGRWQSPIIADEEPNV 79
AP HA+ + + G +F G+ EG PD+ +W T W P+ +
Sbjct: 32 APQVQNHAANLTVLPDGSLACVWFAGTQEGVPDISVWFSRLTPDADTWSEPVQLSHDSTR 91
Query: 80 PMWNPVLFKLPSNGL-LLFYKIGQEVQKWSGCMKR-SYNKGVTWSEREQLPPGIL--GPS 135
NPVL + + LL+ Q + M+R S + G TW E L P G
Sbjct: 92 SEQNPVLHVTNTGTVWLLWTSQHAGNQDTARVMRRISADGGRTWGEAHTLLPETEAGGVF 151
Query: 136 KNKPIL-LENGLLLC---------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+P+ L +G LL G + S M ++ D G SWR+ + +P S
Sbjct: 152 VRQPVAALPSGRLLLPVFHCVRIEGRKWVGDRDYSSVM-ISDDDGDSWRE---VVVPG-S 206
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ + A+ TL L RS + S S D G TW+ T+L N NS + V
Sbjct: 207 VGCVHMNIGRLADGTLVALYRS-RWADSIYRSTSTDDGDTWTEPVATELPNNNSSVQFVV 265
Query: 246 LKDGRLLLAYNTVS---------------------------------------------- 259
L D RL L +N S
Sbjct: 266 LPDDRLALVFNESSAADATARRTSLYDEIDDDGLADEVPAAPDGEPEPLDDGDSRSAFWG 325
Query: 260 --RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHIT 305
R + +A+S+D G +W LE + L EFSYP++ +DG +H+
Sbjct: 326 APRAPMTLAISEDGGRTWPLRRDLETGDGYCLSNNSRDGLNREFSYPSITTGNDGRLHVA 385
Query: 306 YTYNRTQIKHVVL 318
+T R I++V L
Sbjct: 386 FTRFRQAIEYVEL 398
>gi|171913862|ref|ZP_02929332.1| hypothetical protein VspiD_21825 [Verrucomicrobium spinosum DSM
4136]
Length = 936
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 114/260 (43%), Gaps = 37/260 (14%)
Query: 86 LFKLPSNGLLLFYKIGQ-----EVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPI 140
L+ P+ L LFY W+ + + TWSE ++ G+ + NKPI
Sbjct: 129 LWTDPTGKLWLFYDQSMGYYDGRAGVWAITCENPDAENPTWSEPRRIWHGL---TLNKPI 185
Query: 141 LLENGLLLCGSSVESWNSWG----------------SWMEVTVDAGRSWRKYGPIYIPNE 184
+L+NG L S+ + + G + + V+ D G++W + G + +P
Sbjct: 186 VLKNGEWLLPISLWTRDRIGPAELRDGFKELDEFRMANIFVSTDQGKTWNRRGGVAVPET 245
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
V L ++ R+ GI S S D G TW+ A+P+ + N +S
Sbjct: 246 DFDEHMFVELKDGR--LWLMARTKYGIAE---SFSSDQGRTWTAAQPSTIQNVSSRFFFR 300
Query: 245 KLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
+L G LLL N R + LS D+G SW L ++E + SYP QA
Sbjct: 301 RLASGNLLLVKNGPLNERLKGRSHMTAYLSDDEGKSWKGGLIIDERGGV--SYPDGFQAP 358
Query: 299 DGSVHITYTYNRTQIKHVVL 318
DG+++IT+ +NR + + ++
Sbjct: 359 DGTIYITHDHNRDEDREILF 378
>gi|449133855|ref|ZP_21769369.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain protein [Rhodopirellula europaea 6C]
gi|448887495|gb|EMB17870.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain protein [Rhodopirellula europaea 6C]
Length = 733
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 96/206 (46%), Gaps = 27/206 (13%)
Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------VTVDAGRSWRKYGP 178
G + NKP +L NG + S++ G++ V+ D G +W + G
Sbjct: 499 GVTLNKPTVLSNGDWMLPVSLDERGGLGAFRNCFQNLDPLRGANVFVSKDQGETWHRRGR 558
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
PN + + + +L +L R+ GI S S DGG TWS A +++ +PN
Sbjct: 559 ATFPNPDWH--EHMIVQRNDDSLWMLARTRKGI---MESTSVDGGQTWSAAVASKVKHPN 613
Query: 239 SGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
+ +L+ GRLLL + + R L LS D+G +W L L+E + SYP
Sbjct: 614 ARFHVRRLQSGRLLLIKHGEAIDSHRGRVALSAWLSDDEGQTWFGGLMLDERKGI--SYP 671
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
Q+S+GS++I+Y NR V++
Sbjct: 672 DGFQSSNGSIYISYDRNRATDGEVLM 697
>gi|71906197|ref|YP_283784.1| glycosyl hydrolase [Dechloromonas aromatica RCB]
gi|71845818|gb|AAZ45314.1| Glycosyl hydrolase, BNR repeat [Dechloromonas aromatica RCB]
Length = 375
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 4/124 (3%)
Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN- 256
+R L VL + G R +S + +GG TW A L P++ + ++L GRLLLA N
Sbjct: 243 HRGLAVLRDNRAGTSRATLS-TTNGGQTWETASELALPAPDAPVALLRLASGRLLLAGNP 301
Query: 257 TVSRGVLKVALSKDDGDSWHDALTLE--ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
+ L++ LS DDG +W +E + EF+ PA++Q DG +H+TYT+ + QI+
Sbjct: 302 QQGKEALQLWLSADDGQTWAMKRIVEAASDGGAEFADPALLQGRDGRIHLTYTWRQQQIR 361
Query: 315 HVVL 318
+V
Sbjct: 362 YVAF 365
>gi|412991299|emb|CCO16144.1| BNR/Asp-box repeat family protein [Bathycoccus prasinos]
Length = 375
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 84/146 (57%), Gaps = 7/146 (4%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
++ D GR+W+ +E+ +I+ +N ++ L R+ +G + S S DGG
Sbjct: 167 ISKDNGRTWKPSESSLTDSETW-LIEGTMAELSNSSVLHLFRT--SVGEIYKSLSHDGGR 223
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLE 282
TW+ A T L NPNS I ++L DGRL LAYN R L VA+S+DDG++WH A +E
Sbjct: 224 TWTKASSTGLPNPNSKIHLLRLADGRLALAYNHDRKYRANLYVAISEDDGENWHLAAIVE 283
Query: 283 E-NLAMEFSYPAVIQASDGSVHITYT 307
+ + + ++YP +++ + +++ Y+
Sbjct: 284 KGSESSMYAYPTMVE-DECKLYVAYS 308
>gi|452001230|gb|EMD93690.1| glycoside hydrolase family 33 protein [Cochliobolus heterostrophus
C5]
Length = 384
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 146/381 (38%), Gaps = 103/381 (27%)
Query: 17 FPANSAPFKSC--HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPII 72
F +N P + HAS ++++ G L A+FGGS EG D+ I L K G W +P
Sbjct: 17 FASNYIPASTVQVHASNLLQLPNGDLLCAWFGGSQEGLSDICIHLSRLKKGSRVWSTPQK 76
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL-- 127
++P NPVLF +P L Q + R S ++G TWS L
Sbjct: 77 ISDDPERSEQNPVLFLVPGTAALWVMYTAQPAGNQDKAVVRYRVSSDEGQTWSLAHNLFN 136
Query: 128 --------PPGILGPS---------KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
P +L +N P G GS S ++ T D G
Sbjct: 137 DEGLFIRQPVTVLKDGTWVLPAWYCRNPP-----GFRWVGSDDI------SVVKYTRDGG 185
Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLR----VLMRSFNGIGRVCMSESCDGGLTW 226
++W++ G +PN +V H + LR + + MS S + + W
Sbjct: 186 KTWQEKG---VPNSIGAV------HMNIKALRDGSYIAFYRSRWADNIYMSTSTN-CIDW 235
Query: 227 SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT----------------------------- 257
+ KPT L NPNSGI L D RL++ YN
Sbjct: 236 TEPKPTSLPNPNSGICFEVLSDDRLIMIYNDSQFEKGQAKREGLYDDITPKDDKRVNQRG 295
Query: 258 --------VSRGVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAV-IQ 296
V R ++ V +S D G +W + TL+E E SYP++ ++
Sbjct: 296 STKDAVWGVPRKIMTVGVSSDGGRTWK-SKTLQEGDGFCMTNNSRTKENRELSYPSIWVE 354
Query: 297 ASDGS-VHITYTYNRTQIKHV 316
D V++ +T+ R +IK +
Sbjct: 355 RGDKDVVNVAFTFWRQRIKFI 375
>gi|426409730|ref|YP_007029829.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. UW4]
gi|426267947|gb|AFY20024.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. UW4]
Length = 431
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 85/372 (22%), Positives = 138/372 (37%), Gaps = 80/372 (21%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
A+S H+S + + G + A+F GS EGA DV++ F K G W + +
Sbjct: 58 ASSGLVDFVHSSAVTALPDGSLMTAWFAGSREGAADVQVRSARFDAKTGEWGAERVLATR 117
Query: 77 PNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSER 124
+ + NPV+ P L LFY + + W+G + S + G WS
Sbjct: 118 NSTQQGTQRYIRKLGNPVIALAPDKRLWLFY-VSVSMGGWAGSAVNVMVSDDFGENWSAP 176
Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
QL I P ++ P+ +G + E + ++ ++ D G KY
Sbjct: 177 RQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GDVIDKYR 232
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ N + P+ R + +L + N RV + + D G WS P + NP
Sbjct: 233 ISHGKNSLQPTVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAGQNWSEPYPLEPWNP 289
Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN----------- 284
NS + V + LL+A N + G K++L D D+W L+++
Sbjct: 290 NSSLAAVATPEHGLLVALNDLQEGRFKLSLYGTDEKMDNWRSLRDLDKSPDPEGDPFSPQ 349
Query: 285 --------------------------------------LAMEFSYPAVIQASDGSVHITY 306
E+ YP I++ DG H+ Y
Sbjct: 350 AYKKIIGNEFRGSSGALRQPLEAQFLNNLDTRVCNPRGCEFEYEYPYFIRSPDGMYHLVY 409
Query: 307 TYNRTQIKHVVL 318
++N T IKHV
Sbjct: 410 SWNNTFIKHVTF 421
>gi|432337240|ref|ZP_19588687.1| hypothetical protein Rwratislav_19854 [Rhodococcus wratislaviensis
IFP 2016]
gi|430775820|gb|ELB91296.1| hypothetical protein Rwratislav_19854 [Rhodococcus wratislaviensis
IFP 2016]
Length = 408
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 140/373 (37%), Gaps = 82/373 (21%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
AP HA+ + + G +F G+ EG PD+ +W W P+ ++P
Sbjct: 31 APQVQNHAANLTVLPDGSLGCVWFAGTQEGVPDISVWFSRLAPDADAWSEPVQLSDDPTR 90
Query: 80 PMWNPVLFKLPSNGL-LLFYKIGQEVQKWSGCMKR-SYNKGVTWSEREQLPP--GILGPS 135
NPVL + + LL+ Q + M+R S + G TW E L P G
Sbjct: 91 SEQNPVLHVTNTETVWLLWTSQHAGNQDTARVMRRISADGGRTWGEAHTLLPETDAGGVF 150
Query: 136 KNKPIL-LENGLLLC---------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
+P++ L G LL G + S M ++ D G SWR+ + +P S
Sbjct: 151 VRQPVVALPTGRLLLPVFHCVRIEGRKWVGDRDYSSVM-ISDDHGDSWRE---VTVPG-S 205
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ + A TL L RS + S S D G TW+ + T+L N NS I V
Sbjct: 206 VGCVHMNIGRLAGGTLVALYRS-RWADSIYRSVSTDDGDTWTEPEATELPNNNSSIQFVV 264
Query: 246 LKDGRLLLAYNTVS---------------------------------------------- 259
L RL L +N S
Sbjct: 265 LPGDRLALVFNESSAANATARRTSLYDEIDDDGLADGVPEQADPELEPLDDGDSRSAFWG 324
Query: 260 --RGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDGSVHIT 305
R + +A+S+D G +W D L N L E+SYP++ +DG +H+
Sbjct: 325 APRAPMTLAMSEDGGRTWPVRRDIEIGDGYCLSNNSRDGLNREYSYPSITAGNDGRLHVA 384
Query: 306 YTYNRTQIKHVVL 318
+T R I++V L
Sbjct: 385 FTRFRQAIEYVEL 397
>gi|407362319|ref|ZP_11108851.1| hypothetical protein PmanJ_00935 [Pseudomonas mandelii JR-1]
Length = 432
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 140/372 (37%), Gaps = 80/372 (21%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADE 75
A+S H+S + + G + +F GS EGA DV++ F K G W + ++A
Sbjct: 59 ASSGLVDFVHSSAVTALPDGSLMTVWFAGSREGAADVQVRSARFDSKTGEWGAEQVLATR 118
Query: 76 EPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSER 124
+ + NPV+ P N + LFY + + W+G + S + G WS
Sbjct: 119 DSTQQGTRKYIRKLGNPVVALAPDNRVWLFY-VSVSMGGWAGSAVNVMVSDDFGEHWSAP 177
Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
QL I P ++ P+ +G + E + ++ ++ D G K+
Sbjct: 178 RQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GDVIDKFR 233
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ N + P+ R + +L + N RV + + D G TWS P NP
Sbjct: 234 ISHGMNSLQPAVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAGQTWSEPYPLDPSNP 290
Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLE------------- 282
NS + V + LL+A N + G K++L D D W L+
Sbjct: 291 NSSLAAVATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDLDKSPDPLGAPFSPE 350
Query: 283 --------------------------ENL----------AMEFSYPAVIQASDGSVHITY 306
+NL E+ YP I++ DG H+ Y
Sbjct: 351 AYKEIIGREFRGSSGARRHSLEAEFLDNLDNRVCKTQGCEFEYEYPYFIRSPDGMYHLVY 410
Query: 307 TYNRTQIKHVVL 318
++N T IKHV
Sbjct: 411 SWNNTFIKHVTF 422
>gi|417305276|ref|ZP_12092248.1| hypothetical protein RBWH47_02271 [Rhodopirellula baltica WH47]
gi|327538370|gb|EGF25042.1| hypothetical protein RBWH47_02271 [Rhodopirellula baltica WH47]
Length = 114
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 58/111 (52%), Gaps = 26/111 (23%)
Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWHDALTLE 282
NPNSG D V L+DGR +L YN R +L +A+S D G +W LTLE
Sbjct: 3 NPNSGTDAVTLQDGRQVLIYNHSEGLVKRKDAPDLKPRRILNLAISSD-GKTWKPVLTLE 61
Query: 283 ENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
E+SYPA+IQ SDG +++ YTYNR +KH V+ P+
Sbjct: 62 HETGPHPKDPERRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 112
>gi|443643049|ref|ZP_21126899.1| BBNR/Asp-box repeat-containing glycosyl hydrolase [Pseudomonas
syringae pv. syringae B64]
gi|443283066|gb|ELS42071.1| BBNR/Asp-box repeat-containing glycosyl hydrolase [Pseudomonas
syringae pv. syringae B64]
Length = 432
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 135/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + GV+WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGVSWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|398945574|ref|ZP_10671821.1| hypothetical protein PMI27_05672 [Pseudomonas sp. GM41(2012)]
gi|398156586|gb|EJM45002.1| hypothetical protein PMI27_05672 [Pseudomonas sp. GM41(2012)]
Length = 433
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 141/370 (38%), Gaps = 80/370 (21%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADE 75
A+S H+S + + G + +F GS EGA DV++ F K G W + ++A
Sbjct: 60 ASSGLVDFVHSSAVTALPDGSLMTVWFAGSREGAADVQVRSARFDPKTGEWGAEKVLATR 119
Query: 76 EPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSER 124
+ + NPV+ P N + LFY + + W+G + S + G +WS
Sbjct: 120 DSTQQGTRKYIRKLGNPVVALAPDNRVWLFY-VSVSMGGWAGSAVNVMVSDDFGESWSAP 178
Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
QL I P ++ P+ +G + E + ++ ++ D G K+
Sbjct: 179 RQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GNVIDKFR 234
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ N + P+ R + +L + + RV + + D G TWS P NP
Sbjct: 235 ISHGNNSLQPTVVPL---DGQRGIALLRYAGDTHHRVLATRTEDAGQTWSEPYPLDPSNP 291
Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLE------------- 282
NS + V + LL+A N + G K++L D D W L+
Sbjct: 292 NSSLAAVATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDLDKSPDPAGAPFSPE 351
Query: 283 --------------------------ENL----------AMEFSYPAVIQASDGSVHITY 306
+NL E+ YP I++ DG H+ Y
Sbjct: 352 AYKEIIGREFRGSSGALRQPLEAQFLDNLDNRVCKSQGCEFEYEYPYFIRSPDGMYHLVY 411
Query: 307 TYNRTQIKHV 316
++N T IKHV
Sbjct: 412 SWNNTFIKHV 421
>gi|391231583|ref|ZP_10267789.1| hypothetical protein OpiT1DRAFT_04191 [Opitutaceae bacterium TAV1]
gi|391221244|gb|EIP99664.1| hypothetical protein OpiT1DRAFT_04191 [Opitutaceae bacterium TAV1]
Length = 581
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 56/188 (29%), Positives = 94/188 (50%), Gaps = 11/188 (5%)
Query: 137 NKPILLENGLLLCGSS-VESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
NKP +L +G + SS ++S + + G+S+ G + +P ++ + +
Sbjct: 199 NKPTVLSDGTWVWPSSPLKSPRGLPVRPLYSNNNGKSFFYKGAVPVPPQTRNADEYQITE 258
Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
+++L +L R GIG S S DGG TW+ P+Q+ + S +L+ G+LLL
Sbjct: 259 RKDKSLWLLNRMKYGIGE---SFSTDGGTTWTEMAPSQIAHITSRFYITRLQSGKLLLVK 315
Query: 256 N-----TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
+ V R L LS DDG++W L ++E + SYP QA DG ++I Y Y R
Sbjct: 316 HGAIKSNVGRSQLMAFLSDDDGNNWTGGLVIDER--NDLSYPDGTQAKDGRIYIIYDYQR 373
Query: 311 TQIKHVVL 318
+ K +++
Sbjct: 374 HREKEILM 381
>gi|399518684|ref|ZP_10759638.1| BNR/Asp-box repeat protein [Pseudomonas pseudoalcaligenes CECT
5344]
gi|399113178|emb|CCH36196.1| BNR/Asp-box repeat protein [Pseudomonas pseudoalcaligenes CECT
5344]
Length = 425
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/377 (23%), Positives = 142/377 (37%), Gaps = 84/377 (22%)
Query: 18 PANSAPFKS------CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQS 69
PA +A F S H+++I + G + A+F G+ EGA DV+I F G+W +
Sbjct: 45 PAMAAHFASSDLEDFVHSASITGLPGGDLMAAWFAGTREGAADVQIRAARFDASSGQWSA 104
Query: 70 P-IIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNK 117
++A E + NPV+ P N L LFY + + W+G S +
Sbjct: 105 ERVLASRESTQHAVGKHIRKLGNPVISLAPDNRLWLFY-VSVSLGGWAGSAVNAMVSDDL 163
Query: 118 GVTWSEREQLPPGILGPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
G TWS QL + P N P+ +G + E + ++ ++ +
Sbjct: 164 GDTWSAPRQL---VTSPFLNISTLVRAAPVFHADGSIGLPVYHEFLGKFPEYLHLSANG- 219
Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
G I S+ V R + +L S RV S S D G WS +
Sbjct: 220 ---EVLGKHRIGKGRYSLQPTVVALDDKRAVALLRYSGEEHHRVLASYSEDTGRNWSQPE 276
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGD--SWHDALTLEEN---- 284
P + NPNS + V DG LL+A N + G ++ L D + +W + L+E+
Sbjct: 277 PVEPSNPNSSLAAVGRPDGSLLVAMNDLEDGRFRLTLYATDANLSNWRNLGELDESPNPW 336
Query: 285 -------------------------------------------LAMEFSYPAVIQASDGS 301
+ ++ YP ++ +G+
Sbjct: 337 GEPIARDEFPAVVSEKYLASGGKPEQQATFLERVSARMCSDDGCSFDYEYPYFVRDREGA 396
Query: 302 VHITYTYNRTQIKHVVL 318
H+ Y++N IKH+
Sbjct: 397 YHLVYSWNDMFIKHLTF 413
>gi|330501037|ref|YP_004377906.1| BNR/Asp-box repeat-containing protein [Pseudomonas mendocina NK-01]
gi|328915323|gb|AEB56154.1| BNR/Asp-box repeat-containing protein [Pseudomonas mendocina NK-01]
Length = 426
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/362 (24%), Positives = 136/362 (37%), Gaps = 78/362 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSP-IIADEEPNV---- 79
H+++I + G + A+F GS EGA DV+I F G+W + ++A E
Sbjct: 61 VHSASITGLPGGDLMAAWFAGSREGAADVQIRAARFDAASGQWSAERVLATRESTQRAVG 120
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L LFY + + W+G S + G TWS QL +
Sbjct: 121 KHIRKLGNPVISLAPDNRLWLFY-VSVSLGGWAGSAVNAMVSDDLGETWSAPRQL---VT 176
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ + G I
Sbjct: 177 SPFLNISTLVRAAPVFHADGSIGLPVYHEFLGKFPEYLHLSTNG----EVLGKYRIGKGR 232
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
S+ V NR + +L + RV S S D G +WS +P + NPNS + V
Sbjct: 233 YSLQPTVVPLDENRAVALLRYAGEEHHRVLASYSEDAGRSWSRPQPVEPSNPNSSLAAVG 292
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG---------------DSW-----HD-------- 277
DG LL+A N + G ++ L D + W HD
Sbjct: 293 RPDGSLLVAMNDLEDGRFRLTLYATDAKLGNWRTLGELDETPNPWGEPISHDEFPTVVSE 352
Query: 278 ------------ALTLE---------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
A LE E ++ YP + +G+ H+ Y++N IKH+
Sbjct: 353 KYRASGGRPELQARFLEQVTARMCSAEGCTFDYEYPYFTRDREGTYHLVYSWNDMFIKHL 412
Query: 317 VL 318
Sbjct: 413 AF 414
>gi|229592500|ref|YP_002874619.1| hypothetical protein PFLU5114 [Pseudomonas fluorescens SBW25]
gi|229364366|emb|CAY52136.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25]
Length = 422
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 135/362 (37%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
H+S++ + G + +F GS EGA DV++ F + G W + ++A E
Sbjct: 58 VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDARSGEWGAEQVLATRESTRDGTQ 117
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P L +FY + V W+ + S + G +WS QL +
Sbjct: 118 RYIRKLGNPVIALGPDKRLWMFY-VSVSVGGWATSAINLMVSDDLGASWSAPRQL---VT 173
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 174 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVIDKFRISRGKNSL 232
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
I P+ R + +L + + RV S + D G TWS P + NPNS + V
Sbjct: 233 QPTIVPL---DGQRAVAMLRYAGDTHHRVLASRTEDAGQTWSEPYPLEPANPNSSLAAVA 289
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGD--SWHDALTLEEN------------------- 284
+D LL+A N + G K++L D W + L+E+
Sbjct: 290 TQDDGLLVALNDLQDGRFKLSLYGTDAQLSQWRTVVQLDESPDPLGQPFSPEAYKEIIGE 349
Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
E+ YP ++ DG H+ Y++N T IK
Sbjct: 350 GFRASSGARRLPLEQRFLSNLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 409
Query: 315 HV 316
HV
Sbjct: 410 HV 411
>gi|149177497|ref|ZP_01856100.1| hypothetical protein PM8797T_15456 [Planctomyces maris DSM 8797]
gi|148843647|gb|EDL58007.1| hypothetical protein PM8797T_15456 [Planctomyces maris DSM 8797]
Length = 938
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 95/207 (45%), Gaps = 28/207 (13%)
Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSW--------------MEVTVDAGRSWRKYGP 178
G + NKP +L G + S++ + +G + + V+ D G +W++ G
Sbjct: 700 GVTLNKPTVLSKGEWMLPISLDQRSGFGPFKGCFQELDSQRGANVFVSTDQGATWQRRGA 759
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSY-AKPTQLLNP 237
PN + + + +L +L R+ GI + S DGG TW+ +P Q+ P
Sbjct: 760 AVFPNPDWH--EHMIVERKDGSLWMLARTRKGIMQTI---STDGGRTWANPTQPPQIRQP 814
Query: 238 NSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
N+ +L GRLLL + R L LS DDG +W L ++E + SY
Sbjct: 815 NARFHIRRLASGRLLLIKHGDKIDSHQGRVQLSAWLSDDDGLTWQGGLVIDERKGI--SY 872
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVL 318
P QA DGS++I+Y NR+ ++L
Sbjct: 873 PDGFQAPDGSIYISYDRNRSTDGEILL 899
>gi|422653452|ref|ZP_16716218.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
actinidiae str. M302091]
gi|330966501|gb|EGH66761.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
actinidiae str. M302091]
Length = 432
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 134/363 (36%), Gaps = 82/363 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS----PIIADEEPNVP-- 80
H+S + + G + +F GS EGA DV+I F D R + ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRF-DARTEEWGGEQVLATRESTQTGT 125
Query: 81 ------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGI 131
+ NPV+ P N L LFY + V W+G S + G +WS QL +
Sbjct: 126 GKYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---V 181
Query: 132 LGPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 182 TSPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNS 240
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+ P+ A LR +F+ RV S + DGG TWS P NPNS + V
Sbjct: 241 LQPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLTPSNPNSSLAAV 297
Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL-------------- 279
+ LL+A N + G K++L D D D L
Sbjct: 298 GVPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIG 357
Query: 280 --------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQI 313
L + + EF YP I++ DG H+ Y++N T I
Sbjct: 358 QGFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFI 417
Query: 314 KHV 316
KHV
Sbjct: 418 KHV 420
>gi|218659020|ref|ZP_03514950.1| putative glycosyl hydrolase protein [Rhizobium etli IE4771]
Length = 324
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 69/158 (43%), Gaps = 51/158 (32%)
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT---------------- 257
+ S S DGG +WS +PT+L N NS I L DG + + YN
Sbjct: 163 ILSSRSADGGESWSPPEPTELPNNNSSIQATVLDDGGIAMVYNHSNATMSDARRQSLYDE 222
Query: 258 -----------------------VSRGVLKVALSKDDGDSWHDALTLE------------ 282
V R L +A+S+D G ++ + L+
Sbjct: 223 IEGDVAGESAAIVADAGRKAVWGVPRAPLSLAISRDGGRTFPQRMDLDTGDGFCLSNNSK 282
Query: 283 ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
++L EFSYP+VIQ SDG++H+ YTY R IK+V L P
Sbjct: 283 DSLNREFSYPSVIQGSDGTLHVPYTYYRRAIKYVRLAP 320
>gi|66046737|ref|YP_236578.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
syringae B728a]
gi|63257444|gb|AAY38540.1| Glycosyl hydrolase, BNR repeat protein [Pseudomonas syringae pv.
syringae B728a]
Length = 432
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 131/359 (36%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQAGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRLSRGNNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE----------- 282
LL+A N + G K++L D D TLE
Sbjct: 302 KGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGFR 361
Query: 283 -------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 362 SSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|167519867|ref|XP_001744273.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777359|gb|EDQ90976.1| predicted protein [Monosiga brevicollis MX1]
Length = 332
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 116/262 (44%), Gaps = 34/262 (12%)
Query: 84 PVLFKLP-SNGLLLFY------------KIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
P LF P SN L LFY K G+ + G R S + G TWS +
Sbjct: 74 PSLFLQPDSNKLWLFYSESTQYDYRNDSKCGEYGHSYPGGEVRYRTSLDGGHTWSSATTI 133
Query: 128 ----PPGILGPSKNKPILLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGP 178
G + +L+ NG L E + S + ++ D G++W YG
Sbjct: 134 LTFQDRGHIAKVTANTMLVRNGRWLLPFWQEQHTVYDIGPGCSGVLISDDQGQNWTPYGC 193
Query: 179 IYIPNESLS-VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ N S + +I+ N ++ L R+ IG + S S D G TWS A+PT L NP
Sbjct: 194 V---NRSDTWLIENTLGTINNGSILQLFRT--QIGYLYASVSNDNGETWSNAQPTPLQNP 248
Query: 238 NSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
++ + L+DGRL+LAYN R L +A ++ DG SW TLE N++ + YP +
Sbjct: 249 DAKVCMTNLRDGRLMLAYNPSKTQRLPLDLAYTR-DGVSWTHYATLESNMSTTYQYPTPV 307
Query: 296 QASDGSVHITYTYNRTQIKHVV 317
Q + + RT IK +
Sbjct: 308 QLGNEVLTSYSADTRTGIKLAI 329
>gi|225420243|ref|ZP_03762546.1| hypothetical protein CLOSTASPAR_06586 [Clostridium asparagiforme
DSM 15981]
gi|225041060|gb|EEG51306.1| hypothetical protein CLOSTASPAR_06586 [Clostridium asparagiforme
DSM 15981]
Length = 395
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 139/331 (41%), Gaps = 54/331 (16%)
Query: 31 TIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS-----PIIADEEPNVPMWNPV 85
+I +KG V +F G + L T D R ++ +I +P +++P
Sbjct: 42 SIEVTEKGRIFVNFFSGQGAEVGGNFLMLCT-SDNRGETFRSCVTVIEHPDPLCRIYDPC 100
Query: 86 LFKLPSNGLLLFYKIGQEVQK-----WSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPI 140
L+ P+ L + Y + W+ K + W+ ++ GI+ NKPI
Sbjct: 101 LWISPAGELWMIYNQTKGFNDCRSGVWATVCKAPDAPRLEWTPPRRIANGIM---INKPI 157
Query: 141 LLENGLLL---------CGSSVESWNSWG----SWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +G L CG+ + S + + D G ++ G +P S
Sbjct: 158 VDASGDWLFPCAIWRDECGAPPTERHGLEQEQFSNVYASTDGGETFSLRGSADVPGRSFD 217
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ + + TL +L+R+F GIG S S DGG TW+ + + + P S +LK
Sbjct: 218 --EHMIVEKKDGTLWMLVRTFEGIGE---SFSHDGGRTWTPGRKSHIDGPCSRFHIRRLK 272
Query: 248 DGRLLLA-----------YNTVSRGVLK---------VALSKDDGDSWHDALTLEENLAM 287
GRLL+ N + +G +K LS+DDG +W L L+E
Sbjct: 273 SGRLLMINHFHFGERIDLNNILQQGNVKSWKGRSHLTAMLSEDDGQTWPYTLLLDER--N 330
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+ +YP + +DG +++TY Y R + + +++
Sbjct: 331 DAAYPDAKEGADGYIYVTYDYERVKQREILM 361
>gi|422633915|ref|ZP_16699027.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
pisi str. 1704B]
gi|330944496|gb|EGH46484.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
pisi str. 1704B]
Length = 378
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 13 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 72
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 73 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 131
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 132 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKF-RISRGNNSLQP 189
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 190 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 246
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 247 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 306
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 307 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 366
>gi|440724117|ref|ZP_20904455.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
BRIP34876]
gi|440725576|ref|ZP_20905841.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
BRIP34881]
gi|440358322|gb|ELP95694.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
BRIP34876]
gi|440368513|gb|ELQ05546.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
BRIP34881]
Length = 432
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|422297881|ref|ZP_16385507.1| BNR/Asp-box repeat-containing protein [Pseudomonas avellanae BPIC
631]
gi|407990587|gb|EKG32645.1| BNR/Asp-box repeat-containing protein [Pseudomonas avellanae BPIC
631]
Length = 432
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F + W ++A E
Sbjct: 67 VHSSAVTALPGGDLMAVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L LFY + V W+G S + G +WS QL +
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVSAMVSSDMGASWSAPRQL---VT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ P+ A LR +F+ RV S + DGG TWS P NPNS + V
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLSAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
+ LL+A N + G K++L D D D L
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQ 358
Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
L + + EF YP I++ DG H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|424068662|ref|ZP_17806111.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
gi|407996831|gb|EKG37287.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
Length = 432
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDEQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|422665529|ref|ZP_16725400.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
aptata str. DSM 50252]
gi|330975946|gb|EGH76012.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
aptata str. DSM 50252]
Length = 432
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGESWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|302188253|ref|ZP_07264926.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
syringae 642]
Length = 432
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 133/360 (36%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGANWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GKGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|432893409|ref|ZP_20105421.1| hypothetical protein A31K_02554 [Escherichia coli KTE165]
gi|431424389|gb|ELH06485.1| hypothetical protein A31K_02554 [Escherichia coli KTE165]
Length = 240
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 86/207 (41%), Gaps = 54/207 (26%)
Query: 163 MEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
++++ D G SWR + +P +SL + + TL L RS + +S S D
Sbjct: 31 VKISADGGHSWRD---VEVP-QSLGCVHMNITMLHDGTLVALFRS-RWADNIYISHSVDN 85
Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS----------------------- 259
G +WS + T+L N NS I L G L L YN +S
Sbjct: 86 GESWSVPQATELPNNNSSIQVTTLASGELALVYNAMSAAGAVERRASLYDEIDDGDDSRK 145
Query: 260 --------------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPA 293
R + VA+S D G SW L+E L EFSYP+
Sbjct: 146 EPTAVGRSAFWGAPRAPMTVAISADGGKSWPWRRNLDEGDGYCMTNNSLEKLNREFSYPS 205
Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
+ Q+ DG++HI YT+ R IK+V + P
Sbjct: 206 IKQSPDGTLHIAYTWWRQAIKYVRISP 232
>gi|421609054|ref|ZP_16050257.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain protein [Rhodopirellula baltica SH28]
gi|408500157|gb|EKK04613.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain protein [Rhodopirellula baltica SH28]
Length = 733
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 103/219 (47%), Gaps = 30/219 (13%)
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------V 165
TWS ++ G+ + NKP +L +G + S++ G + + V
Sbjct: 489 TWSAPRRIWHGV---TLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFV 545
Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
+ D G +W + G PN + + + +L +L R+ NGI S S D G T
Sbjct: 546 SKDQGETWHRRGRATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDSGKT 600
Query: 226 WSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDAL 279
WS A +++ +PN+ +L+ GRLLL + + R L LS+D+G +W L
Sbjct: 601 WSAAVASKIKHPNARFHVRRLQSGRLLLIKHGDAIDSHHGRVGLSAWLSEDEGQTWLGGL 660
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L+E + SYP Q+S+GS++I+Y NR V++
Sbjct: 661 MLDERKGI--SYPDGFQSSNGSIYISYDRNRATDGEVLM 697
>gi|424073080|ref|ZP_17810499.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
gi|407996670|gb|EKG37131.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
Length = 432
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHVDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|62185252|ref|YP_220037.1| hypothetical protein CAB639 [Chlamydophila abortus S26/3]
gi|62148319|emb|CAH64086.1| hypothetical protein CAB639 [Chlamydophila abortus S26/3]
Length = 293
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 121/262 (46%), Gaps = 17/262 (6%)
Query: 63 KDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYK---IGQEVQKWSGCMKRSYNKGV 119
++GRW P + + +PVL ++ S + LFY+ +G+ + K + S+N GV
Sbjct: 3 REGRWSEPYEIFPQASGLFSHPVLVRISSQEIWLFYRKQQVGESISK--PYVAFSFNSGV 60
Query: 120 TWSEREQLPPGILGPSKNKPIL--LENGLLLCGSSV-ESWNSW---GSWMEVTVDAGRSW 173
W E LPPGI G ++N P + + N + + SV ES + SW+E+ RSW
Sbjct: 61 HWLEHRLLPPGIEGTTRNPPYVDRIRNRVYIPSFSVYESPDQELIGDSWVEIYNFFSRSW 120
Query: 174 R-KYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKP 231
+ GPI + + I+P + L + R+ + +C+S S + G++WS K
Sbjct: 121 EGRMGPILGGSRNQVPIEPAIVKLTSNKLALFFRNASVEESYICLSVSTNQGISWSKLKN 180
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFS 290
+ ++ I ++G+L L N+ G L S ++G +W+ ++ +++
Sbjct: 181 LPWRSYDNSISVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSID-- 238
Query: 291 YPAVIQASDGSVHITYTYNRTQ 312
P+ +H YT Q
Sbjct: 239 -PSTYLDRQECMHHVYTSKDAQ 259
>gi|384251712|gb|EIE25189.1| neuraminidase [Coccomyxa subellipsoidea C-169]
Length = 568
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/327 (23%), Positives = 130/327 (39%), Gaps = 59/327 (18%)
Query: 28 HASTIVEVDKGHFLVAYFGGS--CEGAPDVKIWLQTFKDG--RW-QSPIIADEEPNVPMW 82
H +T+ + G + + + EG D I+ D W + ++ + N+P W
Sbjct: 71 HMATLAHMPNGSIIAQWQAATKWFEGVADQAIYYAISNDHGLTWGPTQLMPGTQDNLPAW 130
Query: 83 NPVLFKLPSNGLLLFYKIGQEVQKW------------SGCMKRSYNKGVTWSEREQLPPG 130
PVL+ L F + Q Q W S + S + G TW+ L
Sbjct: 131 GPVLYSANGTMHLWFSRSHQPCQ-WLSESGVCWAPGGSLMYQTSADNGTTWTNATAL--- 186
Query: 131 ILGPSKNKPILLENGLLLCGSSVESWNSWGSWME-------------------------V 165
P E G+ + S+N+ G W+ +
Sbjct: 187 -------DPWSAEGGIAKLACNGLSFNAAGEWLLPYWRELGGGQQCQKVPKLHGVAGVLI 239
Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTAN-RTLRVLMRSFNG-IGRVCMSESCDGG 223
T D G++W+K + N + + + +TA +R+ ++ F G S S DGG
Sbjct: 240 TGDQGKTWKK-AEVPKDNSTWMIENAIAAYTAGVNGMRMQLQIFRTRTGSSMQSWSSDGG 298
Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--SRGVLKVALSKDDGDSWHDALTL 281
W++ + L NPNS +D + L D L+ A+N R L++A+S+D G +W + +
Sbjct: 299 YNWTFPSNSTLPNPNSRMDVIALPDNVLVTAFNDSPNRRSPLRLAISRDGGRNWTRSALI 358
Query: 282 EENLAMEFSYPAVI-QASDGSVHITYT 307
E+ F YPA++ A V + YT
Sbjct: 359 EDEPDGSFHYPALLYDAVQDQVMVIYT 385
>gi|418055776|ref|ZP_12693830.1| hypothetical protein HypdeDRAFT_2330 [Hyphomicrobium denitrificans
1NES1]
gi|353210054|gb|EHB75456.1| hypothetical protein HypdeDRAFT_2330 [Hyphomicrobium denitrificans
1NES1]
Length = 394
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 128/316 (40%), Gaps = 32/316 (10%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVP------ 80
H +++E G L ++ EGA + ++ F W + V
Sbjct: 67 VHGPSLIETPTG-LLAFWYRAVYEGAANAELVSARFDGDHWSPTAVVTNSNTVTRDIGLT 125
Query: 81 ---MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG---- 130
+ NPV F+ N + LF+ + W+ C + RS + G TWS ++L
Sbjct: 126 VKSLANPVPFRRSPNEIWLFFA-ASRLSGWATCEIVLIRSVDNGRTWSPAQRLYASPFLN 184
Query: 131 ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
+ +K+ PI L + ++ +++ V + R+ R + + N Q
Sbjct: 185 MSHLTKSLPIRLSGDRI----ALPAYHEMNRKYPVMLVLDRNGRVIDKLRMGNGGTVGYQ 240
Query: 191 PVPFHTANRTLRVLMRSFNGI--GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
P T T +R N R+ MS + D G TW+ P L NP I ++ D
Sbjct: 241 PAIVATGPTTAIAFVRRLNSFRPQRILMSHTVDAGRTWTPPTPIDLPNPGGPIAAIRYDD 300
Query: 249 GRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTL-EENLAME---FSYPAVIQASDGSV 302
R+LLA+N +K+A S DG S T+ ++N +E +YP +I++ G
Sbjct: 301 TRILLAFNDDPDHESNVKLAFSNLDGTSLRRIGTVAQKNDRLEGDAVAYPFLIESEPGQF 360
Query: 303 HITYTYNRTQ--IKHV 316
+ ++ Q I HV
Sbjct: 361 DVVFSRPPPQHAINHV 376
>gi|213968573|ref|ZP_03396715.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato T1]
gi|213926506|gb|EEB60059.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato T1]
Length = 432
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F + W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L LFY + V W+G S + G +WS QL +
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ P+ A LR +F+ RV S + DGG TWS P NPNS + V
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
+ LL+A N + G K++L D D D L
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQ 358
Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
L + + EF YP I++ DG H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|28869101|ref|NP_791720.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
tomato str. DC3000]
gi|28852341|gb|AAO55415.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato str.
DC3000]
Length = 432
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F + W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L LFY + V W+G S + G +WS QL +
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ P+ A LR +F+ RV S + DGG TWS P NPNS + V
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
+ LL+A N + G K++L D D D L
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEVYKEVIGQ 358
Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
L + + EF YP I++ DG H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|422658889|ref|ZP_16721320.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. lachrymans
str. M302278]
gi|331017513|gb|EGH97569.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. lachrymans
str. M302278]
Length = 432
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F + W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L LFY + V W+G S + G +WS QL +
Sbjct: 127 KYIRKLGNPVIALAPDNPLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ P+ A LR +F+ RV S + DGG TWS P NPNS + V
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
+ LL+A N + G K++L D D D L
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEVYKEVIGQ 358
Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
L + + EF YP I++ DG H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|417303900|ref|ZP_12090941.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain [Rhodopirellula baltica WH47]
gi|327539850|gb|EGF26453.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain [Rhodopirellula baltica WH47]
Length = 733
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 97/206 (47%), Gaps = 27/206 (13%)
Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------VTVDAGRSWRKYGP 178
G + NKP +L +G + S++ G + + V+ D G +W + G
Sbjct: 499 GVTLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFVSKDQGETWHRRGR 558
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
PN + + + +L +L R+ NGI S S D G TWS A +++ +PN
Sbjct: 559 ATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDSGKTWSAAVTSKIKHPN 613
Query: 239 SGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
+ +L+ GRLLL + + R L LS+D+G +W L L+E + SYP
Sbjct: 614 ARFHIRRLQSGRLLLIKHGDAIDAHHGRVGLSAWLSEDEGQTWLGGLMLDERKGI--SYP 671
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
Q+S+GS++I+Y NR V++
Sbjct: 672 DGFQSSNGSIYISYDRNRATDGEVLM 697
>gi|32476765|ref|NP_869759.1| hypothetical protein RB11055 [Rhodopirellula baltica SH 1]
gi|32447311|emb|CAD77137.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 733
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 97/206 (47%), Gaps = 27/206 (13%)
Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------VTVDAGRSWRKYGP 178
G + NKP +L +G + S++ G + + V+ D G +W + G
Sbjct: 499 GVTLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFVSKDQGETWHRRGR 558
Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
PN + + + +L +L R+ NGI S S D G TWS A +++ +PN
Sbjct: 559 ATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDSGKTWSAAVTSKIKHPN 613
Query: 239 SGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
+ +L+ GRLLL + + R L LS+D+G +W L L+E + SYP
Sbjct: 614 ARFHVRRLQSGRLLLIKHGDAIDAHHGRVGLSAWLSEDEGQTWLGGLMLDERKGI--SYP 671
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
Q+S+GS++I+Y NR V++
Sbjct: 672 DGFQSSNGSIYISYDRNRATDGEVLM 697
>gi|373850784|ref|ZP_09593585.1| hypothetical protein Opit5DRAFT_1639 [Opitutaceae bacterium TAV5]
gi|372476949|gb|EHP36958.1| hypothetical protein Opit5DRAFT_1639 [Opitutaceae bacterium TAV5]
Length = 441
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 91/187 (48%), Gaps = 25/187 (13%)
Query: 137 NKPILLENG-----LLLCGSSVESWNSWGSWMEVTVDAGRSWR---KYGPIYIPNESLSV 188
NKP +L +G ++L +SW ++ D G++W KY P P + +
Sbjct: 198 NKPTVLSDGTWFRPVVLGPEQSSRLHSW-----ISRDRGKTWEFLSKYEPSAPPEVARTW 252
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW---SYAKPTQLLNPNSGIDGVK 245
+P+ + +L +L RS +GI ++ ES D G TW SY + +N I +
Sbjct: 253 CEPMVIERRDGSLWMLFRSKDGIMQI---ESRDKGRTWENESYFTKERGINTRFFI--TR 307
Query: 246 LKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L+ GRLL N +R L LS+DDG +W L L+E + SYP +QA DGS++
Sbjct: 308 LQSGRLLFVVNDHPKARANLTAMLSEDDGKTWPHKLVLDERNPV--SYPDGMQAPDGSIY 365
Query: 304 ITYTYNR 310
+ Y R
Sbjct: 366 VIYDRGR 372
>gi|398868058|ref|ZP_10623483.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM78]
gi|398234577|gb|EJN20442.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM78]
Length = 432
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 144/387 (37%), Gaps = 83/387 (21%)
Query: 4 DCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK 63
D +K + F A+S H++ + + G + +F G+ EGA DV++ F
Sbjct: 47 DKEVKPIYTSRF---ASSGLVDFVHSAAVTALPDGSLMAVWFAGTREGAADVQVRSARFN 103
Query: 64 --DGRW-QSPIIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC-- 110
G W + ++A + + NPV+ P N L LFY + + W+G
Sbjct: 104 LSTGEWGEEQVMATRDSTQQSTRKYIRKLGNPVIALAPDNRLWLFY-VSVSMGGWAGSAV 162
Query: 111 -MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSW 162
+ S + G WS QL I P ++ P+ +G + E + +
Sbjct: 163 NVMVSDDFGGHWSAPRQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEY 219
Query: 163 MEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
+ ++ D G K+ + N I P+ R + +L + + RV S + D
Sbjct: 220 LYLSAD-GVVIDKFRISHGNNSLQPAIVPL---DGQRGIALLRYAGDTHHRVLASRTEDA 275
Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSW---HD 277
G WS P NPNS + V D LL+A N + G K++L D D W HD
Sbjct: 276 GQNWSEPYPLVPSNPNSSLAAVATPDRGLLVALNDLQEGRFKLSLYVTDAKMDDWRLLHD 335
Query: 278 ----------------------------------ALTLE------------ENLAMEFSY 291
AL + + E+ Y
Sbjct: 336 LDKSPDPEGDPYTPLVYKEIIGKQFRGSSGKLRQALEAQFLANLDKRVCKSQECEFEYEY 395
Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVL 318
P I++ DG H+ Y++N T IKHV
Sbjct: 396 PYFIRSPDGMYHLVYSWNNTFIKHVTF 422
>gi|440745293|ref|ZP_20924589.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
BRIP39023]
gi|440372969|gb|ELQ09747.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
BRIP39023]
Length = 432
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRDSTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGSSWSPPWQLVTTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
LL+A N + G K++L D D TLE
Sbjct: 301 GKGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360
Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ E+ YP I++ DG H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|325107979|ref|YP_004269047.1| hypothetical protein Plabr_1413 [Planctomyces brasiliensis DSM
5305]
gi|324968247|gb|ADY59025.1| hypothetical protein Plabr_1413 [Planctomyces brasiliensis DSM
5305]
Length = 429
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 83/166 (50%), Gaps = 14/166 (8%)
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
G+ + V+ D G++W + G + +PN + ++ + +L +L+R+ G ++ S
Sbjct: 230 GASILVSTDEGKTWTRRGGVKMPNPNW--MEHRAIEREDGSLWMLVRTTKG---AMLTIS 284
Query: 220 CDGGLTWSYAK-PTQLLNPNSGIDGVKLKDGRLLLAY------NTVSRGVLKVALSKDDG 272
DGG +WS + P + +P + KLK GR LL N R L LS D+G
Sbjct: 285 QDGGKSWSTPEFPEGIKHPAARFFLEKLKSGRWLLVKHGKTIDNHEGRSHLSAWLSADEG 344
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W L L+E + SYP QA DGS++I+Y + R HV L
Sbjct: 345 KTWQGGLMLDERRGV--SYPDGFQADDGSIYISYDWQRGSKGHVNL 388
>gi|407694959|ref|YP_006819747.1| hypothetical protein B5T_01121 [Alcanivorax dieselolei B5]
gi|407252297|gb|AFT69404.1| hypothetical protein B5T_01121 [Alcanivorax dieselolei B5]
Length = 376
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 88/369 (23%), Positives = 144/369 (39%), Gaps = 76/369 (20%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRW--QSPIIAD 74
A+ A HA+++VE+ G +F GS EGA DV I F G W ++ ++
Sbjct: 4 ASDAETALVHAASMVELPDGRLRAFWFAGSREGAQDVGIHSSVFDPAAGAWSDETTVVTR 63
Query: 75 EE------PNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWS-ER 124
++ +V + L NG + + + W+ + S+++G +W +R
Sbjct: 64 DQIREGWGRHVRKLGNAVPVLDDNGRMRLFVVAVSFGGWAASRLVVLESWDQGESWQFDR 123
Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L S K P+ +G + E +G + V D R I
Sbjct: 124 ALTTSPFLNISTLVKTPPVRFADGSVGLPVYHEMIGKFGEMLRVGEDN----RILDNARI 179
Query: 182 PNESLSVIQPVPF----HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
IQP+ H A LR S + G + S++ DGG W+ + L NP
Sbjct: 180 -GHGRKAIQPLVLVDSPHRATAFLR--NESDDHPGMLYRSDTYDGGSEWTPLVSSGLPNP 236
Query: 238 NSGIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHDAL-----------TLEEN 284
++ + GV L +G LL N+V R L + ++D G +W D + LEEN
Sbjct: 237 SAAVGGVALSEGHWLLVANCNSVERDDLCMRETRDSGRTWTDRIYFHNRVSWRGGDLEEN 296
Query: 285 LAM-----------------------------------EFSYPAVIQASDGSVHITYTYN 309
+ ++ YP VIQ S+G +H+ YT+N
Sbjct: 297 RFLNMIGEELDGTFGVIRDQPYLQRVEHNKCFARGCEFQYDYPYVIQTSNGDLHVLYTWN 356
Query: 310 RTQIKHVVL 318
++ I+H L
Sbjct: 357 KSAIRHAWL 365
>gi|171913941|ref|ZP_02929411.1| Glycosyl hydrolase, BNR repeat [Verrucomicrobium spinosum DSM 4136]
Length = 631
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 143/319 (44%), Gaps = 32/319 (10%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWL---QTFKDGR 66
A + TF +P + S +V + D+ + Y S + +W+ +T G+
Sbjct: 294 AARDITF---DSPHDTVLGSGLVLLKDRTTMMAVYSTPSSYYSKPGTVWIAARRTLDGGK 350
Query: 67 -WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC-----------MKRS 114
W+ + P+ +P + + +G + + +G + W G RS
Sbjct: 351 SWEMEQVVARHPDCQPSHPSVL-VSKDGTIHVFYLGFKKWAWHGVNPASDTASDIWTTRS 409
Query: 115 YNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSS--VESWNSWGSWMEVTVDAGRS 172
++G TWS +++ G G + N + +G L+ S V + S + ++ D GRS
Sbjct: 410 KDQGSTWSVPQRIFTGYSG-ATNGAVETADGKLVVPFSHYVNNPGRLVSRVAISADGGRS 468
Query: 173 WRKYGPIYI--PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
W P+ + + ++P + + +L+R+ + + S DGGL W+
Sbjct: 469 WDLGAPLDMGGAGDHEGALEPAVLPLNDGRIWMLIRTTRKV--FWEAYSSDGGLNWTVPG 526
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAME 288
PT + ++ +L+DGR+ L +N S+G L++ALS DDG SW + L ++
Sbjct: 527 PTTIDASSAPASLTRLRDGRVALVWNQRSKGRRELQLALSSDDGKSWSRPVPLVRGKSV- 585
Query: 289 FSYPAVIQA-SDGSVHITY 306
+YP+VI+ SDG + I Y
Sbjct: 586 -TYPSVIEGPSDGDLWIGY 603
>gi|283782130|ref|YP_003372885.1| hypothetical protein Psta_4378 [Pirellula staleyi DSM 6068]
gi|283440583|gb|ADB19025.1| hypothetical protein Psta_4378 [Pirellula staleyi DSM 6068]
Length = 813
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 94/208 (45%), Gaps = 29/208 (13%)
Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWG----------------SWMEVTVDAGRSWRKY 176
G + NKP +L+NG L S+ + + G + + V+ D G +W +
Sbjct: 202 GCTLNKPTVLKNGEWLLPVSLWTRDRIGPAILKEAHKNLDVIRMANVFVSTDEGATWTRR 261
Query: 177 GPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN 236
G + P + + + L +L R+ GI S S DGG +WS +P+++ N
Sbjct: 262 GGVEFPGTDFD--EHMIVEKNDGRLWMLARTKQGISE---SFSSDGGKSWSKPEPSKIAN 316
Query: 237 PNSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
P++ +L G LLL N R + LS DDG +W L +++ E S
Sbjct: 317 PSARFFIRRLTSGNLLLVKNGPLDVRLPRRSSMTAYLSTDDGATWPHQLLIDDR--AEVS 374
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YP +++ DG +H+ Y +NR ++L
Sbjct: 375 YPDAVESPDGVIHLLYDWNRHTDAEILL 402
Score = 37.4 bits (85), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 70/161 (43%), Gaps = 10/161 (6%)
Query: 165 VTVDAGRSWRKYGPIYI-PNESLSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
++ D G++W ++G I + P+E + AN + +++R+ G + +ES DG
Sbjct: 641 ISSDGGKTWTEHGNIRLTPDERYHGWAENNIVELANGRIAMIIRADRLGGMLFYAESKDG 700
Query: 223 GLTW-SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDAL 279
G TW YA T++ NP S L + + +N S R L + +S D +W
Sbjct: 701 GKTWPEYASVTRVPNPGSKATLYPLGGNNVAMLHNPNSKHRSPLSLWISFDGMKTWPYQR 760
Query: 280 TLE----ENLAMEFSYPAVIQASDGS-VHITYTYNRTQIKH 315
L + +YP + D +H + NR + H
Sbjct: 761 VLHPESVDGPQGRINYPDGFVSQDKQFLHFAFDDNRHRAVH 801
>gi|196232246|ref|ZP_03131100.1| hypothetical protein CfE428DRAFT_4266 [Chthoniobacter flavus
Ellin428]
gi|196223614|gb|EDY18130.1| hypothetical protein CfE428DRAFT_4266 [Chthoniobacter flavus
Ellin428]
Length = 782
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 118/286 (41%), Gaps = 45/286 (15%)
Query: 67 WQSP---IIADEEPNVPMWNPVLFKL---PSNGLLLF------YKIGQEVQKWSGCMKRS 114
W P I + PN P ++ L P+ L LF Y G++ + C
Sbjct: 97 WSKPRVVIDPTDPPNAPQRRALVGNLWTDPTGKLWLFFDQSLGYFDGRDGDWFITCDNPD 156
Query: 115 YNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSW------------ 162
++ V WS+ + G + NKP +L NG L ++ + + GS
Sbjct: 157 ADEPV-WSKPTRFADGC---TLNKPTVLSNGDWLLPVALWTRDHIGSASLKEAHHELDPI 212
Query: 163 ----MEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
+ + D G++W + G + P V L +L R+ GI S
Sbjct: 213 RMANVFASTDQGKTWARRGGVAFPGSEFDEHMIVELRDGR--LWMLARTKKGISE---SF 267
Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDG 272
S D G+TW P+ + NP++ +L G LLL N R L LS DDG
Sbjct: 268 STDHGVTWGEPHPSAIQNPSARFFIRRLASGNLLLVKNGPIETRLPRRSSLTAFLSTDDG 327
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W L L++ + SYP +QA DG+++++Y +NR ++L
Sbjct: 328 KTWGKGLLLDDRAVV--SYPDGVQAPDGTIYLSYDWNRHTDAEILL 371
>gi|218516178|ref|ZP_03513018.1| putative glycosyl hydrolase protein [Rhizobium etli 8C-3]
Length = 300
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 68/158 (43%), Gaps = 51/158 (32%)
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT---------------- 257
+ S S DGG WS +PT+L N NS I L +G + + YN
Sbjct: 139 ILSSRSADGGENWSPPEPTELPNNNSSIQATVLDNGGIAMVYNHSNADMSDARRQSLYDE 198
Query: 258 -----------------------VSRGVLKVALSKDDGDSWHDALTLE------------ 282
V R L +A+S+D G ++ + L+
Sbjct: 199 IEGDEAGENAAVVADAGRKAVWGVPRAPLSLAISRDGGRTFPHHMDLDTGDGFCLSNNSK 258
Query: 283 ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
++L EFSYP+VIQ SDG++H+ YTY R IK+V L P
Sbjct: 259 DSLNREFSYPSVIQGSDGTLHVAYTYYRRAIKYVRLAP 296
>gi|412990692|emb|CCO18064.1| putative exo-alpha-sialidase [Bathycoccus prasinos]
Length = 656
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/335 (27%), Positives = 139/335 (41%), Gaps = 62/335 (18%)
Query: 25 KSCHASTI----VEVDKGHFLVAYFGGS-CEGAPDVKIWLQTFKD--GRWQSPIIADEEP 77
K H +TI V K F VA+ S EG I + +D W P
Sbjct: 244 KYSHMATIGRWNVNATKKGFAVAWQSSSGIEGIRGQSILISYAEDVDQPWSKPKRVSSMQ 303
Query: 78 NV--PMWNPVLFKLPS--NGLLLFYKIGQEV-----QKWSGC-------------MKRSY 115
NV +W+PVLF P+ NGL LFY Q + SG + +S
Sbjct: 304 NVKTAVWSPVLFNPPNEENGLWLFYTESQNCLRPARENKSGSKIPPRFSPGGDVKVTKSR 363
Query: 116 NKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW--NSWGSWME--------- 164
+ G TWSE +++ S P ++ N + + + +W W E
Sbjct: 364 DGGKTWSEPQEVHAQSKEDSPGMPKVIANRITVH-TQTGNWVLPYWKEKAESNACRQPKS 422
Query: 165 -------VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
V+ D G+SW+ G I + L I+ + +L++ R+ G +V +
Sbjct: 423 PASHGVLVSKDQGKSWQPKGKIRVGGTWL--IEGAVVERRDYSLQLFFRTSKG--KVYTA 478
Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR-LLLAYNTVS--------RGVLKVALS 268
S DGG TW+ A PT + NP+S +D D R +++A+N + R L+V++S
Sbjct: 479 FSWDGGDTWTGANPTTVPNPDSKMDATSFPDTREIVVAFNDATDKNKINKGRCRLRVSIS 538
Query: 269 KDDGDSWHDALTLEE-NLAMEFSYPAVIQASDGSV 302
D G + LEE +L M YP VI D ++
Sbjct: 539 CDGGVGFKTITELEEGSLKMYIHYPTVITDGDRAI 573
>gi|440717316|ref|ZP_20897806.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain protein [Rhodopirellula baltica SWK14]
gi|436437502|gb|ELP31128.1| secreted protein containing Inosine/uridine-preferring nucleoside
hydrolase domain protein [Rhodopirellula baltica SWK14]
Length = 733
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 102/219 (46%), Gaps = 30/219 (13%)
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------V 165
TWS ++ G+ + NKP +L +G + S++ G + + V
Sbjct: 489 TWSAPRRIWHGV---TLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFV 545
Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
+ D G +W + G PN + + + +L +L R+ NGI S S DGG
Sbjct: 546 SKDRGETWHRRGRATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDGGKN 600
Query: 226 WSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDAL 279
WS A +++ +PN+ +L+ RLLL + + R L LS+D+G +W L
Sbjct: 601 WSAAVASKIKHPNARFHIRRLQSSRLLLIKHGDAIDAHHGRVGLSAWLSEDEGQTWLGGL 660
Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L+E + SYP Q+S+GS++I+Y NR V++
Sbjct: 661 MLDERKGI--SYPDGFQSSNGSIYISYDRNRATDGEVLM 697
>gi|416015491|ref|ZP_11563065.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. B076]
gi|416028065|ref|ZP_11571190.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. race 4]
gi|320325049|gb|EFW81118.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. B076]
gi|320327846|gb|EFW83852.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. race 4]
Length = 432
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ A LR +F+ RV S + DGG TWS +P NPNS + V +
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361
Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
F Y P I++ DG H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|298157627|gb|EFH98707.1| BNR/Asp-box repeat protein [Pseudomonas savastanoi pv. savastanoi
NCPPB 3335]
Length = 432
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ A LR +F+ RV S + DGG TWS +P NPNS + V +
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361
Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
F Y P I++ DG H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|422598475|ref|ZP_16672736.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
lachrymans str. M301315]
gi|330988753|gb|EGH86856.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
lachrymans str. M301315]
Length = 432
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ A LR +F+ RV S + DGG TWS +P NPNS + V +
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361
Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
F Y P I++ DG H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|289648789|ref|ZP_06480132.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. 2250]
Length = 432
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGCEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ A LR +F+ RV S + DGG TWS +P NPNS + V +
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361
Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
F Y P I++ DG H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|307107037|gb|EFN55281.1| hypothetical protein CHLNCDRAFT_134209 [Chlorella variabilis]
Length = 410
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 73/145 (50%), Gaps = 7/145 (4%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
++ D+G +W+ G I + ++ P + L ++ R+ GR MS S D G+
Sbjct: 250 ISRDSGATWKASGDIE--DARTWLVNPTLEEGSKGQLIMMFRT--STGRTYMSSSADKGV 305
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLE 282
TWS L NPNS V + DG++L +N R L +ALS +D SW +E
Sbjct: 306 TWSRPTYNSLPNPNSPFSTVTI-DGQVLCVFNNSQTIRAPLALALSVNDCKSWEPLAMVE 364
Query: 283 ENLAMEFSYPAVIQASDGSVHITYT 307
E+ FS P++++ +D +V + YT
Sbjct: 365 EDATGNFSCPSIVEWADDTVKVAYT 389
>gi|395500535|ref|ZP_10432114.1| hypothetical protein PPAM2_30810 [Pseudomonas sp. PAMC 25886]
Length = 417
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 131/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRW-QSPIIADEEPNV---- 79
H+S++ + G + +F GS EGA DV++ F K W ++A E
Sbjct: 53 VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDAKSAEWGPEQVLATRESTRDGTH 112
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L +FY + V W+ + S + G WS QL I
Sbjct: 113 RYIRKLGNPVIALAPDNRLWMFY-VSVSVGGWATSAINVMVSDDLGTHWSAPRQL---IT 168
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 169 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVVDKFRISRGKNSL 227
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
I P R + +L + RV S + D G TWS P + NPNS + V
Sbjct: 228 QPTIVP---QDGQRAIAMLRYAGETHHRVLASRTEDAGQTWSEPYPLEPSNPNSSLAAVS 284
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN------------------- 284
+ LL+A N + G K++L D + W + L+++
Sbjct: 285 TEGDGLLVALNDLQDGRFKLSLYGTDANLNQWRTVVELDQSPDPLGQPFSPEAYKEIIGE 344
Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
E+ YP ++ DG H+ Y++N T IK
Sbjct: 345 GFRESSGARRLPLEQRFLSSLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 404
Query: 315 HV 316
HV
Sbjct: 405 HV 406
>gi|410089991|ref|ZP_11286592.1| BNR/Asp-box repeat-containing protein [Pseudomonas viridiflava
UASWS0038]
gi|409762627|gb|EKN47637.1| BNR/Asp-box repeat-containing protein [Pseudomonas viridiflava
UASWS0038]
Length = 432
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 86/362 (23%), Positives = 131/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSP-IIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F W S ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAATEEWGSEHVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
+ NPV+ P L LFY + V W+G S + G +WS QL +
Sbjct: 127 KYIRKLGNPVIALAPDKRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPRQL---VT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 183 SPFLNISTLVRAAPVFHADGSIGLPVYHEFLGKFAEYLYLSPD-GDVIDKFRISRGKNSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+ P+ A LR + + RV S + DGG TWS P NPNS + V
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETHH---RVLASRTEDGGQTWSEPHPLSPSNPNSSLAAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLE--------------------- 282
+ LL+A N + G K++L D D W + L+
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYGTDEQMDIWRPVIDLDKSPDPLGTPFSLEAYKEVIGE 358
Query: 283 ----------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
+ E+ YP I++ +G H+ Y++N T IK
Sbjct: 359 GFRSSSGARRQPMEAEFLSNLDQRVCSPQGCDFEYEYPYFIRSPEGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|395799123|ref|ZP_10478405.1| hypothetical protein A462_27693 [Pseudomonas sp. Ag1]
gi|395336810|gb|EJF68669.1| hypothetical protein A462_27693 [Pseudomonas sp. Ag1]
Length = 417
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 131/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRW-QSPIIADEEPNV---- 79
H+S++ + G + +F GS EGA DV++ F K W ++A E
Sbjct: 53 VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDAKSAEWGPEQVLATRESTRDGTH 112
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L +FY + V W+ + S + G WS QL I
Sbjct: 113 RYIRKLGNPVIALAPDNRLWMFY-VSVSVGGWATSAINVMVSDDLGTNWSAPRQL---IT 168
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 169 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSPD-GAVVDKFRISRGKNSL 227
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
I P R + +L + RV S + D G TWS P + NPNS + V
Sbjct: 228 QPTIVP---QDGQRAIAMLRYAGETHHRVLASRTEDAGQTWSEPYPLEPSNPNSSLAAVS 284
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN------------------- 284
+ LL+A N + G K++L D + W + L+++
Sbjct: 285 TEGDGLLVALNDLQDGRFKLSLYGTDAKLNQWRTVVELDQSPDPLGQPFSPEAYKEIIGE 344
Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
E+ YP ++ DG H+ Y++N T IK
Sbjct: 345 GFRASSGALRLPLEQRFLSNLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 404
Query: 315 HV 316
HV
Sbjct: 405 HV 406
>gi|421139204|ref|ZP_15599246.1| BNR/Asp-box repeat protein [Pseudomonas fluorescens BBc6R8]
gi|404509579|gb|EKA23507.1| BNR/Asp-box repeat protein [Pseudomonas fluorescens BBc6R8]
Length = 417
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 131/362 (36%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRW-QSPIIADEEPNV---- 79
H+S++ + G + +F GS EGA DV++ F K W ++A E
Sbjct: 53 VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDAKSAEWGPEHVLATRESTRDGTH 112
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P N L +FY + V W+ + S + G WS QL I
Sbjct: 113 RYIRKLGNPVIALAPDNRLWMFY-VSVSVGGWATSAINVMVSDDLGTNWSAPRQL---IT 168
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ N
Sbjct: 169 SPFFNISTLVRAAPVFHADGSIGVPVYHEFMGKFAEYLYLSPD-GAVVDKFRISRGKNSL 227
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
I P R + +L + RV S + D G TWS P + NPNS + V
Sbjct: 228 QPTIVP---QDGQRAIAMLRYAGETHHRVLASRTEDAGQTWSEPYPLEPSNPNSSLAAVS 284
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN------------------- 284
+ LL+A N + G K++L D + W + L+++
Sbjct: 285 TEGDGLLVALNDLQDGRFKLSLYGTDAKLNQWRTVVELDQSPDPLGQPFSPAAYKEIIGE 344
Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
E+ YP ++ DG H+ Y++N T IK
Sbjct: 345 GFRASSGALRLPLEQRFLSNLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 404
Query: 315 HV 316
HV
Sbjct: 405 HV 406
>gi|422584317|ref|ZP_16659428.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. 0893_23]
gi|330869135|gb|EGH03844.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
aesculi str. 0893_23]
Length = 432
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGCEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GELIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ A LR +F+ RV S + DGG TWS +P NPNS + V +
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361
Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
F Y P I++ DG H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|431927356|ref|YP_007240390.1| hypothetical protein Psest_2228 [Pseudomonas stutzeri RCH2]
gi|431825643|gb|AGA86760.1| hypothetical protein Psest_2228 [Pseudomonas stutzeri RCH2]
Length = 431
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 138/366 (37%), Gaps = 88/366 (24%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKI-----------WLQTFKDGRWQSPIIADE 75
HAS++ + G + +F GS EGA DV+I W + +S A
Sbjct: 67 VHASSVTSLPGGKLMAVWFAGSREGAADVQIRGARYDALTAEWGEELVLATRESTQQATR 126
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPP--- 129
+ + N V+ P N L LFY + + W+G S + G +WS +QL
Sbjct: 127 KYIRKLGNSVIALAPDNRLWLFY-VSVSIGGWAGSAINTMYSDDFGQSWSTPKQLITTPF 185
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVD---AGRSWRKYGPIYIPNES 185
I +N P+ +G + E + ++ ++ G+S I + +S
Sbjct: 186 LNISTLVRNAPVFHRDGTIGLPVYHEFLGKFAEYLYISPTGDVVGKS-----RISMGTDS 240
Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIG--RVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
L QP VP N + M + G+ RV S + D G TWS P NPNS +
Sbjct: 241 L---QPTVVPMDEQNA---IAMLRYAGVTHHRVLASRTEDAGRTWSKPFPIDPANPNSAL 294
Query: 242 DGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN--------------- 284
V + LL+A N + G ++ L + + + W + L+E+
Sbjct: 295 AAVATPNHGLLVALNDLEDGRFRLRLMETNSSLELWKPVIDLDESPDPEGNPFSPQAYRE 354
Query: 285 ----------------------------------LAMEFSYPAVIQASDGSVHITYTYNR 310
E+ YP I+++DG H+ Y++N
Sbjct: 355 IIGDKFRLSSGPRRLSLVDEFLTNLDQRVCKTHGCDFEYEYPYFIRSADGLYHLVYSWNN 414
Query: 311 TQIKHV 316
T IKHV
Sbjct: 415 TFIKHV 420
>gi|398960857|ref|ZP_10678356.1| hypothetical protein PMI25_00011 [Pseudomonas sp. GM30]
gi|398153510|gb|EJM42009.1| hypothetical protein PMI25_00011 [Pseudomonas sp. GM30]
Length = 431
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 135/362 (37%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
H+S++ + G + +F GS EGA DV+I F W + ++A E
Sbjct: 67 VHSSSVTALPGGDLMAVWFAGSREGAADVQIRTARFDANTAEWGAEQVLATRESTRDGTR 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P L +FY + V W+ + S + G WS QL I
Sbjct: 127 RYIRKLGNPVIALAPDQRLWMFY-VSVSVGGWATSAINLMVSDDFGRNWSVPRQL---IT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ +
Sbjct: 183 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVIDKFRISRGKHSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
I P+ R + +L + + RV S + D G TWS P + NPNS + V
Sbjct: 242 QPTIVPL---DGKRAVAMLRYAGDTHHRVLASRTEDAGQTWSEPYPLEPANPNSSLAAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGD--SWHDALTLEE-------------------- 283
D LL+A N + G K++L D + +W + + L++
Sbjct: 299 TGDNGLLVALNDLQDGRFKLSLYGTDANLSNWRNVIELDQSPDPLGQPFAPEAYRAIIGE 358
Query: 284 -------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQIK 314
NL E+ YP + SDG H+ Y++N T IK
Sbjct: 359 GFRASSGARRLPLEQRFLSNLDYRVCKPQGCDFEYEYPYFSRDSDGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|344997734|ref|YP_004800588.1| hypothetical protein SACTE_0092 [Streptomyces sp. SirexAA-E]
gi|344313360|gb|AEN08048.1| conserved hypothetical protein [Streptomyces sp. SirexAA-E]
Length = 396
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 78/326 (23%), Positives = 132/326 (40%), Gaps = 50/326 (15%)
Query: 33 VEVDKGHFLVA--YFGGSCEGAPDVKIWLQTFKDGR-WQSP--IIADEEPNVPMWNPVLF 87
VE +G L A Y G E + + + DG W P ++ ++P V +++P L+
Sbjct: 34 VERTRGGRLYANWYTGMETETGGNFVVITSSDDDGTTWTGPRFVVEHDDPEVRVYDPCLW 93
Query: 88 KLPSNGLLLFYKIGQE-----VQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILL 142
+ P+ L + ++ + W + W+ ++ GI+ NKP +L
Sbjct: 94 RDPAGRLWATWNQSRDFFDGRIGVWVATSGNPDDDEPVWTAPRRIANGIM---MNKPTVL 150
Query: 143 ENGLLL-------CGSSVESWNSWG---SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
+G L C + E S + V+ D G ++ G +PN S + +
Sbjct: 151 ADGTWLFPAAIWACHTPTEEHALEAERFSNVYVSTDEGETFAYLGGADVPNRSFD--EHM 208
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L +L+R F+G+G S S DGG TWS + + + P S +L GRLL
Sbjct: 209 VVEKRDGELWMLVRCFDGVGE---SFSKDGGRTWSPGRRSHIDGPCSRFHIRRLPSGRLL 265
Query: 253 LAYNT--------------------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
+ + R L LS DDG +W L L++ + SYP
Sbjct: 266 MINHADFGDRRSREEIEEQGNVKEWKGRTNLTAFLSDDDGATWPHRLLLDDR--DDVSYP 323
Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
+DG + + Y ++R + V L
Sbjct: 324 DAAVGADGRIFVVYDHDRFGDRAVYL 349
>gi|171910491|ref|ZP_02925961.1| hypothetical protein VspiD_04945 [Verrucomicrobium spinosum DSM
4136]
Length = 650
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 117/283 (41%), Gaps = 39/283 (13%)
Query: 67 WQSPIIADEEPN-VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
W P+ + P + +P+ + P L +FYK Q K +G + + T++ R
Sbjct: 91 WSKPVAVIQGPEGTAIADPLPWVDPQGRLWIFYK--QVTPKTAGQAESGFAG--TFAIRA 146
Query: 126 QLP---------PGILGPSKN---KPILLENGLLLCGSSVESWNSWGSWME-------VT 166
P P ++G KP++L +G L + + ++W M VT
Sbjct: 147 DDPEKAEPQWTVPQLIGEGGTLFGKPLVLPDGRWLAPFFLSARSAWREQMAGKETGVIVT 206
Query: 167 VDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW 226
DAG +W G I E + + + ++ + +R+ G+ S S D G TW
Sbjct: 207 SDAGATWAWQGGTSIAPELRNFSEATLAPRRDGSILMAIRTQKGL---YESTSPDAGKTW 263
Query: 227 SYAKPTQLLNPNSGIDG--VKLKDGRLLLAYN----TVS----RGVLKVALSKDDGDSWH 276
S A P +P S ++LK G LL Y+ T S R L LS D+G +W
Sbjct: 264 SEAVPMAGFSPGSATRACLLRLKSGAFLLVYHEPETTASGKYPRAKLTAWLSDDEGRTWS 323
Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
L L+E SYP QA DG + ITY R + +L
Sbjct: 324 HKLLLDER--ARVSYPDATQAPDGRIFITYDLGRYEPADKAIL 364
>gi|424923654|ref|ZP_18347015.1| neuraminidase (sialidase) [Pseudomonas fluorescens R124]
gi|404304814|gb|EJZ58776.1| neuraminidase (sialidase) [Pseudomonas fluorescens R124]
Length = 431
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 135/362 (37%), Gaps = 80/362 (22%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
H+S++ + G + +F GS EGA DV+I + W + ++A E
Sbjct: 67 VHSSSVTALPGGDLMAVWFAGSREGAADVQIRTARYDANSAEWGAEQVLATRESTRDGTQ 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P L +FY + V W+ + S + G WS QL I
Sbjct: 127 RYIRKLGNPVIALAPDQRLWMFY-VSVSVGGWATSAINVMVSDDVGRNWSAPRQL---IT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P N P+ +G + E + ++ ++ D G K+ +
Sbjct: 183 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVIDKFRISRGKHSL 241
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
I P+ R + +L + + RV S + D G TWS P + NPNS + V
Sbjct: 242 QPTIVPL---DGRRAVAMLRYAGDTHHRVLASRTEDAGQTWSEPYPLEPANPNSSLAAVG 298
Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE-------------------- 283
D LL+A N + G K++L D ++W + + L++
Sbjct: 299 TDDNGLLVALNDLQDGRFKLSLYGTDASLNNWRNVIELDQSPDPLGQPFAPDAYKAIIGE 358
Query: 284 -------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQIK 314
NL E+ YP + SDG H+ Y++N T IK
Sbjct: 359 GFRASSGARRLPLEQRFLSNLDYRVCKPQGCDFEYEYPYFSRGSDGLYHLVYSWNNTFIK 418
Query: 315 HV 316
HV
Sbjct: 419 HV 420
>gi|110834776|ref|YP_693635.1| hypothetical protein ABO_1915 [Alcanivorax borkumensis SK2]
gi|110647887|emb|CAL17363.1| conserved hypothetical protein [Alcanivorax borkumensis SK2]
Length = 436
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 84/368 (22%), Positives = 135/368 (36%), Gaps = 73/368 (19%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
A+ A + HA+++VE+ G +F G+ EGA DV I F + G W I++
Sbjct: 52 ASDADTRMVHAASMVELPDGRLRAFWFAGTREGAADVSINSAIFDPQLGTWSDEIVSVTR 111
Query: 77 PNVPM-WNPVLFKL-------PSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSERE 125
+ W + KL +G + + + W+ + S + G +W
Sbjct: 112 EQISQGWGRHIRKLGNALPVLDEDGRMRLFVVAVSFGGWAASRLVVLESSDMGASWEFDT 171
Query: 126 QLPP----GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L I K PI +G + E +G + + D R G I
Sbjct: 172 SLVATPFLNISTLVKTPPIHYSDGTIGLPVYHEMIGKFGEILRLDRDN----RILGKARI 227
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFN--GIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
IQP+ N +R+ N G + S S DGG TW+ + + L NP+S
Sbjct: 228 -GHGRKAIQPLVLVDNNNHAVSFLRNENEPNNGYLYQSSSNDGGWTWTPLEYSPLDNPSS 286
Query: 240 GIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHD-------------------- 277
+ GV + G L+A N R L + ++D G W D
Sbjct: 287 ALGGVAMGPGHWLVATNCNREERDDLCIRETRDSGQHWEDRWFFHNREQWRGNALEPGTF 346
Query: 278 -----------------ALTLE----------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
L LE A ++ YP +++ +G +H+ YT+N+
Sbjct: 347 VDLVDDEFAGETIPEDRRLLLERVKANKCHGDHGCAFQYDYPYMVRTRNGDLHMLYTWNK 406
Query: 311 TQIKHVVL 318
T I+H L
Sbjct: 407 TAIRHAWL 414
>gi|71734501|ref|YP_275606.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
phaseolicola 1448A]
gi|71555054|gb|AAZ34265.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. phaseolicola
1448A]
Length = 432
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 85/359 (23%), Positives = 133/359 (37%), Gaps = 74/359 (20%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+ P+ R + +L + + RV S + DGG TWS +P NPNS + V +
Sbjct: 245 VVPL---DGQRAVALLRYAGETLHRVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301
Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361
Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
F Y P I++ DG H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420
>gi|402703439|ref|ZP_10851418.1| hypothetical protein RhelC_04125 [Rickettsia helvetica C9P9]
Length = 111
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 19/103 (18%)
Query: 220 CDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDS 274
+ G+ WS +P LLNP+S ID + L D LLL YN V SR +L VA S D+G +
Sbjct: 2 INNGIYWSNLEPVNLLNPDSAIDAINLGDDTLLLTYNRVINNHKSRNILSVATSYDEGLN 61
Query: 275 WHDALTLEEN------------LAMEFSYPAVIQASDGS-VHI 304
WH +T+ + L+ E+SYPA+I + D + +H+
Sbjct: 62 WH-PITIRNSIYPEGDIEYSNILSEEYSYPAIIMSPDNNEIHV 103
>gi|373853013|ref|ZP_09595813.1| hypothetical protein Opit5DRAFT_3868 [Opitutaceae bacterium TAV5]
gi|372475242|gb|EHP35252.1| hypothetical protein Opit5DRAFT_3868 [Opitutaceae bacterium TAV5]
Length = 454
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 89/199 (44%), Gaps = 22/199 (11%)
Query: 128 PPGIL--GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR---KYGPIYIP 182
PP L G NKP +L +G + + V+ + G++W K+ P P
Sbjct: 196 PPQFLAYGVCINKPTVLSDGTWFRTVVLGPEDPSRLHFYVSRNKGKTWEFLSKHTPSAPP 255
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+ + +P+ + +L L RS I ++ ES DGG TW K GID
Sbjct: 256 EIAKAWCEPMVIERRDGSLWTLFRSERSIKQI---ESSDGGKTW---KNESDFTTERGID 309
Query: 243 G----VKLKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
+L+ GRLL N SR L LS DDG +W L L+E A+ SYP +Q
Sbjct: 310 TRFFITRLQSGRLLFVVNDHPKSRANLTAMLSDDDGKTWSHKLVLDERNAV--SYPDGLQ 367
Query: 297 ASDGSVHITY---TYNRTQ 312
DG+++I Y YN+ Q
Sbjct: 368 TPDGNIYIIYDRGRYNKDQ 386
>gi|225163403|ref|ZP_03725721.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
gi|224801991|gb|EEG20269.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
Length = 434
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 86/183 (46%), Gaps = 17/183 (9%)
Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR---KYGPIYIPNESLSVIQPVP 193
NKP +L NG L + + ++ D G++W+ K+ P P + + + +
Sbjct: 192 NKPTVLANGDWLRPVVASREDPARIQIYISRDKGKTWQFLSKHTPGQAPEIAKTYCESML 251
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG----VKLKDG 249
+ +L +L RS N I ++ ES D G +W + GID +L+ G
Sbjct: 252 IERRDGSLWMLFRSKNSIKQI---ESFDSGKSWQNESD---FSTERGIDTRFFITRLQSG 305
Query: 250 RLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307
+LLL N +R L LS DDG +W L L+E + SYP +QA DGS+++ Y
Sbjct: 306 KLLLVINDHPKARANLTAMLSDDDGKTWPHKLVLDERNPV--SYPDGVQAPDGSIYVIYD 363
Query: 308 YNR 310
+ R
Sbjct: 364 HGR 366
>gi|70731208|ref|YP_260949.1| BNR/Asp-box repeat-containing protein [Pseudomonas protegens Pf-5]
gi|68345507|gb|AAY93113.1| BNR/Asp-box repeat protein [Pseudomonas protegens Pf-5]
Length = 431
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/370 (22%), Positives = 134/370 (36%), Gaps = 96/370 (25%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
H+S++ + G + +F GS EGA DV++ F + G W + ++A E
Sbjct: 67 VHSSSVTALPGGDLMAVWFAGSREGAADVQVRSARFDAQSGEWGAEQVLATRESTRVGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
+ NPV+ P L +FY + V W+ + S + G +WS QL +
Sbjct: 127 RYIRKLGNPVIALGPDQRLWMFY-VSVSVGGWATSAINLMVSDDLGRSWSAPRQL---VT 182
Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAG--------RSWRKYG 177
P N P+ +G + E + ++ ++ D R
Sbjct: 183 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSADGAVIDKFRISRGKHSLQ 242
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
P +P + R + +L + + +V S + D G TWS P Q NP
Sbjct: 243 PTIVPLDE------------RRAVAMLRYAGDTHHKVLASRTEDAGQTWSEPYPLQPSNP 290
Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSW------------------HD 277
NS + V D LL+A N + G K++L D W H+
Sbjct: 291 NSSLAAVGTDDQGLLVALNDLQDGRFKLSLYGTDAGLSQWRPLVELDQSPDPLGQPFSHE 350
Query: 278 A-----------------LTLEENLA--------------MEFSYPAVIQASDGSVHITY 306
A L LE+ E+ YP + +DG H+ Y
Sbjct: 351 AYKDIIGEGFRASSGARRLPLEQRFLSNLDYRVCKPQGCDFEYEYPYFSRGTDGLYHLVY 410
Query: 307 TYNRTQIKHV 316
++N T IKHV
Sbjct: 411 SWNNTFIKHV 420
>gi|149916410|ref|ZP_01904929.1| BNR/Asp-box repeat protein [Roseobacter sp. AzwK-3b]
gi|149809680|gb|EDM69534.1| BNR/Asp-box repeat protein [Roseobacter sp. AzwK-3b]
Length = 373
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 136/313 (43%), Gaps = 34/313 (10%)
Query: 20 NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW---LQTFKDGRWQ-------- 68
+ P + S + ++ + +F GS E PDV+I L+ ++ G WQ
Sbjct: 52 DHTPGEGMAHSPAITINGDEVGILWFQGSSEAQPDVEIHRAVLRRYETG-WQAGPPEPYL 110
Query: 69 -SPIIADE-EPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWS--GCMKRSYNKG-VT 120
S + D EP V + N V + + L Y V W+ +KG
Sbjct: 111 TSAALGDAFEPRQLVVTLGNTVQNEAAQDAL---YATVVSVGGWAMASVADVRMDKGNPV 167
Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGS---SVESWNSWGSWMEVTVDAGRSWRKYG 177
W+ + L P + N+ +L+++ ++ ++ ++ GS V V R
Sbjct: 168 WARKLNLSPFL-----NRSMLVKSPMVAYSDGSMALPAYFEMGSTYGVLVRTDAQGRARD 222
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ + + IQP+ +R F+ G+V +S + DGG TWS A+ T + +P
Sbjct: 223 LRRMTGKRIKPIQPMIVVLDGARALAFLRDFDNSGQVYVSRTQDGGRTWSEAEATGIAHP 282
Query: 238 NSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
++ + + + G +L+A N S +L +ALS+D+G SW TL+ + YP +
Sbjct: 283 SAPVAALPVSGGHILMAMNGGADSADLLHLALSEDEGGSWRVIHTLDRQPG-DARYPMLR 341
Query: 296 QASDGSVHITYTY 308
+ DG + + Y++
Sbjct: 342 RLPDGQILLAYSH 354
>gi|365129132|ref|ZP_09340823.1| hypothetical protein HMPREF1032_02587 [Subdoligranulum sp.
4_3_54A2FAA]
gi|363621763|gb|EHL72956.1| hypothetical protein HMPREF1032_02587 [Subdoligranulum sp.
4_3_54A2FAA]
Length = 386
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 130/329 (39%), Gaps = 69/329 (20%)
Query: 40 FLVAYFGGSCEGAPDVKIWLQTFKDG---RWQSPIIADEEPNVPMWNPVLFKLPSNGLLL 96
F+ Y G E ++ I + G R + ++ +P+ +++P L+ P L +
Sbjct: 43 FVNFYSGQDAEVGGNIMILCISDDQGKTFRSCAAVVEHPDPSCRIYDPNLWIDPLGRLWM 102
Query: 97 FYKIGQEVQKWSGCMKRSYNKGVT--WSEREQLPPGILGPSKNKPILLENGLLLCGSSVE 154
Y R +N G + W+ + P + P + P + NG+++ V
Sbjct: 103 TY-----------AQARGFNDGRSGVWAAICEQPDADV-PQWSAPRRIANGIMMNKPLVT 150
Query: 155 SWNSW----GSWME---------------------VTVDAGRSWRKYGPIYIPNESLSVI 189
S N W W + ++D G + G +P S
Sbjct: 151 SRNEWLFPCAIWRDDSGSVPTERHGLENEQFSNVYASIDEGHTIALRGHADVPARSFD-- 208
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
+ + + TL +L+R+F+GIG S S DGG TWS + + + P S +L G
Sbjct: 209 EHMLVEKRDGTLWMLVRTFDGIGE---SFSSDGGRTWSPGQKSHIDGPCSRFFIRRLSSG 265
Query: 250 RLLLAYNT--------------------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
RLL+ + R L LS+DDG +W L L+E E
Sbjct: 266 RLLMINHYKFSHRIDLDDIMRQGNVKRWQGRSHLTAMLSEDDGATWPYTLLLDER--DEV 323
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SYP +A DG +++TY + R + +++
Sbjct: 324 SYPDAKEADDGYIYVTYDHQRVTKREILM 352
>gi|407801650|ref|ZP_11148494.1| hypothetical protein S7S_00727 [Alcanivorax sp. W11-5]
gi|407025087|gb|EKE36830.1| hypothetical protein S7S_00727 [Alcanivorax sp. W11-5]
Length = 434
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 146/377 (38%), Gaps = 83/377 (22%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRW--QS 69
+ F ++ A ++ HA+++VE+ G +F G+ EG+ DV I F G W +
Sbjct: 42 QLNFASDGATMQT-HAASMVELPDGRIRAFWFAGTREGSADVNIHSSVFDPATGDWTGEQ 100
Query: 70 PIIADEEPN------VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVT 120
++ E+ + V V+ + S+G L Y + W+ + S + G T
Sbjct: 101 VMLTREQVSEGLGRYVRKLGNVVPAVDSDGRLRLYMVVVSFGGWAASRLAVAESADDGST 160
Query: 121 WSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
W + L + P K PI E+G L E +G + + D
Sbjct: 161 WRISDGL---VTSPFFNLSTLVKAPPIRYEDGSLGLPVYHEFAAKFGEVLRL--DEDNRI 215
Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGI--GRVCMSESCDGGLTWSYAKP 231
+ I ESL QP+ + +R+ N G + S++ D G +WS
Sbjct: 216 LERSRIGKLRESL---QPLVLVNGPDSAVAFLRNANNSRPGILWRSDTRDTGHSWSSLDD 272
Query: 232 TQLLNPNSGIDGVKL-KDGRLLLA-YNTVSRGVLKVALSKDDGDSWHDALTLEENLAM-- 287
QL NP S + GV+L KD LL+A N V R L + ++D G +W + ++ A+
Sbjct: 273 GQLPNPGSAVGGVRLGKDHWLLVANNNAVERDDLYILETRDHGQTWQTLHAMHDDAALRN 332
Query: 288 ------------------------------------------------EFSYPAVIQASD 299
+F YP VI+AS+
Sbjct: 333 APVSVDGFRDMVRTALSSIDEGPTTKDQRDRVMQAAVRNNCRSDFCVSQFDYPYVIRASN 392
Query: 300 GSVHITYTYNRTQIKHV 316
G +HI Y++ ++ I H
Sbjct: 393 GDIHILYSWKKSLIAHA 409
>gi|422675853|ref|ZP_16735192.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
syringae pv. aceris str. M302273]
gi|330973566|gb|EGH73632.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
syringae pv. aceris str. M302273]
Length = 321
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 109/264 (41%), Gaps = 25/264 (9%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSP-IIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 13 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQAGTG 72
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 73 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 131
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
I + P+ +G + E + ++ ++ D G K+ I N SL
Sbjct: 132 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKF-RISRGNNSLQP 189
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+ P+ A LR +F+ RV S + DGG +WS P NPNS + V +
Sbjct: 190 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 246
Query: 248 DGRLLLAYNTVSRGVLKVALSKDD 271
LL+A N + G K++L D
Sbjct: 247 GKGLLVALNDLREGRFKLSLYGTD 270
>gi|78211616|ref|YP_380395.1| hypothetical protein Syncc9605_0061 [Synechococcus sp. CC9605]
gi|78196075|gb|ABB33840.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 415
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 144/361 (39%), Gaps = 77/361 (21%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMWNPV 85
HA + + + GS EG + ++L + GRW P + NP+
Sbjct: 44 HAPQLCWIRPDLLGCVWMAGSGEGNAGMSVFLSLLGAEGGRWSEPQRISRDQERSEQNPL 103
Query: 86 LFKLPSNGLLLFYKIGQEVQ----------------KWSGCMK--RSYNKGV------TW 121
LF S+G L Q V+ +W+ ++ R G+ TW
Sbjct: 104 LFV--SDGCLHLIHSAQLVRDPEDRAAVDASSSFSMQWTAILRHQRLALDGLDPSDLETW 161
Query: 122 S-EREQLPPGILGP---SKNKPILLENG--LLLCGSSVESWNSWG---SWMEVTVDAGRS 172
S E +P + +N P LENG LL S+E+ ++G S M +G+
Sbjct: 162 SAEAWSMPVDLFDDPAFCRNPPYPLENGHWLLPIYRSLEAGGAFGHDHSEMVRLDPSGQC 221
Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNG--IGRVCMSESCDGGLTWSYAK 230
+ P IP+ + V V A+R L++ F ++ S S DGG TWS +
Sbjct: 222 LDR--PFGIPDSTGRVHGSV---VASRGGEQLLQFFRSRLADQIYRSVSSDGGHTWSAPE 276
Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTV----------------------SRGVLKVALS 268
PTQL N NS I L GRL + +N +R L +A+S
Sbjct: 277 PTQLPNNNSSIQACWLASGRLAMIFNRFGFAPDPGASEEPLKWGEARWPRTRWPLSIAIS 336
Query: 269 KDDGDSW---HDALT-------LEENLAMEFSYPAVIQASDGSVHITYTY-NRTQIKHVV 317
DDG W D T + +L + +YP +I+ G +H+ Y++ R I++V
Sbjct: 337 DDDGLQWPWIRDIDTGFGFCGPMNWDLNGQLAYPTLIEGQPGELHVAYSWAGRQAIRYVT 396
Query: 318 L 318
L
Sbjct: 397 L 397
>gi|291276607|ref|YP_003516379.1| hypothetical protein HMU03750 [Helicobacter mustelae 12198]
gi|290963801|emb|CBG39637.1| Putative inner membrane protein [Helicobacter mustelae 12198]
Length = 382
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 38/321 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW---LQTFKDGRWQSPI-IAD------- 74
S H+STI + LVA+F GS EGA DV I+ L++ +WQ P I D
Sbjct: 59 SAHSSTITPIRDNSLLVAFFAGSREGAADVAIYGSILESAYSKKWQQPFKILDRFMLMQD 118
Query: 75 -EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSY---NKGV----TWSEREQ 126
E + NPVL+ L L + +G + W+ Y NKG + +
Sbjct: 119 SREYIAKLGNPVLYTL--GDTLHLFVVGVSIGGWATSKIYHYSAENKGAIPTFVFQKALH 176
Query: 127 LPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIP 182
L P I + +P+ + G + ++ + + + DA + P
Sbjct: 177 LSPFLNISNLVRTRPVEITLSTQEKGFILPIYHELATKYPINLVFDARGNLLMTSQ---P 233
Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
N ++QP + + + R+ + +++CD L + PT L N ++ ++
Sbjct: 234 NHLNGLLQPSLTSISQTSCMLAYRANKKAKNILYTQTCDNNLKYGPLTPTNLKNYDNSLN 293
Query: 243 GVKLKDGRLLLAYNT----VSRGVLKVALSKDDGDSWHDALTLEENLAM---EFSYPAVI 295
L + ++ L +NT +RG K+ LSK + L ++ E SYP
Sbjct: 294 LFAL-NKQVYLLHNTPLHNSNRG--KLTLSKMQSPEVFEKLFDIDSTPTQKGEVSYPTAW 350
Query: 296 QASDGSVHITYTYNRTQIKHV 316
G +H+TYT++R I++V
Sbjct: 351 MDPHGILHVTYTFDRKFIRYV 371
>gi|418295150|ref|ZP_12907020.1| hypothetical protein PstZobell_17629 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
gi|379066503|gb|EHY79246.1| hypothetical protein PstZobell_17629 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
Length = 351
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 82/350 (23%), Positives = 133/350 (38%), Gaps = 84/350 (24%)
Query: 41 LVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPII---------ADEEPNVPMWNPVLFKL 89
+ +F GS EGA DV++ + G W S ++ A ++ + NPV+
Sbjct: 1 MAVWFAGSREGAADVQVRGARYDALTGEWGSELVLATRESTQEATQKHIRKLGNPVIALA 60
Query: 90 PSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPP----GILGPSKNKPILL 142
P N L LFY + + W+G S + G TWS +QL I +N P+
Sbjct: 61 PDNKLWLFY-VSVSIGGWAGSAINAMYSDDLGETWSTPKQLITTPFLNISTLVRNAPVFH 119
Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDA---GRSWRKYGPIYIPNESLSVIQP--VPFHTA 197
+G + E + ++ ++ G+S G + +QP VP
Sbjct: 120 SDGTIGLPVYHEFLGKFAEYLYISPSGDVIGKSRISKG--------MDSLQPTVVPMDE- 170
Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
R + +L + +V S + D G TWS P NPNS + V + LL+A N
Sbjct: 171 QRAIAMLRYAGAAHHKVLASRTEDAGRTWSEPYPIDPANPNSSLAAVATPNHGLLVALND 230
Query: 258 VSRGVLKVALSKDDG--DSWHDALTLEEN------------------------------- 284
+ G ++ L + + + W L+E+
Sbjct: 231 LEDGRFRLRLMETNSGLELWKPVFDLDESPDPEGNPFSPQAYREIIGDKFRLSSGARRLP 290
Query: 285 -----LA-------------MEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
LA E+ YP I+++DG H+ Y++N T IKHV
Sbjct: 291 LVDEFLANLDRRVCSAHGCDFEYEYPYFIRSADGLYHLVYSWNNTFIKHV 340
>gi|451849312|gb|EMD62616.1| glycoside hydrolase family 33 protein [Cochliobolus sativus ND90Pr]
Length = 369
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 146/360 (40%), Gaps = 76/360 (21%)
Query: 17 FPANSAPFKSC--HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPII 72
F +N P + HAS ++++ G L A+FGGS EG D+ I L K G W +P
Sbjct: 17 FASNYIPASTVQVHASNLLQLPNGDLLCAWFGGSQEGLSDICIHLSRLKKGSRVWSTPQK 76
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP 129
++ NPVLF +P L Q + R S + G TWS + L
Sbjct: 77 ISDDSERSEQNPVLFLVPGTTTLWVMYTAQPAVNQDKAIVRYRVSSDGGQTWSLAQNLFD 136
Query: 130 GILGPSKNKPILLENGLLLCGSSVESW---NSWG---------SWMEVTVDAGRSWRK-- 175
+ LE+G + + +W N G S ++ T D G++W++
Sbjct: 137 DEGLFIRQPVTFLEDGTWV----LPAWYCRNPPGFCWVGSDDISVVKYTRDGGKTWQEKE 192
Query: 176 ----YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
G +++ ++L + F+ + + M + S +C + W+ KP
Sbjct: 193 VPNSIGAVHMNIKALLDGSYIAFYRSRWADNIYMST---------STNC---IDWTEPKP 240
Query: 232 TQLLNPNSGI---------------DGVKLKDGRLLLAYNT-------VSRGVLKVALSK 269
T L NPNSGI D + +D + + + V R ++ V +S
Sbjct: 241 TPLPNPNSGICFEFEEGQAKREGLYDDITPEDDKRINQRGSTRDAVWGVPRKMMTVGVSS 300
Query: 270 DDGDSWHD-ALTLEENLAM----------EFSYPA--VIQASDGSVHITYTYNRTQIKHV 316
D G +W L + + M E SYP+ V + ++ V++ +T+ R +IK +
Sbjct: 301 DGGRTWKSKTLQVGDGFCMTNNSRTKENRELSYPSIWVERNANDVVNVAFTFWRQRIKFI 360
>gi|420416897|ref|ZP_14916004.1| hypothetical protein HPNQ4044_0822 [Helicobacter pylori NQ4044]
gi|393035410|gb|EJB36455.1| hypothetical protein HPNQ4044_0822 [Helicobacter pylori NQ4044]
Length = 372
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 136/316 (43%), Gaps = 42/316 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF IG + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-IGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNES 185
+ KNKP+ +G G + ++ + + + D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDG----GFVLPLYHELATQYPLLLKFDQQNNPRE---LLRPNAL 232
Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+QP PF + ++ + M E+C W T L N N ++
Sbjct: 233 NHQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNL 288
Query: 244 VKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
+ L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ +
Sbjct: 289 INL-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSH 344
Query: 301 SVHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 345 FIDIVYTYNRSHIKHI 360
>gi|355576659|ref|ZP_09045914.1| hypothetical protein HMPREF1008_01891 [Olsenella sp. oral taxon 809
str. F0356]
gi|354816896|gb|EHF01411.1| hypothetical protein HMPREF1008_01891 [Olsenella sp. oral taxon 809
str. F0356]
Length = 187
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 77/163 (47%), Gaps = 27/163 (16%)
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+PN S + P + L LMRS + SES D G +W+ PT L N NSG
Sbjct: 2 VPN-SAGRVHPTVVELEDGHLVCLMRSREA-DWIYRSESTDFGDSWTEPVPTPLPNNNSG 59
Query: 241 IDGVKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE---- 282
I VKL GR+ +AYN S R + VALS+D ++ +E
Sbjct: 60 ICAVKLASGRVAVAYNHSSAPQAYGQKGAWPGLRCPVSVALSEDGCKTFPLIRHIERGQG 119
Query: 283 ----ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIKHVVL 318
EN + M++ YP ++QA+DG +H+ Y Y R +K V L
Sbjct: 120 YVGDENRSNNMQYEYPCIMQAADGMLHLAYAYETRKGVKWVTL 162
>gi|188527573|ref|YP_001910260.1| hypothetical protein HPSH_03975 [Helicobacter pylori Shi470]
gi|188143813|gb|ACD48230.1| hypothetical protein HPSH_03975 [Helicobacter pylori Shi470]
Length = 372
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDCLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|373849439|ref|ZP_09592240.1| hypothetical protein Opit5DRAFT_0294 [Opitutaceae bacterium TAV5]
gi|372475604|gb|EHP35613.1| hypothetical protein Opit5DRAFT_0294 [Opitutaceae bacterium TAV5]
Length = 378
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 97/204 (47%), Gaps = 29/204 (14%)
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSW---GSWMEVTVDA 169
RS + G TWS+R P S + P+ L +G LL E W G W+ + D
Sbjct: 166 RSGDGGKTWSDRYAAPCN----SPHGPVALRDGRLLYAGK-ELWTEEKKVGVWL--SADD 218
Query: 170 GRSWRKYGPIY-IPNESLSVIQPV-PFHTANRTLRVLMRSFNGIGRVCMS-ESCDGGLTW 226
G +W++ + P E+++ + A+ TL V +R+ NG R + ES DGG TW
Sbjct: 219 GLTWKRAAWLEPRPGETVTDYHELSAIEAADGTLIVQIRNHNGGRRETLQCESHDGGATW 278
Query: 227 SYAKPTQLLNPNSGIDG-----VKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHDAL 279
+ + G+DG V+LKDGRL++ Y RG+ + +S D G +W +
Sbjct: 279 TLPRAI-------GVDGFPSHLVRLKDGRLVMTYGWREAPRGI-RGRVSADHGRTWSEEF 330
Query: 280 TLEENL-AMEFSYPAVIQASDGSV 302
L ++ + YP ++ +DG +
Sbjct: 331 VLTDDAPGWDVGYPGTVELADGQL 354
>gi|190894659|ref|YP_001984952.1| putative glycosyl hydrolase [Rhizobium etli CIAT 652]
gi|190700320|gb|ACE94402.1| putative glycosyl hydrolase protein [Rhizobium etli CIAT 652]
Length = 369
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 81/348 (23%), Positives = 131/348 (37%), Gaps = 86/348 (24%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLPPGSERWSEPEKMSDDPEK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++FK P L Y Q G + + S + G T+ + L PG
Sbjct: 95 SEQNPLIFKAPDGKTWLLY-TSQTSGNQDGAIVKCRISDDGGKTFGPVQILCDSPGTF-- 151
Query: 135 SKNKPILLENG-----LLLC-GSSVESWNSWGSWMEVTV--DAGRSWR------KYGPIY 180
+ + ++ + G + C G + W+ V + D G SW+ G ++
Sbjct: 152 VRQQIVVNDRGDWLLPIFRCVGLEGQRWSGDADTAAVLISRDGGASWQMRDIPDSIGAVH 211
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ L V F+ NR ++ S S DGG TWS +PT+L N NS
Sbjct: 212 MNILPLGGGDMVAFYR-NRFAETILS----------SRSADGGETWSPPEPTELPNNNSS 260
Query: 241 IDGVKLKDGRLLLAYN---------------------------------------TVSRG 261
I L +G + + YN V R
Sbjct: 261 IQATVLDNGGIAMVYNHSNATTSDARRQSLYDEIEGDEAGENAAVVADAGRKAVWGVPRA 320
Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQA 297
L +A+S+D G ++ + L+ ++L EFSYP+VI+
Sbjct: 321 PLSLAISRDGGTTFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSVIEG 368
>gi|420433839|ref|ZP_14932844.1| hypothetical protein HPHPH24_0875 [Helicobacter pylori Hp H-24]
gi|420507605|ref|ZP_15006114.1| hypothetical protein HPHPH24B_0780 [Helicobacter pylori Hp H-24b]
gi|420532939|ref|ZP_15031302.1| hypothetical protein HPHPM1_0779 [Helicobacter pylori Hp M1]
gi|420534600|ref|ZP_15032948.1| hypothetical protein HPHPM2_0749 [Helicobacter pylori Hp M2]
gi|420536308|ref|ZP_15034650.1| hypothetical protein HPHPM3_0779 [Helicobacter pylori Hp M3]
gi|420541499|ref|ZP_15039807.1| hypothetical protein HPHPM6_0901 [Helicobacter pylori Hp M6]
gi|420543018|ref|ZP_15041313.1| hypothetical protein HPHPM9_0702 [Helicobacter pylori Hp M9]
gi|393050254|gb|EJB51214.1| hypothetical protein HPHPH24_0875 [Helicobacter pylori Hp H-24]
gi|393117458|gb|EJC17959.1| hypothetical protein HPHPH24B_0780 [Helicobacter pylori Hp H-24b]
gi|393140570|gb|EJC40943.1| hypothetical protein HPHPM1_0779 [Helicobacter pylori Hp M1]
gi|393141240|gb|EJC41605.1| hypothetical protein HPHPM2_0749 [Helicobacter pylori Hp M2]
gi|393143956|gb|EJC44300.1| hypothetical protein HPHPM3_0779 [Helicobacter pylori Hp M3]
gi|393148119|gb|EJC48443.1| hypothetical protein HPHPM6_0901 [Helicobacter pylori Hp M6]
gi|393160081|gb|EJC60330.1| hypothetical protein HPHPM9_0702 [Helicobacter pylori Hp M9]
Length = 372
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|386750984|ref|YP_006224204.1| hypothetical protein HPSH417_02785 [Helicobacter pylori Shi417]
gi|384557242|gb|AFH97710.1| hypothetical protein HPSH417_02785 [Helicobacter pylori Shi417]
Length = 372
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|300022380|ref|YP_003754991.1| hypothetical protein Hden_0853 [Hyphomicrobium denitrificans ATCC
51888]
gi|299524201|gb|ADJ22670.1| hypothetical protein Hden_0853 [Hyphomicrobium denitrificans ATCC
51888]
Length = 394
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 131/330 (39%), Gaps = 44/330 (13%)
Query: 25 KSCHASTIVEVDKGHF-------------LVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI 71
+ HA+TI D+ F L ++ EGA + +I F W
Sbjct: 51 QQIHAATITPPDEKDFVHGPSLIQTPAGLLAFWYRAVYEGAANAEIVSSRFDGTHWSPTA 110
Query: 72 IADEEPNVP---------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGV 119
+ + V + NPV F+ +N + LF+ + W+ C + RS + G
Sbjct: 111 VVKDSSAVTRDIGLTVKSLANPVAFRRSANEIWLFFA-ASRLSGWATCEIILVRSTDNGQ 169
Query: 120 TWSEREQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
TW ++L + +K+ PI L +G + + N M V GR K
Sbjct: 170 TWERAQRLYASPFLNMSHLTKSLPIRL-SGDRIALPAYHEMNRKYPIMLVLDGNGRVVDK 228
Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGI--GRVCMSESCDGGLTWSYAKPTQ 233
+ + N QP T+ T +R N + ++ + D G TW+ P
Sbjct: 229 ---VRMGNGGTVGYQPAIVPTSPTTAIAFVRRLNSFRPQSILITRTTDAGRTWTPVAPID 285
Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSR--GVLKVALSKDDGDSWHDALTL----EENLAM 287
L NP I ++ + R+LLA+N + +K+A S DG S+ TL ++
Sbjct: 286 LPNPGGPIAAIRYDETRILLAFNDDPKDESNVKLAFSNLDGTSFRRIGTLVQVNDQVKGD 345
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
+YP ++++ G + ++ R +HV+
Sbjct: 346 ATAYPFLLESEPGQFDVVFS--RPPPQHVI 373
>gi|420451898|ref|ZP_14950748.1| hypothetical protein HPHPA6_0645 [Helicobacter pylori Hp A-6]
gi|393069739|gb|EJB70534.1| hypothetical protein HPHPA6_0645 [Helicobacter pylori Hp A-6]
Length = 372
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLIGLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ ++ PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LFRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|87306692|ref|ZP_01088839.1| hypothetical protein DSM3645_10172 [Blastopirellula marina DSM
3645]
gi|87290871|gb|EAQ82758.1| hypothetical protein DSM3645_10172 [Blastopirellula marina DSM
3645]
Length = 421
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 76/161 (47%), Gaps = 14/161 (8%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
V+ D G+SW + G PN + + + +L +L R+ G+ S S DGG
Sbjct: 232 VSSDQGKSWERRGAATFPNPDWH--EHMIVEREDGSLWMLARTTKGL---MQSISTDGGE 286
Query: 225 TWSY-AKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHD 277
TW+ ++P + PN+ +L G++LL + R L LS D+G +W
Sbjct: 287 TWAKPSEPAGIQQPNARFHVRRLASGKILLIKHGDKIDSHHGRVQLSAWLSDDEGKTWTG 346
Query: 278 ALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L L+E + SYP QA DG ++I+Y NR ++L
Sbjct: 347 GLVLDERKGV--SYPDGFQAPDGRIYISYDRNRATDGEILL 385
>gi|420408708|ref|ZP_14907864.1| hypothetical protein HPNQ4216_0918 [Helicobacter pylori NQ4216]
gi|393024124|gb|EJB25237.1| hypothetical protein HPNQ4216_0918 [Helicobacter pylori NQ4216]
Length = 372
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF IG + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-IGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420421925|ref|ZP_14921003.1| hypothetical protein HPNQ4110_0734 [Helicobacter pylori NQ4110]
gi|393038443|gb|EJB39477.1| hypothetical protein HPNQ4110_0734 [Helicobacter pylori NQ4110]
Length = 372
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + ++
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEDLSHHS 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKVLDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|407458234|ref|YP_006736539.1| hypothetical protein B601_0714 [Chlamydia psittaci WS/RT/E30]
gi|405785505|gb|AFS24251.1| hypothetical protein B601_0714 [Chlamydia psittaci WS/RT/E30]
Length = 357
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 134/310 (43%), Gaps = 35/310 (11%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
S+ ++ STIVE G LV + G C D+ IW + ++ +S +
Sbjct: 31 SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
+ +PVL K+ S + LFY+ Q + S + S+N G W E LPPGI
Sbjct: 86 S---------HPVLVKISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136
Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSWG----SWMEVTVDAGRSWR-KYGPIYIPNE 184
G ++N P + + N + + +V SW+E+ +SW + GPI +
Sbjct: 137 EGTTRNPPYVDRIRNRVYIPSYNVYEIPGQAVIGDSWVEIYNFFSKSWEGRMGPILGGSR 196
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+ I+P + L + R+ + +CM+ S + G +WS K + ++ I
Sbjct: 197 NQVPIEPAIVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAV 256
Query: 244 VKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
++G+L L N+ G L S ++G +W+ ++ +++ P+ G +
Sbjct: 257 QSGRNGKLYLFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLDRQGCM 313
Query: 303 HITYTYNRTQ 312
H+ YT Q
Sbjct: 314 HLVYTSKDDQ 323
>gi|420423565|ref|ZP_14922636.1| hypothetical protein HPHPA4_0790 [Helicobacter pylori Hp A-4]
gi|393041287|gb|EJB42303.1| hypothetical protein HPHPA4_0790 [Helicobacter pylori Hp A-4]
Length = 372
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSQSS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420489046|ref|ZP_14987643.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11]
gi|420522725|ref|ZP_15021149.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11b]
gi|393107796|gb|EJC08336.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11]
gi|393129682|gb|EJC30116.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11b]
Length = 372
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|408372942|ref|ZP_11170641.1| hypothetical protein A11A3_02647 [Alcanivorax hongdengensis A-11-3]
gi|407767294|gb|EKF75732.1| hypothetical protein A11A3_02647 [Alcanivorax hongdengensis A-11-3]
Length = 413
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 137/375 (36%), Gaps = 79/375 (21%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
TF A+ A HA+++VE+ G +F GS EGA DV I F ++G W
Sbjct: 41 RMTF-ASDADTTMVHAASMVELPDGRLRAFWFAGSREGASDVAIHSALFDPRNGSWADEH 99
Query: 72 IA-DEEPNVPMWNPVLFKL-------PSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVT 120
+A E W + KL NG + + + W+ + S + G +
Sbjct: 100 VAVTREQIADQWGRYVRKLGNAVPVLDDNGHMRLFVVAVSFGGWAASRLVVLGSDDLGTS 159
Query: 121 WSEREQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKY 176
W + L I K PI +G + E +G + +
Sbjct: 160 WQFEDALTTSPFMNISTLVKAPPIRYSDGTVGLPVYHEMLGKFGEILRLDPQN----HIL 215
Query: 177 GPIYIPNESLSVIQPV-----PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
G I IQP+ P H A LR R NG + S S +GG W
Sbjct: 216 GKARI-GHGRKAIQPLVLVDSPTHAAA-FLRNEQRPNNG--HLYQSHSVNGGWDWQPLTY 271
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSW-------------H 276
++L NP+S + GV + G LLA NT R L + ++D G SW
Sbjct: 272 SRLTNPSSALGGVVIAPGHWLLATNCNTEERDDLCIRETRDGGASWTPRWFFHNRQAWRD 331
Query: 277 DALTLEENLAM---------------------------------EFSYPAVIQASDGSVH 303
D + L M ++ YP +++AS+G +H
Sbjct: 332 DQMQPRPFLNMIDNELDKTVQVDNPARYLDRVRENKCRRGECTFQYDYPYMLRASNGDLH 391
Query: 304 ITYTYNRTQIKHVVL 318
+ YT+N+ I+H L
Sbjct: 392 VLYTWNKAAIRHAWL 406
>gi|420440343|ref|ZP_14939299.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-30]
gi|420499684|ref|ZP_14998238.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-26]
gi|393056570|gb|EJB57481.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-30]
gi|393149664|gb|EJC49973.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-26]
Length = 372
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|386752570|ref|YP_006225789.1| hypothetical protein HPSH169_02985 [Helicobacter pylori Shi169]
gi|384558828|gb|AFH99295.1| hypothetical protein HPSH169_02985 [Helicobacter pylori Shi169]
Length = 372
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESTLEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420479141|ref|ZP_14977790.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-34]
gi|393095383|gb|EJB95987.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-34]
Length = 372
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420456911|ref|ZP_14955730.1| hypothetical protein HPHPA16_0760 [Helicobacter pylori Hp A-16]
gi|393074309|gb|EJB75069.1| hypothetical protein HPHPA16_0760 [Helicobacter pylori Hp A-16]
Length = 372
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDSSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|406593611|ref|YP_006740790.1| hypothetical protein B712_0716 [Chlamydia psittaci NJ1]
gi|405789483|gb|AFS28225.1| hypothetical protein B712_0716 [Chlamydia psittaci NJ1]
Length = 357
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 136/315 (43%), Gaps = 45/315 (14%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
S+ ++ STIVE G LV + G C D+ IW + ++ +S +
Sbjct: 31 SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
+ +PVL ++ S + LFY+ Q + S + S+N G W E LPPGI
Sbjct: 86 S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFRWLEHRLLPPGI 136
Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
G ++N P + + N + + S+N + SW+E+ +SW + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191
Query: 180 YIPNESLSVIQP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ + I+P + T+NR + +CM+ S + G +WS K + +
Sbjct: 192 LGGSRNQVPIEPAIVKFTSNRLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251
Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+ I ++G+L L N+ G L S ++G +W+ ++ +++ P+
Sbjct: 252 NSIAVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308
Query: 298 SDGSVHITYTYNRTQ 312
G +H+ YT Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323
>gi|420428574|ref|ZP_14927609.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-17]
gi|393046233|gb|EJB47213.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-17]
Length = 372
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SAHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N + + LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|87311054|ref|ZP_01093178.1| probable sialidase [Blastopirellula marina DSM 3645]
gi|87286145|gb|EAQ78055.1| probable sialidase [Blastopirellula marina DSM 3645]
Length = 371
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/249 (25%), Positives = 113/249 (45%), Gaps = 19/249 (7%)
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG---ILGPSKNKPILL 142
L +L + LFY + ++ M+ S ++ TWSE PG + ++ I
Sbjct: 106 LLRLADGRIALFYLVKNSIKDCKMRMRTSSDEAETWSEPIDCMPGEENYFVVNNDRVIQT 165
Query: 143 ENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI--PNESLSVIQPVP 193
++G L+ +++ + G W ++ D G++WR+ I ++ + P
Sbjct: 166 KSGRLIAPAAIHVRD--GKWQGAADIVCFLSDDNGKTWRRSQQTVIGKRDDGSRFVTQEP 223
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KDGRLL 252
+ R+LM + +GR + S DGG T+S +P +P S ++ G LL
Sbjct: 224 GVIELKDGRILMWIRSNLGRQAVCYSSDGGETFSDPQPWNFKSPVSPASIKRIPSTGDLL 283
Query: 253 LAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
L +N R L VA+S D+G +W A ++E + F Y A I D V ++Y +
Sbjct: 284 LVWNDNVKRRTPLNVAISADEGATWTHAKSIETDPNGWFCYTA-IHCVDDRVLLSY-WLT 341
Query: 311 TQIKHVVLL 319
Q+KH +L+
Sbjct: 342 EQLKHPLLI 350
>gi|384450754|ref|YP_005663354.1| alpha-rhamnosidase-like protein, putative [Chlamydophila psittaci
6BC]
gi|328914848|gb|AEB55681.1| alpha-rhamnosidase-like protein, putative [Chlamydophila psittaci
6BC]
Length = 319
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 130/301 (43%), Gaps = 35/301 (11%)
Query: 30 STIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPM 81
STIVE G LV + G C D+ IW + ++ +S + +
Sbjct: 2 STIVET-SGRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLFS-------- 48
Query: 82 WNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGILGPSKNKPI 140
+PVL ++ S + LFY+ Q + S + S+N G W E LPPGI G ++N P
Sbjct: 49 -HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGIEGTTRNPPY 107
Query: 141 L--LENGLLLCGSSVESWNSWG----SWMEVTVDAGRSWR-KYGPIYIPNESLSVIQPVP 193
+ + N + + +V SW+E+ +SW + GPI + + I+P
Sbjct: 108 VDRIRNRVYIPSYNVYEIPGQAVIGDSWVEIYNFFSKSWEGRMGPILGGSRNQVPIEPAI 167
Query: 194 FHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
+ L + R+ + +CM+ S + G +WS K + ++ I ++G+L
Sbjct: 168 VKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAVQSGRNGKLY 227
Query: 253 LAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRT 311
L N+ G L S ++G +W+ ++ +++ P+ G +H+ YT
Sbjct: 228 LFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLDRQGCMHLVYTSKDD 284
Query: 312 Q 312
Q
Sbjct: 285 Q 285
>gi|449071311|ref|YP_007438391.1| hypothetical protein AO9_03425 [Chlamydophila psittaci Mat116]
gi|449039819|gb|AGE75243.1| hypothetical protein AO9_03425 [Chlamydophila psittaci Mat116]
Length = 357
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 136/315 (43%), Gaps = 45/315 (14%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
S+ ++ STIVE G LV + G C D+ IW + ++ +S +
Sbjct: 31 SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
+ +PVL ++ S + LFY+ Q + S + S+N G W E LPPGI
Sbjct: 86 S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136
Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
G ++N P + + N + + S+N + SW+E+ +SW + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ + I+P + L + R+ + +CM+ S + G +WS K + +
Sbjct: 192 LGGSRNQVPIEPAIVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251
Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+ I ++G+L L N+ G L S ++G +W+ ++ +++ P+
Sbjct: 252 NSIAVQSGRNGKLYLFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308
Query: 298 SDGSVHITYTYNRTQ 312
G +H+ YT Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323
>gi|407460857|ref|YP_006738632.1| hypothetical protein B603_0723 [Chlamydia psittaci WC]
gi|405787140|gb|AFS25884.1| hypothetical protein B603_0723 [Chlamydia psittaci WC]
Length = 357
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 136/315 (43%), Gaps = 45/315 (14%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
S+ ++ STIVE G LV + G C D+ IW + ++ +S +
Sbjct: 31 SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
+ +PVL ++ S + LFY+ Q + S + S+N G W E LPPGI
Sbjct: 86 S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136
Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
G ++N P + + N + + S+N + SW+E+ +SW + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ + I+P + L + R+ + +CM+ S + G +WS K + +
Sbjct: 192 LGGSRNQVPIEPAIVKLTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251
Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+ I ++G+L L N+ G L S ++G +W+ ++ +++ P+
Sbjct: 252 NSIAVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308
Query: 298 SDGSVHITYTYNRTQ 312
G +H+ YT Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323
>gi|329943014|ref|ZP_08291788.1| hypothetical protein G5Q_0689 [Chlamydophila psittaci Cal10]
gi|332287596|ref|YP_004422497.1| hypothetical protein CPSIT_0711 [Chlamydophila psittaci 6BC]
gi|384451748|ref|YP_005664346.1| hypothetical protein CPS0A_0729 [Chlamydophila psittaci 01DC11]
gi|384452722|ref|YP_005665319.1| hypothetical protein CPS0D_0725 [Chlamydophila psittaci 08DC60]
gi|384453701|ref|YP_005666297.1| hypothetical protein CPS0C_0726 [Chlamydophila psittaci C19/98]
gi|384454680|ref|YP_005667275.1| putative alpha-rhamnosidase-like protein [Chlamydophila psittaci
02DC15]
gi|392376827|ref|YP_004064605.1| hypothetical protein Cpsi_6531 [Chlamydophila psittaci RD1]
gi|407454220|ref|YP_006733328.1| hypothetical protein B595_0770 [Chlamydia psittaci 84/55]
gi|407455499|ref|YP_006734390.1| hypothetical protein B598_0713 [Chlamydia psittaci GR9]
gi|407456909|ref|YP_006735482.1| hypothetical protein B600_0767 [Chlamydia psittaci VS225]
gi|313848170|emb|CBY17171.1| hypothetical protein Cpsi_6531 [Chlamydophila psittaci RD1]
gi|325507153|gb|ADZ18791.1| hypothetical protein CPSIT_0711 [Chlamydophila psittaci 6BC]
gi|328814561|gb|EGF84551.1| hypothetical protein G5Q_0689 [Chlamydophila psittaci Cal10]
gi|334692482|gb|AEG85701.1| hypothetical protein CPS0C_0726 [Chlamydophila psittaci C19/98]
gi|334693458|gb|AEG86676.1| hypothetical protein CPS0A_0729 [Chlamydophila psittaci 01DC11]
gi|334694437|gb|AEG87654.1| putative alpha-rhamnosidase-like protein [Chlamydophila psittaci
02DC15]
gi|334695411|gb|AEG88627.1| hypothetical protein CPS0D_0725 [Chlamydophila psittaci 08DC60]
gi|405780979|gb|AFS19729.1| hypothetical protein B595_0770 [Chlamydia psittaci 84/55]
gi|405782042|gb|AFS20791.1| hypothetical protein B598_0713 [Chlamydia psittaci GR9]
gi|405784170|gb|AFS22917.1| hypothetical protein B600_0767 [Chlamydia psittaci VS225]
Length = 357
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 130/302 (43%), Gaps = 19/302 (6%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVP 80
S+ ++ STIVE G LV + + K+ + G W P + +
Sbjct: 31 SSDNRTSGMSTIVETS-GRLLVVW-----QDCDSNKLIGRRCDLGIWSEPYEIFPQESGL 84
Query: 81 MWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
+PVL ++ S + LFY+ Q + S + S+N G W E LPPGI G ++N P
Sbjct: 85 FSHPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGIEGTTRNPP 144
Query: 140 IL--LENGLLLCGSSVESWNSWG----SWMEVTVDAGRSWR-KYGPIYIPNESLSVIQPV 192
+ + N + + +V SW+E+ +SW + GPI + + I+P
Sbjct: 145 YVDRIRNRVYIPSYNVYEIPGQAVIGDSWVEIYNFFSKSWEGRMGPILGGSRNQVPIEPA 204
Query: 193 PFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
+ L + R+ + +CM+ S + G +WS K + ++ I ++G+L
Sbjct: 205 IVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAVQSGRNGKL 264
Query: 252 LLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
L N+ G L S ++G +W+ ++ +++ P+ G +H+ YT
Sbjct: 265 YLFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLDRQGCMHLVYTSKD 321
Query: 311 TQ 312
Q
Sbjct: 322 DQ 323
>gi|420448593|ref|ZP_14947473.1| hypothetical protein HPHPH44_0668 [Helicobacter pylori Hp H-44]
gi|393065947|gb|EJB66775.1| hypothetical protein HPHPH44_0668 [Helicobacter pylori Hp H-44]
Length = 372
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF GS EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGSKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420395231|ref|ZP_14894459.1| hypothetical protein HPCPY1124_0732 [Helicobacter pylori CPY1124]
gi|393014498|gb|EJB15670.1| hypothetical protein HPCPY1124_0732 [Helicobacter pylori CPY1124]
Length = 337
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 27 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEKLSYYS 86
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 87 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN +
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 199
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 200 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325
>gi|406592550|ref|YP_006739730.1| hypothetical protein B711_0772 [Chlamydia psittaci CP3]
gi|406594360|ref|YP_006741832.1| hypothetical protein B599_0718 [Chlamydia psittaci MN]
gi|410858610|ref|YP_006974550.1| hypothetical protein BN356_6571 [Chlamydia psittaci 01DC12]
gi|405782792|gb|AFS21540.1| hypothetical protein B599_0718 [Chlamydia psittaci MN]
gi|405788422|gb|AFS27165.1| hypothetical protein B711_0772 [Chlamydia psittaci CP3]
gi|410811505|emb|CCO02158.1| hypothetical protein BN356_6571 [Chlamydia psittaci 01DC12]
Length = 357
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 136/315 (43%), Gaps = 45/315 (14%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
S+ ++ STIVE G LV + G C D+ IW + ++ +S +
Sbjct: 31 SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
+ +PVL ++ S + LFY+ Q + S + S+N G W E LPPGI
Sbjct: 86 S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136
Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
G ++N P + + N + + S+N + SW+E+ +SW + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191
Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
+ + I+P + L + R+ + +CM+ S + G +WS K + +
Sbjct: 192 LGGSRNQVPIEPAIVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251
Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+ I ++G+L L N+ G L S ++G +W+ ++ +++ P+
Sbjct: 252 NSIAVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308
Query: 298 SDGSVHITYTYNRTQ 312
G +H+ YT Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323
>gi|254779428|ref|YP_003057533.1| hypothetical protein HELPY_0795 [Helicobacter pylori B38]
gi|254001339|emb|CAX29324.1| Conserved hypothetical protein; putative signal peptide
[Helicobacter pylori B38]
Length = 358
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 48 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTKEELSKNA 107
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 108 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFARKLSLSP 165
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 166 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 220
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 221 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 276
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 277 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 332
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 333 DIVYTYNRSHIKHI 346
>gi|385217538|ref|YP_005779014.1| hypothetical protein HPF16_0779 [Helicobacter pylori F16]
gi|317177587|dbj|BAJ55376.1| hypothetical protein HPF16_0779 [Helicobacter pylori F16]
Length = 372
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDNKINRWSEAFIILTKEELSFYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|208434500|ref|YP_002266166.1| hypothetical protein HPG27_539 [Helicobacter pylori G27]
gi|208432429|gb|ACI27300.1| hypothetical protein HPG27_539 [Helicobacter pylori G27]
Length = 358
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 133/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 48 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFTILTKEDLSHHS 107
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 108 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKVYQFESALEPIRFKFARKLSLSP 165
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 166 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 220
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 221 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 276
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 277 L-NKELFLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNSHFID 333
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 334 IVYTYNRSHIKHI 346
>gi|420504219|ref|ZP_15002746.1| hypothetical protein HPHPP62_0678 [Helicobacter pylori Hp P-62]
gi|393154631|gb|EJC54912.1| hypothetical protein HPHPP62_0678 [Helicobacter pylori Hp P-62]
Length = 358
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 48 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSRHS 107
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 108 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 165
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 166 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 220
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 221 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 276
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 277 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 332
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 333 DIVYTYNRSHIKHI 346
>gi|420430364|ref|ZP_14929393.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-20]
gi|393047776|gb|EJB48748.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-20]
Length = 372
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 129/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N + + LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420490736|ref|ZP_14989319.1| hypothetical protein HPHPP13_0839 [Helicobacter pylori Hp P-13]
gi|420524586|ref|ZP_15022993.1| hypothetical protein HPHPP13B_0844 [Helicobacter pylori Hp P-13b]
gi|393107649|gb|EJC08190.1| hypothetical protein HPHPP13_0839 [Helicobacter pylori Hp P-13]
gi|393130857|gb|EJC31281.1| hypothetical protein HPHPP13B_0844 [Helicobacter pylori Hp P-13b]
Length = 372
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT-----WSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFRFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420401548|ref|ZP_14900742.1| hypothetical protein HPCPY6081_0398 [Helicobacter pylori CPY6081]
gi|393019137|gb|EJB20281.1| hypothetical protein HPCPY6081_0398 [Helicobacter pylori CPY6081]
Length = 372
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSYYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN +
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|338740003|ref|YP_004676965.1| hypothetical protein HYPMC_3184 [Hyphomicrobium sp. MC1]
gi|337760566|emb|CCB66399.1| conserved protein of unknown function [Hyphomicrobium sp. MC1]
Length = 394
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 68/340 (20%), Positives = 134/340 (39%), Gaps = 51/340 (15%)
Query: 11 VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
++ E P + F H ++ V + G ++ EGA D ++ F W
Sbjct: 54 LSSEIVTPKGESEF--VHGASAVAMPNGGLQAFWYRAPYEGANDAELVSARFDGKEWSPT 111
Query: 71 IIADEEPNVP---------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKG 118
+ V + NPV F+ + LF+ + + W+ C + RS++ G
Sbjct: 112 HVVTTSARVSHDLAITIKSLANPVPFRRSDKEIWLFFATSR-LSGWATCEIGLIRSFDNG 170
Query: 119 VTWSEREQLPP----GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR 174
TW E L + +K+ P L +G + E + + + D R
Sbjct: 171 ATWGPAESLYATPFFNMSHLTKSTPFLFSDGRIGLPVYQEMNQKFPVLLVLDADGHVIDR 230
Query: 175 K---------YGPIYI---PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
+ Y P+ + P+ +++ ++P+ +R R + ++ + DG
Sbjct: 231 RRMGKAGKVGYQPMIVATGPSTAIAFVRPL----KSRRPR----------EILITRTTDG 276
Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALT 280
G TW+ +PT L NP I ++ +L+A+N ++ + +AL+ DG ++ +
Sbjct: 277 GQTWTPVEPTNLPNPGGPISAIRYDATHILMAFNDDAKMERNITLALADLDGKTFRRSGV 336
Query: 281 LE----ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+ + +YP +IQ+ G + Y+ I HV
Sbjct: 337 IARMDTDRQDHIVAYPFLIQSGPGQFDVMYSRPIKTINHV 376
>gi|420506413|ref|ZP_15004928.1| hypothetical protein HPHPP74_1239 [Helicobacter pylori Hp P-74]
gi|393115918|gb|EJC16428.1| hypothetical protein HPHPP74_1239 [Helicobacter pylori Hp P-74]
Length = 372
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDNKTNRWSEAFTILTKEELSKNA 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420436991|ref|ZP_14935975.1| hypothetical protein HPHPH28_0700 [Helicobacter pylori Hp H-28]
gi|393053305|gb|EJB54249.1| hypothetical protein HPHPH28_0700 [Helicobacter pylori Hp H-28]
Length = 372
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAST++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASTLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPFYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPYFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|217032365|ref|ZP_03437860.1| hypothetical protein HPB128_25g13 [Helicobacter pylori B128]
gi|298736271|ref|YP_003728797.1| hypothetical protein HPB8_776 [Helicobacter pylori B8]
gi|216945932|gb|EEC24548.1| hypothetical protein HPB128_25g13 [Helicobacter pylori B128]
gi|298355461|emb|CBI66333.1| conserved hypothetical protein [Helicobacter pylori B8]
Length = 337
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 27 SVHASALINLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSKNA 86
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 87 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 144
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 199
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 200 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 256 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 311
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325
>gi|420463871|ref|ZP_14962647.1| hypothetical protein HPHPH4_1004 [Helicobacter pylori Hp H-4]
gi|393079353|gb|EJB80086.1| hypothetical protein HPHPH4_1004 [Helicobacter pylori Hp H-4]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + + + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFVRKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420443569|ref|ZP_14942497.1| hypothetical protein HPHPH41_0647 [Helicobacter pylori Hp H-41]
gi|393061076|gb|EJB61945.1| hypothetical protein HPHPH41_0647 [Helicobacter pylori Hp H-41]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + RK + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRK---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420470428|ref|ZP_14969137.1| hypothetical protein HPHPH11_0782 [Helicobacter pylori Hp H-11]
gi|393085861|gb|EJB86540.1| hypothetical protein HPHPH11_0782 [Helicobacter pylori Hp H-11]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 142/328 (43%), Gaps = 43/328 (13%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP- 70
+FT P + S HAS+++ + + L AYF G+ EGA DVKI F K RW
Sbjct: 51 QFTIPKPNGAL-SVHASSLISLPNDNLLSAYFSGTKEGASDVKISANLFDSKTNRWSEAF 109
Query: 71 -IIADEEPN-------VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
I+ EE + + NP+LF L + +LLF +G + W+ +
Sbjct: 110 TILTKEELSHYSHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPI 167
Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSW 173
++ + L P + KNKP+ +G G + ++ + + + D +
Sbjct: 168 HFKFARKLSLSPFLNLSHLIKNKPLNTTDG----GFVLPLYHELATQYPLLLKFDKQNNP 223
Query: 174 RKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
R+ + PN +QP PF + ++ + M E+C W
Sbjct: 224 RE---LLRPNTLNHQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPML 276
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAME 288
T L N N ++ + L + L L +N +S ++ LSK ++ +S+ L++ A E
Sbjct: 277 TNLKNLNDALNLINL-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANE 333
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHV 316
SYP+ + + I YTYNR+ IKH+
Sbjct: 334 VSYPSY-SLNPHFIDIVYTYNRSHIKHI 360
>gi|420492600|ref|ZP_14991174.1| hypothetical protein HPHPP15_0915 [Helicobacter pylori Hp P-15]
gi|420526622|ref|ZP_15025023.1| hypothetical protein HPHPP15B_1184 [Helicobacter pylori Hp P-15b]
gi|393107039|gb|EJC07582.1| hypothetical protein HPHPP15_0915 [Helicobacter pylori Hp P-15]
gi|393131927|gb|EJC32350.1| hypothetical protein HPHPP15B_1184 [Helicobacter pylori Hp P-15b]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASALINLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|419419543|ref|ZP_13959778.1| Hypothetical protein HP17_05720 [Helicobacter pylori NCTC 11637 =
CCUG 17874]
gi|384372253|gb|EIE27869.1| Hypothetical protein HP17_05720 [Helicobacter pylori NCTC 11637 =
CCUG 17874]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASALINLPNDNLLSAYFSGTKEGAKDVKISANLFDGKTNRWSEAFTILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L ++ +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|387782418|ref|YP_005793131.1| hypothetical protein KHP_0738 [Helicobacter pylori 51]
gi|261838177|gb|ACX97943.1| hypothetical protein KHP_0738 [Helicobacter pylori 51]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV-- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSFYS 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLMRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N+ + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNSSDLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420435864|ref|ZP_14934863.1| hypothetical protein HPHPH27_1221 [Helicobacter pylori Hp H-27]
gi|420465270|ref|ZP_14964037.1| hypothetical protein HPHPH6_0683 [Helicobacter pylori Hp H-6]
gi|420495871|ref|ZP_14994435.1| hypothetical protein HPHPP23_0961 [Helicobacter pylori Hp P-23]
gi|393051723|gb|EJB52674.1| hypothetical protein HPHPH27_1221 [Helicobacter pylori Hp H-27]
gi|393082757|gb|EJB83473.1| hypothetical protein HPHPH6_0683 [Helicobacter pylori Hp H-6]
gi|393112182|gb|EJC12703.1| hypothetical protein HPHPP23_0961 [Helicobacter pylori Hp P-23]
Length = 372
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASALINLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|425789178|ref|YP_007017098.1| hypothetical protein HPAKL117_02740 [Helicobacter pylori
Aklavik117]
gi|425627493|gb|AFX90961.1| hypothetical protein HPAKL117_02740 [Helicobacter pylori
Aklavik117]
Length = 372
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/326 (25%), Positives = 137/326 (42%), Gaps = 39/326 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP- 70
+FT P S HAS ++ + + L AYF G+ EGA DVKI F K RW
Sbjct: 51 QFTIPKPDDAL-SVHASVLINLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAF 109
Query: 71 -IIADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
I+ EE + + NP+LF L + +LLF +G + W+ +
Sbjct: 110 VILTKEELSFYSHEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPI 167
Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
+ + L P + +NKP+ +G + E + ++ D + R+
Sbjct: 168 HFKFVRKLSLSPFLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE 225
Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
+ PN +QP PF + V+ + M E+C TW T
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTTWQKPMLTN 278
Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFS 290
L N N ++ + L + L L +N + + + LSK ++ +S+ L++ A E S
Sbjct: 279 LKNLNDALNLINL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVS 335
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
YP+ + + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNSHFIDIVYTYNRSHIKHI 360
>gi|405971945|gb|EKC36746.1| hypothetical protein CGI_10014218 [Crassostrea gigas]
Length = 391
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 121/300 (40%), Gaps = 35/300 (11%)
Query: 44 YFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQE 103
Y G SC D + +W + + NPVLF +L + QE
Sbjct: 96 YSGASCGCQIDWEANQLKNNSDQWTKAQVVSQRKGYSNQNPVLFHDNKTDVLYLFHSQQE 155
Query: 104 VQKWSGCMKRSYN--------KGVTWSEREQL-PPGILGPSK-----NKPIL-LENGLLL 148
+K S ++ + K VT S Q PP ++ K N+ ++ L N L
Sbjct: 156 AKKASAGIQSEDSAEIWVLSAKNVTNSTAIQFSPPRVMFKHKGSFDRNRVVVSLRNTWLY 215
Query: 149 -----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL--SVIQPVPFHTANRTL 201
GSS + ++ V D SW + P N + SV++P N L
Sbjct: 216 PMYYAGGSSKDQTSNLKEC--VDHDVFSSWLDH-PFTASNYLVQPSVVRP---KKGNPRL 269
Query: 202 RVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN--TVS 259
V R V +ES D G TW+ T L N NSGI+ L G L + YN T
Sbjct: 270 IVFFRDRRAQN-VYRAESPDDGKTWTRPSKTTLPNNNSGIEASVLSSGNLAIVYNPTTKD 328
Query: 260 RGVLKVALSKDDGDSWHDALTLE---ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
R L V+LS D G +W LE + +EFSYP + Q S ++ T + + + KH+
Sbjct: 329 RNPLSVSLSDDQGITWKYTRNLEYTKSSNGVEFSYPTLFQDSSEAIGSTCVH-KMRGKHI 387
>gi|420483924|ref|ZP_14982551.1| hypothetical protein HPHPP3_0763 [Helicobacter pylori Hp P-3]
gi|420514196|ref|ZP_15012669.1| hypothetical protein HPHPP3B_0577 [Helicobacter pylori Hp P-3b]
gi|393101617|gb|EJC02187.1| hypothetical protein HPHPP3_0763 [Helicobacter pylori Hp P-3]
gi|393158659|gb|EJC58919.1| hypothetical protein HPHPP3B_0577 [Helicobacter pylori Hp P-3b]
Length = 372
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|385225478|ref|YP_005785403.1| hypothetical protein HMPREF0462_0836 [Helicobacter pylori 83]
gi|332673624|gb|AEE70441.1| conserved hypothetical protein [Helicobacter pylori 83]
Length = 358
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 134/315 (42%), Gaps = 40/315 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV-- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 48 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSYYS 107
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ L P
Sbjct: 108 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 165
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 166 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 220
Query: 188 VIQP--VPFHT-ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+QP PF A T R + + M E+C W T L N N ++ +
Sbjct: 221 QLQPSLTPFKDCALMTFRN-----HSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLI 275
Query: 245 KLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ +
Sbjct: 276 NL-NQELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHF 331
Query: 302 VHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 332 IDIVYTYNRSHIKHI 346
>gi|420410111|ref|ZP_14909254.1| hypothetical protein HPNQ4200_0646 [Helicobacter pylori NQ4200]
gi|393028424|gb|EJB29510.1| hypothetical protein HPNQ4200_0646 [Helicobacter pylori NQ4200]
Length = 347
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 37 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFTILTKEDLSHHS 96
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + + + L P
Sbjct: 97 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVRKLSLSP 154
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + + + D + R+ + PN
Sbjct: 155 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNALNH 209
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 210 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 265
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 266 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 321
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 322 DIVYTYNRSHIKHI 335
>gi|420480610|ref|ZP_14979253.1| hypothetical protein HPHPP1_0821 [Helicobacter pylori Hp P-1]
gi|420511109|ref|ZP_15009596.1| hypothetical protein HPHPP1B_0902 [Helicobacter pylori Hp P-1b]
gi|393096626|gb|EJB97223.1| hypothetical protein HPHPP1_0821 [Helicobacter pylori Hp P-1]
gi|393119593|gb|EJC20083.1| hypothetical protein HPHPP1B_0902 [Helicobacter pylori Hp P-1b]
Length = 372
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|421710075|ref|ZP_16149432.1| BNR repeat-like domain protein [Helicobacter pylori R018c]
gi|421723357|ref|ZP_16162611.1| BNR repeat-like domain protein [Helicobacter pylori R056a]
gi|407210266|gb|EKE80145.1| BNR repeat-like domain protein [Helicobacter pylori R018c]
gi|407224380|gb|EKE94156.1| BNR repeat-like domain protein [Helicobacter pylori R056a]
Length = 372
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKKELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + + + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVRKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|385220636|ref|YP_005782108.1| hypothetical protein HPIN_03975 [Helicobacter pylori India7]
gi|317009443|gb|ADU80023.1| hypothetical protein HPIN_03975 [Helicobacter pylori India7]
Length = 323
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 132/317 (41%), Gaps = 44/317 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 13 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSHYS 72
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGC-----------MKRSYNKGVTWSE 123
NP+LF L + +LLF +G + W+ ++ + + ++ S
Sbjct: 73 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIRFKFVRKLSLSP 130
Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
L I KNKP+ +G + E + ++ D S R+ + PN
Sbjct: 131 FLNLSHLI----KNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKQNSPRE---LLRPN 181
Query: 184 ESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+QP PF + ++ + M E+C W T L N N +
Sbjct: 182 ALNHQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDAL 237
Query: 242 DGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
+ + L + L L +N +S ++ LSK + + L + + A E SYP+ +
Sbjct: 238 NLINL-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNS 294
Query: 300 GSVHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 295 HFIDIVYTYNRSHIKHI 311
>gi|386746260|ref|YP_006219477.1| hypothetical protein HPB14_03790 [Helicobacter pylori HUP-B14]
gi|384552509|gb|AFI07457.1| hypothetical protein HPB14_03790 [Helicobacter pylori HUP-B14]
Length = 372
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQS--PIIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSETFTILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|385228312|ref|YP_005788245.1| hypothetical protein HPPN120_02855 [Helicobacter pylori Puno120]
gi|344334750|gb|AEN15194.1| hypothetical protein HPPN120_02855 [Helicobacter pylori Puno120]
Length = 372
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSFYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIQFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C +W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTSWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|421711574|ref|ZP_16150917.1| BNR repeat-like domain protein [Helicobacter pylori R030b]
gi|407212723|gb|EKE82585.1| BNR repeat-like domain protein [Helicobacter pylori R030b]
Length = 372
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420519358|ref|ZP_15017802.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-5b]
gi|393128450|gb|EJC28894.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-5b]
Length = 372
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|218682619|ref|ZP_03530220.1| putative glycosyl hydrolase protein [Rhizobium etli CIAT 894]
Length = 358
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 101/252 (40%), Gaps = 27/252 (10%)
Query: 22 APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
+P HA+ + + G +FGG+ EG D+ I++ G RW P ++P
Sbjct: 35 SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLAPGAERWSEPEKMSDDPKK 94
Query: 80 PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
NP++F P L Y Q G + + S + G T+ L PG
Sbjct: 95 SEQNPLIFNAPDRKTWLLY-TSQTSGNQDGSIVKCRISDDGGKTFGPVRVLCDSPGTF-- 151
Query: 135 SKNKPILLENG-------LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
+ ++ NG + C G + W N+ + + ++ D G SW+ + IP+
Sbjct: 152 --VRQQIVVNGRGDWLLPIFRCVGLDGQRWSGNADTAAVLISRDRGASWQMHD---IPDS 206
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+V + + F + S S DGG TWS KPT+L N NS I
Sbjct: 207 IGAVHMNILPLGGGEMIAFYRNRF--AENILFSRSSDGGETWSPPKPTELPNNNSSIQAT 264
Query: 245 KLKDGRLLLAYN 256
L D + + YN
Sbjct: 265 MLNDAAIAMVYN 276
>gi|384894376|ref|YP_005768425.1| hypothetical protein HPSAT_03905 [Helicobacter pylori Sat464]
gi|308063630|gb|ADO05517.1| hypothetical protein HPSAT_03905 [Helicobacter pylori Sat464]
Length = 337
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 27 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 86
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 87 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 199
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 200 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325
>gi|149174320|ref|ZP_01852947.1| probable sialidase [Planctomyces maris DSM 8797]
gi|148846865|gb|EDL61201.1| probable sialidase [Planctomyces maris DSM 8797]
Length = 382
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 106/245 (43%), Gaps = 24/245 (9%)
Query: 83 NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP-PGILGPSKNKPIL 141
+P L +L +L Y + + + ++RS + G TWSE++Q+ PG + + +
Sbjct: 119 HPNLIRLSEQEILFSYVGWDDENQRNVFLRRSTDNGKTWSEQQQISEPGWYCCNADHALR 178
Query: 142 LENGLLL-----------CGSSVESWNSWGSWMEVTVDAGRSW-RKYGPIYIPNESLSVI 189
L G +L G + S++ + D R+W R + P
Sbjct: 179 LSTGRILIPAHGPYAKQYVGGTSYKGGDLHSFVFYSDDGFRTWKRSANSMTAPGRGCH-- 236
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KD 248
+P L +R+ N R+ S S DGG WS +PT+L +P S ++
Sbjct: 237 EPAIVELKEGRLLCFLRNTNQ--RLYQSFSDDGGAHWSRPEPTKLPSPESPAIVKRIPST 294
Query: 249 GRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G LLL +N V+ R L A+S+DDG +W ++ A + +YP++ +D +
Sbjct: 295 GDLLLLWNNVASPTNWPRTPLSAAISQDDGKTWGHFKDIDARPAFDAAYPSITFVNDEAF 354
Query: 303 HITYT 307
Y+
Sbjct: 355 IAYYS 359
>gi|385229873|ref|YP_005789789.1| hypothetical protein HPPN135_02860 [Helicobacter pylori Puno135]
gi|344336311|gb|AEN18272.1| hypothetical protein HPPN135_02860 [Helicobacter pylori Puno135]
Length = 372
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDNKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|412994058|emb|CCO14569.1| exported exo-alpha-sialidase [Bathycoccus prasinos]
Length = 692
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 133/357 (37%), Gaps = 70/357 (19%)
Query: 2 KKDCGIKG-----LVAEEFTFPANSAPFKSCHASTIVEVDKGH----FLVAYFGGSCEGA 52
K CG K ++ + F F N FK H I E K S EGA
Sbjct: 251 KNGCGSKEAKKNIVLKQSFLF-RNGESFKYGHMVMIGEAPKNSPWKWAATWQASNSFEGA 309
Query: 53 PDVKIWLQTFKD--GRWQSPI---IADEEPNVPMWNPVLFKLPSNGLL-LFYKIGQEVQK 106
KI + D G W P+ I +W PV K P+ G+ LFY +K
Sbjct: 310 FGQKIVISFADDPTGTWTPPMVIPIPVTPSTAAVWGPVWHKDPNTGVAWLFYAASTSCRK 369
Query: 107 WSGCMKRSYN-----------KGVTWSEREQLPPGIL---GPSKNKPILLENGLLL---- 148
+G K Y G+ WS P IL S + P L+ N L +
Sbjct: 370 NAGHGKVKYAPGGDIQVIKSLDGIQWS----APKTILMQAAESNHLPKLVSNQLAVHEPT 425
Query: 149 ----------------CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
C SV + + ++ D G+SW G + V
Sbjct: 426 GYWVLPMWRESPHSNTCKPSVSKGTRTSAGVLLSKDQGKSWSVVGSYRARGVRWLIEGTV 485
Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD---- 248
+ L +L RS G + S DGG +WS A+ T + NPNS ++ ++L+
Sbjct: 486 AEVGDTKDLLLLHRS--GEKYLYKQISKDGGKSWSNAERTSVPNPNSKVNLIRLESSKGA 543
Query: 249 -----GRLLLAYNTVSRGV---LKVALSKDDGDSWHDALTLEENL-AMEFSYPAVIQ 296
G L LAYN GV L VALS DG W LE++ + F+YP + Q
Sbjct: 544 PTNLGGSLALAYNDAMSGVRRKLSVALS-SDGHKWSKLQLLEDSSPGLHFNYPTIAQ 599
>gi|420453401|ref|ZP_14952240.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-8]
gi|393071009|gb|EJB71798.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-8]
Length = 372
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420455377|ref|ZP_14954206.1| hypothetical protein HPHPA14_0817 [Helicobacter pylori Hp A-14]
gi|393072349|gb|EJB73127.1| hypothetical protein HPHPA14_0817 [Helicobacter pylori Hp A-14]
Length = 372
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKINRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420474079|ref|ZP_14972755.1| hypothetical protein HPHPH19_0917 [Helicobacter pylori Hp H-19]
gi|393089215|gb|EJB89855.1| hypothetical protein HPHPH19_0917 [Helicobacter pylori Hp H-19]
Length = 372
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPMLTNLKNLNDALNLID 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|384889215|ref|YP_005763517.1| hypothetical protein HPV225_0586 [Helicobacter pylori v225d]
gi|297379781|gb|ADI34668.1| Hypothetical protein HPV225_0586 [Helicobacter pylori v225d]
Length = 337
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVK+ F K RW I + + +
Sbjct: 27 SVHASALISLPNDNLLSAYFSGTKEGARDVKVSANLFDSKINRWSEAFIILTKEELSYYS 86
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT-----WSEREQLPP 129
NP+LF L ++ +LLF +G + W+ + + ++ L P
Sbjct: 87 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQLESALEPIRFRFVQKLSLSP 144
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 199
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 200 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNSHFI 311
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325
>gi|108562984|ref|YP_627300.1| hypothetical protein HPAG1_0559 [Helicobacter pylori HPAG1]
gi|107836757|gb|ABF84626.1| hypothetical protein HPAG1_0559 [Helicobacter pylori HPAG1]
Length = 372
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYTLLLK--FDKNNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420400167|ref|ZP_14899369.1| hypothetical protein HPCPY3281_0734 [Helicobacter pylori CPY3281]
gi|393018384|gb|EJB19534.1| hypothetical protein HPCPY3281_0734 [Helicobacter pylori CPY3281]
Length = 337
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 27 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFILLTKEELSYYS 86
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 87 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 145 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYP--LLLKFDKHNNPRE---LLRPNALNH 199
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 200 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 256 L-NQELYLIHNPSNSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325
>gi|421719657|ref|ZP_16158941.1| BNR repeat-like domain protein [Helicobacter pylori R046Wa]
gi|407220980|gb|EKE90785.1| BNR repeat-like domain protein [Helicobacter pylori R046Wa]
Length = 372
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYP--LLLKFDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWKKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELFLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|210134781|ref|YP_002301220.1| neuraminidase [Helicobacter pylori P12]
gi|210132749|gb|ACJ07740.1| neuraminidase [Helicobacter pylori P12]
Length = 372
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 137/326 (42%), Gaps = 39/326 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
+FT P + S HAS ++ + + L AYF G+ EGA DVKI F K RW
Sbjct: 51 QFTIPKPNGAL-SVHASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKINRWSEAF 109
Query: 72 IADEEPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
I + + + NP+LF L + +LLF +G + W+ +
Sbjct: 110 IILTKEELSHYSHEYIKKLGNPLLF-LHEDKILLFV-VGVSMGGWATSKIYQLESTLEPI 167
Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
+ ++ L P + KNKP+ +G + E + ++ D + R+
Sbjct: 168 HFKFVQKLSLSPFLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNSRE 225
Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
+ PN +QP PF + ++ + M E+C W T
Sbjct: 226 ---LLRPNTLNHQLQPSLTPF----KNCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTN 278
Query: 234 LLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFS 290
L N N ++ + L + L L +N +S ++ LSK ++ +S+ L++ A E S
Sbjct: 279 LKNLNDALNLINLNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKILKVLDK--ANEVS 335
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
YP+ + + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNPHFIDIVYTYNRSHIKHI 360
>gi|420396383|ref|ZP_14895603.1| hypothetical protein HPCPY1313_0212 [Helicobacter pylori CPY1313]
gi|393014374|gb|EJB15547.1| hypothetical protein HPCPY1313_0212 [Helicobacter pylori CPY1313]
Length = 372
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFILLTKEELSYYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN +
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWKKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|322379165|ref|ZP_08053558.1| hypothetical protein HSUHS1_0796 [Helicobacter suis HS1]
gi|321148396|gb|EFX42903.1| hypothetical protein HSUHS1_0796 [Helicobacter suis HS1]
Length = 354
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 131/323 (40%), Gaps = 47/323 (14%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPIIADEEPNV- 79
P KS H++T+ + + L AYFGGS EG DVKI+ L + K WQ P +
Sbjct: 40 PVKSVHSATLSALSNDYLLAAYFGGSAEGLSDVKIYANLYSIKTQTWQKAFALLSAPELS 99
Query: 80 --------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT----WSEREQL 127
+ NPVLF + L + +G + W+ +T +S + L
Sbjct: 100 RLANAYVKALGNPVLFT--QHDRLYLFVVGVSLGGWATSRIYMLVSNITPPFHFSLLQIL 157
Query: 128 PPGILGPSKNKPILLENGLLLC---GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
P L P N L+ +L G + S++ + + R I PN
Sbjct: 158 P---LSPFLNISHLVRTAVLPTTDGGFLLPSYHELAHKFPLLLKFDTQDRLEFLIK-PNT 213
Query: 185 SLSVIQPV--PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+QP P+ + +RS+ + +++C L W+ P L N ++ +
Sbjct: 214 LKQQLQPTLAPYKACA---LMALRSYK--TQQFYTQTCKTPLQWN---PPHLSNLHNYDN 265
Query: 243 GVKLKDGR--LLLAYN------TVSRGVLKVALSKDDGDSWHDALT-LEENLAMEFSYPA 293
+ L + L L YN R L +A K D + LT L+ +L E SYPA
Sbjct: 266 SIVLFNANHTLYLLYNEPLKNSNNPRSALYLAKFK---DGYFSPLTILDSSLKGEVSYPA 322
Query: 294 VIQASDGSVHITYTYNRTQIKHV 316
V+ + +H+ YT R I H+
Sbjct: 323 VLNF-NNQIHVLYTKERKAIMHL 344
>gi|420460081|ref|ZP_14958880.1| hypothetical protein HPHPA27_0567 [Helicobacter pylori Hp A-27]
gi|393077183|gb|EJB77932.1| hypothetical protein HPHPA27_0567 [Helicobacter pylori Hp A-27]
Length = 372
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 136/326 (41%), Gaps = 39/326 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
+FT P S HAS ++ + + L AYF G+ EGA DVKI F K RW
Sbjct: 51 QFTIPRPDDAL-SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAF 109
Query: 72 IADEEPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
I + + + NP+LF L + +LLF +G + W+ + +
Sbjct: 110 ILLTKEELSHYSHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPI 167
Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
+ + L P + KNKP+ +G + E + ++ D + R+
Sbjct: 168 HFKFVRKLSLSPFLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE 225
Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
+ PN +QP PF + ++ + M E+C W T
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAIMAFRNHSFKDNLMLETCKTPTIWQKPMLTN 278
Query: 234 LLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFS 290
L N N ++ + L + L L +N +S ++ LSK ++ +S+ L++ A E S
Sbjct: 279 LKNLNDALNLINL-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVS 335
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
YP+ + + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNPHFIDIVYTYNRSHIKHI 360
>gi|52843133|ref|YP_096932.1| BNR/Asp box repeat-containing protein [Legionella pneumophila
subsp. pneumophila str. Philadelphia 1]
gi|52630244|gb|AAU28985.1| hypothetical BNR/Asp box repeat protein [Legionella pneumophila
subsp. pneumophila str. Philadelphia 1]
Length = 390
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 80/340 (23%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV--- 79
K H++T+ +VD H L YF GS EG PDVKI+ + R W +P I +
Sbjct: 71 KKAHSATLTKVDDSHLLAMYFAGSREGGPDVKIYSSIYDIHRNKWSTPKIVLTREKLRKN 130
Query: 80 ------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG 130
+ NPV++ NG L + + W+ S +KG +++ + L
Sbjct: 131 SHQLIRKLGNPVIYN--QNGKLHLFVVATAFAGWAASKIYHFESLDKGKSFNYKATL--- 185
Query: 131 ILGPSKN-------KPI-LLENGLLL-CGSSVES-------WNSWGSWMEVTVDAGRSWR 174
L P N P+ L++ G L +ES ++ GS+ E ++
Sbjct: 186 YLSPFFNLSTLIRASPLPLVDGGFFLPAYHELESKYPLMLRFDKKGSYTE-SIQINHIGG 244
Query: 175 KYGPIYIPNESLSVIQPVPFHTANRT---LRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
+ P IP SL+ + F AN+T +R+ + C +GGL+W +
Sbjct: 245 QLQPSIIPLSSLNCL--AGFRNANKTDHWMRI---------QYCQ----EGGLSWLDPQK 289
Query: 232 TQLLNPNSGIDGVKLKD---------------GRLLLAYNTVSRGVLKVALSKDDGDSWH 276
T + N N+ I+ G+L+L+ S G+ + D G+ +
Sbjct: 290 TNIKNMNNSINLFNFNSEVFLVHNRTHKNNIKGQLVLSRMENSTGIFSYLFTLDTGEINN 349
Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
++T YP+ + G V I Y N ++ H+
Sbjct: 350 KSVT----------YPSTL-VDRGYVDIIYDKNANEVIHI 378
>gi|378778819|ref|YP_005187261.1| putative BNR/Asp box repeat protein [Legionella pneumophila subsp.
pneumophila ATCC 43290]
gi|364509637|gb|AEW53161.1| putative BNR/Asp box repeat protein [Legionella pneumophila subsp.
pneumophila ATCC 43290]
Length = 395
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 80/340 (23%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV--- 79
K H++T+ +VD H L YF GS EG PDVKI+ + R W +P I +
Sbjct: 76 KKAHSATLTKVDDSHLLAMYFAGSREGGPDVKIYSSIYDIHRNKWSTPKIVLTREKLRKN 135
Query: 80 ------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG 130
+ NPV++ NG L + + W+ S +KG +++ + L
Sbjct: 136 SHQLIRKLGNPVIYN--QNGKLHLFVVATAFAGWAASKIYHFESLDKGKSFNYKATL--- 190
Query: 131 ILGPSKN-------KPI-LLENGLLL-CGSSVES-------WNSWGSWMEVTVDAGRSWR 174
L P N P+ L++ G L +ES ++ GS+ E ++
Sbjct: 191 YLSPFFNLSTLIRASPLPLVDGGFFLPAYHELESKYPLMLRFDKKGSYTE-SIQINHIGG 249
Query: 175 KYGPIYIPNESLSVIQPVPFHTANRT---LRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
+ P IP SL+ + F AN+T +R+ + C +GGL+W +
Sbjct: 250 QLQPSIIPLSSLNCL--AGFRNANKTDHWMRI---------QYCQ----EGGLSWLDPQK 294
Query: 232 TQLLNPNSGIDGVKLKD---------------GRLLLAYNTVSRGVLKVALSKDDGDSWH 276
T + N N+ I+ G+L+L+ S G+ + D G+ +
Sbjct: 295 TNIKNMNNSINLFNFNSEVFLVHNRTHKNNIKGQLVLSRMENSTGIFSYLFTLDTGEINN 354
Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
++T YP+ + G V I Y N ++ H+
Sbjct: 355 KSVT----------YPSTL-VDRGYVDIIYDKNANEVIHI 383
>gi|421712896|ref|ZP_16152227.1| BNR repeat-like domain protein [Helicobacter pylori R32b]
gi|407216262|gb|EKE86099.1| BNR repeat-like domain protein [Helicobacter pylori R32b]
Length = 372
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPFYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPYFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|322379649|ref|ZP_08053968.1| hypothetical neuraminidase (sialidase) [Helicobacter suis HS5]
gi|321147911|gb|EFX42492.1| hypothetical neuraminidase (sialidase) [Helicobacter suis HS5]
Length = 354
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 131/323 (40%), Gaps = 47/323 (14%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPIIADEEPNV- 79
P KS H++T+ + + L AYFGGS EG DVKI+ L + K WQ P +
Sbjct: 40 PVKSVHSATLSALSNDYLLAAYFGGSAEGLSDVKIYANLYSIKTQTWQKTFALLSAPELS 99
Query: 80 --------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT----WSEREQL 127
+ NPVLF + L + +G + W+ +T +S + L
Sbjct: 100 RLANAYVKALGNPVLFT--QHDRLYLFVVGVSLGGWATSRIYMLVSNITPPFHFSLLQIL 157
Query: 128 PPGILGPSKNKPILLENGLLLC---GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
P L P N L+ +L G + S++ + + R I PN
Sbjct: 158 P---LSPFLNISHLVRTAVLPTTDGGFLLPSYHELAHKFPLLLKFDTQDRLEFLIK-PNT 213
Query: 185 SLSVIQPV--PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
+QP P+ + +RS+ + +++C L W+ P L N ++ +
Sbjct: 214 LKQQLQPTLAPYKACA---LMALRSYK--TQQFYTQTCKTPLQWN---PPHLSNLHNYDN 265
Query: 243 GVKLKDGR--LLLAYN------TVSRGVLKVALSKDDGDSWHDALT-LEENLAMEFSYPA 293
+ L + L L YN R L +A K D + LT L+ +L E SYPA
Sbjct: 266 SIVLFNANHTLYLLYNEPLKNSNNPRSALYLAKFK---DGYFSPLTILDSSLKGEVSYPA 322
Query: 294 VIQASDGSVHITYTYNRTQIKHV 316
V+ + +H+ YT R I H+
Sbjct: 323 VLNF-NNQIHVLYTKERKAIMHL 344
>gi|407459486|ref|YP_006737589.1| hypothetical protein B602_0720 [Chlamydia psittaci M56]
gi|405786183|gb|AFS24928.1| hypothetical protein B602_0720 [Chlamydia psittaci M56]
Length = 357
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 132/310 (42%), Gaps = 35/310 (11%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
S+ ++ STIVE G LV + G C D+ IW + ++ +S +
Sbjct: 31 SSDNRTSGMSTIVET-SGRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
+ +PVL ++ S + LFY+ Q + S + S+N G W E LPPGI
Sbjct: 86 S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136
Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW----GSWMEVTVDAGRSWR-KYGPIYIPNE 184
G ++N P + + N + + +V SW+E+ SW + GPI +
Sbjct: 137 EGTTRNPPYVDRIRNRVYIPSYNVYEMPGQEVIGDSWVEIYNFFSTSWEGRMGPILGGSR 196
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+ I+P + L + R+ + +CM+ S + G +WS K + ++ I
Sbjct: 197 NQVPIEPAIVKLTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAV 256
Query: 244 VKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
++G+L L N+ G L S ++G W+ ++ +++ P+ G +
Sbjct: 257 QSGRNGKLYLFANSAPAGERLTCFSSYNEGIIWNHPKLIDSGYSVD---PSAYLDRQGCM 313
Query: 303 HITYTYNRTQ 312
H+ YT Q
Sbjct: 314 HLVYTSKDDQ 323
>gi|418936222|ref|ZP_13489953.1| putative glycosyl hydrolase protein, partial [Rhizobium sp.
PDO1-076]
gi|375057060|gb|EHS53253.1| putative glycosyl hydrolase protein, partial [Rhizobium sp.
PDO1-076]
Length = 207
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 67/164 (40%), Gaps = 58/164 (35%)
Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT--------------- 257
++ S S D G WS +PT L N NS I + LK+G + + YN
Sbjct: 38 QILYSWSDDDGRNWSPPQPTVLPNNNSSIQAIDLKNGSIAIVYNHSNAESSDQRRLSLYD 97
Query: 258 -------------------------------VSRGVLKVALSKDDGDSWHDALTL----- 281
V R L +A S DDG S+ + + L
Sbjct: 98 EIEGEGGGEAVDEQAEPATGGTYDGPRAVWGVPRAPLSLAFSYDDGVSFLEHVDLDTGDG 157
Query: 282 -------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+++L EFSYP++I+ DG++H+ YTY R IK+V +
Sbjct: 158 YCLSNNSQQSLNREFSYPSIIEGPDGTLHVAYTYFRRAIKYVRI 201
>gi|420446924|ref|ZP_14945819.1| hypothetical protein HPHPH43_0612 [Helicobacter pylori Hp H-43]
gi|393064634|gb|EJB65469.1| hypothetical protein HPHPH43_0612 [Helicobacter pylori Hp H-43]
Length = 372
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFILLTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHEDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420427208|ref|ZP_14926253.1| hypothetical protein HPHPA9_1075 [Helicobacter pylori Hp A-9]
gi|393042141|gb|EJB43152.1| hypothetical protein HPHPA9_1075 [Helicobacter pylori Hp A-9]
Length = 372
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 129/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIVLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G G + ++ + + + + + PN
Sbjct: 180 FLNLSHLIKNKPLSTTDG----GFVLPLYHELATQYPLLLKFDKQ-NNPKELLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|171909756|ref|ZP_02925226.1| hypothetical protein VspiD_01250 [Verrucomicrobium spinosum DSM
4136]
Length = 391
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 103/232 (44%), Gaps = 46/232 (19%)
Query: 110 CMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTV-- 167
C+ RS + G+TWS P + S + P L++G LL W ++ +
Sbjct: 174 CL-RSIDGGITWS----TPIKTIVNSPHGPTQLKDGRLLYAGK----QLWSEDKKIGICE 224
Query: 168 --DAGRSWRKYGPI------YIPN--ESLSVIQPVPFHTANRTLRVLMRSFNG--IGRVC 215
D G +W+ I +PN L I+ A+ T+ V +R+ N G
Sbjct: 225 SKDDGLTWQWLSEIPAREGDAVPNGYHELHAIE-----AADGTIIVQIRNHNKQHAGETL 279
Query: 216 MSESCDGGLTWS-------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL-KVAL 267
+ES DGG TWS + P+ LL KL+DGRLL+ Y + +V +
Sbjct: 280 QTESSDGGKTWSTPHSIGVWGLPSHLL---------KLRDGRLLMTYGHRRKPFGNQVRV 330
Query: 268 SKDDGDSWHDALTL-EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SKD+G +W +A L E+ + + YP+ ++ DGS + + + K VL
Sbjct: 331 SKDNGKTWGEASILSEDGIGGDLGYPSTVELGDGSFLSVWYESMKEPKKAVL 382
>gi|421718196|ref|ZP_16157495.1| BNR repeat-like domain protein [Helicobacter pylori R038b]
gi|407221462|gb|EKE91266.1| BNR repeat-like domain protein [Helicobacter pylori R038b]
Length = 347
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 37 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFTILTKEDLSHHS 96
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 97 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPVHFKFARKLSLSP 154
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 155 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 209
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 210 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 265
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 266 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 321
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 322 DIVYTYNRSHIKHI 335
>gi|420404284|ref|ZP_14903467.1| hypothetical protein HPCPY6261_1462 [Helicobacter pylori CPY6261]
gi|393018163|gb|EJB19314.1| hypothetical protein HPCPY6261_1462 [Helicobacter pylori CPY6261]
Length = 372
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCALMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIS 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420497543|ref|ZP_14996103.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25]
gi|420527849|ref|ZP_15026243.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25c]
gi|393113822|gb|EJC14340.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25]
gi|393134975|gb|EJC35384.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25c]
Length = 372
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLILETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|385249100|ref|YP_005777319.1| hypothetical protein HPF57_0605 [Helicobacter pylori F57]
gi|317181895|dbj|BAJ59679.1| hypothetical protein HPF57_0605 [Helicobacter pylori F57]
Length = 372
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS ++ + + L YF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASALISLPNDNLLSTYFSGTKEGARDVKISANLFDSKINRWSKAFVILTKEELSHYS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN +
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|385226812|ref|YP_005786736.1| hypothetical protein HPSNT_02980 [Helicobacter pylori SNT49]
gi|344331725|gb|AEN16755.1| hypothetical protein HPSNT_02980 [Helicobacter pylori SNT49]
Length = 372
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I ++ +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTREDLSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KN P+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNNPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N+ + + + LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELYLIHNSSDSSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|397665554|ref|YP_006507092.1| hypothetical protein LPO_3264 [Legionella pneumophila subsp.
pneumophila]
gi|395128965|emb|CCD07186.1| conserved protein of unknown function [Legionella pneumophila
subsp. pneumophila]
Length = 383
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/316 (23%), Positives = 127/316 (40%), Gaps = 32/316 (10%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV--- 79
K H++T+ +VD H L YF GS EG PDVKI+ + R W +P I +
Sbjct: 64 KKAHSATLTKVDDSHLLAMYFAGSREGGPDVKIYSSIYDIHRNKWSAPKIVLTREKLCKN 123
Query: 80 ------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG 130
+ NPV++ NG L + + W+ S +KG +++ + L
Sbjct: 124 SHQLIRKLGNPVIYN--QNGKLHLFVVATAFAGWAASKIYHFESLDKGKSFNYKATL--- 178
Query: 131 ILGPSKNKPILLENG---LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
L P N L+ L+ G + +++ S + + + I I N
Sbjct: 179 YLSPFFNLSTLIRASPLPLVDGGFFLPAYHELESKYPLMLRFDKKGSYTESIQI-NHIGG 237
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC-DGGLTWSYAKPTQLLNPNSGIDGVKL 246
+QP +N R+ N I + C +GGL+W + T + N N+ I+
Sbjct: 238 QLQPSIIPLSNLNCLAGFRNANKIDHWMRIQYCQEGGLSWLDPQKTNIKNMNNSINLFNF 297
Query: 247 KDGRLLLAYNTVSRGVLK--VALSKDDGD----SWHDALTLEENLAMEFSYPAVIQASDG 300
+ + L +N + +K + LS+ + S+ L E +YP+ + G
Sbjct: 298 -NSEVFLVHNRTHKNNIKGQLVLSRMENSTGIFSYLFTLDTGEINNKSVTYPSTL-VDRG 355
Query: 301 SVHITYTYNRTQIKHV 316
V I Y N ++ H+
Sbjct: 356 YVDIIYDKNANEVIHI 371
>gi|15645205|ref|NP_207375.1| hypothetical protein HP0580 [Helicobacter pylori 26695]
gi|410023813|ref|YP_006893066.1| hypothetical protein C695_02995 [Helicobacter pylori Rif1]
gi|410501580|ref|YP_006936107.1| hypothetical protein C730_02995 [Helicobacter pylori Rif2]
gi|410682100|ref|YP_006934502.1| hypothetical protein C694_02990 [Helicobacter pylori 26695]
gi|419417467|ref|ZP_13957901.1| hypothetical protein HP79_02214 [Helicobacter pylori P79]
gi|2313698|gb|AAD07646.1| predicted coding region HP0580 [Helicobacter pylori 26695]
gi|384372982|gb|EIE28534.1| hypothetical protein HP79_02214 [Helicobacter pylori P79]
gi|409893741|gb|AFV41799.1| hypothetical protein C694_02990 [Helicobacter pylori 26695]
gi|409895470|gb|AFV43392.1| hypothetical protein C695_02995 [Helicobacter pylori Rif1]
gi|409897131|gb|AFV44985.1| hypothetical protein C730_02995 [Helicobacter pylori Rif2]
Length = 372
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDSSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|425790554|ref|YP_007018471.1| hypothetical protein HPAKL86_01960 [Helicobacter pylori Aklavik86]
gi|425628869|gb|AFX89409.1| hypothetical protein HPAKL86_01960 [Helicobacter pylori Aklavik86]
Length = 372
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFIILTKEELSYYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + + L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPILTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NQELYLIHNPSDLSMRRKELLLSKLENSNSFRTLKILDK--ANEVSYPSY-SFNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|383749753|ref|YP_005424856.1| hypothetical protein HPELS_03670 [Helicobacter pylori ELS37]
gi|380874499|gb|AFF20280.1| hypothetical protein HPELS_03670 [Helicobacter pylori ELS37]
Length = 372
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 132/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFTILTKEDLSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALESIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|384896119|ref|YP_005770108.1| hypothetical protein HMPREF4655_21021 [Helicobacter pylori 35A]
gi|315586735|gb|ADU41116.1| conserved hypothetical protein [Helicobacter pylori 35A]
Length = 372
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW + + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L N +LLF +G + W+ + + + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNY 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCALMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420501367|ref|ZP_14999911.1| hypothetical protein HPHPP30_1200 [Helicobacter pylori Hp P-30]
gi|393150173|gb|EJC50481.1| hypothetical protein HPHPP30_1200 [Helicobacter pylori Hp P-30]
Length = 372
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420406765|ref|ZP_14905935.1| hypothetical protein HPCPY6311_0608 [Helicobacter pylori CPY6311]
gi|393023602|gb|EJB24716.1| hypothetical protein HPCPY6311_0608 [Helicobacter pylori CPY6311]
Length = 372
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSYYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIRFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN +
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|384899112|ref|YP_005774492.1| hypothetical protein HPF30_0747 [Helicobacter pylori F30]
gi|317179056|dbj|BAJ56844.1| hypothetical protein HPF30_0747 [Helicobacter pylori F30]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSYYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIRFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN +
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420438606|ref|ZP_14937580.1| hypothetical protein HPHPH29_0669 [Helicobacter pylori Hp H-29]
gi|393056206|gb|EJB57118.1| hypothetical protein HPHPH29_0669 [Helicobacter pylori Hp H-29]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SAHASSLIGLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N + + LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420494076|ref|ZP_14992645.1| hypothetical protein HPHPP16_0724 [Helicobacter pylori Hp P-16]
gi|393110877|gb|EJC11401.1| hypothetical protein HPHPP16_0724 [Helicobacter pylori Hp P-16]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENPNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420425260|ref|ZP_14924321.1| hypothetical protein HPHPA5_0789 [Helicobacter pylori Hp A-5]
gi|393041678|gb|EJB42692.1| hypothetical protein HPHPA5_0789 [Helicobacter pylori Hp A-5]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L YF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASSLIGLPNDNLLSTYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPMLTNLKNLNDALNLID 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|373849440|ref|ZP_09592241.1| hypothetical protein Opit5DRAFT_0295 [Opitutaceae bacterium TAV5]
gi|372475605|gb|EHP35614.1| hypothetical protein Opit5DRAFT_0295 [Opitutaceae bacterium TAV5]
Length = 386
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/217 (26%), Positives = 98/217 (45%), Gaps = 30/217 (13%)
Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
MKRS + G+TWS R ++P G + + P+ L +G + ++ ++W V+ D G
Sbjct: 180 MKRSTDGGLTWSPRYKVP----GYNPHGPVALRDGRMFFAAAHDAW--------VSEDDG 227
Query: 171 RSWRKYGPIYI-PNESLSVIQPVPFHTANRTLRVLMRSFNGIG-----RVCMSESCDGGL 224
+W + + P E S+ A+ L V +R+ R+ S S DGG
Sbjct: 228 LTWTLFSKLPTGPGEMHSI------EAADGALIVHVRNKVKTAKGTRQRIIQSRSEDGGK 281
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV-LKVALSKDDGDSWHDALTLEE 283
TW + P+ V+L DG L+ Y ++ LS+D G SW + + +
Sbjct: 282 TWDAPRVVAEGYPSH---LVRLADGTLITTYGWRQAPFGIRGKLSRDHGRSWSEEFVVTD 338
Query: 284 NLA-MEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
+ A + YP+ +Q DGS+ +T Y + H+ L
Sbjct: 339 DGANWDLGYPSTVQLRDGSL-LTVWYEAPEGSHLAGL 374
>gi|444374201|ref|ZP_21173508.1| hypothetical protein C528_01385 [Helicobacter pylori A45]
gi|443621425|gb|ELT81864.1| hypothetical protein C528_01385 [Helicobacter pylori A45]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 133/316 (42%), Gaps = 42/316 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS ++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASALINLPNDNLLNAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L ++ +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNES 185
+ KNKP+ +G G + ++ + + + D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDG----GFVLPLYHELATQYPLLLKFDQQNNPRE---LLRPNTL 232
Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+QP PF + V+ + M E+C W T L N N ++
Sbjct: 233 NHQLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWKKPTLTNLKNLNDALNL 288
Query: 244 VKLKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
+ L + L L +N + + LSK ++ +S+ L++ A E SYP+ +
Sbjct: 289 INLNE-ELYLIHNPSDLSLRRKELWLSKLENPNSFKTLKVLDK--ANEVSYPSY-SLNPH 344
Query: 301 SVHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 345 FIDIVYTYNRSHIKHI 360
>gi|386755606|ref|YP_006228823.1| hypothetical protein HPPC18_02785 [Helicobacter pylori PeCan18]
gi|384561864|gb|AFI02330.1| hypothetical protein HPPC18_02785 [Helicobacter pylori PeCan18]
Length = 372
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|420466880|ref|ZP_14965637.1| hypothetical protein HPHPH9_0565 [Helicobacter pylori Hp H-9]
gi|393085278|gb|EJB85966.1| hypothetical protein HPHPH9_0565 [Helicobacter pylori Hp H-9]
Length = 372
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 130/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|308184363|ref|YP_003928496.1| hypothetical protein HPSJM_02935 [Helicobacter pylori SJM180]
gi|308060283|gb|ADO02179.1| hypothetical protein HPSJM_02935 [Helicobacter pylori SJM180]
Length = 340
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 90/345 (26%), Positives = 146/345 (42%), Gaps = 53/345 (15%)
Query: 5 CGIKGLVAE------EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW 58
C IK L+ + + T P + S HAS+++ + + L AYF G+ EGA DVKI
Sbjct: 4 CIIKPLLLKPPPYFTQLTIPKPNDAL-SVHASSLIGLPNDNLLSAYFSGTKEGARDVKIS 62
Query: 59 LQTF--KDGRWQSP--IIADEE--PNVPMW-----NPVLFKLPSNGLLLFYKIGQEVQKW 107
F K RW I+ EE NV + NP+LF L + +LLF +G + W
Sbjct: 63 ANLFDSKTNRWSEAFTILTKEELSKNVHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGW 120
Query: 108 SGC-----------MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
+ ++ + + ++ S L I KNKP+ +G + E
Sbjct: 121 ATSKIYQLESALEPIRFKFVRKLSLSPFLNLSHLI----KNKPLSTTDGGFMLPLYHELA 176
Query: 157 NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRV 214
+ ++ D + R+ + PN +QP PF + V+ +
Sbjct: 177 TQYPLLLK--FDQQNNPRE---LLRPNALNHQLQPSLTPF----KDCAVMAFRNHSFKDS 227
Query: 215 CMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA--LSK-DD 271
M E+C W T L N N ++ + L + L L +N + + LSK ++
Sbjct: 228 LMLETCKTPTAWQKPMLTNLKNLNDALNLINL-NKELYLIHNPSDLSLRRKELWLSKLEN 286
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+S+ L++ A E SYP+ + + I YTYNR+ IKH+
Sbjct: 287 PNSFKTLKVLDK--ANEVSYPSY-SLNPHFIDIVYTYNRSHIKHI 328
>gi|421716657|ref|ZP_16155966.1| BNR repeat-like domain protein [Helicobacter pylori R037c]
gi|407219878|gb|EKE89689.1| BNR repeat-like domain protein [Helicobacter pylori R037c]
Length = 372
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N + + LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|387908103|ref|YP_006338437.1| hypothetical protein MWE_0937 [Helicobacter pylori XZ274]
gi|387573038|gb|AFJ81746.1| hypothetical protein MWE_0937 [Helicobacter pylori XZ274]
Length = 358
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HA+ ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 48 SVHANALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSYYS 107
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 108 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLNP 165
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 166 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 220
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 221 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 276
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 277 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 332
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 333 DIVYTYNRSHIKHI 346
>gi|412985115|emb|CCO20140.1| BNR/Asp-box repeat protein [Bathycoccus prasinos]
Length = 863
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/91 (38%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 202 RVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR 260
R L++ F G++ SES DGG++W+ A P +L NPNS I+ + L DG+ +AYN+ R
Sbjct: 665 RELVQFFRTATGKIYRSESDDGGISWTSAGPIELPNPNSKINAIVLSDGQTAIAYNSHDR 724
Query: 261 GV--------LKVALSKDDGDSWHDALTLEE 283
L VA+S D G +W +EE
Sbjct: 725 QYGLKQMRVNLTVAVSPDRGKNWAVLAQIEE 755
>gi|343085113|ref|YP_004774408.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
gi|342353647|gb|AEL26177.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
marinum DSM 745]
Length = 372
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 123/271 (45%), Gaps = 34/271 (12%)
Query: 60 QTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
QT+ D Q +I + + + + L +L + + LFY + M+ S ++G
Sbjct: 85 QTWTD---QDKVIVPNDGKMNVMSVSLLRLQNGDIALFYARKNSLSDCIPLMRISKDEGE 141
Query: 120 TWSE-------REQLPPGILGPSKNKPILLENGLLLC------GSSVESWNSWGS-WMEV 165
TWSE RE G + N+ + L+NG L+ S E +N+ G +
Sbjct: 142 TWSEPSPCITDRE----GYFVLNNNRVLQLDNGRLIMPVALHRTSEGEDFNNNGRLFTYF 197
Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
+ D G +W+ + P + + +P N L +++R+ +G+ S+ D G
Sbjct: 198 SDDNGATWKSGREVKNPKKVI-FQEPGLIALKNGKLMMIIRTDDGVQYAAFSK--DLGEK 254
Query: 226 WSYAKPTQLLNPNSGIDGVKL-KDGRLLLAYNT--VSRGV-------LKVALSKDDGDSW 275
W+ AKPT++ +P S ++ G L++ +N V G+ L VA+SKD+G +W
Sbjct: 255 WTPAKPTKIASPLSPATIARIPSTGDLIMVWNNNGVKEGLYKGKRTPLNVAISKDEGKTW 314
Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITY 306
LE++ + Y A+ ++ + ++Y
Sbjct: 315 EKEKVLEDDPDGWYCYTAIHFVNEDQMLLSY 345
>gi|421714835|ref|ZP_16154153.1| BNR repeat-like domain protein [Helicobacter pylori R036d]
gi|407215689|gb|EKE85527.1| BNR repeat-like domain protein [Helicobacter pylori R036d]
Length = 372
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 135/325 (41%), Gaps = 37/325 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP- 70
+FT P S HAS ++ + + L AYF G+ EGA DVKI F K RW
Sbjct: 51 QFTIPKPDDAL-SVHASALISLPNDNLLSAYFSGTKEGARDVKISANFFNNKTNRWSEAF 109
Query: 71 -IIADEEPN-------VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
I+ E+ + + NP+LF L + +LLF +G + W+ + +
Sbjct: 110 TILTKEDLSKNAHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKVYQFESALEPI 167
Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
++ + L P + KNKP+ +G + E + ++ D + R+
Sbjct: 168 RFKFARKLSLSPFLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE 225
Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
+ PN +QP PF + ++ M E+C W T
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAIMAFRNYSFKDSLMLETCKTPTIWQKPMLTN 278
Query: 234 LLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
L N N ++ + L + L L +N +S ++ LSK + + L + + A E SY
Sbjct: 279 LKNLNDALNLINLNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSY 336
Query: 292 PAVIQASDGSVHITYTYNRTQIKHV 316
P+ + + I YTYNR+ IKH+
Sbjct: 337 PSY-SLNPHFIDIVYTYNRSHIKHI 360
>gi|386754302|ref|YP_006227520.1| hypothetical protein HPSH112_04130 [Helicobacter pylori Shi112]
gi|384560560|gb|AFI01027.1| hypothetical protein HPSH112_04130 [Helicobacter pylori Shi112]
Length = 372
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 134/326 (41%), Gaps = 39/326 (11%)
Query: 14 EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
+FT P S HAS ++ + + L AYF G+ EGA DV I F K RW
Sbjct: 51 QFTIPKPDGAL-SVHASALISLPNDNLLSAYFSGTKEGARDVTISANLFDSKINRWSEAF 109
Query: 72 IADEEPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
I + + + NP+LF L + +LLF +G + W+ +
Sbjct: 110 IILTKEELSYYSHEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPI 167
Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
+ ++ L P + +NKP+ +G + E + ++ D + R+
Sbjct: 168 HFKFVQKLSLSPFLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE 225
Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
+ PN +QP PF + V+ + M E+C W T
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTN 278
Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFS 290
L N N ++ + L + L L +N + + + LSK ++ +S+ L++ A E S
Sbjct: 279 LKNLNDALNLINL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVS 335
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
YP+ + + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNSHFIDIVYTYNRSHIKHI 360
>gi|254428400|ref|ZP_05042107.1| hypothetical protein ADG881_1630 [Alcanivorax sp. DG881]
gi|196194569|gb|EDX89528.1| hypothetical protein ADG881_1630 [Alcanivorax sp. DG881]
Length = 425
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/368 (21%), Positives = 132/368 (35%), Gaps = 73/368 (19%)
Query: 19 ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
A+ A HA+++VE+ G +F G+ EGA DV I F + G W ++
Sbjct: 44 ASDADTSMVHAASMVELPDGRLRAFWFAGTREGAADVSINSAIFDPESGAWSDETVSVTR 103
Query: 77 PNVPM-WNPVLFKL-------PSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSERE 125
+ W + KL +G + + + W+ + S + G W
Sbjct: 104 EQISRGWGRHIRKLGNAVPVLDDDGHMRLFVVAVSFGGWAASRLVVMESSDLGANWDFDT 163
Query: 126 QLPP----GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L I K PI +G + E +G + + D R G I
Sbjct: 164 ALVATPFLNISTLVKTPPIHYSDGSIGLPVYHEMIGKFGEILRLDQDN----RILGKARI 219
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFN--GIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
IQP+ + +R+ N G + S S DGG WS + + NP+S
Sbjct: 220 -GHGRKAIQPLVLVDDSNHAVSFLRNENEPNNGYLYQSSSSDGGWRWSPLEYAPVDNPSS 278
Query: 240 GIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHD-------------------- 277
+ GV + G L+A N R L + ++D G W D
Sbjct: 279 AVGGVAMGPGHWLVATNCNREERDDLCIRETRDAGQHWEDRWFFHNREQWRGNALKPDAF 338
Query: 278 -----------ALTLEENLAME----------------FSYPAVIQASDGSVHITYTYNR 310
A+ ++ L +E + YP +++ +G +H+ YT+N+
Sbjct: 339 ADLIDDEFVGKAIPEDQRLFLERVKANKCQPHRGCEFQYDYPYMVRTRNGDLHLLYTWNK 398
Query: 311 TQIKHVVL 318
T I+H L
Sbjct: 399 TAIRHAWL 406
>gi|116071744|ref|ZP_01469012.1| hypothetical protein BL107_06329 [Synechococcus sp. BL107]
gi|116065367|gb|EAU71125.1| hypothetical protein BL107_06329 [Synechococcus sp. BL107]
Length = 406
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 135/360 (37%), Gaps = 75/360 (20%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMWNPV 85
HA + + + G EG + ++L + G W P ++ NP+
Sbjct: 41 HAPQLCWIRPDLLGCVWMAGGGEGTAGMSVYLSLLSSEQGSWGVPQCISQDLERSEQNPL 100
Query: 86 LFKLPSNGLLLFYKIGQEVQ----------------KWSGCMKRSYNK------------ 117
LF S+G+L Q V+ +W+ ++ K
Sbjct: 101 LFL--SDGVLHLIHSAQRVRDPSDRTWAEAQSSFSMQWTAVLRHQTLKIDVIDLGRPESW 158
Query: 118 -GVTWSEREQLPPGILGPS--KNKPILLENG--LLLCGSSVESWNSWGSWMEVTVDAGRS 172
G WS + L I P+ ++ P L +G LL S+E S+G + +
Sbjct: 159 EGQAWSAAKDL---IETPAFCRHPPHRLADGTWLLPIYRSLEQGGSFGHDHSEVLQLDAA 215
Query: 173 WRKYG-PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
G P +P + V + TL +RS C S S D GL+WS +
Sbjct: 216 GVPMGSPFAVPESTGRVHGSIVPSADGTTLHQFLRSRLADWVYC-STSEDAGLSWSSPQS 274
Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYNTV----------------------SRGVLKVALSK 269
T L N NS I +L+ GRL L +N +R L +A+S
Sbjct: 275 TDLPNNNSSIQACRLESGRLALIFNRFGFLPDPNDPDASSAWGEARWPRTRWPLSIAISA 334
Query: 270 DDGDSW---HDALT-------LEENLAMEFSYPAVIQASDGSVHITYTY-NRTQIKHVVL 318
DDG SW D T + +L + +YP +++ G +H+ Y++ R I+++ L
Sbjct: 335 DDGQSWPWIRDIDTGFGFCGPMNWDLNGQLAYPTLLEGRPGELHVAYSWAERQAIRYLSL 394
>gi|420398787|ref|ZP_14897998.1| hypothetical protein HPCPY1962_0970 [Helicobacter pylori CPY1962]
gi|393013016|gb|EJB14193.1| hypothetical protein HPCPY1962_0970 [Helicobacter pylori CPY1962]
Length = 372
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 132/316 (41%), Gaps = 42/316 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT-------WSEREQL 127
NP+LF L + +LLF +G + W+ + Y T + + L
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATS--KIYQLESTLEPIHFKFVRKLSL 177
Query: 128 PP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
P + +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 178 SPFLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNAL 232
Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
+QP PF + ++ + M E+C W T L N N ++
Sbjct: 233 NYQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNL 288
Query: 244 VKLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
+ L + L L +N + + + LSK ++ +S+ L++ A E SYP+ +
Sbjct: 289 INL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSH 344
Query: 301 SVHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 345 FIDIVYTYNRSHIKHI 360
>gi|406834050|ref|ZP_11093644.1| hypothetical protein SpalD1_20489 [Schlesneria paludicola DSM
18645]
Length = 400
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 103/216 (47%), Gaps = 33/216 (15%)
Query: 102 QEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNS-W 159
+EV W M RS + GVTWS R Q+P S + P+ L +G LL G+++ + +
Sbjct: 178 REVGTW---MWRSTDGGVTWSARYQVPCN----SPHGPVALASGRLLYIGAALWADDRVV 230
Query: 160 GSWMEVTVDAGRSWR---KYGPIYIPN-ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
G+W + D G +W+ ++ P ++ + V + ++ +
Sbjct: 231 GTWE--STDDGVTWQLLSRFTPRPTDQTKNYHELHGVECVSGKIIAQIRNHNTQNAHETL 288
Query: 216 MSESCDGGLTWS-------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA-L 267
SES DGG TWS + P+ LL +LKDGRLL++Y A +
Sbjct: 289 QSESTDGGKTWSIPHSIGVWGLPSHLL---------RLKDGRLLMSYGYRRAPYGNQARM 339
Query: 268 SKDDGDSWHDALTL-EENLAMEFSYPAVIQASDGSV 302
S+D+G +W D +T+ ++ + + YP+ ++ DG++
Sbjct: 340 SQDEGATWSDPITISDDGIRSDLGYPSTVECDDGTL 375
>gi|385215823|ref|YP_005775779.1| hypothetical protein HPF32_0555 [Helicobacter pylori F32]
gi|317180351|dbj|BAJ58137.1| hypothetical protein HPF32_0555 [Helicobacter pylori F32]
Length = 372
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KN P+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNNPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNSHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|384887761|ref|YP_005762272.1| hypothetical protein HPKB_0764 [Helicobacter pylori 52]
gi|261839591|gb|ACX99356.1| hypothetical protein HPKB_0764 [Helicobacter pylori 52]
Length = 372
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI + F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISVNLFDSKINRWSEAFIILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRD---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + + + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAAMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420458413|ref|ZP_14957223.1| hypothetical protein HPHPA26_0579 [Helicobacter pylori Hp A-26]
gi|393075934|gb|EJB76688.1| hypothetical protein HPHPA26_0579 [Helicobacter pylori Hp A-26]
Length = 372
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW + ++ +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFTILTKEDLSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KN P+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNNPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDNLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNSHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|384897290|ref|YP_005772718.1| hypothetical protein HPLT_02885 [Helicobacter pylori Lithuania75]
gi|317012395|gb|ADU83003.1| hypothetical protein HPLT_02885 [Helicobacter pylori Lithuania75]
Length = 310
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 129/310 (41%), Gaps = 34/310 (10%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------ADEE 76
HAS ++ + + L AYF G+ EGA DVKI F K RW I E
Sbjct: 2 HASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFILLTKEELSHHSHE 61
Query: 77 PNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP-- 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 62 YIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSPFL 119
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
+ KNKP+ +G + E + ++ D + R+ + PN +
Sbjct: 120 NLSHLIKNKPLSTTDGGFVLPLYHELATQYTLLLK--FDKNNNPRE---LLRPNALNHQL 174
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
QP T + ++ + M E+C W T L N N ++ + L +
Sbjct: 175 QPSL--TPLKDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLINLNE- 231
Query: 250 RLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
L L +N +S ++ LSK ++ +S+ L++ A E SYP+ + + I Y
Sbjct: 232 ELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFIDIVY 288
Query: 307 TYNRTQIKHV 316
TYNR+ IKH+
Sbjct: 289 TYNRSHIKHI 298
>gi|384892614|ref|YP_005766707.1| hypothetical protein HPCU_03165 [Helicobacter pylori Cuz20]
gi|308061911|gb|ADO03799.1| hypothetical protein HPCU_03165 [Helicobacter pylori Cuz20]
Length = 337
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 130/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DV I F K RW I + + +
Sbjct: 27 SVHASALISLPNDNLLSAYFSGTKEGARDVTISANLFDSKINRWSEAFIILTKEELSYYS 86
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 87 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 145 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 199
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N N ++ +
Sbjct: 200 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325
>gi|308182736|ref|YP_003926863.1| hypothetical protein HPPC_02875 [Helicobacter pylori PeCan4]
gi|308064921|gb|ADO06813.1| hypothetical protein HPPC_02875 [Helicobacter pylori PeCan4]
Length = 310
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 132/313 (42%), Gaps = 40/313 (12%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV---- 79
HAS ++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 2 HASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFIILTKEELSCYSHE 61
Query: 80 ---PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP-- 129
+ NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 62 YIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSPFL 119
Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G G + ++ + + + D + R+ + PN
Sbjct: 120 NLSHLIRNKPLNTTDG----GFMLPLYHELATQYPLLLKFDKHNNPRE---LLRPNALNH 172
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 173 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 228
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N + + + LSK + + L + + A E SYP+ + +
Sbjct: 229 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKILKILDK-ADEVSYPSY-SLNSHFID 285
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 286 IVYTYNRSHIKHI 298
>gi|237751695|ref|ZP_04582175.1| neuraminidase [Helicobacter bilis ATCC 43879]
gi|229373061|gb|EEO23452.1| neuraminidase [Helicobacter bilis ATCC 43879]
Length = 398
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 127/329 (38%), Gaps = 45/329 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD-----GRWQSPIIADEEPNV- 79
S HAS IV D + ++ G+ EGA DVKI+ Q+F D +W P++ +
Sbjct: 53 SAHASAIVAKD-SKLMTLFYAGTREGAFDVKIY-QSFLDTKTLDSKWSEPLVLLTPERLT 110
Query: 80 --------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
+ NP++FK + + LF +G + W+ + + +
Sbjct: 111 SLSGRYVKKLGNPIVFKDSRDRVHLFV-VGVSLGGWATSRIYQFYFSDDLATIHYVRELH 169
Query: 132 LGPSKNKPIL-------LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
L P N L LENG + S E W+ + + D + I N
Sbjct: 170 LNPFGNYSHLVRTPALPLENGGFILAFSHEMWHKFP--LIAFFDENGNMLTTKAI---NH 224
Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG-GLTWSYAKPTQLLNPNSGIDG 243
+ +QP + R+ N ++C+ TW + + + N +S I
Sbjct: 225 LKAQLQPSIIALNEKECAAFYRTNNRYNTNTYYQTCENYAKTWHDIETSNMKNYDSSILL 284
Query: 244 VKLKDGRLLLAYNTVSRGVLKVALSKDDGDS---WHDALTLEENLAM-----------EF 289
VKL D +L+ + + + K D S ++ A ++ E
Sbjct: 285 VKLGDEVVLITNDGKHNPIWQNDTKKGDRQSLSLYYLADKIKGQFVYIDTIDTAKKGGEV 344
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVL 318
SYPA I + +HITYTYNR IKH L
Sbjct: 345 SYPAAI-LDNNLLHITYTYNRKTIKHTAL 372
>gi|207091785|ref|ZP_03239572.1| hypothetical protein HpylHP_01328 [Helicobacter pylori
HPKX_438_AG0C1]
Length = 372
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 62 SVHASALISLPNDNLLNAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLILETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELLLSKLEKSNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420404615|ref|ZP_14903795.1| hypothetical protein HPCPY6271_0106 [Helicobacter pylori CPY6271]
gi|393024485|gb|EJB25595.1| hypothetical protein HPCPY6271_0106 [Helicobacter pylori CPY6271]
Length = 372
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 40/315 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS ++ + + L AYF G+ EGA DVKI F K RW + + + +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+QP PF LM N M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPFKDC-----ALMAFRNYSFKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLI 289
Query: 245 KLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ +
Sbjct: 290 NL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHF 345
Query: 302 VHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 346 IDIVYTYNRSHIKHI 360
>gi|412988631|emb|CCO17967.1| sialidase [Bathycoccus prasinos]
Length = 516
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 119/293 (40%), Gaps = 65/293 (22%)
Query: 60 QTFKD-GRWQSPIIADEEPNVPMWNPVLFKLP--SNGLLLFYKIGQEVQKWSGCMKR--- 113
+ F D G + P A E N +W PVLF P ++ + LFY + +GC R
Sbjct: 192 KIFADVGSYSVPTDAGE--NGAVWGPVLFANPKKNHEIWLFYS----QSRTNGCGSRVPG 245
Query: 114 -SYNKGV------------TWSEREQL------PPGILGPSKNKPILLENG--------- 145
Y G W+ E++ GI + NKPI+L
Sbjct: 246 MDYAPGGDILAKKFNVVSGAWTHTEKIVLSQDFDDGIPKVTANKPIVLRRSGRWVLPYWN 305
Query: 146 --LLLCGSSVESWN------SWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTA 197
+L SV G+ + ++ D G SW I N L V T
Sbjct: 306 ERAMLESPSVNPEKCSKLNGKEGAGVLLSDDHGESWYPSETIRGRNTWLIENAVVELETD 365
Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL-NPNSGIDGVKLK-DGRLLLAY 255
+ L V G V SES DGG TW+ A + + NPNS ID ++L+ G L LA+
Sbjct: 366 DHLLMVFR---TKAGYVFRSESLDGGQTWTDASAMKDVPNPNSKIDMIRLQPSGLLALAF 422
Query: 256 N----------TVSRGVLKVALSKDDGDSWHDALTLEENL--AMEFSYPAVIQ 296
N + R LKVA+S +DG++W LE ++ ++ YP ++Q
Sbjct: 423 NDHAKHPAKGCSRCRTFLKVAVSSNDGNTWTRIRELEMDVDGSLRSHYPTLLQ 475
>gi|260220195|emb|CBA27491.1| hypothetical protein Csp_A02540 [Curvibacter putative symbiont of
Hydra magnipapillata]
Length = 348
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 55/113 (48%), Gaps = 3/113 (2%)
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV 262
LMR G+V + DGG W L NP+S + L ++LLA+N G
Sbjct: 226 ALMRDERHNGKVTAVSTEDGGAHWQDLPDLALGNPDSAVAAFGLGPQQMLLAHNPTPEGR 285
Query: 263 LKVALSK-DDGDSWHDALTLEE-NLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
K+ LS+ DG W TL + + EFSYPA+ A D ++ I YT +R +
Sbjct: 286 AKLELSRSQDGLQWTPQQTLRQGSEEAEFSYPAMAWADD-ALWIAYTVDRATL 337
>gi|149175559|ref|ZP_01854179.1| probable sialidase [Planctomyces maris DSM 8797]
gi|148845544|gb|EDL59887.1| probable sialidase [Planctomyces maris DSM 8797]
Length = 371
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 135/304 (44%), Gaps = 37/304 (12%)
Query: 32 IVEVDKGHFLVAY---FGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV- 85
++ +D G LV Y +G + ++ DG W P E N+ N +
Sbjct: 53 LLTLDNGRLLVVYTQWYGATGNDHDPARLVEIHSDDGGKTWSEPQTVQE--NIGKMNVMS 110
Query: 86 --LFKLPSNGLLLFYKIGQEVQKWSGC-MKRSYNKGVTWSEREQLP---PGILGPSKNKP 139
L K S +LL Y I + +++ K S ++G TWSE +QL G++ +
Sbjct: 111 ASLVKTTSGKILLTY-IRIDSNRFANLWFKESTDEGNTWSEPKQLSHGKKGLIFTVNSAA 169
Query: 140 ILLENGLLLCGS--SVESWNS---WGSWMEVTVDAGRSW-RKYGPIYIPNESLSVIQPVP 193
I L++G +L + S +W + S+ T D G++W R + P ++P
Sbjct: 170 IRLKSGRILLAAFGSPSAWQKDEHFVSFSFYTDDEGKTWHRSANEVDCPLRG--AMEPEI 227
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG--IDGVKLKDGRL 251
+ + +L+R+ R+ + S DGG TWS A+ T +++P + + K+ L
Sbjct: 228 EQLNDGRILMLIRT--QTTRMYRAYSSDGGETWSPAEKTDIVHPEAPMLLQRAPGKNAPL 285
Query: 252 LLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+L +N R L + LS D+G +W + +E++ +SY A + + S+
Sbjct: 286 ILIWNNAVVPGADHQGPRTPLTLGLSYDEGKTWENLTKIEDDPKGSYSY-ASMDFRNDSL 344
Query: 303 HITY 306
H+ Y
Sbjct: 345 HLVY 348
>gi|384890956|ref|YP_005765089.1| BNR/Asp-box repeat-containing protein [Helicobacter pylori 908]
gi|385223630|ref|YP_005783556.1| BNR/Asp-box repeat-containing protein [Helicobacter pylori 2017]
gi|385231481|ref|YP_005791400.1| BNR/Asp-box repeat-containing protein [Helicobacter pylori 2018]
gi|307637265|gb|ADN79715.1| BNR/Asp-box repeat protein [Helicobacter pylori 908]
gi|325995858|gb|ADZ51263.1| BNR/Asp-box repeat protein [Helicobacter pylori 2018]
gi|325997452|gb|ADZ49660.1| BNR/Asp-box repeat protein [Helicobacter pylori 2017]
Length = 372
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K W I + + +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNCWSEAFILLTKEELSHYS 121
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L N +LLF +G + W+ + ++++ L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFAQKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + + E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLILETCKTPTAWQKPMLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N + + LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420450232|ref|ZP_14949096.1| hypothetical protein HPHPH45_0633 [Helicobacter pylori Hp H-45]
gi|393068091|gb|EJB68896.1| hypothetical protein HPHPH45_0633 [Helicobacter pylori Hp H-45]
Length = 310
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 132/315 (41%), Gaps = 42/315 (13%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW-- 82
HAS+++ + + L YF G+ EGA DVKI F K RW I + + +
Sbjct: 1 MHASSLISLPNDNLLSTYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHYSH 60
Query: 83 -------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP- 129
NP+LF L + +LLF +G + W+ + + + L P
Sbjct: 61 EYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSPF 118
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNESL 186
+ KNKP+ +G G + ++ + + + D + R+ + PN
Sbjct: 119 LNLSHLIKNKPLSTTDG----GFVLPLYHELATQYPLLLKFDQQNNPRE---LLRPNALN 171
Query: 187 SVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 172 HQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLI 227
Query: 245 KLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
L + L L +N +S ++ LSK ++ +S+ L++ A E SYP+ +
Sbjct: 228 NLNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKVLDK--ANEVSYPSY-SLNPHF 283
Query: 302 VHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 284 IDIVYTYNRSHIKHI 298
>gi|336453827|ref|YP_004608293.1| BNR/Asp-box repeat protein [Helicobacter bizzozeronii CIII-1]
gi|335333854|emb|CCB80581.1| BNR/Asp-box repeat protein [Helicobacter bizzozeronii CIII-1]
Length = 381
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 135/343 (39%), Gaps = 72/343 (20%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPII---------- 72
KS HA+T+ +D + + AYFGGS EG DVKI+ L K W +
Sbjct: 53 KSMHAATLSVLDTDYLMGAYFGGSAEGVGDVKIYAHLYNLKTQEWGQALPLLSAPLLSQK 112
Query: 73 ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWS--------GCMKR-----SYNKGV 119
A E + + NPVL+ ++ L + +G + W+ +KR +N+
Sbjct: 113 AHEFVKI-LGNPVLYA--THNTLYLFVVGASLGGWATSKIYLLCAPLKRVLEQLQHNQSP 169
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRS---WRKY 176
+ LP L P N L+ L S + G + + + + K+
Sbjct: 170 QLDFIQTLP---LSPFLNISHLVRTSPL-------STDDGGFVLPIYQELAKKTPLLLKF 219
Query: 177 GP------IYIPNESLSVIQP--VPF-HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
P + PN +QP P+ H A R R F+ +++C L W
Sbjct: 220 DPHARLESVIQPNLLKQQLQPSLAPYQHCALMAFRSYKR-FD-----LYTQTCQTPLKWD 273
Query: 228 YAKPTQLLNPNSGIDGVKL--KDGRLLLAYNTV------SRGVLKVA--LSKDDGDSWHD 277
P QL N + D + L DG L L YN SR L +A D S+
Sbjct: 274 ---PPQLSNLKNYDDSLNLFNVDGTLYLIYNAPLQEGYNSRSTLMLAQFAPTPDKPSYFK 330
Query: 278 ALTLEENLAM--EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L + + E SYPA + D VH+ YT +R I+H++
Sbjct: 331 PLCILDFTQQGGEVSYPATLIFKD-QVHVLYTKDRQHIQHIIF 372
>gi|217033525|ref|ZP_03438954.1| hypothetical protein HP9810_905g44 [Helicobacter pylori 98-10]
gi|216944050|gb|EEC23481.1| hypothetical protein HP9810_905g44 [Helicobacter pylori 98-10]
Length = 347
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV-- 79
S HAS ++ + L AYF G+ EGA DVKI F K W I+ EE +
Sbjct: 37 SVHASALISLPNDKLLSAYFSGTKEGARDVKISANLFDSKINCWSEAFVILTKEELSYYS 96
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ L P
Sbjct: 97 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFVQKLSLSP 154
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 155 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLKF--DKHNNPRE---LLRPNALNH 209
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + + M E+C W T L N N ++ +
Sbjct: 210 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 265
Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 266 L-NQELYLIHNPSDSSLHRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 321
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 322 DIVYTYNRSHIKHI 335
>gi|403378213|ref|ZP_10920270.1| hypothetical protein PJC66_00095 [Paenibacillus sp. JC66]
Length = 670
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 129/300 (43%), Gaps = 33/300 (11%)
Query: 34 EVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPS 91
++ G LV Y+ + D +I + D W P I + P +P + L S
Sbjct: 53 QLPNGKLLVVYYNATAHQGTDGRIAMVESTDHGLTWSEPRIIMDTPEDDR-DPSITVL-S 110
Query: 92 NGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL-LENGLLLCG 150
+G LL +E K + RS ++G+TWS ++ + + + I+ L+ G LL
Sbjct: 111 DGTLLLNWFMRENGKNQVFVSRSTDEGLTWSPPSKVQTNLEEAAVSSKIVELDTGELLLP 170
Query: 151 SSVESWNSWGSWMEVTV---DAGRSWRKYG--------PIYIPNE--SLSVIQPVPFHTA 197
+ G + V V D G +W P+ PN ++ ++PV
Sbjct: 171 IYGFYASEQGVYRSVVVRSSDGGMTWDPSQESTMAEPVPLPQPNPFPAIGFVEPVIADLG 230
Query: 198 NRTLRVLMRSF----NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
+ L + RS+ NGIG MSES D G TW+ A+ T + S D + L +G +L
Sbjct: 231 SGHLISMHRSYSGSDNGIG--WMSESFDYGHTWTPAERTTMYAHCS--DLLVLSNGTVLH 286
Query: 254 AYN------TVSRGVL-KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
+ + R V+ +V D + W DAL + + + +YP+ ++ DG + I +
Sbjct: 287 TWGDRKFEFSSGRAVVGRVMEPGTDWEDWEDALIYKSPGSTDMAYPSAVELPDGRIFIVF 346
>gi|403382038|ref|ZP_10924095.1| hypothetical protein PJC66_19676 [Paenibacillus sp. JC66]
Length = 359
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 96/208 (46%), Gaps = 16/208 (7%)
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRS 172
RS N G W E Q + + P+ L++G LL + E+ N + + D G S
Sbjct: 138 RSTNGGSDW-EAPQPAANVGQAVRGVPVQLKDGSLLVSTYGEADNRGRVIISKSEDLGAS 196
Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSF-----NGIGR---VCMSESCDGGL 224
W + + NE+ +P + T + + L+RS + GR + SES DGGL
Sbjct: 197 WTRVAQL--ENENYHFYEPFLYVTPSGKIVALLRSNSLHGPDASGRSHPLFTSESLDGGL 254
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV-LKVALSKDDGDSWHDALTL-- 281
+WS ++ +P S ++L+ G++L+ Y + ++ L + + W D +
Sbjct: 255 SWSDPVMRRVFSP-SPFHALRLQSGKVLINYGYRYQPAGVRAFLLDSECEGWDDIEEIVL 313
Query: 282 -EENLAMEFSYPAVIQASDGSVHITYTY 308
++ +A + Y + +Q ++G + I Y +
Sbjct: 314 RDDGIAGDLGYTSAVQLANGDILIAYYF 341
>gi|420412153|ref|ZP_14911282.1| hypothetical protein HPNQ4228_1044 [Helicobacter pylori NQ4228]
gi|393027811|gb|EJB28899.1| hypothetical protein HPNQ4228_1044 [Helicobacter pylori NQ4228]
Length = 372
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSKAFTILTKEDLSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNILNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
L + L L +N + + LSK ++ +S+ L++ A E SY + + +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYSSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|373852611|ref|ZP_09595411.1| hypothetical protein Opit5DRAFT_3465 [Opitutaceae bacterium TAV5]
gi|372474840|gb|EHP34850.1| hypothetical protein Opit5DRAFT_3465 [Opitutaceae bacterium TAV5]
Length = 468
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 75/168 (44%), Gaps = 23/168 (13%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
V+ D GR+W + G + PN + + + ++ + R+ G+ S S DGG
Sbjct: 238 VSSDQGRTWERRGGVAFPNPRFD--EHMFIERRDGSIWMTARTRTGL---WESTSTDGGR 292
Query: 225 TWS--YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------RGVLKVALSKD 270
TWS A P + + I +L+ G ++L + V R L LS D
Sbjct: 293 TWSKPTASPVRTASARHFI--RRLQSGNIVLVKHGVEIDKAPMSKGEPFRSHLTAFLSDD 350
Query: 271 DGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
DG +W L L+E SYP QA DG++++++ +R +++
Sbjct: 351 DGHTWKGGLLLDER--RRVSYPDGFQAPDGTIYVSWDRDRNGKGEILM 396
>gi|412985815|emb|CCO17015.1| probable sialidase [Bathycoccus prasinos]
Length = 635
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 76/163 (46%), Gaps = 24/163 (14%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
V+ + G++W+ G I N SL + + R ++V +G + S S D G
Sbjct: 449 VSENRGKTWQPRGQITQSNTSL-IEGSIAELNDGRIIQVFR---TRVGCLYKSYSKDEGK 504
Query: 225 TWSYAKPTQLLNPNSGIDGVKLK-DGRLLLAYNTVS---------------RGVLKVALS 268
+W+ +P + NPNS + ++L+ +G LLLA+N + R L VA+S
Sbjct: 505 SWTEPEPMPVPNPNSKVFLMRLEPNGELLLAFNNMKNQYRSRRGATKCRACRTHLHVAIS 564
Query: 269 KD-DGDSWHDALTLEENLAME---FSYPAVIQASDGSVHITYT 307
+D DG+ W LE+ + E YP + Q SV + YT
Sbjct: 565 RDGDGNEWETIARLEDEIGYEAPRIHYPYMAQKDASSVLVAYT 607
>gi|225155980|ref|ZP_03724464.1| hypothetical protein ObacDRAFT_8593 [Diplosphaera colitermitum
TAV2]
gi|224803333|gb|EEG21572.1| hypothetical protein ObacDRAFT_8593 [Diplosphaera colitermitum
TAV2]
Length = 475
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 78/166 (46%), Gaps = 19/166 (11%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
V+ D G++W + G + P + + + ++ + R+ NG+ S S DGG
Sbjct: 262 VSSDQGKTWTRRGGVKFPGPRFD--EHMFIERRDGSIWMTARTRNGL---WESTSADGGR 316
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------------TVSRGVLKVALSKDDG 272
TW+ K + + +S +L+ G +LL + +SR L LS+DDG
Sbjct: 317 TWARPKESPIKTASSRHFIRRLQSGNILLIKHGGKIHEAPIWQGRLSRSRLAAFLSEDDG 376
Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+W L L++ + SYP QA DG+++I++ + R +++
Sbjct: 377 LTWTGGLMLDKRGGV--SYPDGFQAPDGTLYISWDHQRNTEGEILM 420
>gi|420509188|ref|ZP_15007690.1| hypothetical protein HPHPH24C_0661 [Helicobacter pylori Hp H-24c]
gi|420538012|ref|ZP_15036342.1| hypothetical protein HPHPM4_0783 [Helicobacter pylori Hp M4]
gi|420539835|ref|ZP_15038152.1| hypothetical protein HPHPM5_0913 [Helicobacter pylori Hp M5]
gi|393120614|gb|EJC21103.1| hypothetical protein HPHPH24C_0661 [Helicobacter pylori Hp H-24c]
gi|393145567|gb|EJC45898.1| hypothetical protein HPHPM4_0783 [Helicobacter pylori Hp M4]
gi|393146052|gb|EJC46381.1| hypothetical protein HPHPM5_0913 [Helicobacter pylori Hp M5]
Length = 302
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 125/300 (41%), Gaps = 38/300 (12%)
Query: 40 FLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW---------NPVLFK 88
L AYF G+ EGA DVKI F K RW I + + + NP+LF
Sbjct: 6 LLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSHYSHEYIKKLGNPLLF- 64
Query: 89 LPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP--GILGPSKNKPIL 141
L N +LLF +G + W+ + + ++ + L P + KNKP+
Sbjct: 65 LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSPFLNLSHLIKNKPLN 123
Query: 142 LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP--VPFHTANR 199
+G + E + + + D + R+ + PN +QP PF +
Sbjct: 124 TTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNHQLQPSLTPF----K 174
Query: 200 TLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN--T 257
V+ + M E+C W T L N N ++ + L + L L +N
Sbjct: 175 DCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLINLNE-ELYLIHNPSD 233
Query: 258 VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
+S ++ LSK ++ +S+ L++ A E SYP+ + + I YTYNR+ IKH+
Sbjct: 234 LSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFIDIVYTYNRSHIKHI 290
>gi|196229021|ref|ZP_03127887.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
flavus Ellin428]
gi|196227302|gb|EDY21806.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
flavus Ellin428]
Length = 441
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 145/349 (41%), Gaps = 45/349 (12%)
Query: 12 AEEFTFPANSAPFKSC---HASTIVEVDKGHFLVAY---FGGSCEGAPDVKIWLQTFKDG 65
AEE T N P K + V + G L Y +GG+ +G+ + + G
Sbjct: 90 AEENTVVLNIEPTKEHPRNSEGSFVRLKSGRILFYYSQFYGGAEDGSAARIAAIHSDDSG 149
Query: 66 R-WQSP-IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
R W +P ++ + + + L +L S L FY + ++ S ++ TWSE
Sbjct: 150 RTWSAPEVVLENTAGNNVMSVTLLRLASGRLAFFYILKNSWLDARPYVRFSTDEAATWSE 209
Query: 124 REQL--PPGILGPSKNKPILLENGLLLC-----GSSVESWNSWGSWMEVTVD-------A 169
+ + PG + ++ + L +G L+ S ++ +S S+ +D
Sbjct: 210 PKLVVAAPGYFVLNNDRVVQLSSGRLVAPVAFHRSRLDDPHSAKSFDGRAIDLWYYSDDE 269
Query: 170 GRSWRKYGPIY-IPNESLSVIQ-PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
G++W++ + IP S S +Q P A+ L R+ G+ S D G W+
Sbjct: 270 GQTWQQSWSWWGIPVNSNSGLQEPGVVELADGQLFSWARTDQGVQYGF--SSADHGKDWT 327
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLA-YNTVS---------RGVLKVALSKDDGDSWHD 277
PT+L +P S +L D LLA YN S R L A+S D+G +W
Sbjct: 328 APAPTELKSPTSPASIKRLPDSSNLLAIYNDHSGRFPFPPKRRAPLVAAISTDNGLTWPK 387
Query: 278 ALTLEENLAMEFSYPAVIQASDGSVHITYT--------YNRTQIKHVVL 318
A +EE+L + Y A+ D +V + Y NR +I+ + L
Sbjct: 388 AKLIEEDLNGWYCYTAMYFVDD-AVLLAYCAGDDKVGGLNRLRIRRITL 435
>gi|171909757|ref|ZP_02925227.1| probable sialidase [Verrucomicrobium spinosum DSM 4136]
Length = 385
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 125/315 (39%), Gaps = 39/315 (12%)
Query: 40 FLVAYFGGSCEGAPDVKIWLQTFKDGR-WQSPI-IADEEPNVPMWNPVLFKLPSNGLLLF 97
+ ++GG+ + +P + + + DGR W P+ I + + + L +L S L F
Sbjct: 68 YYTQFYGGAADESPARIVGIHSDDDGRTWSQPVTIVENNAGKNIMSVSLLRLASGKLAFF 127
Query: 98 YKIGQEVQKWSGCMKRSYNKGVTWSEREQL--PPGILGPSKNKPILLENGLLLC------ 149
Y + Q M S + G TWS + PG + ++ I + G L+
Sbjct: 128 YCLKNSWQDCRPQMCLSSDDGATWSSPRNILEAPGYFVLNNDRVIQTKAGRLIAPLAFHR 187
Query: 150 --GSSVESWNSWGS-----WMEVTVDAGRSWRKYGPIY-IPNESLSVIQ-PVPFHTANRT 200
S S S+ + W+ + D G SW + + IP S S +Q P A+ +
Sbjct: 188 SRNSDPHSSKSFDARAITMWL-YSDDEGASWHEAATWWAIPVRSGSGLQEPGVVELADGS 246
Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLA-YNTVS 259
L R+ G S+ DGG W+ +PT L +P S L LLA +N S
Sbjct: 247 LFSWSRTDQGAQYGFTSK--DGGTNWTAPEPTALKSPTSPASIKTLPGTTHLLAIFNDHS 304
Query: 260 RGV---------LKVALSKDDGDSWHDALTLEENL-------AMEFSYPAVIQASDGSVH 303
G L A+S D G +W LE+N A+ F+ AV+
Sbjct: 305 GGFPFPQGKRTPLVAAVSSDGGKTWPKVKLLEDNPDGWYCYNAIHFTKDAVLLGYCAGDK 364
Query: 304 ITYTYNRTQIKHVVL 318
NR +I+ + +
Sbjct: 365 TVGGLNRLRIRRITM 379
>gi|257484992|ref|ZP_05639033.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
tabaci str. ATCC 11528]
Length = 56
Score = 54.3 bits (129), Expect = 8e-05, Method: Composition-based stats.
Identities = 22/37 (59%), Positives = 30/37 (81%)
Query: 282 EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+E L EFSYP++IQA+DGS+H+ +TY R +IKHV L
Sbjct: 14 QEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVHL 50
>gi|196232313|ref|ZP_03131167.1| hypothetical protein CfE428DRAFT_4333 [Chthoniobacter flavus
Ellin428]
gi|196223681|gb|EDY18197.1| hypothetical protein CfE428DRAFT_4333 [Chthoniobacter flavus
Ellin428]
Length = 387
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 89/209 (42%), Gaps = 20/209 (9%)
Query: 112 KRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
+RS N G TW + P +G + + PI L++G LL +E + D GR
Sbjct: 164 RRSVNDGKTWEK----PVKTIGSAPHGPIQLQDGRLLYVGLSTHEKVLKLGVEESRDDGR 219
Query: 172 SWRKYGPIYIPNESLS--------VIQPVPFHTANRTLRVLMRSF---NGIGRVCMSESC 220
SW+ I + E +P + L ++R+ + + +ES
Sbjct: 220 SWQYLAEIPVAEEDQRDPDWRVRMQDEPHVVELPDGKLVAMIRAEPKDHSQTYLRQTEST 279
Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA-LSKDDGDSW--HD 277
DGG TW+ T + + + LK+G LL+ Y + A +SKD G +W
Sbjct: 280 DGGKTWTTTHKTPIYGYPPHL--LMLKNGWLLVVYGVRREPFGEHACISKDGGKTWDVEH 337
Query: 278 ALTLEENLAMEFSYPAVIQASDGSVHITY 306
+TL + YP+ +Q +DGS++ Y
Sbjct: 338 EITLRTAPNSDLGYPSSVQLADGSIYTVY 366
>gi|149176442|ref|ZP_01855056.1| probable sialidase [Planctomyces maris DSM 8797]
gi|148844794|gb|EDL59143.1| probable sialidase [Planctomyces maris DSM 8797]
Length = 394
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 106/241 (43%), Gaps = 28/241 (11%)
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP----GILGPSKNKPIL 141
L +L + LFY + M+ S ++G WSE ++ P G + ++ I
Sbjct: 124 LLRLQDGRIALFYARKNSLHDLLPVMRLSVDEGAHWSEPVEIIPQKEVGYYVLNNDRVIQ 183
Query: 142 LENGLLLCGSSV-------ESWNSWGSWM-EVTVDAGRSWRKYGPIYIPNE----SLSVI 189
L+NG ++ ++ + +N ++ + D G++W++ + + + +
Sbjct: 184 LKNGRIVVPLALHQNLPGSDRFNPNAKFLCYYSDDQGKTWQRGTEVVVETQPGGKQPYMQ 243
Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KD 248
+P + TL R+ G V S+ DGG T+S KP+Q+++P S ++
Sbjct: 244 EPGLVELKDGTLMGFCRTNGGSQYVAFSQ--DGGQTFSELKPSQIISPVSPASIERIPST 301
Query: 249 GRLLLAYN---------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
G LLL +N + R L +A+SKD+G +W +E N + Y A+ D
Sbjct: 302 GDLLLVWNNHQGIDAKLSGKRTPLTLAISKDEGKTWQQIQNIETNPNGWYCYTAIEFTKD 361
Query: 300 G 300
G
Sbjct: 362 G 362
>gi|288925246|ref|ZP_06419181.1| putative alpha-rhamnosidase [Prevotella buccae D17]
gi|288338011|gb|EFC76362.1| putative alpha-rhamnosidase [Prevotella buccae D17]
Length = 1043
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 41/69 (59%), Gaps = 4/69 (5%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
+V +EF + SA F HASTI ++ G + AYFGG+ E P+V IW+ G R
Sbjct: 918 AIVEDEFLY--ESAGFPQAHASTITQLKNGDLVAAYFGGTWERNPNVCIWVSRKPKGSDR 975
Query: 67 WQSPIIADE 75
W++P++A +
Sbjct: 976 WEAPVLAGD 984
>gi|373851343|ref|ZP_09594143.1| hypothetical protein Opit5DRAFT_2197 [Opitutaceae bacterium TAV5]
gi|372473572|gb|EHP33582.1| hypothetical protein Opit5DRAFT_2197 [Opitutaceae bacterium TAV5]
Length = 431
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/212 (26%), Positives = 85/212 (40%), Gaps = 32/212 (15%)
Query: 133 GPSKNKPILLENGLLLCGSSV-------------------ESWNSWGSWMEVTVDAGRSW 173
G S NKPI+L +G L +S+ E N G+ + + D G++W
Sbjct: 168 GCSLNKPIVLSSGDWLLPASLWDRNKLRHPTSAPFIEAFHELDNERGAGVLASTDQGKTW 227
Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
+ G + IP + + V R V + + + G + S S D G TW T
Sbjct: 228 ARRGGVRIPRPNFDEVNLV----ERRDGSVWLTARSASGDIWQSTSSDRGATWVPPSETG 283
Query: 234 LLNPNS-------GIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLA 286
+LN ++ + L + T R L LS DDG +W L L+E
Sbjct: 284 ILNVSARHFVRRLRSGRLLLVRHGEQIGVRTEKRSHLAAFLSDDDGLTWRGGLLLDEREG 343
Query: 287 MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+ SYP Q DG + I+Y NR V+L
Sbjct: 344 V--SYPDGGQFDDGVIFISYDRNRDTDGEVLL 373
>gi|421721593|ref|ZP_16160868.1| BNR repeat-like domain protein [Helicobacter pylori R055a]
gi|407224928|gb|EKE94703.1| BNR repeat-like domain protein [Helicobacter pylori R055a]
Length = 372
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SVHASSLINLPNNNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N N ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMALRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDSLNLIN 290
Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L + L L +N +S ++ LSK + + L + + A E SYP+ + +
Sbjct: 291 LNE-ELFLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|67458778|ref|YP_246402.1| hypothetical protein RF_0386 [Rickettsia felis URRWXCal2]
gi|67004311|gb|AAY61237.1| unknown [Rickettsia felis URRWXCal2]
Length = 149
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 67/133 (50%), Gaps = 23/133 (17%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG---RWQ 68
F N++ K HA+TI D + ++FG S + + IW+ FK +W
Sbjct: 11 FILNNSDLKDVHAATIFAFDDKKLITSFFGKTRSESGKSEENNSIWISLGFKSNNHYQWT 70
Query: 69 SP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRS 114
+P I++ + + ++P+ NPV+ +N LL+F KIG + WSG + RS
Sbjct: 71 TPERIVSPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGILSRS 129
Query: 115 YNKGVTWSEREQL 127
Y++G+TW + E L
Sbjct: 130 YDQGLTWQKPELL 142
>gi|296124013|ref|YP_003631791.1| hypothetical protein Plim_3780 [Planctomyces limnophilus DSM 3776]
gi|296016353|gb|ADG69592.1| hypothetical protein Plim_3780 [Planctomyces limnophilus DSM 3776]
Length = 823
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 78/175 (44%), Gaps = 22/175 (12%)
Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLR 202
NG+L+ E+W +G+ T D W + + + + T++
Sbjct: 644 RNGVLISSDQGETWTEYGNIRLTTDDRYHGWAENNIVELDDG---------------TIK 688
Query: 203 VLMRSFNGIGRVCMSESCDGGLTW-SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-- 259
+++R G + M+ES DGG TW ++A+ T++ NP S KL+D + + +N S
Sbjct: 689 MIIRGDRLGGVLYMAESIDGGKTWPNFAQKTEIPNPGSKATIYKLRDNGVAMLHNPNSSH 748
Query: 260 RGVLKVALSKDDGDSW--HDALTLEENLAME--FSYPAVIQASDGSVHITYTYNR 310
R L + +S D +W L E + + +YP + DG +H Y NR
Sbjct: 749 RSPLSLWVSYDGMKTWPYRRVLVKESSDGSQGRLNYPDGFVSEDGYLHFAYDDNR 803
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 69/160 (43%), Gaps = 14/160 (8%)
Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
V+ D G++W G + P + + + TL +L R+ G+ S S D G
Sbjct: 238 VSQDQGKTWNYRGGVAFPGPQFD--EHMIIERKDGTLWMLARTTEGM---FESLSQDQGK 292
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDA 278
TWS + P +L G LLL + T R L LS D+G +W
Sbjct: 293 TWSQPEMAMPHIPTR-FHIRRLASGSLLLVRHGQVDERTKHRTRLMAFLSHDEGQTWGGG 351
Query: 279 LTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L L++ + SYP QA DG ++I+Y R + + V L
Sbjct: 352 LMLDDRGGV--SYPDGFQAPDGRIYISYDRERAKEREVQL 389
>gi|255533465|ref|YP_003093837.1| hypothetical protein Phep_3584 [Pedobacter heparinus DSM 2366]
gi|255346449|gb|ACU05775.1| hypothetical protein Phep_3584 [Pedobacter heparinus DSM 2366]
Length = 583
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 98/227 (43%), Gaps = 40/227 (17%)
Query: 114 SYNKGVTWSEREQLPPGILGPS------KNKPILLENGLLLCGSSVESWNSWGSWMEVTV 167
+ N G WSE ++ G +G KNK LL + S + N+ + +
Sbjct: 163 TLNNGKDWSEPTEIYHGYVGSIRGFIQLKNKRFLLVFAKAIPARSQKPSNNSTDYGLHDI 222
Query: 168 DA------GRSWRKY-GPIYIPNESLSV-----IQPVPFHTANRTLRVLMRSFNGIGRVC 215
A G+SW+ + IP ES ++P ++ + +L+R+ G+
Sbjct: 223 GALYSDNMGKSWQSSENDLKIPIESTQTTRYGGVEPNIIELSDGKIWMLIRTNKGV--TY 280
Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------RGVL 263
S S D G +W +PT+ ++ +S ++L D R+++ +N+ R L
Sbjct: 281 QSFSTDSGTSWHQPEPTKFISSDSPAATLRLADNRIVMFWNSNQRWDNKRTYALGGRETL 340
Query: 264 KVALSKDDGDSW---HDALTLEENLAME-----FSYPAVIQASDGSV 302
A+S D+G +W + LT ME +YP+ IQ++DG +
Sbjct: 341 HAAISSDEGKTWKGFREVLTSPSTKQMEKGDRGTAYPSAIQSADGKI 387
>gi|412985201|emb|CCO20226.1| BNR/Asp-box repeat protein [Bathycoccus prasinos]
Length = 602
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 72/156 (46%), Gaps = 17/156 (10%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ YG + P L I+ + ++ +L R+ IG + S S D G+TW+
Sbjct: 423 DNGKTWKAYGKLDHPRTWL--IENTLSERDDGSVLMLFRT--KIGVLYASASYDEGITWT 478
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNT---------VSRGVLKVALSKDDGDSWHDA 278
+ T + NP+S I +L G L + N +R L VA+S D G +W
Sbjct: 479 PPESTAIPNPDSKIVLTRLHSGALAMVLNAHGKLKGQKRHARTFLDVAVSGDGGATWRRI 538
Query: 279 LTLEENLA---MEFSYPAVIQA-SDGSVHITYTYNR 310
LE+ + YP I+A +G + TY++
Sbjct: 539 ARLEDESQSPLVRSHYPTAIEAFREGGTRLLITYSK 574
>gi|343085110|ref|YP_004774405.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
gi|342353644|gb|AEL26174.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
marinum DSM 745]
Length = 371
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 124/291 (42%), Gaps = 32/291 (10%)
Query: 31 TIVEVDKGHFLVAYF---GGSCEGAPDVKIWLQTFKDG--RW--QSPIIADEEPNVPMWN 83
+ +++ G L AY GGS + A V I + +DG W + +I E + + +
Sbjct: 49 SFIQLKDGRILFAYSKFEGGSGDHASGV-IAGRYSEDGGNTWTGKDEVIISNEGGMNVMS 107
Query: 84 PVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PPGILGPSKNKPI 140
+L + LFY + M+ S ++ TWS+ + G + ++ I
Sbjct: 108 VSFLRLIDGSIALFYARKNSLDDCIPYMRFSTDEAKTWSDPFPVITDKKGYFVLNNDRVI 167
Query: 141 LLENGLLLCGSSVESWNSWGSWMEVTV------DAGRSWRKYGPIYIPNESLSVIQ-PVP 193
L G LL S+ M ++ D G++W + + +PN++ V+Q P
Sbjct: 168 QLPGGRLLAPVSLHKTAGVDFRMRGSIFCYYSDDNGKTW--HSGMEMPNKNRVVVQEPGV 225
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD-GRLL 252
N + + +R+ G+ + SE D GLTW P+ + +P + ++ + G L+
Sbjct: 226 VLLENGKILMWLRTDRGVQYLSKSE--DQGLTWEEVYPSDIASPRAPASLKRIPETGDLM 283
Query: 253 LAYN---------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
L +N R L A+SKD+G SW LE++ F Y A+
Sbjct: 284 LIWNNNKSLVKETAAYRTPLTAAISKDEGKSWTHLKVLEDDPDGFFCYTAI 334
>gi|83951271|ref|ZP_00960004.1| Glycosyl hydrolase, BNR repeat protein [Roseovarius nubinhibens
ISM]
gi|83839170|gb|EAP78466.1| Glycosyl hydrolase, BNR repeat protein [Roseovarius nubinhibens
ISM]
Length = 379
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 6/128 (4%)
Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCM--SESCDGGLTWSYAKPTQLLNPNSGIDG 243
L+ IQP+ R +R F+ + R C+ E+ DGG +WS + T + NP++ +
Sbjct: 228 LNPIQPMIVPLDGRRAVAFLRDFD-VKRGCLLRCETEDGGQSWSPVETTVIANPSAPVAA 286
Query: 244 VKLKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
+ L DGR+L+A N + L +++S D+G +W E + YP + ++G
Sbjct: 287 LPLGDGRILMAGNDDPDAPADLCLSISADEGRTWQRCRRFEGK-SGGLRYPMLRLLAEGE 345
Query: 302 VHITYTYN 309
+ + Y++
Sbjct: 346 IALCYSHG 353
>gi|385222020|ref|YP_005771153.1| neuraminidase (sialidase) [Helicobacter pylori SouthAfrica7]
gi|317010799|gb|ADU84546.1| neuraminidase (sialidase) [Helicobacter pylori SouthAfrica7]
Length = 332
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 134/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAST++ + G+ L AYF G+ EGA DVKI F K RW I + +
Sbjct: 22 SVHASTLISLPNGNLLSAYFSGTKEGARDVKISANLFDGKSNRWGEAFIILTKEKLSKNA 81
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGP 134
+ NP+LF L ++ +LLF +G + W+ + + + + + L P
Sbjct: 82 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQFESALEPIDFKFVQKLSLNP 139
Query: 135 SKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
N KP+ +G + E + ++ D + R+ + PN
Sbjct: 140 FLNLSHLVRTKPLNTTDGGFVLPIYHELATQYPLLLK--FDKHHNPRE---LLRPNALNH 194
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C TW PT L N + ++ +
Sbjct: 195 QLQPSLTPF----KDCAIMAFRNHSFKDHLMLETCKNPTTWQKVMPTNLKNLDDSLNLLN 250
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 251 LNGSLYLIHNPSDLSLRRKELLLSKLENINSFQTLKILDK--ANEVSYPSS-SLNPYFID 307
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 308 IVYTYNRSNIKHI 320
>gi|374850878|dbj|BAL53855.1| glycosyl hydrolase [uncultured Acidobacteria bacterium]
Length = 365
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 129/324 (39%), Gaps = 40/324 (12%)
Query: 20 NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIA---- 73
+A + HA ++ G L+ + G D +I +DG W +P+
Sbjct: 28 EAASPRERHAYPVIARTAGGQLLTVWTVMEAGERDGRIVGSFSRDGGRTWSAPVTLIDVP 87
Query: 74 ---DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE--QLP 128
D +PN+ + P + + ++ + + M S ++G +WS ++P
Sbjct: 88 GKLDADPNIIIDGPRILVYSTT-----VEVPNRIDRSEVWMTVSEDEGKSWSPPVLLRMP 142
Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNSWG------------SWMEVTVDAGRSWRKY 176
+ ++ + L +G L+ S + W G S + ++ D G +W
Sbjct: 143 FRYVVGKRHLGLKLRDGTLVMPVSWDLWAEAGTPARTEGEMDLKSGVLLSRDGGVTWTPS 202
Query: 177 GPIYI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
G ++I P V +P AN L +L+R+ G + + S DGG TW+
Sbjct: 203 GELHIFERKVHPGAIWGVDEPAIVELANGELYMLLRT--GTEWLYEARSPDGGRTWTPPA 260
Query: 231 PTQLLNPNSGIDGVKLKD--GRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLA 286
+ LL NS +L +++ +N R L VALS D G SW + +
Sbjct: 261 RSPLLGHNSPAALWRLDQHPQEIIVIWNNSPRHRYPLTVALSADGGRSWSRPKVVATSDG 320
Query: 287 MEFSYPAVIQASDGSVHITYTYNR 310
+ SYP + QA DG+ + R
Sbjct: 321 PQVSYPGITQAPDGTFVAVWQQQR 344
>gi|196232221|ref|ZP_03131075.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
flavus Ellin428]
gi|196223589|gb|EDY18105.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
flavus Ellin428]
Length = 397
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 94/225 (41%), Gaps = 34/225 (15%)
Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGS--------------SVESW 156
M RS + G+TWS R P I+ S + PI L +G LL S +
Sbjct: 181 MIRSNDGGITWSARY---PTIVN-SPHGPIQLSDGRLLYAGKQLWTPEKRVGVCVSNDDG 236
Query: 157 NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
+W E+ AG S +Y ++ V + + + NG G
Sbjct: 237 TAWQWLAEIPARAGDSVAQYHELHA----------VEAGEGRLLVHIRNENKNGHGETLQ 286
Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL--KVALSKDDGDS 274
+ES DGG TWS + S ++ +L DGRLL+ Y R L + +S D G +
Sbjct: 287 TESTDGGKTWSEPHSIGVWGLPSFLN--RLHDGRLLMTYGH-RRAPLGNQARVSTDAGRT 343
Query: 275 WHDALTLE-ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
W + L + + + + YP+ ++ DGS+ + + + + VL
Sbjct: 344 WSEPLIIYGDGPSGDLGYPSTVEFDDGSLLTVWYEHLKESPYAVL 388
>gi|408907236|emb|CCM11068.1| BNR/Asp-box repeat protein [Helicobacter heilmannii ASB1.4]
Length = 365
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 130/328 (39%), Gaps = 47/328 (14%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRW-QSPIIAD--- 74
+ P +S H++++ + K L AYFGGS E DV IW L + G+W Q+ + D
Sbjct: 47 TPPVQSVHSASLSALGKHTLLAAYFGGSKEAKSDVCIWGNLYNTQTGQWSQAFKLLDPRT 106
Query: 75 -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG----CMKRSYNK--GVTWSE 123
E + NPVL+ N L LF +G + W+ + NK + + +
Sbjct: 107 LSHQAHEFVKILGNPVLYT-HQNRLYLFV-VGASLGGWATSKIYVLSAPANKPQALHYEQ 164
Query: 124 REQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
L P I +N P +G + E +++ + +
Sbjct: 165 TLHLSPLLNISHLVRNPPTPTTDGGFVLPIYHELAKKSPVFLKFDSHTHLQF-----LLR 219
Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
PN +QP VP+ V RS+ ++SC W + KP LLN S
Sbjct: 220 PNSLKQQLQPSLVPY---QHCAFVAYRSYK--TERFYTQSCLD--FWHWQKP--LLNNLS 270
Query: 240 GIDGVKL---KDGRLLLAYNTV------SRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
D +G L L YN +RG L++A + ++ L+ + E S
Sbjct: 271 NHDSANTLFNANGVLYLIYNQALPHHKNARGALRLARLEPKTGAFMPLKILDRSQKGEVS 330
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
YP V+ D VH+ YT R I H+V
Sbjct: 331 YPFVM-LFDNQVHVLYTKERKAIAHMVF 357
>gi|83747403|ref|ZP_00944443.1| Hypothetical Protein RRSL_02925 [Ralstonia solanacearum UW551]
gi|83725990|gb|EAP73128.1| Hypothetical Protein RRSL_02925 [Ralstonia solanacearum UW551]
Length = 81
Score = 51.2 bits (121), Expect = 7e-04, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 40/69 (57%), Gaps = 12/69 (17%)
Query: 260 RGVLKVALSKDDGDSWH--------DALTLEENLA----MEFSYPAVIQASDGSVHITYT 307
R L +A+S+D G +W D + N A E+SYP+++Q +DG++HI +T
Sbjct: 4 RAPLSLAISEDGGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFT 63
Query: 308 YNRTQIKHV 316
Y R +IK+V
Sbjct: 64 YFRQRIKYV 72
>gi|430741382|ref|YP_007200511.1| BNR/Asp-box repeat-containing protein [Singulisphaera acidiphila
DSM 18658]
gi|430013102|gb|AGA24816.1| BNR/Asp-box repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 642
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 137/328 (41%), Gaps = 36/328 (10%)
Query: 17 FPANSAPFKSCHASTIVEVDKGHFLVA---YFGGSCEGAPDVKIWLQTFKDGR-WQSPII 72
FPA + + S +V+ D G + A + G + A I + + GR W +
Sbjct: 304 FPATAQNPRYSEGSVLVQRD-GSLIYATTEFQGSGSDFAKARIIAVDSTDQGRTWSERRV 362
Query: 73 ADEEPNVPMWNPV---LFKLPSNG-----LLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
E NV N + L +L +N + LFY + ++ S ++ T+ E
Sbjct: 363 LQE--NVGQQNVMSVTLRRLKANNPFDGPIGLFYLVKNSPTDLDVYLRISNDEAKTFGEP 420
Query: 125 EQLP--PGILGPSKNKPILLENGLLLC--GSSVESWNS--WGSWMEVTVDAGRSW-RKYG 177
+ PG + ++ +L G +L ++ ++ S + S + D G +W R G
Sbjct: 421 RLVTTDPGYHVLNNDRVTVLSTGRILVPIATTEDAVKSPKYASSCYFSDDQGETWKRSRG 480
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
I+ P ++P A+ R+LM+ + + +SES DGG +WS AK ++ P
Sbjct: 481 LIHYPGPG--AMEPEVLEQADG--RLLMQIRTKVDHIAVSESTDGGESWSEAKSWEIAGP 536
Query: 238 NSGIDGVKL-KDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAME 288
S ++ G LLL +N R L ALS D+G +W LE +
Sbjct: 537 ESPATLRRIPSTGNLLLIWNNTLRDGKVSSVRTPLTSALSADEGRTWSFHRNLENDPTKV 596
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHV 316
++Y +V+ G +TY T+ +
Sbjct: 597 YAYTSVV-FDRGRALLTYYVGDTKTGRI 623
>gi|412992711|emb|CCO18691.1| Putative Glycosyl hydrolase, BNR repeat [Bathycoccus prasinos]
Length = 583
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 60/117 (51%), Gaps = 16/117 (13%)
Query: 212 GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT----------VSRG 261
G + +S S D G +W T L NPNS + V L +G +++ +N V+RG
Sbjct: 442 GAIYVSHSADNGESWDTPHATPLQNPNSKVSAVLLPNGVIVVCHNAHAFMKHPRIGVTRG 501
Query: 262 VLKVALSKDDGDSWHDA-LTLEENL--AMEFSYPAVIQASDG-SVHITYTYNRTQIK 314
L ++ S DDG +W +A L +E++L YP ++ DG + I+Y+ + +K
Sbjct: 502 KLALSTSSDDGRTWKEAVLFIEDSLEPGKHVHYPTMVL--DGCDLRISYSVSGEGVK 556
>gi|315452521|ref|YP_004072791.1| neuraminidase [Helicobacter felis ATCC 49179]
gi|315131573|emb|CBY82201.1| neuraminidase [Helicobacter felis ATCC 49179]
Length = 358
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 127/338 (37%), Gaps = 54/338 (15%)
Query: 18 PANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPIIADE 75
P P +S H++ + + GH LV YFGGS EG PDV I+ L K W SP
Sbjct: 30 PIPPPPVQSVHSANLGALPSGHLLVVYFGGSEEGLPDVCIYGNLYDPKTKVW-SPAFKLL 88
Query: 76 EPNV----------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
P + NPVL L + + + +G + W+ R Y
Sbjct: 89 SPTQLSTLSKSYIETLGNPVLAIL--DHKVYLFVVGVSLGGWAT--SRIYTLQADLPSLL 144
Query: 126 QLPPGI-------LGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
P + L P +N P+ E+G G + ++ + + +
Sbjct: 145 TQKPHLHFISTLSLSPLLNISHLVRNAPLPTEDG----GFILPIYHELATKFPLLLKFDS 200
Query: 172 SWRKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
R I PN S +QP VP+ LM + +++C L W+
Sbjct: 201 KARLQRVIK-PNHLTSQLQPTLVPYQQC-----ALMAFRSSKDTHLYTQTCKDLLHWNPP 254
Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALT- 280
P+ L N + + + L YN S L S+ D L
Sbjct: 255 IPSNLHNYDDALALFN-ANHETYLVYNAPLPNASSSRSSLRLARFAPTSKRSSYFDPLAI 313
Query: 281 LEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
L+++L E SYP+V+ D ++H+ YT +R I H+ L
Sbjct: 314 LDQSLEGEVSYPSVLVLQD-TIHLVYTRHRRTIMHLTL 350
>gi|301385787|ref|ZP_07234205.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato Max13]
Length = 289
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 101/284 (35%), Gaps = 69/284 (24%)
Query: 94 LLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLE 143
L LFY + V W+G S + G +WS QL + P N P+
Sbjct: 2 LWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHA 57
Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
+G + E + ++ ++ D G K+ N + P+ A LR
Sbjct: 58 DGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRY 116
Query: 204 LMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL 263
+F+ RV S + DGG TWS P NPNS + V + LL+A N + G
Sbjct: 117 AGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVGVPGKGLLVALNDLREGRF 173
Query: 264 KVALSKDDG-----------DSWHDAL----------------------TLEENLAMEF- 289
K++L D D D L L + + EF
Sbjct: 174 KLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFL 233
Query: 290 -----------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
YP I++ DG H+ Y++N T IKHV
Sbjct: 234 SNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 277
>gi|302061461|ref|ZP_07253002.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato K40]
Length = 290
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 101/284 (35%), Gaps = 69/284 (24%)
Query: 94 LLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLE 143
L LFY + V W+G S + G +WS QL + P N P+
Sbjct: 3 LWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHA 58
Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
+G + E + ++ ++ D G K+ N + P+ A LR
Sbjct: 59 DGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRY 117
Query: 204 LMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL 263
+F+ RV S + DGG TWS P NPNS + V + LL+A N + G
Sbjct: 118 AGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVGVPGKGLLVALNDLREGRF 174
Query: 264 KVALSKDDG-----------DSWHDAL----------------------TLEENLAMEF- 289
K++L D D D L L + + EF
Sbjct: 175 KLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFL 234
Query: 290 -----------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
YP I++ DG H+ Y++N T IKHV
Sbjct: 235 SNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 278
>gi|149177156|ref|ZP_01855763.1| hypothetical protein PM8797T_27150 [Planctomyces maris DSM 8797]
gi|148844048|gb|EDL58404.1| hypothetical protein PM8797T_27150 [Planctomyces maris DSM 8797]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/223 (25%), Positives = 89/223 (39%), Gaps = 33/223 (14%)
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKP-------------ILLENGLLLCGSSVESWNSW 159
RS + G +W++ E P G S+ P + G S W
Sbjct: 148 RSSDGGRSWTKSEDFPAAEAGWSEYIPFGDIWAGSDGALHVSCYQGQFKDPSQSTKTKGW 207
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE- 218
S + D GR+W +SVI P H + +++ R+ E
Sbjct: 208 KSSHLKSEDDGRTW----------SVVSVIGPA--HNETDLFYLGEKNWLAAARIDKMEL 255
Query: 219 --SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA--LSKDDGDS 274
S D G+TW +P N +G +LKDGRLLL+Y G V+ LS D+G +
Sbjct: 256 IRSNDNGVTWQKPQPVTGRNEING-HLTRLKDGRLLLSYGIRVNGRRGVSAKLSSDEGQT 314
Query: 275 WHDALTLEENL-AMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
W + + + + YP+ IQ +G + +T Y+ +HV
Sbjct: 315 WSEPIRISHTADGGDCGYPSSIQKENGEI-VTAWYSSESPQHV 356
>gi|422591606|ref|ZP_16666246.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
morsprunorum str. M302280]
gi|330879173|gb|EGH13322.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
morsprunorum str. M302280]
Length = 288
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 100/282 (35%), Gaps = 69/282 (24%)
Query: 96 LFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLENG 145
LFY + V W+G S + G +WS QL + P N P+ +G
Sbjct: 3 LFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHADG 58
Query: 146 LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
+ E + ++ ++ D G K+ N + P+ A LR
Sbjct: 59 SIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRYAG 117
Query: 206 RSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKV 265
+F+ RV S + DGG TWS P NPNS + V + LL+A N + G K+
Sbjct: 118 ETFH---RVLASRTEDGGQTWSEPYPLTPSNPNSSLAAVGVPGKGLLVALNDLREGRFKL 174
Query: 266 ALSKDDG-----------DSWHDAL----------------------TLEENLAMEF--- 289
+L D D D L L + + EF
Sbjct: 175 SLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFLSN 234
Query: 290 ---------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
YP I++ DG H+ Y++N T IKHV
Sbjct: 235 LDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 276
>gi|343085111|ref|YP_004774406.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
gi|342353645|gb|AEL26175.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
marinum DSM 745]
Length = 388
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/230 (24%), Positives = 101/230 (43%), Gaps = 29/230 (12%)
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PPGILGPSKNKPILL 142
L +L + + +FY M+ S ++ TWS+ G + ++ + L
Sbjct: 126 LLRLQNGDIAMFYLRKNNTDDCIPYMRISKDEAQTWSDPIACITDDEGYFVLNNDRVVQL 185
Query: 143 ENGLLL------CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT 196
E+G L+ G +E + + D G++W + + P E + + +P
Sbjct: 186 EDGTLMLAVAKHAGPGMEWRGRADLYAYSSTDNGQTWDRSEMVPNPKE-IVLQEPGLVEL 244
Query: 197 ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY- 255
A+ ++ +++R+ G+ C S S D G TWS + + L++P S ++ LLA
Sbjct: 245 ADGSILMVIRTDAGVQ--CYSYSKDNGKTWSAVEESTLVSPVSPATIERIPGTGDLLAVW 302
Query: 256 -NTVS--------RGVLKVALSKDDGDSWHDALTLEEN-------LAMEF 289
N VS R L A+SKD+G +W + TLE + +AMEF
Sbjct: 303 NNNVSEDPDIAKLRTPLSSAISKDEGKTWTNYKTLENDPDGWYCYIAMEF 352
>gi|109947931|ref|YP_665159.1| neuraminidase (sialidase) [Helicobacter acinonychis str. Sheeba]
gi|109715152|emb|CAK00160.1| predicted neuraminidase (sialidase) [Helicobacter acinonychis str.
Sheeba]
Length = 347
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 136/315 (43%), Gaps = 40/315 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAST++ + + L AYF G+ EGA DVKI F K RW ++ EE +
Sbjct: 37 SVHASTLISLSNNNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFTLLTKEELSKNA 96
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L ++ +LLF +G + W+ + + + ++ L P
Sbjct: 97 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQFESALEPIDFKFVQKLSLSP 154
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ + KP+ +G + E + ++ D + R+ + PN
Sbjct: 155 FLNLSHLVRTKPLNTTDGGFVLPLYHELATQYPLLLK--FDKHHNPRE---LLRPNALNH 209
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C TW PT L N N ++ +
Sbjct: 210 QLQPSLTPF----KDCAIMTFRNHFFKDYLMLETCKHPTTWQKPMPTNLKNLNDALNLLN 265
Query: 246 LKDGRLLLAYNTVSRGVLK---VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
L L+ + S L+ + LSK ++ +S+ L++ A E SYP+ +
Sbjct: 266 LNGSLYLI--HNPSDLFLRRKELLLSKLENANSFQTLKILDK--ANEVSYPSS-SLNSHF 320
Query: 302 VHITYTYNRTQIKHV 316
+ I YTYNR+ IKH+
Sbjct: 321 IDIVYTYNRSNIKHI 335
>gi|420502896|ref|ZP_15001432.1| hypothetical protein HPHPP41_1082 [Helicobacter pylori Hp P-41]
gi|393150768|gb|EJC51073.1| hypothetical protein HPHPP41_1082 [Helicobacter pylori Hp P-41]
Length = 372
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 133/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L+AYF G+ EGA DVKI F K RW I +
Sbjct: 62 SAHASSLISLPNDNLLIAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSQSS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|386747305|ref|YP_006220513.1| hypothetical protein HCD_01355 [Helicobacter cetorum MIT 99-5656]
gi|384553547|gb|AFI05303.1| hypothetical protein HCD_01355 [Helicobacter cetorum MIT 99-5656]
Length = 372
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 48/94 (51%), Gaps = 13/94 (13%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + G+ L AYF GS EGA DVKI F K RW I + N+
Sbjct: 62 SVHASSLISLPNGNLLSAYFSGSKEGARDVKISANLFDSKTNRWSEAFILLTKENLFKNS 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWS 108
+ NP+LF L + +LLF +G + W+
Sbjct: 122 HEFIKKLGNPLLF-LHKDKILLFV-VGVSMGGWA 153
>gi|420461945|ref|ZP_14960732.1| hypothetical protein HPHPH3_0770 [Helicobacter pylori Hp H-3]
gi|393080108|gb|EJB80837.1| hypothetical protein HPHPH3_0770 [Helicobacter pylori Hp H-3]
Length = 372
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 132/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSQSS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|373849242|ref|ZP_09592043.1| hypothetical protein Opit5DRAFT_0097 [Opitutaceae bacterium TAV5]
gi|372475407|gb|EHP35416.1| hypothetical protein Opit5DRAFT_0097 [Opitutaceae bacterium TAV5]
Length = 443
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 88/215 (40%), Gaps = 36/215 (16%)
Query: 133 GPSKNKPILLENGLLLCGSSV-------------ESWNSWGSWMEVTV----DAGRSWRK 175
G + NKPI+L NG L S+ +++ S V D GR+W +
Sbjct: 183 GSTLNKPIVLANGDWLLPISLWHRGLIHKPFAERDAFRELDSLRGANVFASGDEGRTWER 242
Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
G + P + + + + T+ + R+ G+ S S DGG TWS + +
Sbjct: 243 RGRVAFPRPNFD--EHMFIERRDGTIWMTARTRRGL---YESISADGGRTWSAPVESAIK 297
Query: 236 NPNSGIDGVKLKDGRLLLAY------------NTVSRGVLKVALSKDDGDSWHDALTLEE 283
++ +L G +LL R L LS DDG +W L L++
Sbjct: 298 TASARHFIRRLASGNILLVKYGEKIDEVPTRDGKTVRSHLTAFLSDDDGHTWKGGLLLDD 357
Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
+ +YP QA DG++++++ +R V++
Sbjct: 358 RTPV--TYPDGFQAPDGTLYVSWDKSRNAEGEVLM 390
>gi|255533458|ref|YP_003093830.1| glycosyl hydrolase family protein [Pedobacter heparinus DSM 2366]
gi|255346442|gb|ACU05768.1| glycosyl hydrolase BNR repeat-containing protein [Pedobacter
heparinus DSM 2366]
Length = 383
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 109/245 (44%), Gaps = 25/245 (10%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL--- 127
+I ++E + + + L +L + + LFY ++ S ++ TW++
Sbjct: 105 VIVEKEGTMNVMSVSLLRLKNGKIALFYLKKNSTSDCIPMVRFSSDEAKTWTDPIACITD 164
Query: 128 PPGILGPSKNKPILLENGLLLCGSSVES------WNSWGS-WMEVTVDAGRSWRKYGPIY 180
G + ++ I L++G +L ++ W++ G+ W + D G++W+ +
Sbjct: 165 KKGYFVLNNSRVIQLKSGRILLAVALHQTPEQAKWSNSGTLWSYYSDDNGKTWKPGQQVA 224
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS- 239
P+ V+ P + VLM G +S S + G +WS A+P+ + +P S
Sbjct: 225 NPD---GVVTQEPGVVELKNGNVLMFIRTDAGVQYLSYSKNKGQSWSPAQPSTIKSPVSP 281
Query: 240 -GIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
I + L G L+L +N R L VA+SKD+G +W + T+E++ +
Sbjct: 282 ASIKRIPLT-GDLILVWNNNGGDNPLIKGKRTPLNVAISKDEGKTWQNIKTVEDDPNGWY 340
Query: 290 SYPAV 294
Y A+
Sbjct: 341 CYIAI 345
>gi|420472054|ref|ZP_14970750.1| hypothetical protein HPHPH18_0671 [Helicobacter pylori Hp H-18]
gi|393091409|gb|EJB92041.1| hypothetical protein HPHPH18_0671 [Helicobacter pylori Hp H-18]
Length = 372
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 134/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|26988313|ref|NP_743738.1| BNR repeat-containing protein [Pseudomonas putida KT2440]
gi|24983060|gb|AAN67202.1|AE016347_3 BNR domain protein [Pseudomonas putida KT2440]
Length = 663
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 3/110 (2%)
Query: 210 GIGRVCMSESCDGGLTWSYAKPTQLLNPNSG--IDGVKLKDGRLLLAYNT-VSRGVLKVA 266
G V MS S D TW+ L P+S I + GRLL+ +N + R L +A
Sbjct: 526 GDSGVLMSVSTDMMKTWTPWVNYLALAPSSSSRIWLGRSPSGRLLICWNNDIERKTLTLA 585
Query: 267 LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
LS D+G +W + LE N + +YP V DG + I Y RT + +
Sbjct: 586 LSDDEGVTWPHKVVLEPNSVGQVTYPIVTFGDDGLILIAYDNERTAKRQI 635
>gi|225158831|ref|ZP_03725147.1| hypothetical protein ObacDRAFT_7801 [Diplosphaera colitermitum
TAV2]
gi|224802584|gb|EEG20840.1| hypothetical protein ObacDRAFT_7801 [Diplosphaera colitermitum
TAV2]
Length = 386
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 104/254 (40%), Gaps = 45/254 (17%)
Query: 83 NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSK------ 136
+P L +LP L + Y K S ++G TW+ PP ++ K
Sbjct: 88 SPALRRLPDGRLGMLYSFRLGTTKAMRHFVTSKDEGKTWT-----PPVVVASGKYVTGCH 142
Query: 137 NKPILLENGLLLCG-SSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPN---------- 183
++ +L++G + E+W + V D GRSWR P+ +P
Sbjct: 143 DRFTILQSGRFVAPLHCSENWEDHYLYTRVATSDDNGRSWRVSEPLELPYVGGLGGLGDH 202
Query: 184 ------ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
ES +P + +L + MR+ +G SES D GLTWS + ++++P
Sbjct: 203 WKGAFYES-GCYEPGVVQRVDGSLLMTMRT--AMGTQFCSESFDEGLTWSPLRSLEVVSP 259
Query: 238 NSGIDGVKLKDGRLLLA-----YNTV-----SRGVLKVALSKDDGDSW--HDALTLEENL 285
+ + V++ +LL Y +R + LS DDG +W H + ++
Sbjct: 260 QAPANLVRIPGTPVLLLLWSSDYGATIGTGGNRRCISCGLSMDDGRTWPHHLRRHVIQDT 319
Query: 286 AMEFSYPAVIQASD 299
+ YP+V+ +D
Sbjct: 320 VSSWDYPSVLFHND 333
>gi|374373452|ref|ZP_09631112.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
19437]
gi|373234425|gb|EHP54218.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
19437]
Length = 387
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 103/267 (38%), Gaps = 24/267 (8%)
Query: 68 QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
Q I D E + + + L +L + + LFY ++ S ++ TWS
Sbjct: 107 QDVKIIDREGGMNVMSVSLLRLRNGTIALFYLRKNSTVDCIPVVRFSNDEAKTWSAPVDC 166
Query: 128 ---PPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYG 177
G + ++ I L +G LL + S + W E + D G++W
Sbjct: 167 ITDQQGYFVLNNDRVIQLSDGRLLLAVAKHSSPADTKWQEKGALFAYYSDDNGKTWTAGK 226
Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
+ P ++I P A + R+LM G S S D G TWS A P L +P
Sbjct: 227 QVPTP---ATIITQEPGLVALKDGRILMYIRASGGSQYASYSKDRGATWSDALPFTLKSP 283
Query: 238 NSGIDGVKL-KDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAM 287
+ ++ G LL +N R L VA+SKDDG SW LE +
Sbjct: 284 IAPATIERIPSTGDLLAVWNNNDGSIATIKGKRTPLTVAISKDDGISWIHIKNLEADPDG 343
Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIK 314
+ Y A I SV ++Y Q K
Sbjct: 344 WYCYTA-IHFYKNSVLLSYCAGSQQAK 369
>gi|15611594|ref|NP_223245.1| hypothetical protein jhp0527 [Helicobacter pylori J99]
gi|4155072|gb|AAD06108.1| putative [Helicobacter pylori J99]
Length = 372
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|412988849|emb|CCO15440.1| hypothetical protein CHLNCDRAFT_134209 [Bathycoccus prasinos]
Length = 676
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 79/190 (41%), Gaps = 37/190 (19%)
Query: 166 TVDAGRSWRKYGPIYIPNESLS--VIQPVPFHTA-NRTLRVLMRSFNGIGRVCMSESCDG 222
+ D G +W+ G I + VI+ F + N L +LMRS G+ S S D
Sbjct: 447 STDKGETWKANGAIQFQTGAKGRWVIEGSIFESGENHHLTMLMRSTEGV--AFKSISSDL 504
Query: 223 GLTWSYAKPTQLLNPNSGIDG-VKLKDGRLLLAYNTVS---------------------- 259
G+TWS P+ L+NP+S ++ V K RL +A+N +
Sbjct: 505 GVTWSAPLPSALVNPDSKLNTIVNRKTKRLYVAFNDETIAMTASEGGADAIKGTSEDKEQ 564
Query: 260 ------RGVLKVALSKDDGDSWHDALTLEENL---AMEFSYPAVIQASDGSVHITYTYNR 310
R L +A S DDG++ + ++E + + YP + D + +TY+ +
Sbjct: 565 TAGGKKRDELVIAHSDDDGENLYIFARIDEGVTPPGVMIHYPTMAFYDDQELFVTYSISY 624
Query: 311 TQIKHVVLLP 320
+ V P
Sbjct: 625 NNATNFVEKP 634
>gi|302133069|ref|ZP_07259059.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato NCPPB
1108]
Length = 291
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 100/284 (35%), Gaps = 69/284 (24%)
Query: 94 LLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLE 143
L LFY + V W+G S + G +WS QL + P N P+
Sbjct: 4 LWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHA 59
Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
+G + E + ++ ++ D G K+ N + P+ A LR
Sbjct: 60 DGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRY 118
Query: 204 LMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL 263
+F+ RV S + DGG TWS P NPNS + + LL+A N + G
Sbjct: 119 AGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAGGVPGKGLLVALNDLREGRF 175
Query: 264 KVALSKDDG-----------DSWHDAL----------------------TLEENLAMEF- 289
K++L D D D L L + + EF
Sbjct: 176 KLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFL 235
Query: 290 -----------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
YP I++ DG H+ Y++N T IKHV
Sbjct: 236 SNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 279
>gi|420413529|ref|ZP_14912653.1| hypothetical protein HPNQ4099_0770 [Helicobacter pylori NQ4099]
gi|420418517|ref|ZP_14917609.1| hypothetical protein HPNQ4076_0654 [Helicobacter pylori NQ4076]
gi|393029516|gb|EJB30597.1| hypothetical protein HPNQ4099_0770 [Helicobacter pylori NQ4099]
gi|393033343|gb|EJB34406.1| hypothetical protein HPNQ4076_0654 [Helicobacter pylori NQ4076]
Length = 372
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFTILTKEDLSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALESIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ KNKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + ++ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420531064|ref|ZP_15029439.1| hypothetical protein HPHPP28B_0548 [Helicobacter pylori Hp P-28b]
gi|393139090|gb|EJC39471.1| hypothetical protein HPHPP28B_0548 [Helicobacter pylori Hp P-28b]
Length = 372
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSQSS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420432150|ref|ZP_14931169.1| hypothetical protein HPHPH16_0883 [Helicobacter pylori Hp H-16]
gi|393049743|gb|EJB50709.1| hypothetical protein HPHPH16_0883 [Helicobacter pylori Hp H-16]
Length = 372
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|424663009|ref|ZP_18100046.1| hypothetical protein HMPREF1205_03395 [Bacteroides fragilis HMW
616]
gi|404576699|gb|EKA81437.1| hypothetical protein HMPREF1205_03395 [Bacteroides fragilis HMW
616]
Length = 360
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 8/144 (5%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G SW K + I ++P ++ + + MR+ G + +G L
Sbjct: 197 DEGNSWDKSDQVVIDESKYGGLEPTIVELSSDKVLMSMRTDLGFQYFAYFDH-NGKLVKG 255
Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---RGVLKVALSKDDGDSWHDALTLEE- 283
+ + L + +S + LK G LL +N R L ++LS D+GDSW++ LEE
Sbjct: 256 PVR-SSLPSSDSPAHLISLKHGELLAVHNNCDVDMRNRLSISLSLDEGDSWNEEFILEES 314
Query: 284 -NLAMEFSYPAVIQASDGSVHITY 306
N FSYP++ + D + ITY
Sbjct: 315 ANEKNSFSYPSLKKVGD-RLLITY 337
>gi|420477675|ref|ZP_14976330.1| hypothetical protein HPHPH23_1232 [Helicobacter pylori Hp H-23]
gi|393092354|gb|EJB92975.1| hypothetical protein HPHPH23_1232 [Helicobacter pylori Hp H-23]
Length = 372
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 128/312 (41%), Gaps = 34/312 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
L L+ + +S ++ LSK + + L + + A E SYP+ + + I
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFIDI 348
Query: 305 TYTYNRTQIKHV 316
YTYNR+ IKH+
Sbjct: 349 VYTYNRSHIKHI 360
>gi|373957578|ref|ZP_09617538.1| hypothetical protein Mucpa_6016 [Mucilaginibacter paludis DSM
18603]
gi|373894178|gb|EHQ30075.1| hypothetical protein Mucpa_6016 [Mucilaginibacter paludis DSM
18603]
Length = 418
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 4/106 (3%)
Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPN--SGIDGVKLKDGRLLLAYNTV-SRGVLKV 265
N G + S S D G +W + + + PN S + KLK G +LL N R L V
Sbjct: 281 NNAGNIQYSNSYDNGNSWDTIRAFKKIFPNAASRVFIRKLKSGNILLIINNSWYRTNLTV 340
Query: 266 ALSKDDGDSW-HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
LS ++G SW + + + ++ SYP +Q +DG +H+ +R
Sbjct: 341 FLSVNNGVSWKYKTVLVANDIVASPSYPDAVQTTDGVIHMVNDADR 386
>gi|313148318|ref|ZP_07810511.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|423278118|ref|ZP_17257032.1| hypothetical protein HMPREF1203_01249 [Bacteroides fragilis HMW
610]
gi|313137085|gb|EFR54445.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|404586128|gb|EKA90701.1| hypothetical protein HMPREF1203_01249 [Bacteroides fragilis HMW
610]
Length = 568
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ ++NG + ++ G + V+ D G +W+ + P+ +
Sbjct: 144 PVFIKNGKRIVVAAHGGVTPKGCYTYVSDDDGLTWKCSNTVTSPDHRGGGFHKGIRWNHG 203
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ R + S DGGLTWS ++P+ + +L
Sbjct: 204 AVEPTVVELKDGTLWMLMRTSQD--RHYQAFSQDGGLTWSESEPSPFYGTITMPTLGRLA 261
Query: 248 DGRLLLAY-NT-----------------VSRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 262 DGRLLLFWCNTTPLPEKEGTNGVWDDVFTNRDVTHVAVSDDDGKTW 307
>gi|420442053|ref|ZP_14940994.1| hypothetical protein HPHPH36_0841 [Helicobacter pylori Hp H-36]
gi|393058853|gb|EJB59739.1| hypothetical protein HPHPH36_0841 [Helicobacter pylori Hp H-36]
Length = 372
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFVLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK + +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELLLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|385218843|ref|YP_005780318.1| hypothetical protein HPGAM_02985 [Helicobacter pylori Gambia94/24]
gi|317014001|gb|ADU81437.1| hypothetical protein HPGAM_02985 [Helicobacter pylori Gambia94/24]
Length = 340
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 129/312 (41%), Gaps = 34/312 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + + +
Sbjct: 30 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHYS 89
Query: 83 --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 90 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESTLEPIHFKFARKLSLSP 147
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 148 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 202
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 203 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPISTNLKNLDDSLNLLN 258
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
L L+ + +S ++ LSK + + L + + A E SYP+ + + I
Sbjct: 259 LNKELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFIDI 316
Query: 305 TYTYNRTQIKHV 316
YTYNR+ IKH+
Sbjct: 317 VYTYNRSHIKHI 328
>gi|424664116|ref|ZP_18101153.1| hypothetical protein HMPREF1205_04502 [Bacteroides fragilis HMW
616]
gi|404576152|gb|EKA80892.1| hypothetical protein HMPREF1205_04502 [Bacteroides fragilis HMW
616]
Length = 568
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 70/166 (42%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + V+ D G +W+ + P+ +
Sbjct: 144 PVFIRNGKRIVVAAHGGVTPKGCYTYVSDDDGLTWKCSNTVTSPDHRGGGFHKGIRWNHG 203
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ R + S DGGLTWS ++P+ + +L
Sbjct: 204 AVEPTVVELKDGTLWMLMRTSQD--RHYQAFSQDGGLTWSESEPSPFYGTITMPTLGRLA 261
Query: 248 DGRLLLAY-NT-----------------VSRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 262 DGRLLLFWCNTTPLPEKEGTNGVWDDVFTNRDVTHVAVSDDDGKTW 307
>gi|420475418|ref|ZP_14974089.1| hypothetical protein HPHPH21_0602 [Helicobacter pylori Hp H-21]
gi|393093525|gb|EJB94142.1| hypothetical protein HPHPH21_0602 [Helicobacter pylori Hp H-21]
Length = 372
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 134/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|420485606|ref|ZP_14984224.1| hypothetical protein HPHPP4_0781 [Helicobacter pylori Hp P-4]
gi|420516100|ref|ZP_15014563.1| hypothetical protein HPHPP4C_0821 [Helicobacter pylori Hp P-4c]
gi|420517803|ref|ZP_15016257.1| hypothetical protein HPHPP4D_0819 [Helicobacter pylori Hp P-4d]
gi|393103741|gb|EJC04304.1| hypothetical protein HPHPP4_0781 [Helicobacter pylori Hp P-4]
gi|393123302|gb|EJC23771.1| hypothetical protein HPHPP4D_0819 [Helicobacter pylori Hp P-4d]
gi|393124399|gb|EJC24867.1| hypothetical protein HPHPP4C_0821 [Helicobacter pylori Hp P-4c]
Length = 372
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 129/314 (41%), Gaps = 38/314 (12%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L N +LLF +G + W+ + + + + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFVRKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFILPLYHELVTQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTN-LKNLDDSLNLL 289
Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+G L L +N + + LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 290 NLNGILYLIHNPSDSSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346
Query: 303 HITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360
>gi|425432777|ref|ZP_18813325.1| hypothetical protein HMPREF1391_00919 [Helicobacter pylori
GAM100Ai]
gi|410714701|gb|EKQ72155.1| hypothetical protein HMPREF1391_00919 [Helicobacter pylori
GAM100Ai]
Length = 347
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS ++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 37 SVHASALIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 96
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + NP+LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 97 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 154
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 155 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDKQNNPRE---LLRPNTLNH 209
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 210 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 265
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 266 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-NLNPHFID 322
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 323 IVYTYNRSHIKHI 335
>gi|417511435|ref|ZP_12176057.1| hypothetical protein LTSESEN_3055, partial [Salmonella enterica
subsp. enterica serovar Senftenberg str. A4-543]
gi|353643055|gb|EHC87336.1| hypothetical protein LTSESEN_3055, partial [Salmonella enterica
subsp. enterica serovar Senftenberg str. A4-543]
Length = 61
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 37/59 (62%), Gaps = 2/59 (3%)
Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
R L + S+D+G+SW L LE + E+SYPA+I + G VHITYT+NR I + L
Sbjct: 2 RYPLSLIASQDNGESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 58
>gi|160890988|ref|ZP_02071991.1| hypothetical protein BACUNI_03435 [Bacteroides uniformis ATCC 8492]
gi|270294310|ref|ZP_06200512.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|156859209|gb|EDO52640.1| BNR/Asp-box repeat protein [Bacteroides uniformis ATCC 8492]
gi|270275777|gb|EFA21637.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 398
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 74/162 (45%), Gaps = 18/162 (11%)
Query: 109 GCMKRSYNKGVTWSE--------REQLPPGILG--PSKNKPILLE--NGLL---LCGSSV 153
G + RSY+ G TW++ E G+ G P+ N+ L E +G + + S
Sbjct: 161 GYLLRSYDDGRTWNDLIIPCSVPSEINYSGLGGKLPAYNRGALYEGKDGRIFWAVVASDT 220
Query: 154 ESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
E+ + ++ ++ D G++W +YG ++ +S + + T N L MRS +
Sbjct: 221 EALRNTSVYLLISKDKGKTW-EYGTKIAKDKKVSFNETSMYETPNGDLIAFMRSEAFGDQ 279
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
C++ S DGG T+ K + ++L D R+LL Y
Sbjct: 280 ACIARSIDGGKTFGEWKSMGF--KGHPLQAMRLPDNRVLLTY 319
>gi|395798866|ref|ZP_10478149.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. Ag1]
gi|395337100|gb|EJF68958.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. Ag1]
Length = 267
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 66/180 (36%), Gaps = 52/180 (28%)
Query: 189 IQPVPFHTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+QP R ++R + + +V S + D G TWS P NPNS + V
Sbjct: 77 LQPTVVPMDERNAVAMLRYAGDTFHKVLASRTEDAGHTWSEPYPLDPANPNSSLAAVATP 136
Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE---------------------- 283
+ LL+A N + G ++ L + D W + L+E
Sbjct: 137 NNGLLVALNDLKDGRFRLRLMGTNSRLDFWKPLINLDESPDPEGNPFSAEAYREIIGDKF 196
Query: 284 -----------------NLA----------MEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
NL E+ YP I++++G H+ Y++N T IKHV
Sbjct: 197 RLSSGPRREMLVDKFLVNLDQRVCEETGCNFEYEYPYFIKSAEGLYHLVYSWNNTFIKHV 256
>gi|317480936|ref|ZP_07940016.1| hypothetical protein HMPREF1007_03135 [Bacteroides sp. 4_1_36]
gi|316902829|gb|EFV24703.1| hypothetical protein HMPREF1007_03135 [Bacteroides sp. 4_1_36]
Length = 370
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 74/162 (45%), Gaps = 18/162 (11%)
Query: 109 GCMKRSYNKGVTWSE--------REQLPPGILG--PSKNKPILLE--NGLL---LCGSSV 153
G + RSY+ G TW++ E G+ G P+ N+ L E +G + + S
Sbjct: 133 GYLLRSYDDGRTWNDLIIPCSVPSEINYSGLGGKLPAYNRGALYEGKDGRIFWAVVASDT 192
Query: 154 ESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
E+ + ++ ++ D G++W +YG ++ +S + + T N L MRS +
Sbjct: 193 EALRNTSVYLLISKDKGKTW-EYGTKIAKDKKVSFNETSMYETPNGDLIAFMRSEAFGDQ 251
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
C++ S DGG T+ K + ++L D R+LL Y
Sbjct: 252 ACIARSIDGGKTFGEWKSMGF--KGHPLQAMRLPDNRVLLTY 291
>gi|289626890|ref|ZP_06459844.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
syringae pv. aesculi str. NCPPB 3681]
Length = 279
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 84/218 (38%), Gaps = 23/218 (10%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGCEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185
Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
I + P+ +G + E + ++ ++ D G K+ N
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GELIDKFRISRGKNSLQPT 244
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW 226
+ P+ A LR +F+ RV S + DGG TW
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTW 279
>gi|386749930|ref|YP_006223137.1| hypothetical protein HCW_06235 [Helicobacter cetorum MIT 00-7128]
gi|384556173|gb|AFI04507.1| hypothetical protein HCW_06235 [Helicobacter cetorum MIT 00-7128]
Length = 377
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 127/312 (40%), Gaps = 36/312 (11%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPII---------ADE 75
HAS+++ + LVAYF G+ EGA DV+I + +W I +
Sbjct: 65 VHASSLISLPNKTLLVAYFSGTREGAKDVQINANIYNPNTKKWSEAFILLTRETLSKSAH 124
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM-----KRSYNKGVTWSEREQLPP- 129
E + NP+LF N +LLF +G + W+ S +K L P
Sbjct: 125 EYIKKLGNPLLF-WHQNKILLFV-VGVSMGGWATSKIYQLESPSIHKPFKLVRSLHLSPF 182
Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
+ + KP+ ++G + E + + ++ A + ++ PN +
Sbjct: 183 LNLSHLVRTKPLNTDDGGFILPLYHELASQYPLLLKFDFQANPT-----ELFRPNLLHAQ 237
Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
+QP T + ++ + M ++C W PT L + +
Sbjct: 238 LQPSI--TPYKKCAIMAFRNHHFSNALMLQTCLSPTKWQSLLPTNL-KNLDDSLNLLNLN 294
Query: 249 GRLLLAYN----TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
L L +N ++ R L ++ K+D +++H L++ A E SYP+ + + + I
Sbjct: 295 NTLYLIHNPDNLSLRRQKLLISQLKND-NTFHTLSVLDK--ADEVSYPSSL-IDNNFIDI 350
Query: 305 TYTYNRTQIKHV 316
TYTYNR I+H+
Sbjct: 351 TYTYNRQTIRHI 362
>gi|26990557|ref|NP_745982.1| hypothetical protein PP_3852 [Pseudomonas putida KT2440]
gi|24985537|gb|AAN69446.1|AE016580_5 hypothetical protein PP_3852 [Pseudomonas putida KT2440]
Length = 518
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 3/119 (2%)
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG--VKLKDGRLLLAYNT-VS 259
L+RS ++ S S D TW+ + + P++ + G LLL +N +
Sbjct: 374 ALLRSLGSNSQMQYSVSRDLMRTWTPWQDYTVTAPSASSRAWLGRTPSGLLLLCWNNDLV 433
Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
RG L V LS DDG ++ +E + +++ SYP V DG + + Y R K + +
Sbjct: 434 RGTLTVGLSDDDGATYRYKKIIEPDSSIQVSYPVVSFGDDGEILVIYDNGRDSHKQIRI 492
>gi|32472278|ref|NP_865272.1| sialidase [Rhodopirellula baltica SH 1]
gi|32443514|emb|CAD72956.1| probable sialidase [Rhodopirellula baltica SH 1]
Length = 1153
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 132/331 (39%), Gaps = 53/331 (16%)
Query: 32 IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQSP---IIADEEPNVPMWNPV 85
V++ G L+ Y F G+ + AP + + +G W + +I + + + +
Sbjct: 57 FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 116
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
L +L + LFY ++K+ M+ S ++G TWSE ++ P
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTPDAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 170
Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
PS ++ I L +G L+ +V W + + ++ D G +W++ +
Sbjct: 171 PSYSVLNNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA-L 229
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
+ESL+ +P ++ + + RS N +S S D G TWS KP+ P
Sbjct: 230 TSESLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSDLKPSSFTQPTVSP 285
Query: 242 DGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
++ G LL+ +N V R A+SKDDG +W + + + +
Sbjct: 286 ASIERIPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGWY 345
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
Y A+ D + Y R + +P
Sbjct: 346 CYTAIEFVDDHVLLAHCEYPRLNSLQLTRIP 376
>gi|420487269|ref|ZP_14985874.1| hypothetical protein HPHPP8_0748 [Helicobacter pylori Hp P-8]
gi|420521053|ref|ZP_15019484.1| hypothetical protein HPHPP8B_0642 [Helicobacter pylori Hp P-8b]
gi|393102390|gb|EJC02955.1| hypothetical protein HPHPP8_0748 [Helicobacter pylori Hp P-8]
gi|393127640|gb|EJC28085.1| hypothetical protein HPHPP8B_0642 [Helicobacter pylori Hp P-8b]
Length = 372
Score = 45.1 bits (105), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 129/312 (41%), Gaps = 34/312 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
S HAS+++ + + L AYF G+ EGA DVKI F K RW I + +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSESFILLTKEELSKNA 121
Query: 80 -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
L L+ + +S ++ LSK + + L + + A E SYP+ + + I
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFIDI 348
Query: 305 TYTYNRTQIKHV 316
YTYNR+ IKH+
Sbjct: 349 VYTYNRSHIKHI 360
>gi|373953332|ref|ZP_09613292.1| hypothetical protein Mucpa_1672 [Mucilaginibacter paludis DSM
18603]
gi|373889932|gb|EHQ25829.1| hypothetical protein Mucpa_1672 [Mucilaginibacter paludis DSM
18603]
Length = 418
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG--IDGVKLKDGRLLLAYN-TVSRGVLKV 265
N G + S S + G +W + + PN+ + KLK G +LL N + R L V
Sbjct: 281 NNAGNIQYSNSYNNGSSWDTISGFKTIFPNAASRVFIGKLKSGNILLIINNSFYRTNLTV 340
Query: 266 ALSKDDGDSW-HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
LS ++G SW + ++ +E ++ SYP +Q +D +H+ +R
Sbjct: 341 FLSVNNGVSWKYKSVLVENDIVAAPSYPDAVQTTDNVIHMVNDADR 386
>gi|420445202|ref|ZP_14944116.1| hypothetical protein HPHPH42_0609 [Helicobacter pylori Hp H-42]
gi|393063396|gb|EJB64243.1| hypothetical protein HPHPH42_0609 [Helicobacter pylori Hp H-42]
Length = 372
Score = 44.7 bits (104), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
E + N +LF L N +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNSLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360
>gi|449134458|ref|ZP_21769959.1| sialidase [Rhodopirellula europaea 6C]
gi|448887088|gb|EMB17476.1| sialidase [Rhodopirellula europaea 6C]
Length = 1153
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 134/332 (40%), Gaps = 55/332 (16%)
Query: 32 IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RW---QSPIIADEEPNVPMWNPV 85
V++ G L+ Y F G+ + AP + + +G W + +I + + + +
Sbjct: 57 FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSSDNGITWTTEDASVIERGDDDANLMSVS 116
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
L +L + LFY ++K+ M+ S ++G TWSE ++ P
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTPEAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 170
Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
PS ++ I L +G L+ +V W + + ++ D G++W++ +
Sbjct: 171 PSYSVLNNDRVIQLRSGRLIVPLAVHYRVGWPGYRKSAEIVCYLSDDQGKTWKRSQSA-L 229
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN--- 238
+ESL+ +P ++ L + RS N +S S D G WS P+ P
Sbjct: 230 TSESLAQ-EPGVVELSDDRLMMFCRSSNA---QLLSYSDDQGDFWSDFTPSSFTQPTVSP 285
Query: 239 SGIDGVKLKDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAME 288
+ I+ ++ G LL+ +N V R A+SKDDG +W + + +
Sbjct: 286 ASIERIQ-STGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGW 344
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
+ Y A+ D + Y R + +P
Sbjct: 345 YCYTAIEFVGDHVLLAHCEYPRLNSLQITRIP 376
>gi|422406818|ref|ZP_16483837.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
syringae pv. glycinea str. race 4]
gi|330882049|gb|EGH16198.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. race 4]
Length = 235
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 68/181 (37%), Gaps = 54/181 (29%)
Query: 189 IQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
+QP VP R + +L + RV S + DGG TWS +P NPNS + V +
Sbjct: 44 LQPTVVPL-DGQRAVALLRYAGETFHRVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGV 102
Query: 247 KDGRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME--------- 288
LL+A N + G K++L D W L+++ A+E
Sbjct: 103 PGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEG 162
Query: 289 -------------------------------FSY--PAVIQASDGSVHITYTYNRTQIKH 315
F Y P I++ DG H+ Y++N T IKH
Sbjct: 163 FRASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKH 222
Query: 316 V 316
V
Sbjct: 223 V 223
>gi|421611766|ref|ZP_16052898.1| sialidase [Rhodopirellula baltica SH28]
gi|408497479|gb|EKK02006.1| sialidase [Rhodopirellula baltica SH28]
Length = 1153
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 32 IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQSP---IIADEEPNVPMWNPV 85
V++ G L+ Y F G+ + AP + + +G W + +I + + + +
Sbjct: 57 FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 116
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPP---- 129
L +L + LFY ++K+ M+ S ++G TWSE ++ P
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTPDAKHLFLDDILMRTSSDEGDTWSEPTRIVPEDTP 171
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIY 180
+L + ++ I L +G L+ +V W + + ++ D G +W++
Sbjct: 172 SYSVL--NNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA- 228
Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
+ +ESL+ +P ++ + + RS N +S S D G TWS KP+ P
Sbjct: 229 LTSESLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSDLKPSSFTQPTVS 284
Query: 241 IDGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAME 288
++ G LL+ +N V R A+SKDDG +W + + +
Sbjct: 285 PASIERFPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGW 344
Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
+ Y A+ D + Y R + +P
Sbjct: 345 YCYTAIEFVDDHVLLAHCEYPRLNSLQLTRVP 376
>gi|87310349|ref|ZP_01092479.1| probable sialidase [Blastopirellula marina DSM 3645]
gi|87286848|gb|EAQ78752.1| probable sialidase [Blastopirellula marina DSM 3645]
Length = 373
Score = 44.3 bits (103), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 119/314 (37%), Gaps = 44/314 (14%)
Query: 32 IVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD-GR-WQSPIIADEEPNVP----MWNPV 85
+V+ G L Y E P+ I ++ D GR W I +P P +P
Sbjct: 52 MVQRKDGSLLFCY----TEYGPEGGIVAKSSTDQGRTWTEAKILVPQPVSPDPGRFNHPS 107
Query: 86 LFKLPSNGLLLFYKIGQEVQK---WSGCMKRSYNKGVTWSEREQLPP--GILGPSKNKPI 140
L +L + LLL Y K +RS + G TW E+ L P G +K +
Sbjct: 108 LLRLTNGDLLLSYNYTTHPTKPYYAVTLYRRSTDDGATWGEQFPLTPYSGYTLIHNDKLL 167
Query: 141 LLENGLLLCGSSVESW-------NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
LE+G ++ + + + N + + + D G SW + ES +
Sbjct: 168 TLEDGRIIAVAETKKYLPSSQDHNGYVGLVFYSDDQGYSWYPSKNVVDLYESRKIDVQEA 227
Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN-PNSGIDGVKL--KDGR 250
R R++M + + G + S D G TWS K L P +G+ V+ G
Sbjct: 228 DAVELRDGRLMMFARSYSGHPVKAFSTDRGETWSQGKMIPALKMPYAGLPTVRRIPSTGD 287
Query: 251 LLLAY-----------NTVSRGVLKVALSKDDGDSWHDALTLEENLAME----FSYPAVI 295
LL + R L A+SKD+G+++ L+ N+ E Y V
Sbjct: 288 LLFVWISERSALKSDPKVPVRSTLTSAISKDEGETF----ILQRNIVKEPGNDLGYQCVE 343
Query: 296 QASDGSVHITYTYN 309
DG+ I Y N
Sbjct: 344 FLEDGTALIAYHAN 357
>gi|420420528|ref|ZP_14919614.1| hypothetical protein HPNQ4161_1038 [Helicobacter pylori NQ4161]
gi|393036219|gb|EJB37259.1| hypothetical protein HPNQ4161_1038 [Helicobacter pylori NQ4161]
Length = 372
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 131/311 (42%), Gaps = 32/311 (10%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS+++ + + L AYF G+ EGA DVKI F K RW I+ EE +
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF + +L + +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLFS--HDDKILLFVVGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + + + D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
+QP T+ + V+ + M E+C W T L N + ++ + L
Sbjct: 235 QLQPSL--TSFKDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLNLN 292
Query: 248 DGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
L+ + +S ++ LSK + +S+ L++ A E SYP+ + + I
Sbjct: 293 GILYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFIDIV 349
Query: 306 YTYNRTQIKHV 316
YTYNR+ IKH+
Sbjct: 350 YTYNRSHIKHI 360
>gi|430743658|ref|YP_007202787.1| BNR/Asp-box repeat-containing protein [Singulisphaera acidiphila
DSM 18658]
gi|430015378|gb|AGA27092.1| BNR/Asp-box repeat protein [Singulisphaera acidiphila DSM 18658]
Length = 410
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 110/247 (44%), Gaps = 44/247 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNG---LLLFYKIGQ--EVQKWSGCMK--------RSYNK 117
+IAD+ P+ M NP G +L+ GQ E Q +G + +S ++
Sbjct: 126 VIADDGPHT-MGNPCPVVDRETGTIWMLMTRNHGQDKESQILNGTSQETRSVWVMKSTDE 184
Query: 118 GVTWSEREQLPPGILGPS-------KNKPILLENGLLL--CGSSVESWNSWGSWMEVTVD 168
G+TWS R ++ P + P I LE+G ++ C + S + + D
Sbjct: 185 GLTWSPRVEITPQVKLPDWTWYATGPGVSIQLESGRMVVPCDHFLAKSKVSRSHVIYSDD 244
Query: 169 AGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRS---FNGIGRVCMSESCDGGLT 225
G +W+ G + E ++ Q +NRTL + MR G GR +S S DGGLT
Sbjct: 245 HGATWKIGG---VGGERVNECQVAEL--SNRTLLLNMRHQPPRRGEGRA-ISTSRDGGLT 298
Query: 226 WSYAKPT---QLLNP--NSGIDGVKLKDGRL---LLAYNTVS--RGVLKVALSKDDGDSW 275
W+ PT L+ P + + G+K + GR LL N S R ++ V LS+D G +W
Sbjct: 299 WT--PPTLDPSLVEPACQASLIGLKDEQGRASNRLLFSNPASSKRELMTVRLSEDGGKTW 356
Query: 276 HDALTLE 282
+L
Sbjct: 357 SQHKSLH 363
>gi|440717535|ref|ZP_20898020.1| sialidase [Rhodopirellula baltica SWK14]
gi|436437339|gb|ELP30987.1| sialidase [Rhodopirellula baltica SWK14]
Length = 1121
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 71/331 (21%), Positives = 132/331 (39%), Gaps = 53/331 (16%)
Query: 32 IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQSP---IIADEEPNVPMWNPV 85
V++ G L+ Y F G+ + AP + + +G W + +I + + + +
Sbjct: 25 FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 84
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
L +L + LFY ++K+ M+ S ++G TWSE ++ P
Sbjct: 85 LLRLQDGRIGLFY-----IRKYDPTPDAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 138
Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
PS ++ I L +G L+ +V W + + ++ D G +W++ +
Sbjct: 139 PSYSVLNNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA-L 197
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
++SL+ +P ++ + + RS N +S S D G TWS KP+ P
Sbjct: 198 TSKSLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSELKPSSFTQPTVSP 253
Query: 242 DGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
++ G LL+ +N V R A+SKDDG +W + + + +
Sbjct: 254 ASIERIPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGWY 313
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
Y A+ D + Y R + +P
Sbjct: 314 CYTAIEFVDDHVLLAHCEYPRLNSLQLTRVP 344
>gi|78485668|ref|YP_391593.1| BNR repeat-containing glycosyl hydrolase [Thiomicrospira crunogena
XCL-2]
gi|78363954|gb|ABB41919.1| Glycosyl hydrolase [Thiomicrospira crunogena XCL-2]
Length = 354
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 51/232 (21%), Positives = 92/232 (39%), Gaps = 38/232 (16%)
Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGR 171
RS + G TW ++ G G + L LL+ +++ S ++ D G+
Sbjct: 124 RSLDGGQTWEAPVRVQKGYCGATTTMISLSSGALLMTAQNLDYEAGRHYSLTYLSEDEGQ 183
Query: 172 SWRKYGPIYIPNESL--SVIQPVPFHTANRTLRVLMRS-----FNGIGRVCMSESCDGGL 224
SW I I + + + + L+R+ +NG S + G
Sbjct: 184 SWHASNKIDIGGQGHHGGCYEGTLLELKDGRVWYLIRTNLDWFWNGY-------SNNQGK 236
Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS----------------------RGV 262
TW++ +P + G+ +L+ GRLLLAYN V+ R
Sbjct: 237 TWTHLEPGLSAGSSPGML-CRLQSGRLLLAYNQVTKEGETLAPRIAGLFSEVPASWQREE 295
Query: 263 LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
L + LS+DDG +W D + + SY ++ + G + +T ++ ++K
Sbjct: 296 LSIRLSEDDGKTWSDPIVIAACKGAWLSYCSLFEPHAGEIWLTTMQSQLKLK 347
>gi|406832273|ref|ZP_11091867.1| hypothetical protein SpalD1_11554 [Schlesneria paludicola DSM
18645]
Length = 397
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 3/95 (3%)
Query: 218 ESCDGGLTWSY-AKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV-LKVALSKDDGDSW 275
S D G TW + P + ++L DGR+ L Y + ++ LS D G SW
Sbjct: 282 RSADDGKTWKLDSTPVPDAGEGNPPAMIRLADGRVCLTYGVRAEPYEMRARLSSDGGRSW 341
Query: 276 HDALTLE-ENLAMEFSYPAVIQASDGSVHITYTYN 309
H + L + + YPA +Q DG + Y ++
Sbjct: 342 HKEIVLRGKGGGRDIGYPATVQRPDGKIVTIYYFH 376
>gi|417303213|ref|ZP_12090274.1| sialidase [Rhodopirellula baltica WH47]
gi|327540483|gb|EGF27066.1| sialidase [Rhodopirellula baltica WH47]
Length = 1153
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 71/331 (21%), Positives = 132/331 (39%), Gaps = 53/331 (16%)
Query: 32 IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQS---PIIADEEPNVPMWNPV 85
V++ G L+ Y F G+ + AP + + +G W + +I + + + +
Sbjct: 57 FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 116
Query: 86 LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
L +L + LFY ++K+ M+ S ++G TWSE ++ P
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTLDAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 170
Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
PS ++ I L +G L+ +V W + + ++ D G +W++ +
Sbjct: 171 PSYSVLNNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA-L 229
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
++SL+ +P ++ + + RS N +S S D G TWS KP+ P
Sbjct: 230 TSKSLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSDLKPSSFTQPTVSP 285
Query: 242 DGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
++ G LL+ +N V R A+SKDDG +W + + + +
Sbjct: 286 ASIERIPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGWY 345
Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
Y A+ D + Y R + +P
Sbjct: 346 CYTAIQFVDDHVLLAHCEYPRLNSLQLTRVP 376
>gi|412988324|emb|CCO17660.1| hypothetical protein PTSG_00864 [Bathycoccus prasinos]
Length = 554
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 65/303 (21%), Positives = 111/303 (36%), Gaps = 62/303 (20%)
Query: 20 NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVK---IWLQTFKDG-RWQSPIIADE 75
+ + FK H + E+ G VA+ E ++ W + +G + P + +
Sbjct: 139 SESEFKYMHMAMPTELPDGSIAVAFQASPTEYEGSIRQALYWAISRDNGVTFSRPEMIAK 198
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--------------CMKRSYNKGVTW 121
+ +P+W PVL K+ + L Y + ++ K S + G TW
Sbjct: 199 DDQLPVWTPVL-KVSGQRVFLLYTVSSRKCRYYDQSRNALRHSPGGDVVYKVSDDNGKTW 257
Query: 122 SEREQL-----PPGILGPSKNKPILLENG-------------------------LLLCGS 151
S + + G+ N + LE+G LL GS
Sbjct: 258 SSGQVILSYGSEDGMPKVIANTLVELEDGSWVLPFWREPGKTCPQMKSEVKDASLLRKGS 317
Query: 152 -----SVESWNSWGSWMEVTVDAGRSWRKYGPIY---IPNESLSVIQPVPFHTANRTLRV 203
S++ +W + +T G SW + + + + RT V
Sbjct: 318 AGVIRSMDQGLTWEVFGNLTATKGESWLIENTVVERKSERRKTTKKKENNNNKGGRTNNV 377
Query: 204 LMRSF---NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR 260
L++ F +G+ V +S D G TW+ A T L NPNS + +++ DG AY+ V
Sbjct: 378 LVQHFRTKSGVAFVSLSR--DNGKTWTEASETTLPNPNSKMHTIRIDDGDTYDAYDNVDE 435
Query: 261 GVL 263
L
Sbjct: 436 KTL 438
>gi|420482145|ref|ZP_14980782.1| hypothetical protein HPHPP2_0672 [Helicobacter pylori Hp P-2]
gi|420512568|ref|ZP_15011051.1| hypothetical protein HPHPP2B_0676 [Helicobacter pylori Hp P-2b]
gi|393099379|gb|EJB99960.1| hypothetical protein HPHPP2_0672 [Helicobacter pylori Hp P-2]
gi|393157631|gb|EJC57892.1| hypothetical protein HPHPP2B_0676 [Helicobacter pylori Hp P-2b]
Length = 372
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 12/83 (14%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
S HAS+++ + + L AYF G+ EGA DVKI F K RW I
Sbjct: 62 SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTKEELSHHS 121
Query: 75 EEPNVPMWNPVLFKLPSNGLLLF 97
E + NP+LF L + +LLF
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLF 143
>gi|320537217|ref|ZP_08037181.1| hypothetical protein HMPREF9554_01919 [Treponema phagedenis F0421]
gi|320145930|gb|EFW37582.1| hypothetical protein HMPREF9554_01919 [Treponema phagedenis F0421]
Length = 525
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 76/191 (39%), Gaps = 35/191 (18%)
Query: 146 LLLCGSSVESWNSWGSWMEVTV------DAGRSWRKYGPIYIPNESLSVIQPVPFHTANR 199
LL + NSWG+ V D ++W+ I P +S + +
Sbjct: 318 LLFSAYGGSTANSWGAHFAERVAVFWTDDEAKTWQHSRYIKTPVHKMS--ESAIIEAPDG 375
Query: 200 TLRVLMRS-FNGIGRVCMSESCDGGLTWSYAKPTQLLNP--NSGIDGVKLK-----DGRL 251
L + RS F G V MS+S DGG TWS A+P N+ I + L+ +
Sbjct: 376 ALIIFSRSDFQG---VMMSQSNDGGATWSEAQPLAGFTATVNNLISVINLERLSELQNKT 432
Query: 252 LLAYN-------TVSRGVLKVALSKDDGDSWHDALTLEENLAME---------FSYPAVI 295
L+A++ T S G +K+ +GD W E +E F+Y +
Sbjct: 433 LVAFSYANGLDRTRSAGTVKIGALVKNGDKWQLDFLWNEGSPLELAVTAADKFFAYSCLA 492
Query: 296 QASDGSVHITY 306
+ DGS+ + Y
Sbjct: 493 ELPDGSLALAY 503
>gi|400293104|ref|ZP_10794989.1| BNR/Asp-box repeat protein [Actinomyces naeslundii str. Howell 279]
gi|399901810|gb|EJN84680.1| BNR/Asp-box repeat protein [Actinomyces naeslundii str. Howell 279]
Length = 771
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 85/211 (40%), Gaps = 44/211 (20%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
+ S + GVTWS+ L P + LG I L NG + L ++ ++W
Sbjct: 507 IAHSDDDGVTWSKPRDLNPQLKQPWMRFLGTCPGNGIALRNGPHAGRLVVPLYFNNDQNW 566
Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
+ + + + D G +W+ + P +E+ S+ + + L
Sbjct: 567 LAMCATVAYSDDHGETWQLGRSPNEGRQTPEGELDPQTFVDETWSLHEAAVVERRDGVLL 626
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAK-----PTQLLNPNS-------GIDGVKLKDGR 250
+ MR+ + GRV +SES D G TW + P PN+ G D + +
Sbjct: 627 LFMRNQHPRGRVAVSESHDAGQTWGAIRFDKELPEIWCQPNAISLPSTEGEDRIVFANAS 686
Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTL 281
L+L + RG + LS D G +W + TL
Sbjct: 687 LMLPF----RGCGVIRLSLDGGRTWKYSRTL 713
>gi|255035678|ref|YP_003086299.1| hypothetical protein Dfer_1900 [Dyadobacter fermentans DSM 18053]
gi|254948434|gb|ACT93134.1| hypothetical protein Dfer_1900 [Dyadobacter fermentans DSM 18053]
Length = 384
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 38/160 (23%), Positives = 72/160 (45%), Gaps = 22/160 (13%)
Query: 157 NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT-----ANRTLRVLMRSFNGI 211
+WG M ++ D G++W GP+ E ++ P+ T + TL +R+ +
Sbjct: 202 KAWGGVM-LSRDKGKTW---GPVITIGEKANL--PLAAETDVILLKDGTLYAALRAQQEV 255
Query: 212 GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVA 266
+ ++S D G TWS + + +LK G +LL + +G +
Sbjct: 256 -HMHFAKSKDMGKTWSDVEDIGFRGHSPSF--TRLKSGEILLTTRAFTDAPSKKGYTGLR 312
Query: 267 LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
+S+D+ +W ++E L +YP+ ++ DGSV + Y
Sbjct: 313 ISRDEAKTWEGPYMVDETLG---AYPSTVELKDGSVLVVY 349
>gi|406835791|ref|ZP_11095385.1| hypothetical protein SpalD1_29249 [Schlesneria paludicola DSM
18645]
Length = 405
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 3/106 (2%)
Query: 167 VDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW 226
DA +W+ + P+ + L+ +P+ + + L L R G + S S D G TW
Sbjct: 240 TDAIDAWKSF-PVLGSSSELAAEEPLWWVLPDNRLLALFRDNRHSGFLYRSFSRDNGQTW 298
Query: 227 SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT--VSRGVLKVALSKD 270
T + S I G++L+DGR +L N R L ++LS D
Sbjct: 299 DTPVRTNFPDATSKIHGLRLRDGRYVLVSNANPKKRDPLVISLSND 344
>gi|223935468|ref|ZP_03627385.1| sialidase [bacterium Ellin514]
gi|223895878|gb|EEF62322.1| sialidase [bacterium Ellin514]
Length = 402
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 124/302 (41%), Gaps = 33/302 (10%)
Query: 40 FLVAYF-GGSCEGAPDVKIWLQTFKDG-RW-QSP-IIADEEPNVPMWNPVLFKLPSNGLL 95
FL + F GG+ + +P + + + G W Q P ++ D N + + L +L S +
Sbjct: 79 FLYSQFRGGAADESPSRIVSVYSDDAGLTWNQKPELVVDTGTNQNVMSVSLLRLKSGKIA 138
Query: 96 LFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL--PPGILGPSKNKPILLENGLLLC---- 149
LFY I Q + ++ S ++ TWS + + PG + ++ I L NG L+
Sbjct: 139 LFYLIKQNLHDCHPWIQLSSDEAKTWSSPKLVVDAPGYFVLNNDRVIQLNNGRLIVPVAF 198
Query: 150 -GSSVESWNSWGSWMEVTV-------DAGRSWRKYGPIY-IPNESLSVIQ-PVPFHTANR 199
S N + ++ + D G+ W + + +P S + +Q P A+
Sbjct: 199 HRSRNSKLNDYSTFDPRAIALWYLSDDEGQIWHEAETWWAMPLASKTGLQEPGVVERADG 258
Query: 200 TLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK-DGRLLLAYNTV 258
T VL + G S + G TWS T+L +P S +L LL +N
Sbjct: 259 T--VLSWARTDAGAQYGFASANSGATWSAPIATELKSPVSPASIKRLPGSAELLAVFNDH 316
Query: 259 S---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
S R L A+S D G +W LE + A + Y A I ++ +V Y
Sbjct: 317 SGQFPFPKGKRTPLVAAISPDGGRTWKKRKILENDPAGSYCYTA-IHFTENAVLFAYLDF 375
Query: 310 RT 311
RT
Sbjct: 376 RT 377
>gi|410099125|ref|ZP_11294098.1| hypothetical protein HMPREF1076_03276 [Parabacteroides goldsteinii
CL02T12C30]
gi|409219606|gb|EKN12567.1| hypothetical protein HMPREF1076_03276 [Parabacteroides goldsteinii
CL02T12C30]
Length = 706
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 50/236 (21%), Positives = 99/236 (41%), Gaps = 26/236 (11%)
Query: 101 GQEVQKWSGCMKRSYNKGVTWSEREQLPPGILG-----------PSKNKPILLE--NGL- 146
G E+ G + RS++KG TW E +PP + P+ N+ + E NG
Sbjct: 466 GGEIAFLGGYILRSFDKGKTW-EDPIIPPTLDSEIYISATGEPLPTYNRGAMCEGKNGKI 524
Query: 147 --LLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVL 204
++ G + + V+ D G +W +Y + ++S++ + + T +
Sbjct: 525 YWIVAGHDPAPLGKTSNHLLVSEDKGETW-QYSGLVATDDSVAFNEASVYETPKGDIVGF 583
Query: 205 MRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR--GV 262
+R+ C++ S DGG T+ + +P + ++L D R+LL Y + G+
Sbjct: 584 LRT-TLYDHACIARSTDGGKTFEWKSMQFQGHP---LAALRLPDNRVLLTYGYRHKPFGI 639
Query: 263 LKVALSKDDGDSW--HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
L+ + D + + ++ + YP I D V + Y YN+ +++
Sbjct: 640 RARILNAECTDYAISEEFILRDDGDGPDLGYPWPIMLEDNRVLVVYYYNKNGHRNI 695
>gi|420414952|ref|ZP_14914068.1| hypothetical protein HPNQ4053_0541 [Helicobacter pylori NQ4053]
gi|393033919|gb|EJB34980.1| hypothetical protein HPNQ4053_0541 [Helicobacter pylori NQ4053]
Length = 372
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 76/313 (24%), Positives = 133/313 (42%), Gaps = 36/313 (11%)
Query: 26 SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
S HAS ++ + + L AYF G+ EGA DVKI F K RW I+ E+ +
Sbjct: 62 SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFTILTKEDLSHHS 121
Query: 79 ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
+ NP+LF L + +LLF +G + W+ + + ++ + L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALESVRFKFARKLSLSP 179
Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
+ +NKP+ +G + E + ++ D + R+ + PN
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234
Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
+QP PF + V+ + M E+C W T L N + ++ +
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290
Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
L L+ + +S ++ LSK ++ +S+ L++ A E SYP+ + +
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347
Query: 304 ITYTYNRTQIKHV 316
I YT+NR+ IKH+
Sbjct: 348 IVYTHNRSHIKHI 360
>gi|421882034|ref|ZP_16313327.1| predicted neuraminidase (sialidase) [Helicobacter bizzozeronii
CCUG 35545]
gi|375315749|emb|CCF81323.1| predicted neuraminidase (sialidase) [Helicobacter bizzozeronii
CCUG 35545]
Length = 352
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRW 67
KS HA+T+ +D + + AYFGGS EG DVKI+ L K W
Sbjct: 53 KSMHAATLSVLDTDYLMGAYFGGSAEGVGDVKIYAHLYNLKTQEW 97
>gi|326773741|ref|ZP_08233024.1| sialidase A [Actinomyces viscosus C505]
gi|326636971|gb|EGE37874.1| sialidase A [Actinomyces viscosus C505]
Length = 771
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 95/247 (38%), Gaps = 48/247 (19%)
Query: 114 SYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESWNSW 159
S + G+TWS L P + LG I L NG + L ++ ++W +
Sbjct: 510 SDDDGLTWSRPRDLNPELKQQWMRFLGTCPGNGIALRNGPHAGRLIVPLYFNNDQNWLAM 569
Query: 160 GSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
+ + + D G +W+ + P +E+ S+ + + L + M
Sbjct: 570 CATVAYSDDHGETWQLGHGPNEGRQTPQGELDPQTFVDETWSLHEAAVVERQDGVLVLFM 629
Query: 206 RSFNGIGRVCMSESCDGGLTW-----SYAKPTQLLNPNS-------GIDGVKLKDGRLLL 253
R+ + GRV +SES D G TW A P PN+ G D + + L+L
Sbjct: 630 RNQHPRGRVAVSESHDAGQTWGPIRFDEALPEIWCQPNAISLPSPEGEDRIVFANASLML 689
Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
+ RG + LS D G SW + T + Y + DG + I + N Q
Sbjct: 690 PF----RGCGVIRLSLDGGRSWKHSRTFNPG---HYVYQCMCVLPDGRIGILWE-NECQG 741
Query: 314 KHVVLLP 320
+ LP
Sbjct: 742 LYFSRLP 748
>gi|257484989|ref|ZP_05639030.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
syringae pv. tabaci str. ATCC 11528]
Length = 113
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 38/79 (48%), Gaps = 4/79 (5%)
Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD---GRWQSPIIADEEPNV 79
P+ HA+ + E+ G L +F G+ EG D+ + L + +D G W P E+
Sbjct: 15 PYAQNHAANMHELADGTLLCTWFAGTQEGMADIFVLL-SRRDPATGVWSEPQKMSEDATR 73
Query: 80 PMWNPVLFKLPSNGLLLFY 98
NP+LF+ P L L +
Sbjct: 74 SEQNPILFQAPGGPLWLIW 92
>gi|329947419|ref|ZP_08294623.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 170 str.
F0386]
gi|328525169|gb|EGF52220.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 170 str.
F0386]
Length = 724
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 83/211 (39%), Gaps = 44/211 (20%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
+ S + GVTWS+ L P + LG I L +G + L ++ +W
Sbjct: 460 IAHSDDDGVTWSKPRDLNPQLKQPWMRFLGTCPGNGIALRHGPHAGRLVVPLYFNNDRNW 519
Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
+ + + + D G +WR + P +E+ S+ + + TL
Sbjct: 520 LAMCATVAYSDDHGETWRLSHGPNEGRETSDGQLDPQTFVDETWSLHEATVVERQDGTLL 579
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAK-----PTQLLNPNS-------GIDGVKLKDGR 250
+ MR+ + GRV +SES D G TW + P PN+ D + +
Sbjct: 580 LFMRNQHPHGRVAVSESHDAGQTWGTVRFDEDLPEIWCQPNAIRLPSPEDEDRIVFANAS 639
Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTL 281
L+L + RG + LS D G +W + T
Sbjct: 640 LMLPF----RGCGVIRLSLDGGRTWKRSRTF 666
>gi|346970813|gb|EGY14265.1| oxidoreductase [Verticillium dahliae VdLs.17]
Length = 337
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 57/134 (42%), Gaps = 15/134 (11%)
Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
TVD G +W GP P ++ + V +A + + +L R+ ++E DGG +
Sbjct: 51 TVDGGATWTSVGPRLAPEDAALQFRDVQAFSATKAV-ILSIGEGTYSRIYVTE--DGGTS 107
Query: 226 WSYA----KPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWH--DAL 279
W +PT N V +D LA + G ++ +KD G SW DA
Sbjct: 108 WDLVFTNEEPTAFYNC------VDFEDQERGLAVSDPVDGKFRLVETKDGGRSWGIVDAS 161
Query: 280 TLEENLAMEFSYPA 293
+ LA EF + A
Sbjct: 162 GMAPALAGEFGFSA 175
>gi|420243216|ref|ZP_14747166.1| BNR/Asp-box repeat protein [Rhizobium sp. CF080]
gi|398063066|gb|EJL54823.1| BNR/Asp-box repeat protein [Rhizobium sp. CF080]
Length = 370
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 46/100 (46%), Gaps = 14/100 (14%)
Query: 219 SCDGGLTWSY-AKPTQLLNPNSGIDGV-----KLKDGRLLL--AYNTVSRGVLKVALSKD 270
S D G W +P PN+G G +L DGRL L Y G L+V +S D
Sbjct: 258 SSDEGKNWELIGRPV----PNTGYAGNPPTINRLNDGRLALVFGYRDAPYG-LRVRVSSD 312
Query: 271 DGDSWHDALTLEENLAM-EFSYPAVIQASDGSVHITYTYN 309
+G +W D + + ++ M + YP + DG + Y YN
Sbjct: 313 EGKTWSDDIIIRDDGGMSDLGYPRTVVRPDGKLLSVYYYN 352
>gi|149179051|ref|ZP_01857624.1| probable sialidase [Planctomyces maris DSM 8797]
gi|148842091|gb|EDL56481.1| probable sialidase [Planctomyces maris DSM 8797]
Length = 358
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 100/229 (43%), Gaps = 31/229 (13%)
Query: 91 SNGLLLFYKIGQ------EVQKWSGCM-KRSYNKGVTWSEREQLPPGILGPSKNK---PI 140
+NG LL + G E G M K+S ++G TWSE L P P+K + P
Sbjct: 28 NNGDLLILQDGSILFCYTEYGDHGGIMAKKSVDQGKTWSEPWTLVPQPKLPAKGRYVHPS 87
Query: 141 LL--ENGLLLCGSSVESWNSWGSWMEV----TVDAGRSWRKYGPI-------YIPNESLS 187
LL +NG +L + + + + V + D G++W + P+ + N+ L
Sbjct: 88 LLRIKNGEILLAYNYSTHPAKPYYATVLYRRSADEGQTWTEQLPVSAYPGYNLVHNDKLL 147
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMS-ESCDGGLTWSYAKPTQLLNPNSGI----- 241
++ + + S + G V ++ S D G +W +K T L I
Sbjct: 148 SLRDGRIIAVGENKKYMPDSNDHNGYVGLTFYSDDQGHSWHSSKNTVDLYGTRKIEVQEA 207
Query: 242 DGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
D V+LKDGRL++ + S G A S D G++W + L +L M ++
Sbjct: 208 DAVELKDGRLMMFARSYS-GYPVRAYSTDRGETWSEG-KLIRDLKMPYA 254
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 64/283 (22%), Positives = 110/283 (38%), Gaps = 49/283 (17%)
Query: 67 WQSPIIADEEPNVP----MWNPVLFKLPSNGLLLFYKIGQEVQK---WSGCMKRSYNKGV 119
W P +P +P +P L ++ + +LL Y K + +RS ++G
Sbjct: 65 WSEPWTLVPQPKLPAKGRYVHPSLLRIKNGEILLAYNYSTHPAKPYYATVLYRRSADEGQ 124
Query: 120 TWSEREQLP----PGILGPSKNKPILLENGLLLCGSSVESW----NSWGSWMEVTV---D 168
TW+E QLP PG +K + L +G ++ + + N ++ +T D
Sbjct: 125 TWTE--QLPVSAYPGYNLVHNDKLLSLRDGRIIAVGENKKYMPDSNDHNGYVGLTFYSDD 182
Query: 169 AGRSWRK-------YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
G SW YG + V + + L + RS++G + S D
Sbjct: 183 QGHSWHSSKNTVDLYG-----TRKIEVQEADAVELKDGRLMMFARSYSGY--PVRAYSTD 235
Query: 222 GGLTWSYAKPTQLLN-PNSGIDGVKL--KDGRLLLAY----NTVS--------RGVLKVA 266
G TWS K + L P +G+ V+ G LL + +T+ R L A
Sbjct: 236 RGETWSEGKLIRDLKMPYAGLPTVRRIPSTGDLLFIWVSEKSTIEENGRKLPMRSALTTA 295
Query: 267 LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
+S+D+G ++ + ++ +F Y + D I Y N
Sbjct: 296 ISRDEGKTFLHQRNIVQDPTNDFGYQCIEFIDDDLALIAYHTN 338
>gi|320534602|ref|ZP_08035050.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 171 str.
F0337]
gi|320133178|gb|EFW25678.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 171 str.
F0337]
Length = 760
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 95/250 (38%), Gaps = 48/250 (19%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
+ S + GVTWS L P + LG I L NG + L ++ +W
Sbjct: 499 IAHSDDDGVTWSTPRDLNPQVKQPWMRFLGTCPGNGIALRNGPHAGRLIVPLYFNNERNW 558
Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
+ + + + D G +W+ P +E+ S+ + + L
Sbjct: 559 LAMCATVAYSDDHGETWQLGHGPNEGRQTPEGALDPQTFVDETWSLHEAAVVERQDGVLL 618
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWS-----------YAKPTQLLNPN-SGIDGVKLKDGR 250
+ MR+ + GRV +SES D G TW + +P + P+ G D + +
Sbjct: 619 LFMRNQHPRGRVAVSESHDAGQTWGPIRFDEELPEIWCQPNAISLPSPEGEDRIVFANAS 678
Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
L+L + RG + LS D G +W + T + Y + DG + I + N
Sbjct: 679 LMLPF----RGCGVIRLSLDGGRTWKHSRTFNPG---HYVYQCMCVLPDGRIGILWE-NE 730
Query: 311 TQIKHVVLLP 320
Q + LP
Sbjct: 731 CQGLYFSRLP 740
>gi|325068374|ref|ZP_08127047.1| exo-alpha-sialidase [Actinomyces oris K20]
Length = 716
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 95/247 (38%), Gaps = 48/247 (19%)
Query: 114 SYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESWNSW 159
S + G+TWS L P + LG I L NG + L ++ ++W +
Sbjct: 455 SDDDGLTWSRPRDLNPELKQPWMRFLGTCPGNGIALRNGPHAGRLIVPLYFNNDQNWLAM 514
Query: 160 GSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
+ + + D G +W+ + P +E+ S+ + + L + M
Sbjct: 515 CATVAYSDDHGETWQLGHGPNEGRQTPQGELDPQTFVDETWSLHEAAVVERHDGVLVLFM 574
Query: 206 RSFNGIGRVCMSESCDGGLTW-----SYAKPTQLLNPNS-------GIDGVKLKDGRLLL 253
R+ + GRV +SES D G TW A P PN+ G D + + L+L
Sbjct: 575 RNQHPRGRVAVSESHDAGQTWGPIRFDEALPEIWCQPNAISLPSPEGEDRIVFANASLML 634
Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
+ RG + LS D G +W + T + Y + DG + I + N Q
Sbjct: 635 PF----RGCGVIRLSLDGGRTWKHSRTFNPG---HYVYQCMCVLPDGRIGILWE-NECQG 686
Query: 314 KHVVLLP 320
+ LP
Sbjct: 687 LYFSRLP 693
>gi|149178198|ref|ZP_01856792.1| Sialidase (Precursor) [Planctomyces maris DSM 8797]
gi|148842980|gb|EDL57349.1| Sialidase (Precursor) [Planctomyces maris DSM 8797]
Length = 385
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 68/145 (46%), Gaps = 16/145 (11%)
Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
G++W PI +V++ + L + MRS++G R ++ S DGG+TWS
Sbjct: 227 GKTWELSEPIGEKTNECAVVE-----LTDGRLLMNMRSYHGENRRAIAFSSDGGMTWSDV 281
Query: 230 KPTQ-LLNP--NSGIDGVKLKD----GRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTL 281
Q L+ P + + V+ D G++L + + R + V LS+DDG +W + +
Sbjct: 282 SLDQALIEPVCQASLIRVRFPDVEKPGQILFSNPASKKREKMVVRLSEDDGRTWTASRLV 341
Query: 282 EENLAMEFSYPAVIQASDGSVHITY 306
A +Y ++ DG V + Y
Sbjct: 342 TPGSA---AYSSLAGIGDGKVGLLY 363
>gi|343082891|ref|YP_004772186.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351425|gb|AEL23955.1| hypothetical protein Cycma_0172 [Cyclobacterium marinum DSM 745]
Length = 395
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 42/221 (19%), Positives = 87/221 (39%), Gaps = 22/221 (9%)
Query: 109 GCMKRSYNKGVTWS----------EREQLPPGILGPSKNKPILLENGL-----LLCGSSV 153
G + +S + G TW E P G P+ N+ + E ++ S
Sbjct: 163 GYLVKSVDGGETWKGPIYPPSIAPEVNHTPMGNKVPAYNRGAMYEGKSGRIFWIVAASDT 222
Query: 154 ESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
E +++ V+ D G +W + + +E + + + T + +R+ +
Sbjct: 223 EKRGKTSNYLMVSDDKGNTWEHLSTVAVDDEVI-FNEASAYETPKGDIVAFIRTAKYDDQ 281
Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR--GVLKVALSKD- 270
++ S DGG T+ K + + ++L D R+ L Y + G+ L+ +
Sbjct: 282 AVIARSKDGGKTFD--KWESMGFQGHPLQALRLPDNRVFLVYGYRHKPYGIRARILNAEC 339
Query: 271 -DGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
D + + + E+ + + YP + D V +TY +N+
Sbjct: 340 TDFATAEEIIIREDGGSTDIGYPWAVNMGDDKVLVTYYFNK 380
>gi|422406819|ref|ZP_16483838.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
syringae pv. glycinea str. race 4]
gi|330882050|gb|EGH16199.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
glycinea str. race 4]
Length = 203
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 15/115 (13%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQL 180
>gi|422605480|ref|ZP_16677493.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
syringae pv. mori str. 301020]
gi|330889135|gb|EGH21796.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
mori str. 301020]
Length = 216
Score = 41.2 bits (95), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 15/115 (13%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQL 180
>gi|257482882|ref|ZP_05636923.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
syringae pv. tabaci str. ATCC 11528]
Length = 271
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 15/115 (13%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
H+S + + G + +F GS EGA DV+I F +G W + +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126
Query: 80 ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQL 180
>gi|87310347|ref|ZP_01092477.1| probable sialidase [Blastopirellula marina DSM 3645]
gi|87286846|gb|EAQ78750.1| probable sialidase [Blastopirellula marina DSM 3645]
Length = 649
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 68/159 (42%), Gaps = 13/159 (8%)
Query: 147 LLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMR 206
L + V+ N + ++ D G++WR G + ++P RV+M
Sbjct: 458 LASTADVQRVNHFVCQCCLSDDGGKTWR-LGKESVDYPKRGAMEPEVVEIDGN--RVMMI 514
Query: 207 SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KDGRLLLAYNTV------- 258
N +G + + S DGG TWS + + P + ++ G LLL +N
Sbjct: 515 FRNQLGHIAHAMSEDGGETWSEPQSLGVPGPEAPATIRRIPSTGDLLLIWNDNVEKGRHH 574
Query: 259 --SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
SR L +A+S D+G +W +E E++Y +V+
Sbjct: 575 GGSRNPLSLAISSDEGKTWRLRRNVEAEKGHEYAYTSVL 613
>gi|432893410|ref|ZP_20105422.1| hypothetical protein A31K_02555 [Escherichia coli KTE165]
gi|431424390|gb|ELH06486.1| hypothetical protein A31K_02555 [Escherichia coli KTE165]
Length = 129
Score = 41.2 bits (95), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 2/73 (2%)
Query: 28 HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
HA+ I+ + G + +F G+ EG D+ +W G +W + + NPV
Sbjct: 33 HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92
Query: 86 LFKLPSNGLLLFY 98
LF P N L L +
Sbjct: 93 LFLAPDNVLWLLW 105
>gi|336411775|ref|ZP_08592235.1| hypothetical protein HMPREF1018_04253 [Bacteroides sp. 2_1_56FAA]
gi|335940119|gb|EGN01987.1| hypothetical protein HMPREF1018_04253 [Bacteroides sp. 2_1_56FAA]
Length = 567
Score = 40.8 bits (94), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|423283708|ref|ZP_17262592.1| hypothetical protein HMPREF1204_02130 [Bacteroides fragilis HMW
615]
gi|404580746|gb|EKA85454.1| hypothetical protein HMPREF1204_02130 [Bacteroides fragilis HMW
615]
Length = 567
Score = 40.8 bits (94), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|412992355|emb|CCO20068.1| hypothetical protein PTSG_00864 [Bathycoccus prasinos]
Length = 518
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 69/173 (39%), Gaps = 42/173 (24%)
Query: 165 VTVDAGRSWRKYGPIYIPNESL---SVIQPVPFHTANRTL---------------RVLMR 206
++ D G++W+ G I +PN L SV++ T N R+ M
Sbjct: 286 ISKDKGKTWKAQGSITLPNTWLIEQSVVELTSKATNNGGGGGGDENDHNNNISVNRIAMF 345
Query: 207 SFNGIGRVCMSESCDGGLTWSY-AKPTQLL-NPNSGI-------------DG-------V 244
G V +S D G TW + AKP L NPN+ + DG
Sbjct: 346 CRTAAGHVFRIDSNDLGATWEHKAKPVLALPNPNAKLTMKTFGPSGRYRADGGGKNWIVA 405
Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENL--AMEFSYPAVI 295
+ D + L SR L++ +S D+G SW ++EE + + YP +I
Sbjct: 406 VMNDHKELPQPYRRSRSKLRLIVSHDNGKSWTRLASIEEGIKPGLRHHYPTLI 458
>gi|378766118|ref|YP_005194579.1| Signal peptidase I [Pantoea ananatis LMG 5342]
gi|386016767|ref|YP_005935061.1| signal peptidase I LepB [Pantoea ananatis AJ13355]
gi|386078352|ref|YP_005991877.1| signal peptidase I LepB [Pantoea ananatis PA13]
gi|327394843|dbj|BAK12265.1| signal peptidase I LepB [Pantoea ananatis AJ13355]
gi|354987533|gb|AER31657.1| signal peptidase I LepB [Pantoea ananatis PA13]
gi|365185592|emb|CCF08542.1| Signal peptidase I [Pantoea ananatis LMG 5342]
Length = 324
Score = 40.8 bits (94), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 47/97 (48%), Gaps = 12/97 (12%)
Query: 214 VCMSESCDGG--------LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKV 265
+ ++ +CD G +T+S +P++ + SG DG + +G L N RG L++
Sbjct: 165 LTINPACDNGKPCSKALPVTYSNIEPSKFVQTFSGFDGNEAGNGFYQLPLNDTMRGGLRL 224
Query: 266 ALSKDD-GDSWHDALTLEE---NLAMEFSYPAVIQAS 298
A K+ GD HD L + E +M + P QAS
Sbjct: 225 ATRKETLGDVTHDILLVTEAQSQASMYYQQPGQPQAS 261
>gi|289674394|ref|ZP_06495284.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
syringae pv. syringae FF5]
Length = 178
Score = 40.8 bits (94), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 47/110 (42%), Gaps = 15/110 (13%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A E
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWS 122
+ NPV+ P N L LFY + V W+G S + G +WS
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWS 175
>gi|60682407|ref|YP_212551.1| hypothetical protein BF2937 [Bacteroides fragilis NCTC 9343]
gi|60493841|emb|CAH08632.1| putative exported protein [Bacteroides fragilis NCTC 9343]
Length = 567
Score = 40.8 bits (94), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|53714386|ref|YP_100378.1| hypothetical protein BF3099 [Bacteroides fragilis YCH46]
gi|265766163|ref|ZP_06094204.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|423260884|ref|ZP_17241786.1| hypothetical protein HMPREF1055_04063 [Bacteroides fragilis
CL07T00C01]
gi|423267019|ref|ZP_17246001.1| hypothetical protein HMPREF1056_03688 [Bacteroides fragilis
CL07T12C05]
gi|52217251|dbj|BAD49844.1| hypothetical protein [Bacteroides fragilis YCH46]
gi|263253831|gb|EEZ25296.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|387774645|gb|EIK36755.1| hypothetical protein HMPREF1055_04063 [Bacteroides fragilis
CL07T00C01]
gi|392697722|gb|EIY90905.1| hypothetical protein HMPREF1056_03688 [Bacteroides fragilis
CL07T12C05]
Length = 567
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|237804114|ref|ZP_04591699.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
syringae pv. oryzae str. 1_6]
gi|331026095|gb|EGI06151.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
oryzae str. 1_6]
Length = 115
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 30/59 (50%)
Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDD 271
RV S + DGG TWS P NPNS + V + LL+A N + G K++L D
Sbjct: 3 RVLASRTEDGGQTWSEPYPLTPSNPNSSLAAVGVPGKGLLVALNDLREGRFKLSLYGTD 61
>gi|66361505|pdb|2BER|A Chain A, Y370g Active Site Mutant Of The Sialidase From
Micromonospora Viridifaciens In Complex With Beta-Neu5ac
(Sialic Acid)
Length = 601
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 86/209 (41%), Gaps = 36/209 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ +NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 201 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313
Query: 229 AKPTQLLNPN--SGIDGVKLKDGRLLLAY 255
+K + P SG L DG L Y
Sbjct: 314 SK---VFQPGSMSGSTLTALPDGTYGLLY 339
>gi|255532110|ref|YP_003092482.1| hypothetical protein Phep_2216 [Pedobacter heparinus DSM 2366]
gi|255345094|gb|ACU04420.1| hypothetical protein Phep_2216 [Pedobacter heparinus DSM 2366]
Length = 652
Score = 40.4 bits (93), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 115/283 (40%), Gaps = 58/283 (20%)
Query: 84 PVLFKLPSNGLLLFY-----------KIGQE---VQKWSGCMKRSYNKGVTWSEREQLPP 129
P L K P+ LLLF ++ Q V MKRS + G TWS + +
Sbjct: 305 PALAKTPNGELLLFAEGRSTPTSAKERVAQNTMPVGNTDIIMKRSADGGNTWSNQLTVLA 364
Query: 130 G----------ILGPSKNKPILLE----NGLLLCGSSVESW--NSWGSWMEVTVDAGRSW 173
G + + N I+L+ G S+ ++ NS + + D G SW
Sbjct: 365 GNGSNLTYCFPQVVTTDNGKIILQYSTIGGAFAANSNAFTYDVNSQRIYQIESNDNGLSW 424
Query: 174 RKYGPIYIP---NESLSVIQPVPFH-----TANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
P+ I N + IQ H +A R++M V M+ S DGG+T
Sbjct: 425 TI--PVEITAFKNTAAGYIQGGAVHGIALKSATYNKRLVMPLTYSTNVVRMALSDDGGIT 482
Query: 226 WSYAKP---TQLLNPNSGIDGVKLKDGRLLLA---YNTVSRGVLKVALSKDDGDSWHDAL 279
W K ++L V+L DGRL++ NT + L V+ S D GD+W A
Sbjct: 483 WRLGKAISGSRLTKATV----VELSDGRLMMVAGHTNTSPKNKL-VSYSSDGGDNWTAAA 537
Query: 280 TLEENLAM-EFSY---PAVIQASDGSVHITYTYNR---TQIKH 315
+ ++ ++ + +++ DG + + NR T++K+
Sbjct: 538 NISADVKTGDYGHLFGGIILKGKDGEILLVNPINRETDTEVKN 580
>gi|423271118|ref|ZP_17250089.1| hypothetical protein HMPREF1079_03171 [Bacteroides fragilis
CL05T00C42]
gi|423274942|ref|ZP_17253888.1| hypothetical protein HMPREF1080_02541 [Bacteroides fragilis
CL05T12C13]
gi|392699042|gb|EIY92224.1| hypothetical protein HMPREF1079_03171 [Bacteroides fragilis
CL05T00C42]
gi|392704221|gb|EIY97360.1| hypothetical protein HMPREF1080_02541 [Bacteroides fragilis
CL05T12C13]
Length = 567
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGQLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|171911563|ref|ZP_02927033.1| glycoside hydrolase family 33, candidate sialidase
[Verrucomicrobium spinosum DSM 4136]
Length = 391
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 58/237 (24%), Positives = 95/237 (40%), Gaps = 43/237 (18%)
Query: 76 EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQ----KWSGCMKRSYNKGVTWSEREQLPPGI 131
+PN + + L +LP +LF G + + ++G RS ++G TW+ E G
Sbjct: 55 KPNSMVCDTTLRELPDGSWILFILAGGDTEPSPLNYTGVT-RSKDQGRTWTPLEAFDVGF 113
Query: 132 LGPSKNKPI-------LLENG--LLLCGSSVESW-NSWGSWMEVTVDAGRSWRKYGPIYI 181
P + K I ++ +G L + + W N W SW + D+ ++W K P +
Sbjct: 114 --PREGKTIGQGPTELMIRDGRATLYFSTHSKHWANDWRSWFLTSDDSFKTWSK--PQEV 169
Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIG-----------------RVCMSESCDGGL 224
P L + H + R+L+ + IG R + S DGG
Sbjct: 170 PGR-LKERTFIRNHIITKDGRILLPFQHYIGPEAEQDKAPLDRAFTNPRNGVIMSSDGGK 228
Query: 225 TWSYAKPTQLLNPNSGI-----DGVKLKDGRLLLAYNTVSR-GVLKVALSKDDGDSW 275
TW+ +L + + +L DGR+ + GVL A SKD G +W
Sbjct: 229 TWTEHGNIRLTTDDKYFGWAENNLAELSDGRIAMVIRGDKLGGVLYYAESKDGGRTW 285
>gi|86141536|ref|ZP_01060082.1| hypothetical protein MED217_05942 [Leeuwenhoekiella blandensis
MED217]
gi|85832095|gb|EAQ50550.1| hypothetical protein MED217_05942 [Leeuwenhoekiella blandensis
MED217]
Length = 595
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 56/126 (44%), Gaps = 28/126 (22%)
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWS-------YAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
L+ ++ G++ + S D G TWS Y P D ++LK+G +LLAY
Sbjct: 90 ALVVTYESHGKILVKRSTDLGQTWSSLIVVADYKNGVSNTTP----DLIELKNGTILLAY 145
Query: 256 -----NTVS---RGVLKVALSKDDGDSWHD------ALTLEENLAMEFSYPAVIQASDGS 301
NT S +KV S D G SWH+ A TL EN A E P +Q G
Sbjct: 146 NPRPGNTASVDEHFSIKVIQSSDGGRSWHNDQTVYTADTLFENGAWE---PTFLQLPSGE 202
Query: 302 VHITYT 307
+ + ++
Sbjct: 203 IQLYFS 208
>gi|422641291|ref|ZP_16704715.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
syringae Cit 7]
gi|330953679|gb|EGH53939.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae Cit
7]
Length = 206
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 49/115 (42%), Gaps = 15/115 (13%)
Query: 27 CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNVP--- 80
H+S + + G + +F GS EGA DV+I F G W ++A +
Sbjct: 67 VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRDSTQSGTG 126
Query: 81 -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
+ NPV+ P N L LFY + V W+G S + G +WS QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNGMVSSDMGSSWSPPWQL 180
>gi|343086452|ref|YP_004775747.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
gi|342354986|gb|AEL27516.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
marinum DSM 745]
Length = 385
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 74/175 (42%), Gaps = 22/175 (12%)
Query: 159 WGSWMEVTVDAGRSWRKYGPIYIP-NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
+G+ + + D G +W G I NES Q + TL + MRS+ G R +S
Sbjct: 214 YGAHVIFSDDHGVTWELGGSIRPKMNES----QLIEMADGKGTLLMNMRSYYGKNRRALS 269
Query: 218 ESCDGGLTWSY-AKPTQLLNPNSGIDGVKL-------KDGRLLLAYNTVSRGVLKVALSK 269
S DGG TWS + T L+ P ++ KD L ++ R + + LSK
Sbjct: 270 VSKDGGETWSDPVEATSLIEPVCQAAFIRYSWPDEKGKDLVLFSNPSSTKRENMTIKLSK 329
Query: 270 DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT------YNRTQIKHVVL 318
D G +W + L + + +Y A+ +G + Y Y + + H+ L
Sbjct: 330 DSGKTWIASKVLNQGPS---AYSALAALPNGDILCLYEKGEKNPYEKLVLSHLSL 381
>gi|440717064|ref|ZP_20897563.1| Exo-alpha-sialidase [Rhodopirellula baltica SWK14]
gi|436437890|gb|ELP31484.1| Exo-alpha-sialidase [Rhodopirellula baltica SWK14]
Length = 409
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 107/259 (41%), Gaps = 35/259 (13%)
Query: 91 SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
+NG LL + G++ S MKRS + G TWS+ L G L + + +LL++G
Sbjct: 74 NNGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGKLSFNNPQAVLLQSGR 133
Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
+L S NS+ ++++ + D G++W R ++ +
Sbjct: 134 ILLMYQRSKLGEHKASDGYGPNSYFTFVQTSDDDGQTWAAPRDVSKQVKRERKVTSVAAG 193
Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
P R++M G RV + S D G TW+ P + +
Sbjct: 194 PGIGIVLQNGPHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253
Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVI---Q 296
V+L DGR++L T +G KVA+S + G +W L +++ L +++ +
Sbjct: 254 MVELSDGRVMLNSRTQGKGSTKHRKVAVSANGGQTWS-QLQIDQTLIEPTCQASILRYSR 312
Query: 297 ASDGSVHITYTYNRTQIKH 315
S+G I + +Q K
Sbjct: 313 PSEGKSRIIFANPASQTKR 331
>gi|365828125|ref|ZP_09369956.1| hypothetical protein HMPREF0975_01739 [Actinomyces sp. oral taxon
849 str. F0330]
gi|365264307|gb|EHM94117.1| hypothetical protein HMPREF0975_01739 [Actinomyces sp. oral taxon
849 str. F0330]
Length = 770
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 52/234 (22%), Positives = 90/234 (38%), Gaps = 47/234 (20%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
+ S + G TWS+ L P + LG I L +G + L ++ +W
Sbjct: 506 IAHSDDDGATWSKPRDLNPQLKQPWMRFLGTCPGNGIALRHGPRPGRLVIPLYFNNDGNW 565
Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
+ + + + D G +W P +E+ S+ + + TL
Sbjct: 566 LAMCATVAYSDDHGETWNLGHGPNDGRRTPDGALDPRTFVDETWSLHEAAVVERRDGTLV 625
Query: 203 VLMRSFNGIGRVCMSESCDGGLTWS-----------YAKPTQLLNPNS-GIDGVKLKDGR 250
+ MR+ + GRV +SES D G TW + +P + P++ G D + +
Sbjct: 626 LYMRNQHPCGRVAVSESHDAGQTWGPIDFDEDLPEIWCQPNAICLPSAEGEDRIVFANAS 685
Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
L+L + RG + LS D G +W + T + Y + DG + I
Sbjct: 686 LMLPF----RGCGVIRLSLDGGRTWRRSRTFNPG---HYVYQCMCALPDGRIGI 732
>gi|289807542|ref|ZP_06538171.1| hypothetical protein Salmonellaentericaenterica_25287 [Salmonella
enterica subsp. enterica serovar Typhi str. AG3]
Length = 55
Score = 39.7 bits (91), Expect = 1.7, Method: Composition-based stats.
Identities = 18/49 (36%), Positives = 25/49 (51%)
Query: 9 GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKI 57
LV + P + CHAST+V + G + A+F G CEG+ D I
Sbjct: 4 ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAI 52
>gi|585539|sp|Q02834.1|NANH_MICVI RecName: Full=Sialidase; AltName: Full=Neuraminidase; Flags:
Precursor
gi|216784|dbj|BAA00852.1| neuraminidase [Micromonospora viridifaciens]
Length = 647
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 189 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 246
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ +NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 247 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 299
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 300 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 359
Query: 229 AK 230
+K
Sbjct: 360 SK 361
>gi|317501714|ref|ZP_07959904.1| hypothetical protein HMPREF1026_01848 [Lachnospiraceae bacterium
8_1_57FAA]
gi|336439652|ref|ZP_08619260.1| hypothetical protein HMPREF0990_01654 [Lachnospiraceae bacterium
1_1_57FAA]
gi|316896882|gb|EFV18963.1| hypothetical protein HMPREF1026_01848 [Lachnospiraceae bacterium
8_1_57FAA]
gi|336015616|gb|EGN45427.1| hypothetical protein HMPREF0990_01654 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1310
Score = 39.7 bits (91), Expect = 1.9, Method: Composition-based stats.
Identities = 27/97 (27%), Positives = 44/97 (45%), Gaps = 13/97 (13%)
Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGI------DGVKLKDGRLLLAYNTVSRGVLKVA--L 267
+++S D G TWS + N + + G+ GRL+ + GV + +
Sbjct: 267 ITKSTDNGATWSAPQLVDAKNESEQVFLIGPGRGITTSTGRLIFPCYQYTSGVQRTSTIY 326
Query: 268 SKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
S DDG +WH T+ N S AVI +DG +++
Sbjct: 327 SDDDGTTWHRGATVSGN-----SSEAVITEADGRLYL 358
>gi|157831017|pdb|1EUT|A Chain A, Sialidase, Large 68kd Form, Complexed With Galactose
gi|157831018|pdb|1EUU|A Chain A, Sialidase Or Neuraminidase, Large 68kd Form
Length = 605
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 147 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 204
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ +NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 205 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 257
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 258 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 317
Query: 229 AK 230
+K
Sbjct: 318 SK 319
>gi|82407379|pdb|1WCQ|A Chain A, Mutagenesis Of The Nucleophilic Tyrosine In A Bacterial
Sialidase To Phenylalanine.
gi|82407380|pdb|1WCQ|B Chain B, Mutagenesis Of The Nucleophilic Tyrosine In A Bacterial
Sialidase To Phenylalanine.
gi|82407381|pdb|1WCQ|C Chain C, Mutagenesis Of The Nucleophilic Tyrosine In A Bacterial
Sialidase To Phenylalanine
Length = 601
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ +NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 201 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313
Query: 229 AK 230
+K
Sbjct: 314 SK 315
>gi|55670665|pdb|1W8N|A Chain A, Contribution Of The Active Site Aspartic Acid To Catalysis
In The Bacterial Neuraminidase From Micromonospora
Viridifaciens.
gi|56554377|pdb|1W8O|A Chain A, Contribution Of The Active Site Aspartic Acid To Catalysis
In The Bacterial Neuraminidase From Micromonospora
Viridifaciens
Length = 601
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ +NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 201 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313
Query: 229 AK 230
+K
Sbjct: 314 SK 315
>gi|320531595|ref|ZP_08032538.1| BNR/Asp-box repeat protein, partial [Actinomyces sp. oral taxon 171
str. F0337]
gi|320136172|gb|EFW28177.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 171 str.
F0337]
Length = 665
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D GR+W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 442 DHGRTWQAGQPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 496
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
P Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 497 EPVPDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 554
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 555 GASWVASKVFHEPF---VGYTTIAVQSDGSI 582
>gi|311748209|ref|ZP_07721994.1| putative neuramidase [Algoriphagus sp. PR1]
gi|126576699|gb|EAZ80947.1| putative neuramidase [Algoriphagus sp. PR1]
Length = 388
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/140 (22%), Positives = 65/140 (46%), Gaps = 10/140 (7%)
Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFN 209
G ++++ + + T D G+++ PI IP + S+ +P +R + +
Sbjct: 207 GDPQDNFSEYQAHGFFTDDHGKTYGISEPISIPGSNESIAAELP---NDRMIMSIRNQKG 263
Query: 210 GIGRVCMSESCDGGLTWSYAK-PTQLLNPNSGIDGVKLKD--GRLLLAY----NTVSRGV 262
I + ++ S DGG TW + +L +P + L D GR +L + +T++R
Sbjct: 264 DIRQRILAYSSDGGDTWDSEEFEEELPDPVCQGSIISLTDSRGRFILVHSNNADTLNRNN 323
Query: 263 LKVALSKDDGDSWHDALTLE 282
L + S D+G +W + ++
Sbjct: 324 LSIKFSYDEGKTWPKTMLID 343
>gi|201067792|gb|ACH92894.1| sialidase [Actinomyces oris]
Length = 359
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 51/107 (47%), Gaps = 10/107 (9%)
Query: 195 HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG---VKLKDGRL 251
H + +R+ +G + S D G TW +PT +G+D V+L DG L
Sbjct: 193 HAGRLVQQYTIRTADGTVQAVSVYSDDHGKTWQAGQPT-----GTGMDENKVVELSDGSL 247
Query: 252 LL-AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+L + + S G KVA S D G +W + ++ ++NL +I+A
Sbjct: 248 MLNSRASDSSGFRKVARSTDGGQTWSEPVS-DQNLPDSVDNAQIIRA 293
>gi|157831015|pdb|1EUR|A Chain A, Sialidase
gi|157831016|pdb|1EUS|A Chain A, Sialidase Complexed With 2-Deoxy-2,3-Dehydro-N-
Acetylneuraminic Acid
Length = 365
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 147 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 204
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ +NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 205 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 257
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 258 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 317
Query: 229 AK 230
+K
Sbjct: 318 SK 319
>gi|421609699|ref|ZP_16050887.1| Exo-alpha-sialidase [Rhodopirellula baltica SH28]
gi|408499472|gb|EKK03943.1| Exo-alpha-sialidase [Rhodopirellula baltica SH28]
Length = 409
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 60/259 (23%), Positives = 108/259 (41%), Gaps = 35/259 (13%)
Query: 91 SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
+NG LL + G++ S MKRS + G TWS+ L G L + + ++L++G
Sbjct: 74 NNGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGKLSLNNPQAVVLQSGR 133
Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
+L S NS+ ++++ + D G++W R + ++ +
Sbjct: 134 ILLMYQRSKLGEHKASDGYGPNSYFTFVQTSDDDGQTWAAPRDVSKQVKREKKVTSVAAG 193
Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
P R++M G RV + S D G TW+ P + +
Sbjct: 194 PGIGIVLQNGPHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253
Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ--- 296
V+L DGR++L T +G KVA+S + G +W L +++ L ++++
Sbjct: 254 MVELSDGRVMLNARTQGKGSTKHRKVAVSDNGGQTWSQ-LQMDQTLIEPTCQASILRYSW 312
Query: 297 ASDGSVHITYTYNRTQIKH 315
S G I ++ +Q K
Sbjct: 313 PSKGKSRIIFSNPASQTKR 331
>gi|198273964|ref|ZP_03206496.1| hypothetical protein BACPLE_00100 [Bacteroides plebeius DSM 17135]
gi|198273042|gb|EDY97311.1| BNR/Asp-box repeat protein [Bacteroides plebeius DSM 17135]
Length = 364
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 85/211 (40%), Gaps = 22/211 (10%)
Query: 114 SYNKGVTWSEREQLP-PGILGPSKNKPILLENGLLLCGSSVESWNSWGS-----WMEVTV 167
S + G TWSE + L P ++ S + L N L G WN W ++
Sbjct: 131 SEDNGQTWSEIQSLKKPCVMAFSS--IVKLSNNDYL-GVYHRGWNDQDRAPLTLWSAISH 187
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANR-TLRVLMRSFNGIGRVCMSESCDGGLTW 226
D G +W I E S +P F++ ++ L + R N +G M S D G TW
Sbjct: 188 DGGLNWTPSVKI-AEVEGRSPCEPCVFYSPDKKQLVCIARENNRVGNSLMMFSNDEGKTW 246
Query: 227 SYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-------TVSRGVLKVALSKD--DGDSWHD 277
S + T LKDGRLL+A+ T V V KD +G S
Sbjct: 247 SAMRETPWGVTGDRHVARYLKDGRLLIAFRDMAPDSPTKGHFVAWVGTYKDLIEGTSGQY 306
Query: 278 ALTLEENLA-MEFSYPAVIQASDGS-VHITY 306
+ L + A + YP + DG+ V ITY
Sbjct: 307 RIKLLHSYAGSDCGYPGLEVLPDGTIVAITY 337
>gi|358636892|dbj|BAL24189.1| hypothetical protein AZKH_1876 [Azoarcus sp. KH32C]
Length = 325
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 84/203 (41%), Gaps = 38/203 (18%)
Query: 88 KLPSNGLLLFYKIGQEVQKWS----GCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL-L 142
K+P N L E W+ G + + + G TW+ + + S+++P+ +
Sbjct: 82 KVPVNATLTGVSFADERNGWAVGHWGVILHTEDGGETWTSQR------METSEDRPLFSV 135
Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLR 202
G++V W S + T D G++W K PN++ A+R L
Sbjct: 136 HFTDAQHGTAVGLW----SLLLTTTDGGKTWDKVSLPAPPNDT----------RADRNLM 181
Query: 203 VLMRSFNG----IG-RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
+ G +G R + S D G TW+Y +PT S G L DG LL+A
Sbjct: 182 HVFADRRGDLFVVGERGTVLHSADRGQTWAY-RPTGY--RGSFWSGTALADGTLLVAG-- 236
Query: 258 VSRGVLKVALSKDDGDSWHDALT 280
RG L S D G++WH +T
Sbjct: 237 -LRGSLY--RSSDGGETWHKVVT 256
>gi|83644164|ref|YP_432599.1| hypothetical protein HCH_01307 [Hahella chejuensis KCTC 2396]
gi|83632207|gb|ABC28174.1| hypothetical protein HCH_01307 [Hahella chejuensis KCTC 2396]
Length = 57
Score = 39.3 bits (90), Expect = 2.6, Method: Composition-based stats.
Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 281 LEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
LEE A+E+SYP + G +HI YT+ R IKHV
Sbjct: 10 LEE--AVEYSYPGAYISRSGDLHIAYTWRRKTIKHV 43
>gi|288963051|ref|YP_003453330.1| hypothetical protein AZL_f00260 [Azospirillum sp. B510]
gi|288915303|dbj|BAI76786.1| hypothetical protein AZL_f00260 [Azospirillum sp. B510]
Length = 382
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 49/221 (22%), Positives = 81/221 (36%), Gaps = 24/221 (10%)
Query: 107 WSGCMKRSYNKGVTWSEREQLP-----PGIL--------GPSKNKPILLENGLLLCGSSV 153
W G + S + G +W+ LP P I+ G + + + +G +L
Sbjct: 147 WGGYSRHSDDGGRSWTPHRDLPPLPGHPDIVPGRRPFHGGAVRGRAVEAADGTILQTGYT 206
Query: 154 E--SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH-TANRTLRVLMRSFNG 210
S ++ S + + D G SW I I E+ + H A+ L R+
Sbjct: 207 HHPSTGTYASHLFASTDRGESWGHRAVIAIDPEAKAGFCETALHLDADGVLHAFHRTTGL 266
Query: 211 IGRVCMSESCDGGLTWS-YAKPTQLLNPNSGIDGVKLKDGRLLLA--YNTVSRGVLKVAL 267
+ S S D G +W + + + +P D L DGRLLLA Y G+
Sbjct: 267 DDHLATSRSTDLGHSWEPWRRHAVVGHP---YDACPLPDGRLLLAGGYRHAPYGIRARLY 323
Query: 268 S--KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
+ D + + ++ A + YP DG + Y
Sbjct: 324 DPRRQQPDEGTELVLRDDGPAPDLGYPWAAVLPDGRAMVAY 364
>gi|75766432|pdb|2BZD|A Chain A, Galactose Recognition By The Carbohydrate-Binding Module
Of A Bacterial Sialidase.
gi|75766433|pdb|2BZD|B Chain B, Galactose Recognition By The Carbohydrate-Binding Module
Of A Bacterial Sialidase.
gi|75766434|pdb|2BZD|C Chain C, Galactose Recognition By The Carbohydrate-Binding Module
Of A Bacterial Sialidase
Length = 601
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 76/182 (41%), Gaps = 31/182 (17%)
Query: 71 IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
I AD P+ P W F G+ L Y G+ +Q+++ G + S + G
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200
Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
TW E + G+ NK + L +G +L +S +S S + V+ D G S YGP+
Sbjct: 201 TWRAGEAVGVGM---DANKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253
Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
I P + S+I+ P A ++ N + S+ SCD G TW
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313
Query: 229 AK 230
+K
Sbjct: 314 SK 315
>gi|149175555|ref|ZP_01854175.1| probable sialidase [Planctomyces maris DSM 8797]
gi|148845540|gb|EDL59883.1| probable sialidase [Planctomyces maris DSM 8797]
Length = 386
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 132/347 (38%), Gaps = 66/347 (19%)
Query: 20 NSAP-FKSCHA--STIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD------------ 64
N+ P KS HA ++ + G L Y G A D+K+ L +D
Sbjct: 14 NAKPHVKSVHAYFPSVASLPDGSLLSMYMLGEAFEAVDLKLHLSRSRDQGQTWEYQGSID 73
Query: 65 ----GRWQSPI--IADEEPNVPMWNPVLFK---LPSNGLLLFYKIGQEVQKWSGCMKRSY 115
GR S +A E + N V F P GL +G + + RS
Sbjct: 74 TSVTGRQTSTFGRLAVTETGELIANLVRFDRTDFPGEGLCNPQTLGMVPSEL--LIIRSL 131
Query: 116 NKGVTWSEREQLPPGILGPSKN--KPI-LLENGLLLCGSSVESWNSWGSWME-------- 164
++G TWSE + P + GP P+ +L +G + +S +W W +
Sbjct: 132 DQGKTWSEPTLVTPPLEGPEFEMCSPLTILNDGRWIWPTS--TWRDWNGALPNGNRMLAL 189
Query: 165 VTVDAGRSWRKYGPI-YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE----S 219
V+ D G SW +Y I + N +L + L + ++ + S
Sbjct: 190 VSRDQGSSWDEYLDIMHSENNNLIFWESKVVEYPEGRLLAVAWCYDEASSTDLPNHYAIS 249
Query: 220 CDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSK-DDGDSW--- 275
DGG +WS TQ+ + L+DG LL Y + + L LS+ DD +W
Sbjct: 250 DDGGTSWSEPASTQI--QGQTLTPCLLEDGSLLNIYRRMDQPGLWACLSRLDDAGNWING 307
Query: 276 -------HDAL--------TLEENLA-MEFSYPAVIQASDGSVHITY 306
H+ L + +N A ++F P + + SD + +T+
Sbjct: 308 DQQPLWGHNQLEGITATGENMSDNFAVLKFGAPHITRLSDSELFVTF 354
>gi|424513260|emb|CCO66844.1| unknown protein [Bathycoccus prasinos]
Length = 847
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 63/156 (40%), Gaps = 25/156 (16%)
Query: 165 VTVDAGRSWRKYGPIYIPNESL-----SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
++ D G++W K G I P SL + I +P + + M G V S S
Sbjct: 487 ISSDNGKTWTKKGDIRHPKTSLLEGAIAEISALPQGSDEPQPALTMLFRTDCGCVFRSMS 546
Query: 220 CDGGLTWS--YAKPTQLLNPNSGIDGVKL-KDGRLLLAYNT---------------VSRG 261
D G TWS + P L N N+ +D L G L +A+N + R
Sbjct: 547 FDFGATWSDPHPSPGNLPNVNTKLDMTSLYPTGHLAMAFNNHRRWEHIGSKEHPCKLCRT 606
Query: 262 VLKVALSKDDGDSWHDALTLEENLA--MEFSYPAVI 295
L ALS + G+ W ++ + + YP+++
Sbjct: 607 HLHFALSSNGGEDWTTVAAFDDLVQPDVRIHYPSIL 642
>gi|383119129|ref|ZP_09939868.1| hypothetical protein BSHG_1859 [Bacteroides sp. 3_2_5]
gi|251946348|gb|EES86725.1| hypothetical protein BSHG_1859 [Bacteroides sp. 3_2_5]
Length = 567
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 66/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
+GRLLL + NT +R V VA+S DDG +W
Sbjct: 261 NGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|343522379|ref|ZP_08759345.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 175 str.
F0384]
gi|343401788|gb|EGV14294.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 175 str.
F0384]
Length = 771
Score = 38.5 bits (88), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 94/247 (38%), Gaps = 48/247 (19%)
Query: 114 SYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESWNSW 159
S + G+TWS L P + LG I L N + L ++ ++W +
Sbjct: 510 SDDDGLTWSRPRDLNPELKQPWMRFLGTCPGNGISLRNAPHAGRLIVPLYFNNDQNWLAM 569
Query: 160 GSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
+ + + D G +W+ + P +E+ S+ + + L + M
Sbjct: 570 CATVAYSDDHGETWQLGHGPNEGRQTPQGELDPQTFVDETWSLHEAAVVERHDGVLVLFM 629
Query: 206 RSFNGIGRVCMSESCDGGLTW-----SYAKPTQLLNPNS-------GIDGVKLKDGRLLL 253
R+ + GRV +SES D G TW A P PN+ G D + + L+L
Sbjct: 630 RNQHPRGRVAVSESHDAGQTWGPIRFDEALPEIWCQPNAISLPSPEGEDRIVFANASLML 689
Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
+ RG + LS D G SW + T + Y + DG + I + N Q
Sbjct: 690 PF----RGCGVIRLSLDGGRSWKYSRTFNPG---HYVYQCMCVLPDGRIGILWE-NECQG 741
Query: 314 KHVVLLP 320
+ LP
Sbjct: 742 LYFSRLP 748
>gi|201067786|gb|ACH92891.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAADDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|418935311|ref|ZP_13489091.1| glycosyl hydrolase protein, partial [Rhizobium sp. PDO1-076]
gi|375057995|gb|EHS54139.1| glycosyl hydrolase protein, partial [Rhizobium sp. PDO1-076]
Length = 200
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 2/80 (2%)
Query: 21 SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPN 78
++P HA+ + + G +FGG+ EG D+ I++ G W P E+
Sbjct: 38 ASPTVQNHAANLAFLADGTLTCVWFGGTMEGMGDISIYMSRLAPGATAWSVPEKMTEDVT 97
Query: 79 VPMWNPVLFKLPSNGLLLFY 98
NP++F P L Y
Sbjct: 98 KSEQNPIVFNAPDGKAWLIY 117
>gi|375359189|ref|YP_005111961.1| hypothetical protein BF638R_2942 [Bacteroides fragilis 638R]
gi|301163870|emb|CBW23425.1| putative exported protein [Bacteroides fragilis 638R]
Length = 567
Score = 38.5 bits (88), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 65/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ D G TW ++P+ + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DDGQTWGESEPSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|201067830|gb|ACH92913.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 10/107 (9%)
Query: 195 HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG---VKLKDGRL 251
H + +R+ G + S D G TW +PT +G+D V+L DG L
Sbjct: 193 HAGRLVQQYTIRTAGGAVQAVSVYSDDHGKTWQAGQPT-----GTGMDENKVVELSDGSL 247
Query: 252 LL-AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+L + + S G KVA S D G +W + ++ ++NL +I+A
Sbjct: 248 MLNSRASDSSGFRKVARSTDGGQTWSEPVS-DQNLPDSVDNAQIIRA 293
>gi|376289488|ref|YP_005161735.1| hypothetical protein CDC7B_0282 [Corynebacterium diphtheriae C7
(beta)]
gi|372102885|gb|AEX66482.1| hypothetical protein CDC7B_0282 [Corynebacterium diphtheriae C7
(beta)]
Length = 678
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLEN----GLLLCGSSVESWNSW 159
M S + G+TWS L P + G S I L+ G LL + +
Sbjct: 452 MLTSDDDGLTWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRDTHAGRLLIPMYYTKGDQF 511
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
+ + D G +W + + L + + ++ ++ + R+ + G V + S
Sbjct: 512 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 566
Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
DGG +WS +++P L N DG+ + LL + RGVL+ +S
Sbjct: 567 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 620
Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+DG SW + Y + Q DGS+
Sbjct: 621 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 651
>gi|375292226|ref|YP_005126765.1| hypothetical protein CDB402_0252 [Corynebacterium diphtheriae INCA
402]
gi|371581897|gb|AEX45563.1| hypothetical protein CDB402_0252 [Corynebacterium diphtheriae INCA
402]
Length = 678
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLEN----GLLLCGSSVESWNSW 159
M S + G+TWS L P + G S I L+ G LL + +
Sbjct: 452 MLTSDDDGLTWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRDTHAGRLLIPMYYTKGDQF 511
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
+ + D G +W + + L + + ++ ++ + R+ + G V + S
Sbjct: 512 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 566
Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
DGG +WS +++P L N DG+ + LL + RGVL+ +S
Sbjct: 567 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 620
Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+DG SW + Y + Q DGS+
Sbjct: 621 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 651
>gi|239919163|gb|ACS34894.1| sialidase [Corynebacterium diphtheriae]
Length = 687
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLEN----GLLLCGSSVESWNSW 159
M S + G+TWS L P + G S I L+ G LL + +
Sbjct: 461 MLTSDDDGLTWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRDTHAGRLLIPMYYTKGDQF 520
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
+ + D G +W + + L + + ++ ++ + R+ + G V + S
Sbjct: 521 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 575
Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
DGG +WS +++P L N DG+ + LL + RGVL+ +S
Sbjct: 576 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 629
Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+DG SW + Y + Q DGS+
Sbjct: 630 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 660
>gi|201067846|gb|ACH92921.1| sialidase [Actinomyces oris]
gi|201067850|gb|ACH92923.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 10/107 (9%)
Query: 195 HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG---VKLKDGRL 251
H + +R+ G + S D G TW +PT +G+D V+L DG L
Sbjct: 193 HAGRLVQQYTIRTAGGAVQAVSVYSDDHGKTWQAGQPT-----GTGMDENKVVELSDGSL 247
Query: 252 LL-AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
+L + + G KVA S D G +W + ++ +ENL +I+A
Sbjct: 248 MLNSRASDGSGFRKVARSTDGGQTWSEPVS-DENLPDSVDNAQIIRA 293
>gi|201067718|gb|ACH92857.1| sialidase [Actinomyces oris]
gi|201067720|gb|ACH92858.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067848|gb|ACH92922.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGQPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
P Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVPDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWVASKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067840|gb|ACH92918.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDKPEIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GPSWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067752|gb|ACH92874.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|443722941|gb|ELU11586.1| hypothetical protein CAPTEDRAFT_201074 [Capitella teleta]
Length = 375
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 95/225 (42%), Gaps = 21/225 (9%)
Query: 96 LFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG--ILGP--------SKNKPILLENG 145
L + I + ++ M S + G TWS+RE LP I GP S K L+ G
Sbjct: 134 LAHNISNKARQHMPVMITSGDGGQTWSDREVLPFKGLIFGPGHGIQKQQSPAKGRLIIPG 193
Query: 146 LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-LSVIQPVPFHTANRTLRVL 204
L VE + + + D G +W+ P+ +E+ ++ + + R L
Sbjct: 194 YL--NQKVEGRANQAVVL-FSDDHGNTWQTSEPLNTGDETAIAELADGRIYMNTRQHAAL 250
Query: 205 MRSFNGIGRVCMSESCDGGLTWSYA-KPTQLLNPNSGIDGVKLKDGRLLLA--YNTVSRG 261
R+ GR +S S DGG+ W A K L P + D LL A +T +R
Sbjct: 251 GRAPAPNGR-WVSTSSDGGVNWLAAEKDLALHTPVCQGSVEQYGDAGLLFANPASTRARV 309
Query: 262 VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
+ V S+D G SW +++T+ A Y + S+G V + +
Sbjct: 310 NMTVRYSEDQGSSWKNSVTVYPGAA---GYSDLGALSNGDVAVLF 351
>gi|201067838|gb|ACH92917.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 66/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++WR P+ + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWRAGTPVGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAK-----PTQLLN-------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
P + N PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDKNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWMTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067828|gb|ACH92912.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPDPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067748|gb|ACH92872.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 66/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G +S S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVSHSTDGGQTWS 273
Query: 228 YAK-----PTQLLN-------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
P + N PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDKNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067738|gb|ACH92867.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|449135021|ref|ZP_21770484.1| Exo-alpha-sialidase [Rhodopirellula europaea 6C]
gi|448886316|gb|EMB16724.1| Exo-alpha-sialidase [Rhodopirellula europaea 6C]
Length = 409
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 56/262 (21%), Positives = 111/262 (42%), Gaps = 41/262 (15%)
Query: 91 SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
++G LL + G++ S MKRS + G TWS+ + G L + + ++L++G
Sbjct: 74 NDGTLLAFAEGRQGGDHSDNDMIMKRSTDNGSTWSDVVLMNDQGKLSLNNPQAVVLDSGR 133
Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
+L S S+ ++++ + D G++W + ++ + + V
Sbjct: 134 VLLMYQRSKLGEHKASDGFGAESYFTFVQTSDDDGQTWSAPHDV---SKQVKREKNVTSV 190
Query: 196 TANRTLRVLMRSFNGIGRVCM--------------SESCDGGLTWSYAK--PTQLLNPNS 239
A + +++R+ + GR+ M + S D G TW+ P +
Sbjct: 191 AAGPGIGIILRNGSHKGRIIMPFNQGPYNDWRVYAAYSDDDGQTWNIGDVAPGDGKGHAN 250
Query: 240 GIDGVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
+ V+L DGR++L T +G KV +S D G +W L +++ L ++++
Sbjct: 251 EVQMVELADGRVMLNARTQGKGSTKHRKVTVSDDGGQTWS-KLQIDQTLIEPTCQASILR 309
Query: 297 ---ASDGSVHITYTYNRTQIKH 315
S+G I + +Q K
Sbjct: 310 YSWPSEGKSRILFANPASQTKR 331
>gi|201067774|gb|ACH92885.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067722|gb|ACH92859.1| sialidase [Actinomyces oris]
gi|201067728|gb|ACH92862.1| sialidase [Actinomyces oris]
Length = 359
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067652|gb|ACH92824.1| sialidase [Actinomyces oris]
Length = 367
Score = 37.7 bits (86), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067784|gb|ACH92890.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067756|gb|ACH92876.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067768|gb|ACH92882.1| sialidase [Actinomyces oris]
gi|201067780|gb|ACH92888.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067714|gb|ACH92855.1| sialidase [Actinomyces oris]
gi|201067842|gb|ACH92919.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067750|gb|ACH92873.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067672|gb|ACH92834.1| sialidase [Actinomyces naeslundii]
Length = 359
Score = 37.7 bits (86), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
+P N +D ++ D R LLL+++ SR +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + EN Y + SDGS+
Sbjct: 332 GASWATSKVFNENF---VGYTTIAVQSDGSI 359
>gi|201067770|gb|ACH92883.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067844|gb|ACH92920.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067754|gb|ACH92875.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067794|gb|ACH92895.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067674|gb|ACH92835.1| sialidase [Actinomyces naeslundii]
Length = 359
Score = 37.7 bits (86), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
+P N +D ++ D R LLL+++ SR +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + EN Y + SDGS+
Sbjct: 332 GASWATSKVFNENF---VGYTTIAVQSDGSI 359
>gi|374375918|ref|ZP_09633576.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
19437]
gi|373232758|gb|EHP52553.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
19437]
Length = 390
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 102/266 (38%), Gaps = 59/266 (22%)
Query: 94 LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ----LPPGILGPSKNKPILL--ENGLL 147
LLL+ +G + +GC R S +EQ P + S P L+ + G L
Sbjct: 14 LLLWVTLGWLLLIMNGCSAR------LQSAKEQTTTVFAPDAVYKSTRIPALVYTKRGSL 67
Query: 148 L--CGSSVESWNSWGSW---MEVTVDAGRSWRKYGPIYIPNESLS--VIQPVPFHTANRT 200
L C + + + W M ++D GRSW PI I + V P P AN
Sbjct: 68 LAFCEGRIGTASDWADMNLIMRRSIDGGRSW---APIVIIDSMKGGPVGNPTPIVDANGI 124
Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT---------------QLLNPNSGIDGVK 245
+ +L + G SE D G TW AKP ++L P G G++
Sbjct: 125 IHLLYQRDYAEGFYTFSE--DDGKTW--AKPVNITATYNAFRPEYNWKVLAPGPG-HGIQ 179
Query: 246 LKDGRLLLAY-----------NTVSRGVLKVALSKDDGDSWHDALTLEENLAM----EFS 290
L++GRLL A + S + S D G +W + ++ A S
Sbjct: 180 LQNGRLLSAVWLANSSKTKPRRSHSPSCVATIYSDDHGKTWKRGAIIADSSAAVANPNES 239
Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
P +Q DG+V ++ Q++
Sbjct: 240 MP--VQLEDGTVLMSIRNPSKQLRRA 263
>gi|225155223|ref|ZP_03723717.1| Exo-alpha-sialidase [Diplosphaera colitermitum TAV2]
gi|224803978|gb|EEG22207.1| Exo-alpha-sialidase [Diplosphaera colitermitum TAV2]
Length = 374
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 107/266 (40%), Gaps = 53/266 (19%)
Query: 81 MWNPVLFKLPS---------------NGLLLFYKIGQEVQ-KWSGC---MKRSYNKGVTW 121
+W LF P+ NG+LL + G++ + W+ + S + G TW
Sbjct: 9 IWKTTLFSAPALGYALTRIPALVCTKNGVLLAFCEGRDDKSDWADINIIYRHSTDGGETW 68
Query: 122 SEREQLPPGILGPSKNK-PILLENGLL--LCGSSVESWNSWGSWMEVTVDAGRSWRKYGP 178
S+ + G GP+ N PI ++G + +C + + + + D G +W +
Sbjct: 69 SDARIILDGEGGPASNSTPIAGKDGTVHFICQKNYKD-----VYYYRSTDDGLTWSEPRD 123
Query: 179 I------YIPNESLSVIQPVPFHTAN-RTLRVLMRSF----------NGIGR-VCMS--E 218
I Y VI P P H + R+L+ + G R C+S
Sbjct: 124 ITSVFDHYKAEYDWEVIAPGPGHAIELKNGRLLVPVWLCDPAGPALVGGDHRPSCVSTIA 183
Query: 219 SCDGGLTWS----YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLK-VALSKDDGD 273
S DGGLTW A T ++ S +L DGR+++ + S+G + V++S D
Sbjct: 184 SDDGGLTWKRGDIIANTTDIIRNPSECTVAELSDGRVMINIRSESKGYRRLVSISADGTT 243
Query: 274 SWHDALTLEENLAMEFSYPAVIQASD 299
W + +E L + + A+D
Sbjct: 244 GWTKPV-FDEALYDPICMASFLSATD 268
>gi|201067798|gb|ACH92897.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.7 bits (86), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|344204493|ref|YP_004789636.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343956415|gb|AEM72214.1| hypothetical protein Murru_3194 [Muricauda ruestringensis DSM
13258]
Length = 870
Score = 37.7 bits (86), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 78/180 (43%), Gaps = 34/180 (18%)
Query: 112 KRSY----NKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTV 167
RSY + G TWSE L P S K +L+ NG ++ S+ ++ S +
Sbjct: 251 SRSYFYSNDDGATWSEVLVLSPD--NYSILKSLLISNGKIIVSSTSGNYES--------L 300
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G SW + I ++++ ++ F G G + S D G++W+
Sbjct: 301 DNGMSW-TWNNTGIRTQNITTFASNDYYL-----------FTGTGGNGLFRSADNGVSWT 348
Query: 228 YAKPTQLLNPNS-GIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLA 286
A + + PN+ I + D +L++A S GV LS D+GD+W + N+A
Sbjct: 349 -AINSGIDRPNAFTIHEMIEVDNKLIIA---TSGGVF---LSNDNGDTWEKKINPSVNIA 401
>gi|383317755|ref|YP_005378597.1| outer membrane receptor protein [Frateuria aurantia DSM 6220]
gi|379044859|gb|AFC86915.1| outer membrane receptor protein [Frateuria aurantia DSM 6220]
Length = 809
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 11/121 (9%)
Query: 156 WNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTL-RVLMRSFNGIGRV 214
W +GS + VDA S N+ +S F T +L RV MR N V
Sbjct: 591 WRFYGSRVSAVVDAYAS-------NFSNKQISGYDQNTFQTVYLSLGRVHMRGLNTELNV 643
Query: 215 CMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDS 274
+S+ ++SY + N NSG DG+ G+ LL R +L V +S G +
Sbjct: 644 KLSQDWSIYGSYSYTQAIMQDNLNSGGDGIYYTRGKTLL---NTPRNLLNVGVSFHHGPA 700
Query: 275 W 275
W
Sbjct: 701 W 701
>gi|358640105|dbj|BAL27401.1| hypothetical protein AZKH_p0518 [Azoarcus sp. KH32C]
Length = 321
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 81/203 (39%), Gaps = 38/203 (18%)
Query: 88 KLPSNGLLLFYKIGQEVQKWS----GCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL-L 142
K+P N L E W+ G + + + G TW+ + + S+++P+ +
Sbjct: 78 KVPVNATLTGVSFADERNGWAVGHWGVILHTDDGGETWTRQR------VETSEDRPLFSV 131
Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLR 202
G++V W S + TVD G++W + PN++ A+R L
Sbjct: 132 HFTDAQHGTAVGLW----SLLLRTVDGGKTWDRVSLPPPPNDT----------RADRNLM 177
Query: 203 VLMRSFNG----IG-RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
+ G +G R + S D G TWSY S G L DG LL+A
Sbjct: 178 HVFADRRGELFVVGERGTVLHSNDHGQTWSYQATGY---RGSFWSGTSLSDGTLLIAG-- 232
Query: 258 VSRGVLKVALSKDDGDSWHDALT 280
RG L S D GDSWH T
Sbjct: 233 -LRGSLY--RSTDGGDSWHKVAT 252
>gi|32472463|ref|NP_865457.1| sialidase [precursor] [Rhodopirellula baltica SH 1]
gi|32443699|emb|CAD73141.1| Sialidase [Precursor] [Rhodopirellula baltica SH 1]
Length = 409
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 59/259 (22%), Positives = 108/259 (41%), Gaps = 35/259 (13%)
Query: 91 SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
++G LL + G++ S MKRS + G TWS+ L G L + + ++L++G
Sbjct: 74 NHGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGKLSLNNPQAVVLQSGR 133
Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
+L S NS+ ++++ + D G++W R + ++ +
Sbjct: 134 ILLMYQRSKLGEHKASDGYGPNSYFTFVQTSDDDGQTWAAPRDVSKQVKREKKVTSVAAG 193
Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
P R++M G RV + S D G TW+ P + +
Sbjct: 194 PGIGIVLQNGPHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253
Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ--- 296
V+L DGR++L T +G KVA+S + G +W L +++ L ++++
Sbjct: 254 MVELSDGRVMLNARTQGKGSTKHRKVAVSDNGGQTWSQ-LQMDQTLIEPTCQASILRYSW 312
Query: 297 ASDGSVHITYTYNRTQIKH 315
S G I ++ +Q K
Sbjct: 313 PSKGKSRIIFSNPASQTKR 331
>gi|417303698|ref|ZP_12090746.1| Exo-alpha-sialidase [Rhodopirellula baltica WH47]
gi|327540032|gb|EGF26628.1| Exo-alpha-sialidase [Rhodopirellula baltica WH47]
Length = 409
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 95/226 (42%), Gaps = 32/226 (14%)
Query: 91 SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
+NG LL + G++ S MKRS + G TWS+ L G L + + ++L++G
Sbjct: 74 NNGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGQLSLNNPQAVVLQSGR 133
Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
++ S NS+ + ++ + D G++W R + ++ +
Sbjct: 134 IVLMYQRSKLGEHKASDGYGPNSYFTLVQTSDDDGQTWSAPRDVSKQVKREKKVTSVAAG 193
Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
P R++M G RV + S D G TW+ P + +
Sbjct: 194 PGIGIVLQNGRHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253
Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENL 285
V+L DGR++L T +G KVA+S + G +W L +++ L
Sbjct: 254 MVELSDGRVMLNARTQGKGSTKHRKVAVSDNGGQTWSQ-LQIDQTL 298
>gi|201067730|gb|ACH92863.1| sialidase [Actinomyces oris]
gi|201067836|gb|ACH92916.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGQTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|423250701|ref|ZP_17231716.1| hypothetical protein HMPREF1066_02726 [Bacteroides fragilis
CL03T00C08]
gi|423254027|ref|ZP_17234957.1| hypothetical protein HMPREF1067_01601 [Bacteroides fragilis
CL03T12C07]
gi|392651658|gb|EIY45320.1| hypothetical protein HMPREF1066_02726 [Bacteroides fragilis
CL03T00C08]
gi|392654585|gb|EIY48232.1| hypothetical protein HMPREF1067_01601 [Bacteroides fragilis
CL03T12C07]
Length = 567
Score = 37.4 bits (85), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 65/166 (39%), Gaps = 31/166 (18%)
Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
P+ + NG + ++ G + + D G +W+ + P+ +
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202
Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
++P + TL +LMR+ S+ DGG TW ++ + + +L
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESETSPFYGTITMPTLGRLA 260
Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
DGRLLL + NT +R V VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306
>gi|201067806|gb|ACH92901.1| sialidase [Actinomyces oris]
Length = 359
Score = 37.4 bits (85), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGQTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359
>gi|201067658|gb|ACH92827.1| sialidase [Actinomyces naeslundii]
gi|201067662|gb|ACH92829.1| sialidase [Actinomyces naeslundii]
gi|201067668|gb|ACH92832.1| sialidase [Actinomyces naeslundii]
gi|201067678|gb|ACH92837.1| sialidase [Actinomyces naeslundii]
gi|201067702|gb|ACH92849.1| sialidase [Actinomyces naeslundii]
gi|201067708|gb|ACH92852.1| sialidase [Actinomyces naeslundii]
gi|201067712|gb|ACH92854.1| sialidase [Actinomyces naeslundii]
Length = 359
Score = 37.4 bits (85), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
+P N +D ++ D R LLL+++ SR +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW EN Y + SDGS+
Sbjct: 332 GASWATGKVFNENF---VGYTTIAVQSDGSI 359
>gi|201067692|gb|ACH92844.1| sialidase [Actinomyces naeslundii]
Length = 359
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
+P N +D ++ D R LLL+++ SR +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW EN Y + SDGS+
Sbjct: 332 GASWATGKVFNENF---VGYTTIAVQSDGSI 359
>gi|201067650|gb|ACH92823.1| sialidase [Actinomyces naeslundii]
Length = 376
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ P + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273
Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
+P N +D ++ D R LLL+++ SR +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW EN Y + SDGS+
Sbjct: 332 GASWATGKVFNENF---VGYTTIAVQSDGSI 359
>gi|376283834|ref|YP_005157044.1| hypothetical protein CD31A_0333 [Corynebacterium diphtheriae 31A]
gi|371577350|gb|AEX41018.1| hypothetical protein CD31A_0333 [Corynebacterium diphtheriae 31A]
Length = 678
Score = 37.4 bits (85), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 48/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENGL----LLCGSSVESWNSW 159
M S + G++WS L P + G S I L+ G LL + +
Sbjct: 452 MLTSDDDGLSWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRGTHAGRLLIPMYYTKGDQF 511
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
+ + D G +W + + L + + ++ ++ + R+ + G V + S
Sbjct: 512 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 566
Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
DGG +WS +++P L N DG+ + LL + RGVL+ +S
Sbjct: 567 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 620
Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+DG SW + Y + Q DGS+
Sbjct: 621 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 651
>gi|325067903|ref|ZP_08126576.1| BNR/Asp-box repeat protein [Actinomyces oris K20]
Length = 685
Score = 37.4 bits (85), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)
Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
D G++W+ PI + V++ ++ +L + R+ +G G ++ S DGG TWS
Sbjct: 491 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 545
Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
Q L PN+ D + K LLL+++ SR +++S DD
Sbjct: 546 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 603
Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
G SW + E Y + SDGS+
Sbjct: 604 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 631
>gi|376247613|ref|YP_005139557.1| hypothetical protein CDHC04_0246 [Corynebacterium diphtheriae HC04]
gi|376250431|ref|YP_005137312.1| hypothetical protein CDHC03_0263 [Corynebacterium diphtheriae HC03]
gi|372111935|gb|AEX77994.1| hypothetical protein CDHC03_0263 [Corynebacterium diphtheriae HC03]
gi|372114181|gb|AEX80239.1| hypothetical protein CDHC04_0246 [Corynebacterium diphtheriae HC04]
Length = 687
Score = 37.4 bits (85), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 48/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)
Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENGL----LLCGSSVESWNSW 159
M S + G++WS L P + G S I L+ G LL + +
Sbjct: 461 MLTSDDDGLSWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRGTHAGRLLIPMYYTKGDQF 520
Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
+ + D G +W + + L + + ++ ++ + R+ + G V + S
Sbjct: 521 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 575
Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
DGG +WS +++P L N DG+ + LL + RGVL+ +S
Sbjct: 576 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 629
Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
+DG SW + Y + Q DGS+
Sbjct: 630 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 660
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.136 0.433
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,754,777,526
Number of Sequences: 23463169
Number of extensions: 255997723
Number of successful extensions: 474583
Number of sequences better than 100.0: 951
Number of HSP's better than 100.0 without gapping: 553
Number of HSP's successfully gapped in prelim test: 398
Number of HSP's that attempted gapping in prelim test: 470861
Number of HSP's gapped (non-prelim): 1845
length of query: 321
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 179
effective length of database: 9,027,425,369
effective search space: 1615909141051
effective search space used: 1615909141051
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)