BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 020796
         (321 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255581890|ref|XP_002531744.1| conserved hypothetical protein [Ricinus communis]
 gi|223528614|gb|EEF30633.1| conserved hypothetical protein [Ricinus communis]
          Length = 321

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 267/321 (83%), Positives = 291/321 (90%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           M+KDC I+G + EEFTFP  SAPF +CHASTIVEVDK HFLVAYFGG+ EGAPDVKIWLQ
Sbjct: 1   MRKDCYIRGPLKEEFTFPEKSAPFDNCHASTIVEVDKDHFLVAYFGGTLEGAPDVKIWLQ 60

Query: 61  TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
           T+KDG WQSP+I DE+P VPMWNPVLFKL S  LLLFYKIGQEVQKWSGCMKRSY+KGVT
Sbjct: 61  TYKDGFWQSPVIVDEQPEVPMWNPVLFKLSSEELLLFYKIGQEVQKWSGCMKRSYDKGVT 120

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           W+ REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG+WMEVT DAGRSW+KYGPIY
Sbjct: 121 WTAREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGAWMEVTADAGRSWKKYGPIY 180

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           I N SLSVIQPV F TA  TLRVL+RSF+GIG++CMSES DGGL W YAK T+L NPNSG
Sbjct: 181 IVNTSLSVIQPVLFQTAKGTLRVLLRSFDGIGKICMSESLDGGLNWGYAKLTELPNPNSG 240

Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           IDGVKLKDGRLLLAYNT+SRGVLKVA+S+DDGDSW+D +TLEENL MEFSYPAVIQASDG
Sbjct: 241 IDGVKLKDGRLLLAYNTISRGVLKVAISEDDGDSWYDVITLEENLEMEFSYPAVIQASDG 300

Query: 301 SVHITYTYNRTQIKHVVLLPN 321
           SVHI+YTYNRTQIK+VVL PN
Sbjct: 301 SVHISYTYNRTQIKYVVLQPN 321


>gi|224142297|ref|XP_002324495.1| predicted protein [Populus trichocarpa]
 gi|222865929|gb|EEF03060.1| predicted protein [Populus trichocarpa]
          Length = 321

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 268/321 (83%), Positives = 291/321 (90%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           M KDC    LV EEFTFPANS+PF SCHASTIVEVDK HFLVAYFGG+ EGAPDVKIW+Q
Sbjct: 1   MSKDCHNWQLVKEEFTFPANSSPFNSCHASTIVEVDKNHFLVAYFGGTKEGAPDVKIWIQ 60

Query: 61  TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
           T+KDG WQSPII D++ +VPMWNPVLFKL S  LLLFYK+GQEVQKWSGCMKRSY+KGVT
Sbjct: 61  TYKDGYWQSPIIVDDQLDVPMWNPVLFKLASEELLLFYKVGQEVQKWSGCMKRSYDKGVT 120

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           WS+REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG+WMEVT DAGRSWRK+GPIY
Sbjct: 121 WSKREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGAWMEVTADAGRSWRKHGPIY 180

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           + N SLSVIQPVPF TA+ T RVL+RSF+ IGRVCMSES DGG+TW YAKPT+L NPNSG
Sbjct: 181 VENTSLSVIQPVPFQTADGTFRVLLRSFDDIGRVCMSESYDGGVTWGYAKPTELPNPNSG 240

Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           IDGVKLKDG +LLAYNT+SRGVLKVA+SKDDGDSWHD +TLEE L MEFSYPAVIQASDG
Sbjct: 241 IDGVKLKDGPVLLAYNTISRGVLKVAVSKDDGDSWHDVVTLEEKLDMEFSYPAVIQASDG 300

Query: 301 SVHITYTYNRTQIKHVVLLPN 321
           SVHITYTYNRTQIKHVVL P+
Sbjct: 301 SVHITYTYNRTQIKHVVLQPS 321


>gi|363808210|ref|NP_001242743.1| uncharacterized protein LOC100809430 [Glycine max]
 gi|255636459|gb|ACU18568.1| unknown [Glycine max]
          Length = 354

 Score =  536 bits (1382), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 257/321 (80%), Positives = 282/321 (87%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           M  +C I+G + EEFT+ A SAPFKSCHASTIVEVDK HFLVAYFGG+ EGAPDVKIW+Q
Sbjct: 34  MASNCSIEGPLVEEFTYQAGSAPFKSCHASTIVEVDKDHFLVAYFGGTFEGAPDVKIWVQ 93

Query: 61  TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
           T+K+GRWQ+P+IADEEPNVPMWNP LFKLPS+ LLLFYKIGQEVQKWSG MKRSY+KG T
Sbjct: 94  TYKNGRWQAPVIADEEPNVPMWNPALFKLPSDELLLFYKIGQEVQKWSGFMKRSYDKGKT 153

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           W+ REQLPPGILGP KNKPILLENG LLCGSSVESWNSWG+W+EVT D GRSW KYGPIY
Sbjct: 154 WTGREQLPPGILGPIKNKPILLENGNLLCGSSVESWNSWGAWVEVTTDFGRSWSKYGPIY 213

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           I N+ LSVIQPVP+ TA+  LRVL+RSF+GIGRVCMSES DGG TW YAKPTQL NPNSG
Sbjct: 214 IENKPLSVIQPVPYQTADGKLRVLLRSFDGIGRVCMSESSDGGKTWGYAKPTQLPNPNSG 273

Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           IDGVKL+DGRLLLAYNTVSR V KVALSKDDGDSW + LTLE+   MEFSYPAVIQ SDG
Sbjct: 274 IDGVKLRDGRLLLAYNTVSRSVPKVALSKDDGDSWCEVLTLEDTSGMEFSYPAVIQDSDG 333

Query: 301 SVHITYTYNRTQIKHVVLLPN 321
            +HITYTYNRTQIK VVL P+
Sbjct: 334 RIHITYTYNRTQIKRVVLRPS 354


>gi|356547785|ref|XP_003542289.1| PREDICTED: uncharacterized protein LOC100779946 [Glycine max]
          Length = 366

 Score =  533 bits (1372), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 255/319 (79%), Positives = 278/319 (87%)

Query: 2   KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
           K    +KG V EEFTFP  S PF SCHASTIVEV KGHFLVAYFGGS EGAPDVKIWLQT
Sbjct: 40  KNSSYVKGPVVEEFTFPERSTPFNSCHASTIVEVGKGHFLVAYFGGSSEGAPDVKIWLQT 99

Query: 62  FKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
           +K GRW+SPIIADEEPNVPMWNPVLFKLPSN LLLFYKIG +VQKWSG MKRSY+KG+TW
Sbjct: 100 YKSGRWKSPIIADEEPNVPMWNPVLFKLPSNELLLFYKIGLDVQKWSGFMKRSYDKGITW 159

Query: 122 SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
           +EREQLP GILGP KNKP+LLENG LLCGSSVESWNSWG+W EVT D G +WRKYGPIYI
Sbjct: 160 TEREQLPSGILGPIKNKPLLLENGDLLCGSSVESWNSWGAWAEVTTDFGGAWRKYGPIYI 219

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
            N+   VIQPVP+ TA  TLRVL+RSF G+GR+ MSES DGG +W YAKPTQL NPNSGI
Sbjct: 220 ENKPHGVIQPVPYQTAKGTLRVLLRSFTGLGRIYMSESFDGGKSWGYAKPTQLPNPNSGI 279

Query: 242 DGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           DGVKLKDGRLLLAYNT+SRGVLK+ALS+DDGDSWH+ALTLE+ L MEFSYPAVIQASDG 
Sbjct: 280 DGVKLKDGRLLLAYNTISRGVLKLALSEDDGDSWHEALTLEDTLGMEFSYPAVIQASDGR 339

Query: 302 VHITYTYNRTQIKHVVLLP 320
           VH+TYTY RTQIKHVV+ P
Sbjct: 340 VHVTYTYKRTQIKHVVVRP 358


>gi|357479849|ref|XP_003610210.1| hypothetical protein MTR_4g129200 [Medicago truncatula]
 gi|355511265|gb|AES92407.1| hypothetical protein MTR_4g129200 [Medicago truncatula]
          Length = 467

 Score =  532 bits (1370), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 253/316 (80%), Positives = 279/316 (88%), Gaps = 1/316 (0%)

Query: 7   IKGLVAEEFTFPANSA-PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
           I+GLV EEFTFPA S+ PF SCHASTIVEV KGHFLVAYFGG+ EGAPDVKIWLQT+K+G
Sbjct: 38  IQGLVVEEFTFPAKSSVPFNSCHASTIVEVGKGHFLVAYFGGTSEGAPDVKIWLQTYKNG 97

Query: 66  RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
            WQ+P+I DEEPNVPMWNPVLFKLPSN LLLFYKIGQ+ QKWSG +KRSY+KG+TW+ERE
Sbjct: 98  IWQAPVIGDEEPNVPMWNPVLFKLPSNVLLLFYKIGQDFQKWSGFIKRSYDKGITWTERE 157

Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
           QLPPGILGP KNKPILLENG LLCGSSVESWNSWGSW EVT D G++WRKYGPIYI N  
Sbjct: 158 QLPPGILGPIKNKPILLENGDLLCGSSVESWNSWGSWAEVTTDFGKTWRKYGPIYIKNNP 217

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           L VIQPVP+ TA  TLRVL+RSF GIGRV MSES DGG TW YA+PTQL NPNSGIDG+K
Sbjct: 218 LGVIQPVPYQTAKGTLRVLLRSFTGIGRVYMSESFDGGKTWGYAEPTQLHNPNSGIDGIK 277

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
           LKDGRLL+AYNT+SRGVLK+ALS+DDGDSWH ALTLE+ + MEFSYPAVIQASDG VH+T
Sbjct: 278 LKDGRLLIAYNTISRGVLKLALSEDDGDSWHQALTLEDTVGMEFSYPAVIQASDGRVHVT 337

Query: 306 YTYNRTQIKHVVLLPN 321
           YTYNRTQIK  V+  N
Sbjct: 338 YTYNRTQIKAKVMQTN 353


>gi|357479845|ref|XP_003610208.1| hypothetical protein MTR_4g129180 [Medicago truncatula]
 gi|355511263|gb|AES92405.1| hypothetical protein MTR_4g129180 [Medicago truncatula]
          Length = 354

 Score =  523 bits (1347), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 250/321 (77%), Positives = 280/321 (87%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           M+  C I G + EE TF   SAPF +CHASTIVE DK HFLVAYFGG+ EGAPDVKIWLQ
Sbjct: 34  MENRCPIYGPLTEELTFLEGSAPFNNCHASTIVEFDKDHFLVAYFGGTSEGAPDVKIWLQ 93

Query: 61  TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
           T K+GRW+SP+IADE+PNVPMWNPVLFKLPS+ LLLFYKIGQEVQKWSG +KRSY+KG T
Sbjct: 94  TCKNGRWESPVIADEQPNVPMWNPVLFKLPSDELLLFYKIGQEVQKWSGFVKRSYDKGNT 153

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           W+EREQLPPGILGP KNKPILLENG L+CGSSVESWNSWG+W+EVT D GRSW K+GPIY
Sbjct: 154 WTEREQLPPGILGPIKNKPILLENGHLICGSSVESWNSWGAWVEVTSDFGRSWSKHGPIY 213

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           I NE LSVIQPVP+ TA+  LRVL+RSF+GIGRVC+SES DGG TW +AKPTQL NPNSG
Sbjct: 214 IENEPLSVIQPVPYQTADGKLRVLLRSFDGIGRVCISESSDGGQTWGHAKPTQLPNPNSG 273

Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           IDGVKL+DGR+LL YNT SRG+LKVALS DDGDSW++ALTLE++  MEFSYPAVIQASDG
Sbjct: 274 IDGVKLRDGRILLVYNTTSRGILKVALSNDDGDSWYEALTLEDSSGMEFSYPAVIQASDG 333

Query: 301 SVHITYTYNRTQIKHVVLLPN 321
            +HITYTY RTQIKHV L PN
Sbjct: 334 LIHITYTYKRTQIKHVTLRPN 354


>gi|449464334|ref|XP_004149884.1| PREDICTED: uncharacterized protein LOC101219119 [Cucumis sativus]
 gi|449507997|ref|XP_004163188.1| PREDICTED: uncharacterized protein LOC101230872 [Cucumis sativus]
          Length = 398

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 245/318 (77%), Positives = 273/318 (85%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           M K   + G + EEFTFPANS PF +CHASTIVEVDK H+LVAYFGG+ EGAPDVKIWLQ
Sbjct: 73  MGKGSYVDGPLQEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTLEGAPDVKIWLQ 132

Query: 61  TFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
            FK+G W SPI+ADEEP++PMWNPVLFKLPS+ LLLFYK+GQEVQKWSGCMKRSY+KG+T
Sbjct: 133 AFKNGSWHSPIVADEEPDIPMWNPVLFKLPSDELLLFYKVGQEVQKWSGCMKRSYDKGIT 192

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           W+ REQLPPGILGP KNKPILLENG+LLCGSSVESWNSWG+WMEVT D+GRSWRK+GPIY
Sbjct: 193 WTAREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEVTSDSGRSWRKFGPIY 252

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           + N SLSVIQPVP+ TAN  LRVL+RSF GI  +CMSES DGG  WS+A PT L NPNSG
Sbjct: 253 MKNRSLSVIQPVPYQTANGNLRVLLRSFTGIDSICMSESNDGGHNWSFAIPTNLPNPNSG 312

Query: 241 IDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           IDGVKL+DGRLLL YNTVSRGVLKV LS DDGDSW D +TLE+    EFSYPAVI+ASD 
Sbjct: 313 IDGVKLRDGRLLLVYNTVSRGVLKVGLSLDDGDSWQDVMTLEDEPENEFSYPAVIEASDD 372

Query: 301 SVHITYTYNRTQIKHVVL 318
           S+HITYTY R QIKHVV 
Sbjct: 373 SIHITYTYRRKQIKHVVF 390


>gi|297793281|ref|XP_002864525.1| hypothetical protein ARALYDRAFT_495867 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310360|gb|EFH40784.1| hypothetical protein ARALYDRAFT_495867 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score =  515 bits (1327), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 247/309 (79%), Positives = 270/309 (87%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV + HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8   VLLETFTFPADSAPFKSCHASTIVEVVEDHFLTAYFGGTTEGAPDVKIWLQHFKDGQWDS 67

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPMWNPVLFKLPS  LLLFYKIGQEVQKWSGCMKRSY+KG TW+EREQLPP
Sbjct: 68  PVIVDEEPGVPMWNPVLFKLPSQELLLFYKIGQEVQKWSGCMKRSYDKGRTWTEREQLPP 127

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGRSWRK GPIYI  +SLSVI
Sbjct: 128 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRSWRKQGPIYIQGKSLSVI 187

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LRVL+RSF GI ++C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGKLRVLLRSFTGIDKICISESSDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLKV +S DDGDSW D LTLEE+  MEFSYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKVGVSLDDGDSWTDVLTLEESPGMEFSYPAVIQAGDGNVHVTYTYN 307

Query: 310 RTQIKHVVL 318
           RTQIKHVVL
Sbjct: 308 RTQIKHVVL 316


>gi|30696915|ref|NP_851204.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
 gi|119360001|gb|ABL66729.1| At5g57700 [Arabidopsis thaliana]
 gi|332009554|gb|AED96937.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
          Length = 352

 Score =  513 bits (1320), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 244/309 (78%), Positives = 272/309 (88%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 13  VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 72

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 73  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 132

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI  +SLSVI
Sbjct: 133 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 192

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LR+L+RSF GI R+C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 193 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 252

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 253 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 312

Query: 310 RTQIKHVVL 318
           RTQIKHVVL
Sbjct: 313 RTQIKHVVL 321


>gi|145334835|ref|NP_001078763.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
 gi|332009556|gb|AED96939.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
          Length = 358

 Score =  513 bits (1320), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 244/309 (78%), Positives = 272/309 (88%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 19  VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 78

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 79  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 138

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI  +SLSVI
Sbjct: 139 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 198

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LR+L+RSF GI R+C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 199 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 258

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 259 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 318

Query: 310 RTQIKHVVL 318
           RTQIKHVVL
Sbjct: 319 RTQIKHVVL 327


>gi|238481582|ref|NP_001154784.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
 gi|332009557|gb|AED96940.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
          Length = 348

 Score =  512 bits (1319), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 244/309 (78%), Positives = 272/309 (88%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 9   VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 68

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 69  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 128

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI  +SLSVI
Sbjct: 129 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 188

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LR+L+RSF GI R+C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 189 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 248

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 249 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 308

Query: 310 RTQIKHVVL 318
           RTQIKHVVL
Sbjct: 309 RTQIKHVVL 317


>gi|30696919|ref|NP_568864.2| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
 gi|124301162|gb|ABN04833.1| At5g57700 [Arabidopsis thaliana]
 gi|332009555|gb|AED96938.1| alpha-rhamnosidase-like protein [Arabidopsis thaliana]
          Length = 347

 Score =  512 bits (1319), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 244/309 (78%), Positives = 272/309 (88%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8   VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 67

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 68  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 127

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI  +SLSVI
Sbjct: 128 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 187

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LR+L+RSF GI R+C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 307

Query: 310 RTQIKHVVL 318
           RTQIKHVVL
Sbjct: 308 RTQIKHVVL 316


>gi|21536870|gb|AAM61202.1| unknown [Arabidopsis thaliana]
          Length = 347

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 243/309 (78%), Positives = 272/309 (88%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8   VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 67

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 68  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 127

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE+G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI  +SLSVI
Sbjct: 128 GILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 187

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LR+L+RSF GI ++C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGNLRILLRSFTGIDKICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 307

Query: 310 RTQIKHVVL 318
           RTQIKHVVL
Sbjct: 308 RTQIKHVVL 316


>gi|9759267|dbj|BAB09588.1| unnamed protein product [Arabidopsis thaliana]
          Length = 371

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 244/328 (74%), Positives = 272/328 (82%), Gaps = 19/328 (5%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 13  VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 72

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 73  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 132

Query: 130 GILGPSKNK-------------------PILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           GILGP KNK                   PILLE+G LLCGSSVESWNSWG+WMEVT DAG
Sbjct: 133 GILGPIKNKVLVALRRLDYSIKTKPFVLPILLEDGTLLCGSSVESWNSWGAWMEVTSDAG 192

Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
           R+WRK GPIYI  +SLSVIQPVP+ TA   LR+L+RSF GI R+C+SES DGG  WS+A 
Sbjct: 193 RTWRKKGPIYIQGKSLSVIQPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAV 252

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
           PT L NPNSGIDGVKLKDGRL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+S
Sbjct: 253 PTVLPNPNSGIDGVKLKDGRLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYS 312

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YPAVIQA DG+VH+TYTYNRTQIKHVVL
Sbjct: 313 YPAVIQAGDGNVHVTYTYNRTQIKHVVL 340


>gi|326520790|dbj|BAJ92758.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 418

 Score =  498 bits (1283), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 230/312 (73%), Positives = 268/312 (85%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V EE TF A SAPF SCHASTIVE+ K +FLVAYFGGS EGAPDV+IWLQ + DG W S
Sbjct: 102 IVKEELTFSAGSAPFSSCHASTIVEIGKDNFLVAYFGGSKEGAPDVRIWLQRYSDGHWHS 161

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +ADE+  VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G++WSEREQLPP
Sbjct: 162 PEVADEQDGVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGISWSEREQLPP 221

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI  E+L VI
Sbjct: 222 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 281

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TAN T+R+L+RSF  IGRVCM++S DGG+TWSYA  T+L NPNSGIDGVK+KDG
Sbjct: 282 QPVPYQTANGTIRMLLRSFETIGRVCMADSADGGVTWSYAHKTELPNPNSGIDGVKMKDG 341

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R+L+AYNT SRG+LK+A+S+DDGDSW++ +TLE+   MEFSYPAVI+  D  VHITYTYN
Sbjct: 342 RILVAYNTFSRGILKIAVSRDDGDSWNEVMTLEDTKGMEFSYPAVIKTMDELVHITYTYN 401

Query: 310 RTQIKHVVLLPN 321
           RTQIKHVVL P+
Sbjct: 402 RTQIKHVVLRPS 413


>gi|326514456|dbj|BAJ96215.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 391

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 230/312 (73%), Positives = 268/312 (85%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V EE TF A SAPF SCHASTIVE+ K +FLVAYFGGS EGAPDV+IWLQ + DG W S
Sbjct: 75  IVKEELTFSAGSAPFSSCHASTIVEIGKDNFLVAYFGGSKEGAPDVRIWLQRYSDGHWHS 134

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +ADE+  VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G++WSEREQLPP
Sbjct: 135 PEVADEQDGVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGISWSEREQLPP 194

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI  E+L VI
Sbjct: 195 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 254

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TAN T+R+L+RSF  IGRVCM++S DGG+TWSYA  T+L NPNSGIDGVK+KDG
Sbjct: 255 QPVPYQTANGTIRMLLRSFETIGRVCMADSADGGVTWSYAHKTELPNPNSGIDGVKMKDG 314

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R+L+AYNT SRG+LK+A+S+DDGDSW++ +TLE+   MEFSYPAVI+  D  VHITYTYN
Sbjct: 315 RILVAYNTFSRGILKIAVSRDDGDSWNEVMTLEDTKGMEFSYPAVIKTMDELVHITYTYN 374

Query: 310 RTQIKHVVLLPN 321
           RTQIKHVVL P+
Sbjct: 375 RTQIKHVVLRPS 386


>gi|26452755|dbj|BAC43458.1| unknown protein [Arabidopsis thaliana]
          Length = 310

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 237/302 (78%), Positives = 264/302 (87%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ E FTFPA+SAPFKSCHASTIVEV K HFL AYFGG+ EGAPDVKIWLQ FKDG+W S
Sbjct: 8   VLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAPDVKIWLQHFKDGQWDS 67

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+I DEEP VPM+NPVLFKLPS+ LLLFYKIGQEVQKWSGCMKRSY+KG+TW+EREQLPP
Sbjct: 68  PVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKRSYDKGITWTEREQLPP 127

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLE G LLCGSSVESWNSWG+WMEVT DAGR+WRK GPIYI  +SLSVI
Sbjct: 128 GILGPIKNKPILLEGGTLLCGSSVESWNSWGAWMEVTSDAGRTWRKKGPIYIQGKSLSVI 187

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TA   LR+L+RSF GI R+C+SES DGG  WS+A PT L NPNSGIDGVKLKDG
Sbjct: 188 QPVPYQTAAGNLRILLRSFTGIDRICISESLDGGENWSFAVPTVLPNPNSGIDGVKLKDG 247

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           RL+LAYNT SRGVLK+ +S DDGDSW D LTLEE+  ME+SYPAVIQA DG+VH+TYTYN
Sbjct: 248 RLVLAYNTDSRGVLKLGVSLDDGDSWTDILTLEESPGMEYSYPAVIQAGDGNVHVTYTYN 307

Query: 310 RT 311
           RT
Sbjct: 308 RT 309


>gi|115472361|ref|NP_001059779.1| Os07g0516000 [Oryza sativa Japonica Group]
 gi|34393670|dbj|BAC83370.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
 gi|50508733|dbj|BAD31309.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
 gi|113611315|dbj|BAF21693.1| Os07g0516000 [Oryza sativa Japonica Group]
          Length = 424

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 234/312 (75%), Positives = 263/312 (84%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V EEFTFPA SAPF SCHASTIVE +K  FLVAYFGGS EGAPDVKIWLQ + DG W +
Sbjct: 108 IVKEEFTFPAGSAPFNSCHASTIVETEKDSFLVAYFGGSKEGAPDVKIWLQRYSDGCWHT 167

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +ADE+  VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G TWS REQLPP
Sbjct: 168 PQVADEQDEVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGKTWSAREQLPP 227

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP L+E+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI  E+L VI
Sbjct: 228 GILGPIKNKPFLVEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 287

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TAN T+RVL+RSF  IGRVCM++S DGG+TWSY   T L NPNSGIDGVK+KDG
Sbjct: 288 QPVPYMTANGTIRVLLRSFETIGRVCMADSADGGVTWSYVHETDLPNPNSGIDGVKMKDG 347

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R+LLAYNT SRG LKVA+S DDGDSW+D +TLE+   MEFSYPAVIQ  D  +HITYTYN
Sbjct: 348 RVLLAYNTFSRGTLKVAVSMDDGDSWNDVMTLEDTEGMEFSYPAVIQTMDDLIHITYTYN 407

Query: 310 RTQIKHVVLLPN 321
           RTQIKHVVL P+
Sbjct: 408 RTQIKHVVLQPS 419


>gi|357122645|ref|XP_003563025.1| PREDICTED: uncharacterized protein LOC100823135 [Brachypodium
           distachyon]
          Length = 404

 Score =  493 bits (1268), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 230/312 (73%), Positives = 263/312 (84%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EE TFPA +APF SCHASTIVEV+K +FLVAYFGGS EGAPDVKIWLQ + DG W  
Sbjct: 88  ILKEELTFPAGTAPFNSCHASTIVEVEKDNFLVAYFGGSKEGAPDVKIWLQRYGDGYWHP 147

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +ADE+  VPMWNPVLFKLPS  LLLFYKIGQEVQKWSG MKRS + G++WS+REQLP 
Sbjct: 148 PKVADEQDEVPMWNPVLFKLPSRELLLFYKIGQEVQKWSGAMKRSLDGGISWSQREQLPA 207

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIY+  E+L VI
Sbjct: 208 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYVEGETLGVI 267

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TAN T+RVL+RSF  IGRVCM++S DGG+TWSY   T+L NPNSGIDGVK+KDG
Sbjct: 268 QPVPYQTANGTIRVLLRSFETIGRVCMADSIDGGVTWSYVHKTELPNPNSGIDGVKMKDG 327

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R+LLAYNT SRG LK+A+S DDGDSW + +TLE+   MEFSYPAVIQ  D  VHITYTYN
Sbjct: 328 RVLLAYNTFSRGTLKIAISTDDGDSWKEVMTLEDTKGMEFSYPAVIQTMDELVHITYTYN 387

Query: 310 RTQIKHVVLLPN 321
           RTQIKHVVL P+
Sbjct: 388 RTQIKHVVLQPS 399


>gi|222637136|gb|EEE67268.1| hypothetical protein OsJ_24447 [Oryza sativa Japonica Group]
          Length = 1184

 Score =  485 bits (1249), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 229/305 (75%), Positives = 257/305 (84%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V EEFTFPA SAPF SCHASTIVE +K  FLVAYFGGS EGAPDVKIWLQ + DG W +
Sbjct: 608 IVKEEFTFPAGSAPFNSCHASTIVETEKDSFLVAYFGGSKEGAPDVKIWLQRYSDGCWHT 667

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +ADE+  VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G TWS REQLPP
Sbjct: 668 PQVADEQDEVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGKTWSAREQLPP 727

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP L+E+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI  E+L VI
Sbjct: 728 GILGPIKNKPFLVEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 787

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TAN T+RVL+RSF  IGRVCM++S DGG+TWSY   T L NPNSGIDGVK+KDG
Sbjct: 788 QPVPYMTANGTIRVLLRSFETIGRVCMADSADGGVTWSYVHETDLPNPNSGIDGVKMKDG 847

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R+LLAYNT SRG LKVA+S DDGDSW+D +TLE+   MEFSYPAVIQ  D  +HITYTYN
Sbjct: 848 RVLLAYNTFSRGTLKVAVSMDDGDSWNDVMTLEDTEGMEFSYPAVIQTMDDLIHITYTYN 907

Query: 310 RTQIK 314
           RTQIK
Sbjct: 908 RTQIK 912



 Score =  360 bits (924), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 175/308 (56%), Positives = 219/308 (71%), Gaps = 26/308 (8%)

Query: 14   EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIA 73
              T+  N    K    +   +V K  FLVAYFGG+ EGA DVKIWLQ + +G W SP + 
Sbjct: 901  HITYTYNRTQIKFDLPTLSSQVQKNMFLVAYFGGTQEGADDVKIWLQRYYNGSWHSPEVV 960

Query: 74   DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILG 133
            DE PNVP+WNPVLF+LPS  +LLFYK+G+ V+ WSGCMK S + GV WS+REQLPPGILG
Sbjct: 961  DEVPNVPLWNPVLFQLPSGEILLFYKVGKTVESWSGCMKCSSDGGVIWSKREQLPPGILG 1020

Query: 134  PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
            P KNK                          VT D G++W+KYGPIY+  +++ VIQPV 
Sbjct: 1021 PIKNK--------------------------VTKDHGKTWKKYGPIYVRGKTMGVIQPVL 1054

Query: 194  FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
            + T++ T+R+L+R  + +GR+C++ES D G+ WSYA+PT+L NPNSGIDGVKLKDGR++L
Sbjct: 1055 YQTSSGTIRMLLRPSDEVGRICVAESKDSGVNWSYAQPTELPNPNSGIDGVKLKDGRVVL 1114

Query: 254  AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
             YN+ SRGVLKVA+S+DDGD W D LTLEE   +EFSYPAVIQ SDG VH+TYTY RTQI
Sbjct: 1115 VYNSTSRGVLKVAVSQDDGDKWEDVLTLEETHGVEFSYPAVIQTSDGLVHVTYTYKRTQI 1174

Query: 314  KHVVLLPN 321
            KHVVL P+
Sbjct: 1175 KHVVLQPS 1182


>gi|218199713|gb|EEC82140.1| hypothetical protein OsI_26191 [Oryza sativa Indica Group]
          Length = 684

 Score =  483 bits (1242), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 228/305 (74%), Positives = 256/305 (83%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V EEFTFPA SAPF SCHASTIVE +K  FLVAYFGGS EGAPDVKIWLQ + DG W +
Sbjct: 108 IVKEEFTFPAGSAPFNSCHASTIVETEKDSFLVAYFGGSKEGAPDVKIWLQRYSDGCWHT 167

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +ADE+  VPMWNPVLF+LPS+ LLLFYKIGQEVQKWSG MKRS + G TWS REQLPP
Sbjct: 168 PQVADEQDEVPMWNPVLFQLPSHELLLFYKIGQEVQKWSGAMKRSLDGGKTWSAREQLPP 227

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP L+E+G LLCGSSVESWNSWG+W+EVT DAGR+WRKYGPIYI  E+L VI
Sbjct: 228 GILGPIKNKPFLVEDGRLLCGSSVESWNSWGAWLEVTKDAGRTWRKYGPIYIEGETLGVI 287

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ TAN T+RVL+RSF  IGRVCM++S DGG+TWSY   T L NPNSGIDGVK+KDG
Sbjct: 288 QPVPYMTANGTIRVLLRSFETIGRVCMADSADGGVTWSYVHETDLPNPNSGIDGVKMKDG 347

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R+LLAYNT SRG LKVA+S DDGDSW+D + LE+   MEFSYPAVIQ  D  +HITYTYN
Sbjct: 348 RVLLAYNTFSRGTLKVAVSMDDGDSWNDVMALEDTEGMEFSYPAVIQTMDDLIHITYTYN 407

Query: 310 RTQIK 314
           RTQIK
Sbjct: 408 RTQIK 412



 Score =  360 bits (923), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 175/308 (56%), Positives = 219/308 (71%), Gaps = 26/308 (8%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIA 73
             T+  N    K    +   +V K  FLVAYFGG+ EGA DVKIWLQ + +G W SP + 
Sbjct: 401 HITYTYNRTQIKFDLPTLSSQVQKNMFLVAYFGGTQEGADDVKIWLQRYYNGSWHSPEVV 460

Query: 74  DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILG 133
           DE PNVP+WNPVLF+LPS  +LLFYK+G+ V+ WSGCMK S + GV WS+REQLPPGILG
Sbjct: 461 DEVPNVPLWNPVLFQLPSGEILLFYKVGKTVESWSGCMKCSSDGGVIWSKREQLPPGILG 520

Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           P KNK                          VT D G++W+KYGPIY+  +++ VIQPV 
Sbjct: 521 PIKNK--------------------------VTKDHGKTWKKYGPIYVRGKTMGVIQPVL 554

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
           + T++ T+R+L+R  + +GR+C++ES D G+ WSYA+PT+L NPNSGIDGVKLKDGR++L
Sbjct: 555 YQTSSGTIRMLLRPSDEVGRICVAESKDSGVNWSYAQPTELPNPNSGIDGVKLKDGRVVL 614

Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
            YN+ SRGVLKVA+S+DDGD W D LTLEE   +EFSYPAVIQ SDG VH+TYTY RTQI
Sbjct: 615 VYNSTSRGVLKVAVSQDDGDKWEDVLTLEETHGVEFSYPAVIQTSDGLVHVTYTYKRTQI 674

Query: 314 KHVVLLPN 321
           KHVVL P+
Sbjct: 675 KHVVLQPS 682


>gi|242045796|ref|XP_002460769.1| hypothetical protein SORBIDRAFT_02g034675 [Sorghum bicolor]
 gi|241924146|gb|EER97290.1| hypothetical protein SORBIDRAFT_02g034675 [Sorghum bicolor]
          Length = 411

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 221/316 (69%), Positives = 262/316 (82%), Gaps = 5/316 (1%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V +EFTFPA  APF SCHASTIV++D+G+FLVAYFGGS EGAPDVKIWLQ +  G W  
Sbjct: 91  IVKKEFTFPARRAPFNSCHASTIVQIDEGNFLVAYFGGSTEGAPDVKIWLQRYSGGHWHP 150

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P++ADE+  VPMWNPVLF+LPS  LLLFYKIGQEVQKWSG MKRS + G+TW  REQLPP
Sbjct: 151 PVVADEQLGVPMWNPVLFQLPSRELLLFYKIGQEVQKWSGAMKRSLDGGLTWLPREQLPP 210

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT +AGR+WRKYGPIY+  + L VI
Sbjct: 211 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTQNAGRTWRKYGPIYVEGQPLGVI 270

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS-----GIDGV 244
           QPVP+ TAN  +RVL+RSF  IGRVCM++S DGG+TWS+ + T+L NPNS     GIDGV
Sbjct: 271 QPVPYRTANGIIRVLLRSFQTIGRVCMADSFDGGVTWSFVRGTELPNPNSDLKAEGIDGV 330

Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
           K+KDGR++LAYNT SRG LK+A+S +DGDSW++ LTLE+   MEFSYPAVIQ  D  +H+
Sbjct: 331 KMKDGRVVLAYNTFSRGTLKLAVSLNDGDSWNEVLTLEDTSGMEFSYPAVIQTMDQLIHV 390

Query: 305 TYTYNRTQIKHVVLLP 320
           TYTYNRTQIKHVV+ P
Sbjct: 391 TYTYNRTQIKHVVIKP 406


>gi|23617011|dbj|BAC20707.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
 gi|50508734|dbj|BAD31310.1| alpha-rhamnosidase-like protein [Oryza sativa Japonica Group]
          Length = 371

 Score =  442 bits (1137), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 205/311 (65%), Positives = 255/311 (81%), Gaps = 2/311 (0%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EEFTF   + PFK+CHASTIVEV K  FLVAYFGG+ EGA DVKIWLQ + +G W S
Sbjct: 7   VLREEFTFSEGNVPFKTCHASTIVEVQKNMFLVAYFGGTQEGADDVKIWLQRYYNGSWHS 66

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P + DE PNVP+WNPVLF+LPS  +LLFYK+G+ V+ WSGCMK S + GV WS+REQLPP
Sbjct: 67  PEVVDEVPNVPLWNPVLFQLPSGEILLFYKVGKTVESWSGCMKCSSDGGVIWSKREQLPP 126

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LL++G LLCG+SVESWNSWG+W+EVT D G++W+KYGPIY+  +++ VI
Sbjct: 127 GILGPIKNKPFLLKDGHLLCGTSVESWNSWGAWLEVTKDHGKTWKKYGPIYVRGKTMGVI 186

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPV + T++ T+R+L+R  + +GR+C++ES D G+ WSYA+PT+L NPNSGIDGVKLKDG
Sbjct: 187 QPVLYQTSSGTIRMLLRPSDEVGRICVAESKDSGVNWSYAQPTELPNPNSGIDGVKLKDG 246

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           R++L YN+ SRGVLKVA+S+DDGD W D LTLEE   +EFSYPAVIQ SDG VH+TYTY 
Sbjct: 247 RVVLVYNSTSRGVLKVAVSQDDGDKWEDVLTLEETHGVEFSYPAVIQTSDGLVHVTYTYK 306

Query: 310 RTQIKHVVLLP 320
           RTQIK  V+LP
Sbjct: 307 RTQIK--VVLP 315


>gi|302805374|ref|XP_002984438.1| hypothetical protein SELMODRAFT_268887 [Selaginella moellendorffii]
 gi|300147826|gb|EFJ14488.1| hypothetical protein SELMODRAFT_268887 [Selaginella moellendorffii]
          Length = 329

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 203/313 (64%), Positives = 242/313 (77%)

Query: 6   GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
           G  G+V EEF F    APF SCHASTIVE++  HF+VAYFGG+ EG  DV IW Q FK G
Sbjct: 7   GRIGIVKEEFLFEKGLAPFNSCHASTIVELEPEHFMVAYFGGTYEGDSDVVIWTQRFKHG 66

Query: 66  RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
            W  P +AD E  VPMWNPVLFK+P   +LLFYKIG EVQKWSG MKRS++ GVTWS R+
Sbjct: 67  VWSDPQVADSELGVPMWNPVLFKMPKGEVLLFYKIGPEVQKWSGFMKRSFDNGVTWSARQ 126

Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            LPPGILGP KNKP+LL +G LLCGSSVESW++WG+WMEVT D+G +WRK+GPIY+ +  
Sbjct: 127 ALPPGILGPIKNKPLLLRDGRLLCGSSVESWDAWGAWMEVTEDSGYTWRKHGPIYVQHTP 186

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           + VIQPVP+ T N T+RVL+R+   IG++CM+ S DGG +W+YA PTQL NPN G DGVK
Sbjct: 187 MGVIQPVPYLTDNGTIRVLLRATQEIGKICMASSSDGGRSWTYATPTQLPNPNCGFDGVK 246

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
           L+DG LLL YNT SRG+LKV +S DDGD+W + +TLEE    EFSY AVIQ+  G VH+T
Sbjct: 247 LRDGTLLLVYNTDSRGILKVGVSGDDGDTWTEEITLEEQEGGEFSYAAVIQSFTGLVHVT 306

Query: 306 YTYNRTQIKHVVL 318
           YTYNR QIKHVVL
Sbjct: 307 YTYNRVQIKHVVL 319


>gi|168030356|ref|XP_001767689.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681009|gb|EDQ67440.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 358

 Score =  426 bits (1096), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 199/313 (63%), Positives = 242/313 (77%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            ++ EEF F    APF SCHAS+IVE+  G FLVAYFGG+ EG  DV IW   F+ G W+
Sbjct: 37  AIIKEEFLFEDGKAPFPSCHASSIVELGSGTFLVAYFGGTHEGFSDVAIWTSRFEVGLWK 96

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
            P + D EP+VP WNPVLFK+P   LL+FYKIG+EVQKWSG +KRS + GVTWS+REQLP
Sbjct: 97  PPALVDREPDVPAWNPVLFKMPDGELLIFYKIGEEVQKWSGFIKRSSDNGVTWSDREQLP 156

Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
           PGILGP+KNKP+L+++G LLCGSS ESWN+WG+WMEVT DAGR+W K+GPI++    L +
Sbjct: 157 PGILGPTKNKPLLIKDGRLLCGSSTESWNAWGAWMEVTADAGRTWAKHGPIHLSGTPLGI 216

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           IQPVPF TA+ T+RVL+R  + +GR+CM+ S D GLTWS+A PT+L+N N G DGVKL D
Sbjct: 217 IQPVPFVTAHDTVRVLLRPSSMVGRICMATSQDAGLTWSFATPTELINCNCGFDGVKLHD 276

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
           GRLL+ YNT SRG+LKVA+S DDG SW D LTLE+    EFSY AVI ASD  +H TYTY
Sbjct: 277 GRLLILYNTTSRGILKVAISADDGLSWRDYLTLEDTEGCEFSYAAVILASDKHIHATYTY 336

Query: 309 NRTQIKHVVLLPN 321
            R QIKHVVL P+
Sbjct: 337 KRRQIKHVVLQPS 349


>gi|302764718|ref|XP_002965780.1| hypothetical protein SELMODRAFT_84263 [Selaginella moellendorffii]
 gi|300166594|gb|EFJ33200.1| hypothetical protein SELMODRAFT_84263 [Selaginella moellendorffii]
          Length = 387

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 199/309 (64%), Positives = 238/309 (77%)

Query: 6   GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
           G  G+V EEF F    APF SCHASTIVE++  HF+VAYFGG+ EG  DV IW Q FK G
Sbjct: 7   GRIGIVKEEFLFEKGLAPFNSCHASTIVELEPEHFMVAYFGGTYEGDSDVVIWTQRFKHG 66

Query: 66  RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
            W  P +AD E  VPMWNPVLFK+P   +LLFYKIG EVQKWSG MKRS++ GVTWS R+
Sbjct: 67  VWSDPQVADSELGVPMWNPVLFKMPKGEVLLFYKIGPEVQKWSGFMKRSFDNGVTWSARQ 126

Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            LPPGILGP KNKP+LL +G LLCGSSVESW++WG+WMEVT D+G +WRK+GPIY+ +  
Sbjct: 127 ALPPGILGPIKNKPLLLRDGRLLCGSSVESWDAWGAWMEVTEDSGYTWRKHGPIYVQHTP 186

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           + VIQPVP+ T N T+RVL+R+   IG++CM+ S DGG +W+YA PTQL NPN G DGVK
Sbjct: 187 MGVIQPVPYLTDNGTVRVLLRATQEIGKICMASSSDGGRSWTYATPTQLPNPNCGFDGVK 246

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
           L+DG LLL YNT SRG+LKV +S DDGD+W + +TLEE    EFSY AVIQ+  G VH+T
Sbjct: 247 LRDGTLLLVYNTDSRGILKVGVSGDDGDTWTEEITLEEQEGGEFSYAAVIQSFTGLVHVT 306

Query: 306 YTYNRTQIK 314
           YTYNR QIK
Sbjct: 307 YTYNRVQIK 315


>gi|259489888|ref|NP_001159249.1| uncharacterized protein LOC100304339 [Zea mays]
 gi|223942995|gb|ACN25581.1| unknown [Zea mays]
 gi|414590413|tpg|DAA40984.1| TPA: hypothetical protein ZEAMMB73_438581 [Zea mays]
          Length = 360

 Score =  409 bits (1051), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 192/274 (70%), Positives = 226/274 (82%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V EE   PA  APF SCHASTIV++++ +FLVAYFGGS EGAPDVKIWLQ + DG W  
Sbjct: 81  IVKEELILPARGAPFNSCHASTIVQIEEDNFLVAYFGGSREGAPDVKIWLQRYSDGHWHP 140

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P++ADE+  VPMWNPVLF+LPS  LLLFYKIGQEVQKWSG MKRS + GVTWS REQLPP
Sbjct: 141 PVVADEQDGVPMWNPVLFQLPSRELLLFYKIGQEVQKWSGAMKRSLDGGVTWSPREQLPP 200

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAG +WRKYGPI I  + L VI
Sbjct: 201 GILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAGWTWRKYGPICIKGQPLGVI 260

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ T + T+RVL+RSF  IG VCM++S D G+TWSY + T+L NPNSGIDGVK+KDG
Sbjct: 261 QPVPYRTDDGTIRVLLRSFETIGHVCMADSIDEGVTWSYVRETELPNPNSGIDGVKMKDG 320

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEE 283
           R+LLAYNT SRG LK+A+S +DGDSW + +TLE+
Sbjct: 321 RVLLAYNTFSRGTLKLAVSLNDGDSWDEVMTLED 354


>gi|195643224|gb|ACG41080.1| hypothetical protein [Zea mays]
          Length = 305

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/288 (66%), Positives = 229/288 (79%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EEFTF A+  PF +CHASTIVEV+K  FLV+YFGGS EGAPDVKIW Q + DG W  
Sbjct: 14  IMKEEFTFQAHDRPFNNCHASTIVEVEKEIFLVSYFGGSIEGAPDVKIWTQRYSDGYWHP 73

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P++ADEE    MWNPVLF+LPS+ LLLFYKIG+  Q WSG MKRS N G++W EREQLPP
Sbjct: 74  PVVADEENATAMWNPVLFQLPSHELLLFYKIGEHPQNWSGAMKRSLNGGMSWLEREQLPP 133

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LL++G LLCG+SVESWNSWG+W+EVT DAGR+W KYGPI+I  E L VI
Sbjct: 134 GILGPIKNKPFLLDDGRLLCGTSVESWNSWGAWLEVTEDAGRTWSKYGPIFIQGEKLGVI 193

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ T N ++R+L+RS   IG VCM++S DGGL WS+A+ + L NPNSGIDG+K+KDG
Sbjct: 194 QPVPYQTTNGSIRMLLRSHQTIGSVCMADSYDGGLRWSFARKSVLPNPNSGIDGIKMKDG 253

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           R+ LAYNTVSRG LKVA+S DDG SW + +TLE     EFSYPAVIQ 
Sbjct: 254 RVALAYNTVSRGTLKVAVSTDDGLSWQEVVTLENAEGWEFSYPAVIQT 301


>gi|212722798|ref|NP_001131252.1| hypothetical protein [Zea mays]
 gi|194690998|gb|ACF79583.1| unknown [Zea mays]
 gi|414590410|tpg|DAA40981.1| TPA: hypothetical protein ZEAMMB73_041549 [Zea mays]
          Length = 308

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/288 (66%), Positives = 229/288 (79%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EEFTF A+  PF +CHASTIVEV+K  FLV+YFGGS EGAPDVKIW Q + DG W  
Sbjct: 17  IMKEEFTFQAHDRPFNNCHASTIVEVEKEIFLVSYFGGSIEGAPDVKIWTQRYSDGYWHP 76

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P++ADEE    MWNPVLF+LPS+ LLLFYKIG+  Q WSG MKRS N G++W EREQLPP
Sbjct: 77  PVVADEENATAMWNPVLFQLPSHELLLFYKIGEHPQNWSGAMKRSLNGGMSWLEREQLPP 136

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP LL++G LLCG+SVESWNSWG+W+EVT DAGR+W KYGPI+I  E L VI
Sbjct: 137 GILGPIKNKPFLLDDGRLLCGTSVESWNSWGAWLEVTEDAGRTWSKYGPIFIQGEKLGVI 196

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QPVP+ T N ++R+L+RS   IG VCM++S DGGL WS+A+ + L NPNSGIDG+K+KDG
Sbjct: 197 QPVPYQTTNGSIRMLLRSHQTIGSVCMADSYDGGLRWSFARKSVLPNPNSGIDGIKMKDG 256

Query: 250 RLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           R+ LAYNTVSRG LKVA+S DDG SW + +TLE     EFSYPAVIQ 
Sbjct: 257 RVALAYNTVSRGTLKVAVSTDDGLSWQEVVTLENAEGWEFSYPAVIQT 304


>gi|242045798|ref|XP_002460770.1| hypothetical protein SORBIDRAFT_02g034680 [Sorghum bicolor]
 gi|241924147|gb|EER97291.1| hypothetical protein SORBIDRAFT_02g034680 [Sorghum bicolor]
          Length = 216

 Score =  321 bits (822), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 151/211 (71%), Positives = 177/211 (83%)

Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           MKRS + GVTWS+REQLPPGILGP KNKP LLE+G LLCGSSVESWNSWG+W+EVT DAG
Sbjct: 1   MKRSLDGGVTWSQREQLPPGILGPIKNKPFLLEDGRLLCGSSVESWNSWGAWLEVTKDAG 60

Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
           R+WRKYGPI I  + L VIQPVP+ T N T+RVL+RSF  IGRVCM++S D G+TWSY  
Sbjct: 61  RTWRKYGPICIEGQPLGVIQPVPYRTDNGTIRVLLRSFETIGRVCMADSVDEGVTWSYVH 120

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
            T+L NPNSGIDGVK+KDGR+LLAYNT SRG LK+A+S +DGDSW + +TLE+   MEFS
Sbjct: 121 ETELPNPNSGIDGVKMKDGRVLLAYNTFSRGTLKIAVSSNDGDSWDEVMTLEDTKGMEFS 180

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           YPAVIQ+ D  +HITYTYNRTQ+KHVVL P+
Sbjct: 181 YPAVIQSLDELIHITYTYNRTQVKHVVLQPS 211


>gi|297736658|emb|CBI25675.3| unnamed protein product [Vitis vinifera]
          Length = 951

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 159/231 (68%), Positives = 183/231 (79%), Gaps = 18/231 (7%)

Query: 4   DCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK 63
           D  I+G V EEFTFP+NSAPF SCHASTIVEV K HFLVAYFGG+ EGAPDVKIWLQT++
Sbjct: 2   DHWIEGPVLEEFTFPSNSAPFNSCHASTIVEVGKLHFLVAYFGGTAEGAPDVKIWLQTYR 61

Query: 64  DGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +             N+ +   ++F     G+L   K+  + Q WSGCMKRS++ GVTW+E
Sbjct: 62  E-------------NISLEKCIMF-----GILAHNKVSFKCQGWSGCMKRSFDGGVTWTE 103

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
           REQLPPGILGP KNKPILLENGLLLCGSSVESWNSWG+WMEVT D+GRSWRKYGPI+I N
Sbjct: 104 REQLPPGILGPIKNKPILLENGLLLCGSSVESWNSWGAWMEVTEDSGRSWRKYGPIFIKN 163

Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQL 234
           E+LSVIQPVP+ TAN TLRVL+RSF+GI RVCMSES DGG +W YAKPT L
Sbjct: 164 ETLSVIQPVPYQTANGTLRVLLRSFDGIDRVCMSESHDGGQSWDYAKPTAL 214


>gi|406959033|gb|EKD86495.1| hypothetical protein ACD_37C00268G0002 [uncultured bacterium]
          Length = 336

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 157/313 (50%), Positives = 205/313 (65%), Gaps = 6/313 (1%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EEF F     PF SCHAST+ +   G  L A+F GS EGA DV IW  T ++ +W  
Sbjct: 25  IIKEEFIFI--DPPFASCHASTLTQTHSGTILCAWFAGSEEGANDVAIWFSTCEESKWDL 82

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P    E   VP WNP LF +PS  +LLFYK G+  Q+WSG +KRS+++GV WSE E LP 
Sbjct: 83  PKKIAETEGVPCWNPALFTMPSKEILLFYKTGRNPQQWSGLLKRSWDEGVHWSEEECLPA 142

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GI+GP+KNKP+LL NG LLCGSS+ESW  WG W+++T D G +W K  PI +  +   +I
Sbjct: 143 GIIGPAKNKPLLLPNGTLLCGSSIESWRRWGCWIDITADVGHTWHKSSPINVDAQLFGII 202

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP  F   N +L++L RS + IG +C +ES D G TWS A+P  L NPNS ID V + DG
Sbjct: 203 QPTLFFGKNDSLKLLARS-HQIGSICTAESYDQGETWSKAQPISLPNPNSAIDAVNIMDG 261

Query: 250 RLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307
           R++L YN     R  L +A+SKD G +W+  + LEE    E+SYP+VIQ+ D  +HITYT
Sbjct: 262 RIVLVYNHSKEERYPLNIAVSKDGGGTWNTEVVLEEEPG-EYSYPSVIQSLDKEIHITYT 320

Query: 308 YNRTQIKHVVLLP 320
           +NR  IKHV+L P
Sbjct: 321 WNRKYIKHVILDP 333


>gi|359495950|ref|XP_002263578.2| PREDICTED: uncharacterized protein LOC100263718, partial [Vitis
           vinifera]
          Length = 216

 Score =  303 bits (776), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 153/223 (68%), Positives = 173/223 (77%), Gaps = 26/223 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EEFTFP+NSAPF SCHASTIVEV K HFLVAYFGG+ EGAPDVKIWLQT+       
Sbjct: 19  IMLEEFTFPSNSAPFNSCHASTIVEVGKLHFLVAYFGGTAEGAPDVKIWLQTY------- 71

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                        NP++  L     +  +K G     WSGCMKRS++ GVTW+EREQLPP
Sbjct: 72  -------------NPLIVHLDRR-QIKGWKTG-----WSGCMKRSFDGGVTWTEREQLPP 112

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKPILLENGLLLCGSSVESWNSWG+WMEVT D+GRSWRKYGPI+I NE+LSVI
Sbjct: 113 GILGPIKNKPILLENGLLLCGSSVESWNSWGAWMEVTEDSGRSWRKYGPIFIKNETLSVI 172

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
           QPVP+ TAN TLRVL+RSF+GI RVCMSES DGG +W YAKPT
Sbjct: 173 QPVPYQTANGTLRVLLRSFDGIDRVCMSESHDGGQSWDYAKPT 215


>gi|116623694|ref|YP_825850.1| hypothetical protein Acid_4606 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116226856|gb|ABJ85565.1| hypothetical protein Acid_4606 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 321

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 146/308 (47%), Positives = 195/308 (63%), Gaps = 5/308 (1%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           L A+   F A+  P  SCHASTIVE+  G  + A+FGG+ EG PDV IW      G+WQ 
Sbjct: 13  LAAQTSEFVADPMPTPSCHASTIVELASGDLMTAWFGGAGEGRPDVAIWGARRHAGKWQP 72

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+    EPN+  WNPVLF      L L+YK G   Q+W+     S + GVTWS+ E LP 
Sbjct: 73  PVELAREPNIATWNPVLFHSGDGLLWLYYKFGPSPQQWTAARMWSRDDGVTWSKAEHLPA 132

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           G+ G  + KP++L  GL++ GSSVES+NSW +W+E + D G++W ++GPI +P E   +I
Sbjct: 133 GLYGAIRAKPLVLPKGLIVSGSSVESYNSWAAWVERSTDFGKTWTRHGPI-VPPEG-GII 190

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP         LR   RS +   R+C+S+S D G+TW+ A+ T+L NPNSGID V+LKDG
Sbjct: 191 QPSVVQVEGDHLRFYARS-DKAARICVSDSYDAGITWTPARVTELPNPNSGIDVVRLKDG 249

Query: 250 RLLLAYNTVSRGVLKVALS-KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
           R L  YN + RG   + L+  +DGD W    TLE +   E+SYPA+IQASDG +HITYT+
Sbjct: 250 RYLAVYNHIPRGRTPLNLAISNDGDHWTPLKTLESDPG-EYSYPAMIQASDGLIHITYTW 308

Query: 309 NRTQIKHV 316
            R +IKHV
Sbjct: 309 RRQKIKHV 316


>gi|338734050|ref|YP_004672523.1| alpha-rhamnosidase-like protein [Simkania negevensis Z]
 gi|336483433|emb|CCB90032.1| alpha-rhamnosidase-like protein [Simkania negevensis Z]
          Length = 348

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 149/332 (44%), Positives = 204/332 (61%), Gaps = 23/332 (6%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           + L+ +EF F  ++APF+SCHAST+ E ++G  +VAYF GS EG  DV I+L    D +W
Sbjct: 20  QTLLVDEFLF--SNAPFESCHASTLTETEEG-LIVAYFAGSKEGNSDVSIYLSRQCDNKW 76

Query: 68  QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
           Q+P+   E+   P WNPVLF +PS  +LLFYK G +  +WSG +  S + G TWS    L
Sbjct: 77  QAPVKVIEDWGAPTWNPVLFTMPSGKILLFYKAGYDPTRWSGFLTSSIDAGQTWSRPFLL 136

Query: 128 PPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI-------- 179
           P GILGP KNKP+LL++G LLCGSS++S+ +W    E T D G +W +  PI        
Sbjct: 137 PGGILGPVKNKPLLLQDGRLLCGSSIQSYLNWACSFEWTRDEGLTWERSNPIPYFEERRA 196

Query: 180 -YIPNES-------LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
            + P++        + VIQP  +    + + +L RS   IG +C + S DGG TW+ A P
Sbjct: 197 PFFPDKKSASKDRPIGVIQPTFWTEDGQHITMLCRS-RRIGWICKATSSDGGRTWTRAYP 255

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
           T+L NP+SG D V++ DGR+ L YN     R  L +ALS D G++W D L LE+     F
Sbjct: 256 TELPNPDSGFDAVRMFDGRIALVYNHSKTKRTPLNLALSIDGGETWKDVLVLEDGPG-SF 314

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           +YPA+IQ  DG +HITYT+NR  IKH+ L P 
Sbjct: 315 AYPAIIQTQDGLLHITYTWNRKHIKHIALDPT 346


>gi|390956309|ref|YP_006420066.1| putative neuraminidase (sialidase) [Terriglobus roseus DSM 18391]
 gi|390411227|gb|AFL86731.1| putative neuraminidase (sialidase) [Terriglobus roseus DSM 18391]
          Length = 361

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 137/319 (42%), Positives = 194/319 (60%), Gaps = 20/319 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK-DGRWQSPIIADE 75
           F   +APF S HASTIVEV KG FL A+FGG+ EGAPDV IW+     DG W +PI    
Sbjct: 35  FIYQNAPFPSAHASTIVEVGKGEFLSAWFGGTKEGAPDVAIWMSRRSVDGAWSAPIEMAR 94

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS 135
           EP  P WNPVLF   +  L L++K G    +W+   + S + G TWS+ E LP G+ GP 
Sbjct: 95  EPGTPTWNPVLFHAKTGRLWLYFKYGPHPAEWTAARRYSDDDGKTWSQVEHLPAGVYGPI 154

Query: 136 KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI------------PN 183
           + KP +  +G ++ GSSVES++SW  W+E + D G +W + GPI +            P+
Sbjct: 155 RAKPYVAPDGTIVSGSSVESYHSWSVWVERSTDNGMTWTRTGPITLPELLHAPASALDPD 214

Query: 184 ESLSVIQP--VPFHTAN--RTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
           +++ +IQP  +P +       LR+ MRS   I ++C+++S DGG TWS A+   + NPNS
Sbjct: 215 KAIGLIQPTVIPMNDGKPGMHLRLYMRSSIQINKICVADSTDGGKTWSDARKLNVPNPNS 274

Query: 240 GIDGVKLKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           GID V+LKD R +L YN     R  L +A+SK DG+ + +   +E     EFSYPA+IQ 
Sbjct: 275 GIDIVRLKDRRFVLLYNDTPTGRTPLNLAVSK-DGEHFTNFAIVENENPGEFSYPALIQG 333

Query: 298 SDGSVHITYTYNRTQIKHV 316
           +DG++H+TYT+ R  I++ 
Sbjct: 334 TDGNLHMTYTWQRKTIRYA 352


>gi|326799722|ref|YP_004317541.1| exported exo-alpha-sialidase [Sphingobacterium sp. 21]
 gi|326550486|gb|ADZ78871.1| exported exo-alpha-sialidase [Sphingobacterium sp. 21]
          Length = 377

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/327 (44%), Positives = 196/327 (59%), Gaps = 22/327 (6%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           +G++ EEF +    A F SCH++TI E   G  + A+FGG+ E  PDV+I++  F++G W
Sbjct: 27  QGILKEEFLY--EKAAFPSCHSATIAETPTG-LVAAFFGGTKERHPDVEIYVCRFENGTW 83

Query: 68  QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
            +P+     +  E   +P WNPVL+++P   LLLFYKIG +   W G +KRS + G TWS
Sbjct: 84  TAPVSAADGVQAEGKRLPTWNPVLYQIPGGDLLLFYKIGPKPSAWWGMLKRSKDGGKTWS 143

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
             E+LP G +GP KNKP+LL+NG L C SS E  N W    EVT D  ++WRK GP+   
Sbjct: 144 TAEKLPNGHIGPVKNKPVLLDNGQLFCPSSTEG-NGWKIHFEVTPDFAKTWRKIGPLAA- 201

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            +S+  IQP      NRTL++L RS N    +  S S D G TWS  + T L N NSG D
Sbjct: 202 KDSIDAIQPSILIHGNRTLQLLARSRNRA--LVQSWSTDNGETWSPLEKTSLPNNNSGTD 259

Query: 243 GVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
            V + DGR +L YN V          R  L V++SK DG SW  AL LE++   ++SYPA
Sbjct: 260 AVTMADGRHVLVYNHVLPPGNLAKGPRTPLNVSVSK-DGKSWSAALILEDSPISQYSYPA 318

Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
           VIQ  DG +H  YT+ R +IKHVV+ P
Sbjct: 319 VIQTKDGLLHFIYTWRRQKIKHVVVDP 345


>gi|325103077|ref|YP_004272731.1| exported exo-alpha-sialidase [Pedobacter saltans DSM 12145]
 gi|324971925|gb|ADY50909.1| exported exo-alpha-sialidase [Pedobacter saltans DSM 12145]
          Length = 383

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 140/327 (42%), Positives = 203/327 (62%), Gaps = 21/327 (6%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           KG+V  EF +  + APF SCH++TI E   G  + A+FGG+ E  PDV+I++   ++G+W
Sbjct: 32  KGIVKNEFIY--DKAPFPSCHSATIAETPTG-LVYAFFGGTHERHPDVEIYVSRQENGKW 88

Query: 68  QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
            +P+     I ++   +P WNPVL+++P+  L+LFYK+G +  +W G +KRS + G TWS
Sbjct: 89  LTPVSAANGIQEDGKRLPTWNPVLYQIPNGDLMLFYKVGPKPSEWWGMLKRSKDGGKTWS 148

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
             E+LP G LGP KNKP+LL NG L+C +S E  N W    E+T D G++WRK GP+   
Sbjct: 149 AAEKLPEGYLGPVKNKPVLLSNGNLICPTSTEG-NGWNIHFEITSDFGKTWRKVGPVGRG 207

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            ++++ IQP      N  L++L R+ N    +  S S D G TW+  K T L N NSG D
Sbjct: 208 EDNINAIQPSVLDHGNGKLQILARTRNWA--IAESWSYDNGETWTPLKKTTLPNNNSGTD 265

Query: 243 GVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
            V +K+G+ +L YN V         +R  L VA+SK DG +W+  L LE++   ++SYPA
Sbjct: 266 AVTMKNGKHVLVYNHVLPPKDNPKGARTPLNVAVSK-DGVNWNAVLILEDSPISQYSYPA 324

Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
           VIQ SDG +H  YT+ R ++KHVV+ P
Sbjct: 325 VIQTSDGLLHFAYTWRREKMKHVVVDP 351


>gi|409100320|ref|ZP_11220344.1| exported exo-alpha-sialidase [Pedobacter agri PB92]
          Length = 368

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 23/327 (7%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           G+V +EF +  + APF SCH++TI E   G  + ++FGG+ E  PDV+I++  F DG+W 
Sbjct: 30  GVVKQEFLY--DKAPFPSCHSATIAETPTG-LVASFFGGTKERNPDVEIYISRFIDGKWL 86

Query: 69  SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +P+     +  +   +P WNPVL+++P   LLLFYKIG +  +W G MK S + G+TWSE
Sbjct: 87  APVSVANGVMPDGKRLPTWNPVLYQVPGGDLLLFYKIGPKPSEWWGLMKTSKDGGITWSE 146

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
             +LP G +GP KNKP+LL NG L   SS E  N W    EVT D G++WR  GP  +P 
Sbjct: 147 ATKLPDGYIGPVKNKPVLLSNGNLFAPSSKEG-NGWKIHFEVTKDNGKTWRTVGP--LPE 203

Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
             +  IQP      N  L++L R+ N    +  S S D G TWS    T L N NSG D 
Sbjct: 204 NGIKAIQPSILQHGNGKLQILARTANRA--IVESWSEDNGETWSPLTKTSLPNNNSGTDA 261

Query: 244 VKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
           V +KDGR +L YN V          R  L V++S+ DG +W  AL LE++   ++SYPAV
Sbjct: 262 VTMKDGRHVLVYNHVLPPGDLAKGPRTPLNVSVSR-DGKTWSAALILEDSPISQYSYPAV 320

Query: 295 IQASDGSVHITYTYNRTQIKHVVLLPN 321
           IQ SDG +H  YT+ R +IKHVV+ P+
Sbjct: 321 IQTSDGMLHFIYTWRREKIKHVVVDPS 347


>gi|431795896|ref|YP_007222800.1| neuraminidase [Echinicola vietnamensis DSM 17526]
 gi|430786661|gb|AGA76790.1| putative neuraminidase (sialidase) [Echinicola vietnamensis DSM
           17526]
          Length = 366

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 195/323 (60%), Gaps = 22/323 (6%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ EEF F  ++APF+ CHAST+V +  G  + A+FGG  E  P+V IW+       W  
Sbjct: 46  VITEEFVF--DNAPFRECHASTLVGLKDGSIMAAWFGGEYERHPNVGIWIARKTGTGWSQ 103

Query: 70  PI-IADEEPN----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P  +AD + N     P WNPVLF +  + L+LFYK G   Q+W G  K S ++G TWS  
Sbjct: 104 PQKVADGQVNDTLSYPTWNPVLFNMTPDSLVLFYKEGPSPQEWWGMYKVSVDEGKTWSSA 163

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
           ++LP GILGP KNKPI L NG +L  SSVE  + W + +E+T D G++W+K   + I +E
Sbjct: 164 QKLPEGILGPIKNKPITLRNGAVLAPSSVEDEDGWRAHVEITEDHGKTWKK---VAIDHE 220

Query: 185 -SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
               VIQP     A+  L+VL RS     R+  + S D G +W     T++LNPNSG D 
Sbjct: 221 GEYDVIQPSILQHADGRLQVLCRSQQ--NRIITAWSEDQGRSWGQLSATEVLNPNSGTDA 278

Query: 244 VKLKDGRLLLAYN--------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
           V L DGR LL YN        +  R  L++A+S  DG SW+D +TLE   + EFSYPA+I
Sbjct: 279 VTLADGRFLLVYNPEVSGKDWSDGRNELRLAVST-DGVSWNDVMTLEHEPSGEFSYPAII 337

Query: 296 QASDGSVHITYTYNRTQIKHVVL 318
           Q +DG VHI+YT+ R +IK+VVL
Sbjct: 338 QTTDGKVHISYTWKRKKIKYVVL 360


>gi|149176178|ref|ZP_01854794.1| hypothetical protein PM8797T_29628 [Planctomyces maris DSM 8797]
 gi|148845045|gb|EDL59392.1| hypothetical protein PM8797T_29628 [Planctomyces maris DSM 8797]
          Length = 350

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 190/319 (59%), Gaps = 17/319 (5%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           GLV +EF +  + APF SCHASTI E  +G  + A+FGG+ E  PDV+IWL       W 
Sbjct: 33  GLVMQEFIY--DQAPFPSCHASTIAETPEG-LVCAFFGGTAEKNPDVEIWLSRNTGDGWS 89

Query: 69  SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +P+     + D     P WNPVLF+     LLLFYK+G    +W G +K S++ G TW +
Sbjct: 90  APVSVADGVKDATKRWPCWNPVLFQTKPGTLLLFYKVGPNPSEWWGMLKISHDNGKTWGK 149

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
             +LP G +GP KNKP LL +G LLC +S E  +SW   ME T D G++W + GP+    
Sbjct: 150 ARRLPDGFVGPVKNKPFLLSDGTLLCPASTEH-DSWQLQMEWTPDLGKTWHRTGPLNDGR 208

Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
           E +  IQP      ++ L++L RS    G++  + S D G +WS   PT L NPNSG D 
Sbjct: 209 E-IGAIQPSVLQYGDK-LQILCRSRQ--GKIVEAWSEDNGRSWSKVTPTSLPNPNSGTDA 264

Query: 244 VKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           V LKDGR LL YN   +G   L VA+S +DG  W   L LE+    E+SYPAVIQ +DG 
Sbjct: 265 VTLKDGRALLVYNPTKKGRSPLHVAIS-EDGKHWKTGLVLEDKKG-EYSYPAVIQTADGK 322

Query: 302 VHITYTYNRTQIKHVVLLP 320
           VHITYT+ R  +KHVV+ P
Sbjct: 323 VHITYTWRRELVKHVVIDP 341


>gi|332666300|ref|YP_004449088.1| BNR/Asp-box repeat domain-containing protein [Haliscomenobacter
           hydrossis DSM 1100]
 gi|332335114|gb|AEE52215.1| BNR/Asp-box repeat domain protein [Haliscomenobacter hydrossis DSM
           1100]
          Length = 338

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 188/328 (57%), Gaps = 24/328 (7%)

Query: 9   GLVAEEFTFPAN----SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
           GL A+   F +      APF SCHAST+ E  +G  + A+FGG+ E   DV IW     +
Sbjct: 13  GLTAQSINFTSEFIYEQAPFPSCHASTLAETPQG-LIAAWFGGTHERHKDVGIWSSRLVN 71

Query: 65  GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           G+W SP+     + +     P WNPVLF++P   LLLFYK+G     W G + RS + G+
Sbjct: 72  GKWTSPVELANGVQNASLRHPCWNPVLFQMPGAELLLFYKVGPSPADWWGMLMRSTDGGL 131

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           +WS  E+LP GILGP KNKP+LL++G LLC SS E  + W    E T DAGR+W+K   I
Sbjct: 132 SWSTPEKLPEGILGPIKNKPVLLKDGTLLCPSSSEH-DGWRVHFESTKDAGRTWQKTAAI 190

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
               E  S IQP     A+  L++L RS NG   V  + S D G TWS  K T + NPNS
Sbjct: 191 NDGKE-FSAIQPSVLFHADGRLQILCRSKNGF--VLEAFSKDNGQTWSPLKKTSIPNPNS 247

Query: 240 GIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
           G D V L+DGR LL YN V+         R  L V+LS  DG  W     LE+    EFS
Sbjct: 248 GTDAVTLRDGRQLLVYNHVTKRSRQWGGDRSPLNVSLST-DGKHWKSMTVLEQEPKQEFS 306

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YPAVIQ  DG VHITYT+ R +IK+V +
Sbjct: 307 YPAVIQTQDGRVHITYTWKRQRIKYVSM 334


>gi|255532668|ref|YP_003093040.1| exported exo-alpha-sialidase [Pedobacter heparinus DSM 2366]
 gi|255345652|gb|ACU04978.1| exported exo-alpha-sialidase [Pedobacter heparinus DSM 2366]
          Length = 378

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 23/327 (7%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           G++ +EF +    A F SCH++TI E   G  + AYFGG+ E  PDV+I++    +G W 
Sbjct: 31  GIITDEFLY--EKAAFPSCHSATIAETPTG-LVAAYFGGTHERHPDVEIYVSRQVNGTWL 87

Query: 69  SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +P+     I +++  +P WNPVL+++P   LLLFYKIG +  +W G M+ S + G+TWSE
Sbjct: 88  APVSVANGIQNDKVRLPTWNPVLYQVPGGELLLFYKIGPKPSEWWGMMRSSKDGGITWSE 147

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
            ++LP G +GP KNKP+LL NG L C SS E    W    EVT D G++WR  GP  +  
Sbjct: 148 AQKLPEGQIGPVKNKPVLLSNGNLFCPSSTEG-KGWKVHFEVTKDNGKTWRLIGP--LEG 204

Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
             ++ IQP      N  L++L RS N    +  S S D G TWS    T L N NSG D 
Sbjct: 205 GEINAIQPSILDHGNGKLQILARSRNRA--IVESWSQDNGETWSALAKTSLPNNNSGTDA 262

Query: 244 VKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
           V +KDGR +L YN V         +R  L VA+SK DG +W  AL LE++   ++SYPAV
Sbjct: 263 VTMKDGRHVLVYNHVLPPGDLAKGARTPLNVAISK-DGKNWSAALILEDSPTSQYSYPAV 321

Query: 295 IQASDGSVHITYTYNRTQIKHVVLLPN 321
           IQ SDG +H  YT+ R +IKHVV+ P+
Sbjct: 322 IQTSDGLLHFIYTWRREKIKHVVVDPS 348


>gi|338209632|ref|YP_004653679.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
 gi|336303445|gb|AEI46547.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
          Length = 806

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 144/324 (44%), Positives = 186/324 (57%), Gaps = 23/324 (7%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV EEF F  ++ P  SCHAS+IV++   H +  +FGG+ EG PDV IWL  ++ G+W  
Sbjct: 27  LVKEEFIF--DTPPVPSCHASSIVQLGDQHLMATWFGGTAEGKPDVTIWLAEYRKGKWGQ 84

Query: 70  PI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P      I D     P WNPVLFK     L+L+YK+G   ++W G ++ S + G  WS  
Sbjct: 85  PKQMADGIMDAGKRYPCWNPVLFKTREGKLMLYYKVGPNPREWWGMVRYSTDNGKNWSAP 144

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKYGPIYIP 182
           E+LP GI+GP KNKPI L NG +L  +S ES   N W   +E T   G++W+K   I I 
Sbjct: 145 EKLPEGIMGPIKNKPIQLPNGDILHPTSTESVTGNIWHVHLEKTDKNGKNWQK---ISID 201

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
           N     IQP      +  +++L RS   +  V  + S DGG TWS      L NPNSG D
Sbjct: 202 NGDFGAIQPSILTYPDGKMQMLCRSRQNV--VVQTWSSDGGKTWSPVSALSLPNPNSGTD 259

Query: 243 GVKLKDGRLLLAYNTVSRGV--------LKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
            V LK+G  +L YN + RG         L+VA SK DG +W D  TLE+    EFSYPAV
Sbjct: 260 AVTLKNGTQILIYNPLIRGSDWAKGRNKLRVAASK-DGTNWQDIYTLEDEGKGEFSYPAV 318

Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
           IQ +DG VHITYT+ R  +KHVVL
Sbjct: 319 IQTTDGLVHITYTHERKNVKHVVL 342


>gi|86142035|ref|ZP_01060559.1| putative neuraminidase [Leeuwenhoekiella blandensis MED217]
 gi|85831598|gb|EAQ50054.1| putative neuraminidase [Leeuwenhoekiella blandensis MED217]
          Length = 351

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 145/326 (44%), Positives = 185/326 (56%), Gaps = 25/326 (7%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V E F +    APF  CHAST+VE   G  + A+FGG  E  P+V I+     D  W +
Sbjct: 31  IVQEGFIY--EEAPFPQCHASTLVEAQDGSIIAAWFGGEYERHPEVSIYQSKLTDTVWST 88

Query: 70  PI-IAD-----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           P  IAD     +  + P WNPVLFK   N L+L+YK G     W G +K S + G +WS 
Sbjct: 89  PQKIADGKVENDTLSYPTWNPVLFKNAENTLMLYYKEGPSPSSWWGKLKTSEDNGASWSA 148

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYI 181
            EQLP GILGP KNKPI L NG ++  SSVES +   W S +E++ D G +W +   + I
Sbjct: 149 SEQLPDGILGPIKNKPIQLANGSIVSPSSVESEDGAVWKSHIELSSDNGYTWER---VAI 205

Query: 182 PN-ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           P+ +++ VIQP      N  L+ L+RS      +  SES D G TWS A    +LNPNSG
Sbjct: 206 PSVDTVKVIQPTLIQLQNGNLKALLRSDQNY--ILESESSDQGKTWSEATKGTMLNPNSG 263

Query: 241 IDGVKLKDGRLLLAYNTVSRGV--------LKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           ID V LK G  LL YN    G         L +A S  DG  W D L LE     EFSYP
Sbjct: 264 IDAVTLKSGNFLLVYNPTEAGADWSDGRNKLNLAYST-DGSHWEDVLQLENEAEGEFSYP 322

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
           A+IQ S+G VHITYT+NR++IK++ L
Sbjct: 323 AIIQDSEGLVHITYTHNRSKIKYLQL 348


>gi|326801668|ref|YP_004319487.1| neuraminidase [Sphingobacterium sp. 21]
 gi|326552432|gb|ADZ80817.1| putative neuraminidase [Sphingobacterium sp. 21]
          Length = 343

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 188/323 (58%), Gaps = 21/323 (6%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ +E  F   + PFK CHAST++EV  G  L A FGGS EG  DV IWL      +  +
Sbjct: 27  IIKQELVF--TNPPFKECHASTLLEVKPGILLAACFGGSQEGKEDVSIWLCKINSEKIST 84

Query: 70  P-IIADEEPN----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
             IIAD   N     P WNPVLFK  +    LFYK+G   + W G +K S + G+ WS  
Sbjct: 85  AQIIADGVINDTLRYPTWNPVLFKSQAGKTFLFYKVGPNPRAWWGMVKTSTDDGLNWSSP 144

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
           ++L  GILGP KNKPI L NG +L  SSVE + N W + +E++ D G SW+ Y PI   +
Sbjct: 145 QKLTDGILGPIKNKPIQLANGYILSPSSVEVTENKWRAHIELSKDDGESWQ-YIPIDTAS 203

Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
           E  +VIQP      N+ L+VL RS    G +  S S DGGL W     T LLNPNSG D 
Sbjct: 204 E-YNVIQPSILTYKNQRLQVLCRSKE--GSIIQSWSEDGGLHWGSLNKTALLNPNSGTDA 260

Query: 244 VKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
           V L+DGR L+ YN          +RG L++A S  DG  W+D   LE+    E+SYPA+I
Sbjct: 261 VTLRDGRQLIVYNPDVPGKEWFNNRGKLRIACSM-DGQKWNDVAVLEDGTNEEYSYPAII 319

Query: 296 QASDGSVHITYTYNRTQIKHVVL 318
           Q SDG +H+ YTY+R  IKHVVL
Sbjct: 320 QTSDGLIHVLYTYDRKNIKHVVL 342


>gi|283778151|ref|YP_003368906.1| hypothetical protein Psta_0356 [Pirellula staleyi DSM 6068]
 gi|283436604|gb|ADB15046.1| conserved hypothetical protein [Pirellula staleyi DSM 6068]
          Length = 357

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 193/321 (60%), Gaps = 18/321 (5%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           G++  EF    N  P   CHA+TI E   G  + A+FGG+ EG  DV IW+   ++G+WQ
Sbjct: 40  GVLVSEFI--DNPTPTPQCHATTIAETPAG-LVAAWFGGTREGNKDVVIWVARHREGKWQ 96

Query: 69  SPI------IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
            P+      ++D E + P WNPVLF++P   LLLFYK+G     W G +  S + G TW 
Sbjct: 97  PPVEVASGKVSDTERH-PTWNPVLFQMPDGPLLLFYKVGPSPSTWWGMLTSSSDNGDTWE 155

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
              +LP  I GP KNKPILL+N  LLC SS E  N W   ME+T DAG++W +   +   
Sbjct: 156 VPTKLPADIAGPIKNKPILLDNKTLLCPSSTED-NGWRVHMEMTTDAGKTWTRTEAL-CD 213

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            +S+  IQP      ++ L++L R+    G++  S S DGG TWS  +   L+NPNSGID
Sbjct: 214 GKSVQAIQPTVLRHGDK-LQMLCRT-RLPGKIVESWSEDGGKTWSKLESIALVNPNSGID 271

Query: 243 GVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           G+ LK G  LL YN   ++R  L +ALS  DG +W+D L LE +   E+SYPA+IQ SDG
Sbjct: 272 GLTLKSGEHLLVYNHTLIARSPLNLALSS-DGKTWNDILDLETSKG-EYSYPAIIQTSDG 329

Query: 301 SVHITYTYNRTQIKHVVLLPN 321
            VHITYT+ R + KHVV+ P+
Sbjct: 330 LVHITYTWKRLRAKHVVIDPS 350


>gi|373958303|ref|ZP_09618263.1| exported exo-alpha-sialidase [Mucilaginibacter paludis DSM 18603]
 gi|373894903|gb|EHQ30800.1| exported exo-alpha-sialidase [Mucilaginibacter paludis DSM 18603]
          Length = 411

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 146/329 (44%), Positives = 192/329 (58%), Gaps = 23/329 (6%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR 66
           KG++ +EF +    APF  CHA+TI E D+G  + A+FGG+ E  PDV IW+ +  K+G+
Sbjct: 61  KGILTDEFIY--EKAPFPECHAATIAETDEG-LVAAWFGGTKERNPDVCIWISRQKKNGK 117

Query: 67  WQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
           W +P      I  +      WNPVL+++P   L+LFYKIG +   W G  K S + G TW
Sbjct: 118 WSTPENVANGIQPDTTRYACWNPVLYQVPHGQLILFYKIGPKPATWKGFYKTSDDGGKTW 177

Query: 122 SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
           SE++ LP  ILGP KNKP+LL NG LL  SS E    W    E++ D G++W+  GP+  
Sbjct: 178 SEQQALPNSILGPIKNKPVLLSNGQLLSPSSTEE-KGWDVHFEMSPDNGKTWKLIGPV-A 235

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
             E+   IQP      N +L+ L RS +    +  + S D GLTWS    T L N NSG 
Sbjct: 236 DGETQGGIQPSILFHKNGSLQALCRSKDRA--IVETWSTDNGLTWSPLAKTSLPNNNSGT 293

Query: 242 DGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           D V LKDGR LL YN V         +R  L VALSK DG  W  +L LE++   ++SYP
Sbjct: 294 DAVTLKDGRQLLVYNHVLPPGKLAKGARTPLNVALSK-DGKKWFASLILEDSPISQYSYP 352

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           AVIQ SDG VH  YT+ R +IKHVV+ P+
Sbjct: 353 AVIQTSDGLVHFVYTWRRQRIKHVVVDPS 381


>gi|255035915|ref|YP_003086536.1| BNR repeat-containing glycosyl hydrolase [Dyadobacter fermentans
           DSM 18053]
 gi|254948671|gb|ACT93371.1| BNR repeat-containing glycosyl hydrolase [Dyadobacter fermentans
           DSM 18053]
          Length = 361

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 140/328 (42%), Positives = 191/328 (58%), Gaps = 21/328 (6%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           +G++ +E  F   +APF   HASTI E   G  + A+FGG+ E  PDV IW+   ++ +W
Sbjct: 27  EGILVDENIF--ETAPFPESHASTIAETPAG-LVAAWFGGTKERNPDVGIWVSRLENDKW 83

Query: 68  QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
            +P+     + +E+     WNPVL+++P   L+LFYK+G  V  W G +  S + G TWS
Sbjct: 84  TAPVEVANGVVNEKLRYACWNPVLYQIPKGELMLFYKVGPNVAGWKGFLITSKDNGKTWS 143

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
           +   L  G LGP KNKP+LL NG L C SS E    W    EVT D G++WRK GP+   
Sbjct: 144 KPTALQEGYLGPVKNKPVLLANGELWCPSSTEGEGGWRLHFEVTPDFGKTWRKVGPLN-D 202

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            +++  IQP     AN  +++L RS +    +  S S DGG TWS   P+ L N NSG D
Sbjct: 203 GKTIKAIQPSLLTYANGDMQILCRSQS--RSIVESWSKDGGKTWSEVGPSALPNNNSGTD 260

Query: 243 GVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
            V LKDGR LL YN V          R  L VA+SK DG +W   L LE++   ++SYP+
Sbjct: 261 AVTLKDGRQLLVYNHVKPPAGQPKGPRTPLNVAVSK-DGKNWEAVLILEDSPISQYSYPS 319

Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLPN 321
           VIQ++DG VHI YT+ R +IKHV + P+
Sbjct: 320 VIQSTDGMVHIVYTWRRQRIKHVKIDPS 347


>gi|171913078|ref|ZP_02928548.1| hypothetical protein VspiD_17900 [Verrucomicrobium spinosum DSM
           4136]
          Length = 1006

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 181/329 (55%), Gaps = 17/329 (5%)

Query: 2   KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
           K      GL + EF +  + APF SCHASTI +   G  L ++FGG+ EG PDV IW+  
Sbjct: 670 KTQAAAAGLTSSEFIY--DKAPFPSCHASTIAQAADGTLLTSWFGGTAEGKPDVGIWVSR 727

Query: 62  FKDGRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
              G+W  P+     +  +    P WNPVLF+     L LFYK+G   Q W G ++ S +
Sbjct: 728 QVGGKWSEPVEVANGLQADGTRHPCWNPVLFQPKQGPLWLFYKVGPSPQTWWGMLRTSTD 787

Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW---NSWGSWMEVTVDAGRSW 173
            G TW E  +LP GILGP KNKP+ L NG +LC +S ES    + W    E T D G++W
Sbjct: 788 GGKTWGEAVRLPDGILGPIKNKPVQLANGDILCPTSTESPTKPSKWAVHFERTSDMGKTW 847

Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
               P     E++  IQP         L  L RS  G  +V   ES D G TW   +   
Sbjct: 848 TAT-PALHDGEAIRAIQPSVLLAGGDKLIALGRSAQG--KVFEIESNDLGKTWGTIRLGS 904

Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSY 291
           L NPNSG D V LKDGR LL YN  S+G   L +A+SKD G +W  AL LE     E+SY
Sbjct: 905 LPNPNSGTDAVTLKDGRHLLIYNHTSKGRSPLNLAISKD-GVTWQAALVLESEKG-EYSY 962

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           PA+IQ  DG VH+TYT+ R ++KH V+ P
Sbjct: 963 PAIIQTPDGKVHVTYTWKREKVKHAVIDP 991


>gi|149280002|ref|ZP_01886127.1| hypothetical protein PBAL39_23968 [Pedobacter sp. BAL39]
 gi|149229199|gb|EDM34593.1| hypothetical protein PBAL39_23968 [Pedobacter sp. BAL39]
          Length = 333

 Score =  236 bits (603), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 140/324 (43%), Positives = 185/324 (57%), Gaps = 25/324 (7%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
           V EE  F    APF+ CHASTI E+     + A+FGG  EG+ DV IW    K+G+W  P
Sbjct: 15  VTEEMIF--QDAPFQQCHASTIAEISGARIMAAWFGGKHEGSNDVCIWSSVLKNGKWSEP 72

Query: 71  II-----ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
                  +DEE   P+WNPVLF   S  L L+YK+G+  ++W G    S + G TW+  +
Sbjct: 73  RKLADGKSDEETAYPLWNPVLFMNKSGKLFLYYKMGRSPREWWGMEMSSLDGGKTWTAPK 132

Query: 126 QLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE- 184
           +LP GILGP KNKP+ L  G++L  SS E+   W + +E + D G +W +     IP + 
Sbjct: 133 RLPDGILGPIKNKPVELVGGIILSPSSTETTKRWKAHIERSTDGGETWTR-----IPVDE 187

Query: 185 --SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
             +  VIQP      +  +++L RS +G+  V  S S D GL+WS    T LLNPNSG D
Sbjct: 188 KGAYDVIQPSVMVYPDGKIQLLCRSKHGV--VMESWSSDQGLSWSPLVKTSLLNPNSGTD 245

Query: 243 GVKLKDGRLLLAYNTV------SRGVLKVALSKD-DGDSWHDALTLEENLAM-EFSYPAV 294
           GV L  G+ L  YN          G  K+AL++  DG  W D L LE+     EFSYPA+
Sbjct: 246 GVSLASGKQLFVYNPQIPGNDWFNGRYKLALTESTDGKDWLDQLMLEDGKETDEFSYPAI 305

Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
           IQ+ DG +HITYTYNR  IK+VVL
Sbjct: 306 IQSQDGKIHITYTYNRKNIKYVVL 329


>gi|325110761|ref|YP_004271829.1| FG-GAP repeat-containing protein [Planctomyces brasiliensis DSM
           5305]
 gi|324971029|gb|ADY61807.1| FG-GAP repeat protein [Planctomyces brasiliensis DSM 5305]
          Length = 1006

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 146/337 (43%), Positives = 198/337 (58%), Gaps = 27/337 (8%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           K+    GLV+E F +   +APFK CHASTI E  +G  + ++FGG+ EG  DV IW   F
Sbjct: 659 KEPRFPGLVSETFIY--ENAPFKECHASTICETTRG-LVASWFGGTKEGNKDVGIWTSYF 715

Query: 63  KDGRW-QSPIIAD----EEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYN 116
              RW ++ + AD    E    P WNPVLF+ P +   LLF+K+G   ++W G M  SY+
Sbjct: 716 DGQRWSRTSLAADGVQHEGLRYPCWNPVLFQPPGDAPTLLFFKVGPNPREWWGEMMVSYD 775

Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVT--VDA--GR 171
           +G T+ +R +LP GI GP + KPILL++G  LLCGSS E ++ W    E    VD     
Sbjct: 776 RGRTFRDRVRLPEGIDGPVRCKPILLDDGSTLLCGSSTE-YDGWRVHFESVQLVDGIPNG 834

Query: 172 SWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
           +W++ GPI    E  + IQP     A+  L+VL R+   +  +  S S DGG TWS  K 
Sbjct: 835 TWKRVGPINDGKE-FNAIQPTFLQHADGRLQVLCRTKEAV--ITSSHSTDGGETWSTMKA 891

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEE 283
             L NPNSGI+ V LKDGR L+ YN +         RG+L +A+S DDG++W     +E 
Sbjct: 892 IDLPNPNSGIEAVTLKDGRHLMIYNHLGSGASGWGRRGLLNLAIS-DDGENWRKVAVVER 950

Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
               EFSYPA+IQ  DG VHITYT+ R ++KH+V+ P
Sbjct: 951 EQRGEFSYPAIIQTEDGKVHITYTWKRQRVKHLVVDP 987


>gi|256420851|ref|YP_003121504.1| alpha-L-rhamnosidase [Chitinophaga pinensis DSM 2588]
 gi|256035759|gb|ACU59303.1| alpha-L-rhamnosidase [Chitinophaga pinensis DSM 2588]
          Length = 1251

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 141/327 (43%), Positives = 185/327 (56%), Gaps = 22/327 (6%)

Query: 8    KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
            KG+V +E+ F    APF   HA+TI E   G  + A+FGG+ E  PDV IW+   +   W
Sbjct: 919  KGIVEDEYIF--MDAPFPESHAATIAETSDG-LVAAWFGGTKERNPDVGIWVSRKEGNTW 975

Query: 68   QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
             +P+     I  +   V  WNPVL+++P   L L+YK G +V  W G M+ S + G TWS
Sbjct: 976  TAPVEVANGILSDTLRVACWNPVLYQVPGGELHLYYKTGTKVATWVGWMRTSNDGGRTWS 1035

Query: 123  EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
              + LP G LGP KNKP+LL+NG LLC SS E  + W    E T D G++W   GPI   
Sbjct: 1036 AAKALPEGFLGPVKNKPVLLDNGELLCPSSTEG-SGWKVHFECTPDNGKTWTMRGPIN-D 1093

Query: 183  NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
             ++ + IQP         L++L RS    G V  S S D G TWS    T+L N NSG D
Sbjct: 1094 GKTFNTIQPSILTYGKGKLQILCRSKE--GSVVQSWSTDNGRTWSPMSATELPNNNSGTD 1151

Query: 243  GVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
             V LKDGR L+ YN V         +R  L VA+S +DG  W  AL LE++   ++SYP+
Sbjct: 1152 AVTLKDGRQLIVYNHVKTPKGKSKGARTPLNVAIS-EDGIHWSAALVLEDSPVSQYSYPS 1210

Query: 294  VIQASDGSVHITYTYNRTQIKHVVLLP 320
            VIQ +DG VHI YT+ R +IKHV + P
Sbjct: 1211 VIQTADGYVHIVYTWRRQRIKHVKIDP 1237


>gi|431797036|ref|YP_007223940.1| hypothetical protein Echvi_1665 [Echinicola vietnamensis DSM 17526]
 gi|430787801|gb|AGA77930.1| hypothetical protein Echvi_1665 [Echinicola vietnamensis DSM 17526]
          Length = 379

 Score =  235 bits (599), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 136/334 (40%), Positives = 192/334 (57%), Gaps = 21/334 (6%)

Query: 2   KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
           +K+    G+V +EF +    AP+ SCHA+TI E  +G  + A+FGG+ E  PDV IW+  
Sbjct: 25  QKETWRDGIVVDEFIY--EEAPYPSCHAATIAETPEG-LVAAWFGGTHERNPDVGIWVSH 81

Query: 62  FKDGRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
            KDG+W   +     + ++    P WNPVL+++P   L L+YK+G    +W G +  S +
Sbjct: 82  RKDGKWTESVEVANGVINDTLRYPTWNPVLYQVPDGDLQLYYKVGPHPSQWWGMLITSAD 141

Query: 117 KGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKY 176
            G TWS  + LP   +GP KNKP+LL+NG L+  SS E  N W    EVT D G +W   
Sbjct: 142 GGHTWSAPKALPDSAIGPVKNKPVLLDNGALIAPSSTEG-NGWNIHFEVTPDFGETWENV 200

Query: 177 GPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN 236
           GPI    + ++ IQP      N  L++L R+ N    +  S S D G TWS  + + L N
Sbjct: 201 GPIARGEDDINAIQPSVLDHGNGKLQILARTRNRA--IGTSWSEDYGKTWSPMEKSNLPN 258

Query: 237 PNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAM 287
            NSG D V L+DGR LL YN V          R  L V+LS DDG  W+ +L LE++   
Sbjct: 259 NNSGTDAVTLQDGRHLLVYNHVLPPGEKAKGPRTPLNVSLS-DDGIHWNASLILEDSEIS 317

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           ++SYP+VIQ++DG VHI YT+ R +IK+V + P+
Sbjct: 318 QYSYPSVIQSADGMVHIVYTWRRERIKYVKIDPS 351


>gi|375146864|ref|YP_005009305.1| alpha-L-rhamnosidase [Niastella koreensis GR20-10]
 gi|361060910|gb|AEV99901.1| alpha-L-rhamnosidase [Niastella koreensis GR20-10]
          Length = 1278

 Score =  234 bits (596), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 138/326 (42%), Positives = 188/326 (57%), Gaps = 22/326 (6%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            G+V +EF F   S  F   HA+TI E  KG  + A+FGG+ E  PDV IW+    + +W 
Sbjct: 939  GIVTDEFIFDRTS--FPESHAATIAETPKG-LVTAWFGGTKERNPDVCIWVSRQVNNKWT 995

Query: 69   SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
             P+     I ++      WNPVL++ P+  LLLFYK+G  V  W G + RS + GV+W +
Sbjct: 996  EPVNVANGIINDSLRYACWNPVLYQAPNGELLLFYKVGPNVAGWKGWLIRSKDSGVSWGK 1055

Query: 124  REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
             E LP G LGP KNKP+LL NG L+C SS E  + W    EVT D G++WR  GPI  P+
Sbjct: 1056 PEALPEGFLGPIKNKPVLLSNGDLICPSSREG-SGWTVLFEVTSDFGKTWRMVGPIN-PD 1113

Query: 184  ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
            + ++ IQP      +  +++L RS      V  S S D G TW+    + L N NSG D 
Sbjct: 1114 KKINAIQPSILTYKDGKMQILCRS--KTSSVVESWSTDNGKTWTPLAESTLPNNNSGTDA 1171

Query: 244  VKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
            V LKDGR L+ YN V         +R  L VA+S  DG  W  AL LE++   ++SYP+V
Sbjct: 1172 VTLKDGRQLIVYNHVATPKGAPKGARTPLNVAVS-SDGKLWQAALVLEDSPVSQYSYPSV 1230

Query: 295  IQASDGSVHITYTYNRTQIKHVVLLP 320
            IQ++DG VHI YT+ R ++K+V + P
Sbjct: 1231 IQSADGMVHIVYTWRRQRVKYVKVDP 1256


>gi|430746327|ref|YP_007205456.1| neuraminidase [Singulisphaera acidiphila DSM 18658]
 gi|430018047|gb|AGA29761.1| putative neuraminidase (sialidase) [Singulisphaera acidiphila DSM
           18658]
          Length = 358

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 176/318 (55%), Gaps = 15/318 (4%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           G V  E  +   SAPF  CHASTI E   G  + ++FGG+ EG  DV IW+ +     W 
Sbjct: 38  GFVKSELIY--ESAPFPKCHASTIAEA-GGRLVASWFGGTDEGNSDVGIWVASHDGKAWT 94

Query: 69  SPIIA----DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           +P+      D +   P WNPVLF+     LLLFYK+G     W G +  S + G TWS  
Sbjct: 95  TPVEVANGIDGDKRYPCWNPVLFQPKKGPLLLFYKVGPTPANWWGMVTTSDDAGKTWSTP 154

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
            +LP   LGP KNKPI L  G LLC SS E  + W    E T D G  W   GPI     
Sbjct: 155 RRLPDPFLGPIKNKPIQLPEGDLLCPSSSED-HGWRVHFERTADLGYHWELIGPIN-DGV 212

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
           +   IQP      +  L+ L R+     +V  S S D G TWS    T L NPNSGIDGV
Sbjct: 213 TFGAIQPTLLTHPDGRLQALSRTKQ--AKVVESWSSDRGKTWSELAATTLPNPNSGIDGV 270

Query: 245 KLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            L DGR LL YN   RG   L +A+S +DG +W  AL LE+    EFSYPAVIQ +DG V
Sbjct: 271 TLADGRQLLVYNHTRRGRSPLNIAVS-EDGHAWRSALILEDQPG-EFSYPAVIQGADGLV 328

Query: 303 HITYTYNRTQIKHVVLLP 320
           H+TYT+NRT+I+HVV+ P
Sbjct: 329 HVTYTWNRTRIRHVVVDP 346


>gi|333377826|ref|ZP_08469559.1| hypothetical protein HMPREF9456_01154 [Dysgonomonas mossii DSM
           22836]
 gi|332883846|gb|EGK04126.1| hypothetical protein HMPREF9456_01154 [Dysgonomonas mossii DSM
           22836]
          Length = 365

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 190/329 (57%), Gaps = 25/329 (7%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           +G++ ++F +  +SA F S HA+TI E   G  + A+FGG  EG P+V I+        W
Sbjct: 35  EGILVDQFIY--DSAEFPSVHAATIAETPSG-LITAFFGGKYEGHPEVNIYTSRHTATGW 91

Query: 68  QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
             P+     I  +      +NPVLF+ P   LLLFYKIG+ VQ W+G + RSYN G TW+
Sbjct: 92  TPPVKVADGIVSDTLRKACYNPVLFQYPDGELLLFYKIGKNVQDWTGYLIRSYNDGQTWT 151

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
           + E LP G LGP KNKPIL+ N  L+C SS E+ + W   ME T D G++WRK  P  I 
Sbjct: 152 KPEALPEGFLGPIKNKPILIGNK-LICASSTEN-DGWQVHMEFTEDRGKTWRKTSP--IN 207

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
           ++  ++IQP      N  L+++ RS N    +  + S D G TWS      L N NSG D
Sbjct: 208 SKPWNIIQPSILTLNNNVLQIVCRSQN--EHIISAFSKDEGETWSDPIALYLPNNNSGTD 265

Query: 243 GVKLKDGRLLLAYNTV----------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            V LKDGR L+ YN V          +R  L VALS DDG +W  +L LE++   E+SYP
Sbjct: 266 AVTLKDGRQLMVYNHVAASESAYKDKARTPLNVALS-DDGITWKASLILEDSPIKEYSYP 324

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           +VIQ  DG +HI YT+ R +IK+V + P+
Sbjct: 325 SVIQGHDGMIHIVYTWRREKIKYVKVDPS 353


>gi|168701252|ref|ZP_02733529.1| hypothetical protein GobsU_17136 [Gemmata obscuriglobus UQM 2246]
          Length = 284

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 121/283 (42%), Positives = 170/283 (60%), Gaps = 5/283 (1%)

Query: 41  LVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKI 100
           +  +FGG  EGA DV+IW  TF   +W  P +   EP  P WNPVLFK     L L+YK 
Sbjct: 1   MATWFGGKAEGAKDVEIWASTFDGKKWSEPKVFGTEPGQPCWNPVLFKSAKGTLFLWYKA 60

Query: 101 GQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG 160
           G +   W+G ++ S + G TW++ E +P   +GP + KPI L NG +L G+S ES+ +W 
Sbjct: 61  GPKPDNWTGYVRTSADNGKTWTKPEMMPSTFMGPVRAKPIQLANGTILAGTSWESYRNWV 120

Query: 161 SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
            +++ + D G++W++  P  +P E  + IQP  F   +  + VLMRS N +  VC SES 
Sbjct: 121 PFVDRSTDEGKTWKRSNPFPVP-EKFNQIQPALFEAKDGKIVVLMRSRNPL-TVCRSESK 178

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDA 278
           DGG T+S A+ T L NP+SGID V+ K+G + L YN  S  R  + +A S DDG +W   
Sbjct: 179 DGGETFSPAEETSLANPSSGIDCVRTKEGDVFLIYNPTSVLRTPISLARSTDDGKTWKKV 238

Query: 279 LTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             LE     EFSYPA+I++S G++ ITYT+ RT IKH  + P 
Sbjct: 239 ADLETEPG-EFSYPAIIESSAGTLEITYTWKRTHIKHQSVDPK 280


>gi|284036722|ref|YP_003386652.1| BNR/Asp-box repeat domain-containing protein [Spirosoma linguale
           DSM 74]
 gi|283816015|gb|ADB37853.1| BNR/Asp-box repeat domain protein [Spirosoma linguale DSM 74]
          Length = 348

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 142/330 (43%), Positives = 192/330 (58%), Gaps = 28/330 (8%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            ++  EF +    APF  CHASTI E  +G  + A+FGG+ E  PDV IW+ +     W 
Sbjct: 27  AVLKSEFIY--EQAPFPECHASTIAETPQG-LMTAWFGGTRERHPDVGIWVSSATKSGWT 83

Query: 69  SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +P+     +  +   +P WNPVLF++P   LLLFYK+G     W G +KRS + G +WS 
Sbjct: 84  TPVEVATGVQADGKRLPCWNPVLFQVPKGELLLFYKVGPSPSTWWGMLKRSTDGGKSWSA 143

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
            E+LP GI GP KNKP+LL +G+LLC SS E  N W    E T D G++W++   I   N
Sbjct: 144 AERLPDGIAGPIKNKPVLLPSGVLLCPSSSEDHN-WRVHFEQTSDWGKTWQRTEAI---N 199

Query: 184 ESLS--VIQP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           + ++   IQP + FH   + L+ L RS    G +  + S DGG +WS  + T L NPNSG
Sbjct: 200 DGVNDGAIQPSILFHPGGQ-LQALCRS-QRTGFIAETWSTDGGKSWSPLQKTTLPNPNSG 257

Query: 241 IDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
            D V L DGR +L YN VS         R  L+VA+S  DG +W   L + EN   E+SY
Sbjct: 258 TDAVTLADGRQVLVYNPVSPTPGKSGGPRTPLEVAIS-SDGKTWK-TLAVLENTPGEYSY 315

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           PAVIQ +DG + ITYT+ R +I+HVV+ P 
Sbjct: 316 PAVIQTADGLLQITYTWKRQRIRHVVVDPK 345


>gi|408675471|ref|YP_006875219.1| neuraminidase [Emticicia oligotrophica DSM 17448]
 gi|387857095|gb|AFK05192.1| neuraminidase [Emticicia oligotrophica DSM 17448]
          Length = 341

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 145/309 (46%), Positives = 179/309 (57%), Gaps = 21/309 (6%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR-WQSPI-IAD---EEPNV 79
           KSCHASTIVE+ KG  + A+F G  E  P V IW+ T + G+ W   I IAD   E    
Sbjct: 36  KSCHASTIVELSKGKLMAAWFAGEHENHPQVCIWVATQERGKTWSKAIKIADGIYEGKQY 95

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
             WNPVLFK  S  L L+YK+G   ++W G +K S N+G TWS   +LP  ILGP +NK 
Sbjct: 96  ACWNPVLFKNSSGKLFLYYKVGINPREWWGMVKTSDNEGKTWSSPSRLPNDILGPIRNKS 155

Query: 140 ILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTA 197
           I L+NG +L  SS ES     W S ME++   G++W+K   I I      VIQP   +  
Sbjct: 156 IQLKNGSVLHPSSTESIRGDIWHSHMEISDANGKNWKK---IDIDCGEYGVIQPTILNHK 212

Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
           +  +++LMRS     ++  S S D G TWS   P  L NPNSGID V LK+G  LL YN 
Sbjct: 213 DGKIQILMRSRQ--NKIIQSWSEDNGQTWSKLSPINLPNPNSGIDAVTLKNGSFLLVYNP 270

Query: 258 V--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           +         R VLKVA S  DG++W D  TLE     EFSYPA+IQ  DG VHITYTY+
Sbjct: 271 LPSGKEWSNGRNVLKVARSI-DGNNWEDIYTLENQEKGEFSYPAIIQTGDGLVHITYTYD 329

Query: 310 RTQIKHVVL 318
           R  IK VVL
Sbjct: 330 RKLIKTVVL 338


>gi|196232131|ref|ZP_03130986.1| Glycosyl hydrolase family 32 domain protein [Chthoniobacter flavus
           Ellin428]
 gi|196223853|gb|EDY18368.1| Glycosyl hydrolase family 32 domain protein [Chthoniobacter flavus
           Ellin428]
          Length = 373

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 142/325 (43%), Positives = 181/325 (55%), Gaps = 22/325 (6%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           G V  EF F   SAPF SCHASTI +   G  + A+FGG+ EGAPDV IWL     G+W 
Sbjct: 29  GYVHSEFIF--ESAPFPSCHASTIAQTASG-LVSAWFGGTREGAPDVGIWLSRHIGGQWS 85

Query: 69  SPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +P+     +  +      +NPVLF+     LLLFYK G     WSG +K S + G  WS 
Sbjct: 86  APVEVANGLRSDGRRFACYNPVLFQPAKGPLLLFYKAGGNPNGWSGFLKTSLDDGADWSA 145

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSW---MEVTVDAGRSWRKYGPIY 180
            + LP G +GP KNKP+ L NG+LLCG+S+E+      W   +E T D G++W K     
Sbjct: 146 AKPLPDGFVGPVKNKPVQLANGMLLCGASIETPEKPSRWRVQIERTPDLGKTWEK---TE 202

Query: 181 IPNESL--SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
             N+ L  S IQP         L  + R+      +  + S D G TW+    T L NPN
Sbjct: 203 FLNDGLEISAIQPSILFLGGDKLLAIGRTRQ--AHLFRTTSADDGQTWNELSLTDLPNPN 260

Query: 239 SGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
           SG D V L+D R +L YN    G   L VA+S+ DG +W  AL L EN   E+SYPAVIQ
Sbjct: 261 SGTDAVTLRDHRHVLVYNHTDHGRSPLNVAVSR-DGRTWQAALVL-ENEPGEYSYPAVIQ 318

Query: 297 ASDGSVHITYTYNRTQIKHVVLLPN 321
            +DG VHITYT+ R +IKHVV+ P 
Sbjct: 319 TADGLVHITYTWKRHRIKHVVVDPT 343


>gi|311748214|ref|ZP_07721999.1| putative neuraminidase [Algoriphagus sp. PR1]
 gi|126576704|gb|EAZ80952.1| putative neuraminidase [Algoriphagus sp. PR1]
          Length = 336

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 134/315 (42%), Positives = 185/315 (58%), Gaps = 14/315 (4%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++  EF +   +APF SCHAST+VE   G  + A+FGG+ E  PDV I+     DG W +
Sbjct: 27  ILKSEFIY--ENAPFPSCHASTLVETPDG-IMAAWFGGTYERHPDVSIYTALLSDGSWST 83

Query: 70  P-IIADEEPNV----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P ++AD   N     P WNPVL++ P+  L+LFYK G   ++W G  K S + G TWS+ 
Sbjct: 84  PKMVADGVENKDFRNPTWNPVLYRNPNGQLVLFYKEGPNPREWWGLYKTSDDGGKTWSKA 143

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
            Q+PPG+LGP KNK + L +G LL  SS E+   W   +E T    + W+K   I I N 
Sbjct: 144 IQIPPGMLGPVKNKSVTLADGTLLHPSSFETNGVWSMHVETTTSDIQDWKK---IAIDNG 200

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
           +   IQP      N  L++L R+   +  +  + S DGG TWS    T L++ NSGID V
Sbjct: 201 AFHAIQPTVLTYPNGKLQMLARTQEHV--IGTTWSTDGGKTWSPVSSTGLVHNNSGIDAV 258

Query: 245 KLKDGRLLLAYNTVSRGVLKVAL-SKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
            LK+G  LL  N +  G  K++L   +DG +W +   +E+    EFSYPA+IQA DG+VH
Sbjct: 259 TLKNGVQLLLCNPIKEGRNKLSLMMSEDGVNWEEIHVMEDQPEGEFSYPAIIQAEDGTVH 318

Query: 304 ITYTYNRTQIKHVVL 318
           +TYTYNR +IK+V L
Sbjct: 319 MTYTYNREKIKYVSL 333


>gi|374376460|ref|ZP_09634118.1| BNR repeat-containing glycosyl hydrolase [Niabella soli DSM 19437]
 gi|373233300|gb|EHP53095.1| BNR repeat-containing glycosyl hydrolase [Niabella soli DSM 19437]
          Length = 374

 Score =  227 bits (579), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 144/330 (43%), Positives = 188/330 (56%), Gaps = 25/330 (7%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           +G++ +EF +  ++A F   HASTI E   G  + A+FGG+ EG  DV IW     + +W
Sbjct: 29  QGILVDEFIY--DTASFPQAHASTIAETPDG-LIAAWFGGTKEGNKDVCIWTSRLVNNKW 85

Query: 68  QSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
            +P ++AD    +      +NPVL+  P+  LLLFYKIG  V  W+G MKRS + G TWS
Sbjct: 86  TAPQLVADGILNDSTRYACYNPVLYYAPTGELLLFYKIGPNVAGWTGWMKRSKDNGKTWS 145

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
            RE LP G LGP KNKP L+ NG+L+C SS E    W    E T D G +W K  PI   
Sbjct: 146 PREALPEGYLGPIKNKPELI-NGVLVCPSSTEV-GGWKVHFEYTKDWGHTWTKSAPIN-D 202

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            ++ S IQP      N  L+VL R+ N +  +  S S D G TWS      L N NSG D
Sbjct: 203 GKTFSAIQPSILKYKNGALQVLGRTRNTV--IYESWSKDHGKTWSPMTALDLPNNNSGTD 260

Query: 243 GVKLKDGRLLLAYNTV-----------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
            V LKDGR LL YN V            R  L VA+SK DG +W+ AL LE++   ++SY
Sbjct: 261 AVTLKDGRQLLVYNHVLPDSTWVKGKGPRTPLNVAVSK-DGKNWYAALVLEDSPISQYSY 319

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           P+VIQ  DG VHI YT+ R +IK+V + P+
Sbjct: 320 PSVIQGKDGMVHIVYTWRRQRIKYVKIDPS 349


>gi|393721557|ref|ZP_10341484.1| hypothetical protein SechA1_17504 [Sphingomonas echinoides ATCC
           14820]
          Length = 365

 Score =  227 bits (579), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 181/331 (54%), Gaps = 26/331 (7%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++  EF  PA   P+   HASTIVE+  G    A+FGG+ E  PDV IW        W  
Sbjct: 36  ILLSEFIDPA--PPYPQAHASTIVELPDGTLAAAWFGGTGESRPDVTIWFARRGAHGWDR 93

Query: 70  PI-IAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P+ +AD    +    P WNPVLF+ P   L LFYK+G   + W G +K S + G +WS  
Sbjct: 94  PVAVADGQSADRTRYPTWNPVLFQPPGAPLTLFYKVGPNPRGWWGMVKISSDGGRSWSAA 153

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVE----SWNSWGSWMEVTVDAGRSWRKYGPIY 180
           ++LP G+LGP KNKP++L +G  L  SS E      N W   ME + D GRSW    PI 
Sbjct: 154 QRLPEGVLGPIKNKPVVLADGSWLSPSSREEGTPEHNRWCIRMERSTDQGRSWTVEPPIA 213

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
            P   +  IQP   H A+  L V+ R+  G   + MS S D G +WS      L NPN+G
Sbjct: 214 SP-MGIEAIQPSVLHFADGRLAVVARTRQGA--LAMSWSRDTGKSWSPLAAIDLPNPNAG 270

Query: 241 IDGVKLKDGRLLLAYNTVS----------RGVLKVALSKDDGDSWHDALTLEEN-LAMEF 289
            D V L DGR L+ YN  +          R  L + LS+D GD W + LTLE+  L   +
Sbjct: 271 TDAVTLADGRQLIVYNHSAHWPDHPGEGPRWPLNIGLSRDGGD-WRNVLTLEDKPLPDGY 329

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           +YPAVIQ  DG VHI+YTYNRT I+HVV+ P
Sbjct: 330 AYPAVIQTRDGLVHISYTYNRTHIRHVVVDP 360


>gi|406831122|ref|ZP_11090716.1| BNR/Asp-box repeat domain-containing protein [Schlesneria
           paludicola DSM 18645]
          Length = 355

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 139/320 (43%), Positives = 183/320 (57%), Gaps = 20/320 (6%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           L+  EF +   +APF SCHASTI E  +G  + A+FGG+ E  PDV IWL     G W S
Sbjct: 32  LLKREFVY--ETAPFPSCHASTIAETKEG-LVAAWFGGTKESDPDVGIWLSRHVAGAWTS 88

Query: 70  PI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P+     +       P WNPVLF+  +  L+LFYK+G    KW G +  S + G +WSE 
Sbjct: 89  PVEVANGVQSPTKREPCWNPVLFQPQTGPLMLFYKVGPNPDKWWGMLMVSDDSGKSWSEP 148

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
            +LP GI+GP KNKP+ L NG +L  SS E +  W    E + D G++W     +   N+
Sbjct: 149 RRLPDGIVGPIKNKPVQLANGDILSPSSSE-YAKWRVHFERSADLGKTWTATAAV---ND 204

Query: 185 SLSV--IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            +++  IQP      +  L+ + R+  G  ++    S + G TWS  K   L NPNSG D
Sbjct: 205 GIAIAAIQPSILTYRDGHLQAIGRTRQG--QIFDLTSSNQGATWSEMKLISLPNPNSGTD 262

Query: 243 GVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
            V L DGR LL YN +SRG   L VA+S DDG +W  AL LE     EFSYPAVIQ +DG
Sbjct: 263 AVTLADGRQLLIYNHISRGRSPLNVAVS-DDGYNWFAALVLESEPG-EFSYPAVIQTADG 320

Query: 301 SVHITYTYNRTQIKHVVLLP 320
            VH  YT+NR +I+HVV+ P
Sbjct: 321 LVHFAYTWNRKRIRHVVIDP 340


>gi|146301814|ref|YP_001196405.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium johnsoniae
           UW101]
 gi|146156232|gb|ABQ07086.1| hypothetical lipoprotein [Flavobacterium johnsoniae UW101]
          Length = 381

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 199/338 (58%), Gaps = 24/338 (7%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           ++K    +G++ +EF +  + AP+ SCHA TIVE   G  + ++FGG+ E  PDV I++ 
Sbjct: 23  LEKKTWKEGILVDEFIY--DKAPYPSCHAVTIVEATNGDLVASWFGGTHERHPDVCIYVA 80

Query: 61  TFKDG--RWQSPI-IAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR 113
               G  +W   + +AD    E P +P WNPVL+++P   L+LFYKIG +  +W G ++ 
Sbjct: 81  IKPKGSDKWGEGVKVADGVMKEGPRLPTWNPVLYQIPGGDLMLFYKIGPKPSEWWGVIRT 140

Query: 114 SYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
           S + G TWSE +++P G LGP KNKP+LL NG LLC SS+E  + W   ME T D G++W
Sbjct: 141 SSDGGKTWSEAQKMPDGFLGPIKNKPVLLSNGTLLCPSSIEG-DGWRLRMESTPDFGKTW 199

Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-SCDGGLTWSYAKPT 232
                +    + ++ IQP      + +++ + R+ N   R   S  S D G TWS  +  
Sbjct: 200 VMGDTLPRGKQKINAIQPSILFHKDGSIQAIGRTRN---RAIFSTFSKDNGKTWSDVELI 256

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEE 283
            L N NSG D V L+DGR LL YN V          R  L V++SK DG  W+ AL LE+
Sbjct: 257 GLPNNNSGTDAVTLRDGRHLLVYNHVLPPGKEAKGPRTPLNVSVSK-DGIHWNAALVLED 315

Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           +   ++SYP++IQ+SDG VHI YT+ R ++K+V + P+
Sbjct: 316 SKISQYSYPSMIQSSDGMVHIVYTWRREKLKYVKVDPS 353


>gi|399031119|ref|ZP_10731258.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
 gi|398070588|gb|EJL61880.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
          Length = 381

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 195/338 (57%), Gaps = 24/338 (7%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           ++K    +G++ +EF +  + AP+ SCHA TIVE   G  + A+FGG+ E  PDV I++ 
Sbjct: 23  LEKKTWKEGILVDEFIY--DKAPYPSCHAVTIVEATNGDLVSAWFGGTAERNPDVCIYVS 80

Query: 61  TFKDG--RWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR 113
               G  +W   +     I  + P +P WNPVL+++P   L+LFYKIG +  +W G  + 
Sbjct: 81  VKPKGSDKWGEGVQVADGIMKDGPRLPTWNPVLYQIPGGDLMLFYKIGPKPSEWWGVTRT 140

Query: 114 SYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
           S + G TWSE +++P G LGP KNKP+LL NG LLC SS+E  + W   ME T D G++W
Sbjct: 141 SSDGGKTWSEAKKMPEGFLGPIKNKPVLLSNGTLLCPSSIEG-DGWRLRMESTPDFGKTW 199

Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC-DGGLTWSYAKPT 232
                +    + ++ IQP      + T++ + R+ N   R   S S  D G TWS  +  
Sbjct: 200 VMGDTLPRGKQKINAIQPSILFHKDGTIQAIGRTRN---RAIFSTSSKDNGKTWSNVELI 256

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEE 283
            L N NSG D V LK+G+ LL YN V          R  L +++S DDG +W   L LE+
Sbjct: 257 GLPNNNSGTDAVTLKNGKHLLVYNHVLPPGKEAKGPRTPLNLSIS-DDGINWKAVLILED 315

Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           +   ++SYP++IQ+SDG VHI YT+ R ++K+V + P+
Sbjct: 316 SKISQYSYPSIIQSSDGMVHIVYTWRREKLKYVKIDPS 353


>gi|395803588|ref|ZP_10482832.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
 gi|395434142|gb|EJG00092.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
          Length = 381

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 198/338 (58%), Gaps = 24/338 (7%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           ++K    +G++ ++F +  + AP+ SCHA TIVE   G  + ++FGG+ E  PDV I++ 
Sbjct: 23  LEKKTWKEGILVDQFVY--DKAPYPSCHAITIVEATNGDLVASWFGGTHERNPDVCIYVA 80

Query: 61  TFKDGR--WQSPI-IAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR 113
               G   W   I +AD    + P +P WNPVL+++P   L+LFYKIG +  +W G ++ 
Sbjct: 81  IKPKGSDTWNEGIKVADGVMKDGPRLPTWNPVLYQIPGGDLMLFYKIGPKPSEWWGVIRT 140

Query: 114 SYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
           S + G TWSE +++P G LGP KNKP+LL NG LLC SS+E  + W   ME T D G++W
Sbjct: 141 SSDGGKTWSEAKKMPDGFLGPIKNKPVLLSNGTLLCPSSIEG-DGWRLRMESTPDFGKTW 199

Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-SCDGGLTWSYAKPT 232
                +    + ++ IQP      + +++ + R+ N   R   S  S D G TWS  +  
Sbjct: 200 VMGDTLPRGKQKINAIQPSILFHKDGSIQAIGRTRN---RAIFSTFSKDNGKTWSNVELI 256

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEE 283
            L N NSG D V LKDGR LL YN V          R  L V++SK DG  W+ AL LE+
Sbjct: 257 GLPNNNSGTDAVTLKDGRHLLVYNHVLPPGKEAKGPRTPLNVSVSK-DGIHWNAALVLED 315

Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           +   ++SYP++IQ+SDG VHI YT+ R ++K+V + P+
Sbjct: 316 SKISQYSYPSIIQSSDGMVHIVYTWRREKLKYVKVDPS 353


>gi|289664888|ref|ZP_06486469.1| BNR/Asp-box repeat domain protein [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
          Length = 367

 Score =  223 bits (569), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 139/322 (43%), Positives = 180/322 (55%), Gaps = 24/322 (7%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   GH L A+FGG+ EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGHLLAAWFGGAHEGAADVGIWLSQRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 100 ACAGIDGAAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRIVSTDDGAHWSPTQRLPD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+   N++  + 
Sbjct: 160 GILGPIKNKPVQLPNGRILSPSSSED-RGWRAHLEWSDDDGAHWQRGMPL---NDASVID 215

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            IQP      +  L+ L RS     ++  + S DGGL W   +   L NPNSG D V L+
Sbjct: 216 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSSDGGLHWQPMQLLDLQNPNSGTDAVMLR 273

Query: 248 DGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           DGR LL YN           RG L VA+S +DG  W   LTLE +   EFSYPAVIQ  D
Sbjct: 274 DGRALLVYNPGIAGKDWWEGRGTLAVAVS-NDGTHWQRVLTLENSEKDEFSYPAVIQTRD 332

Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
           G VHI+YT+ R  IKHVVL P 
Sbjct: 333 GLVHISYTWKRRHIKHVVLDPT 354


>gi|87307124|ref|ZP_01089270.1| hypothetical protein DSM3645_01685 [Blastopirellula marina DSM
           3645]
 gi|87290497|gb|EAQ82385.1| hypothetical protein DSM3645_01685 [Blastopirellula marina DSM
           3645]
          Length = 372

 Score =  223 bits (568), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 142/330 (43%), Positives = 187/330 (56%), Gaps = 26/330 (7%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           G+V+ E+    + AP  SCHASTI     G F+ A+FGG+ EGA DV I+     DG+W 
Sbjct: 36  GVVSVEYL--TDGAPHDSCHASTIAHTPAG-FVAAWFGGTREGAKDVGIYFNRQVDGKWI 92

Query: 69  SPI-IAD---------EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKG 118
           +P+ +A+         +E   P WNPVL +     L+LFYK G     W G +  S + G
Sbjct: 93  TPVEVANGVQYVSPTGKEHRHPCWNPVLHQAQEGPLMLFYKCGPTPSTWWGMLMTSDDSG 152

Query: 119 VTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGP 178
           VTWSE ++LP GI GP KNK I   +G L+C SS E+ + W   +E+T D GR+W + GP
Sbjct: 153 VTWSEPQRLPEGIDGPVKNKAI-EHDGRLVCPSSSEN-HGWRLHLEITSDQGRAWTRVGP 210

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +      +  IQP      N   ++L R  N  G V  + S D G+TW+    T L NPN
Sbjct: 211 LN-DGTKIGAIQPSVLVYPNGRWQILARDRNNNGNVWSTWSDDQGITWTPLVSTGLPNPN 269

Query: 239 SGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
           SG D V L DGR LL YN           R +L VA+S+ DG +W  AL LE++ A EFS
Sbjct: 270 SGTDAVMLADGRALLVYNHTHRSGPFPSGRNMLNVAVSQ-DGRTWEAALVLEKS-AGEFS 327

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           YPAVIQ  DG VHITYT+ R +I+HV + P
Sbjct: 328 YPAVIQTPDGLVHITYTWRRKKIRHVTVDP 357


>gi|189460070|ref|ZP_03008855.1| hypothetical protein BACCOP_00706 [Bacteroides coprocola DSM 17136]
 gi|189433231|gb|EDV02216.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides coprocola
            DSM 17136]
          Length = 1267

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 190/358 (53%), Gaps = 54/358 (15%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGR 66
             ++  EF +    APF  CHASTIVE+D G  + A+FGG+ E  PDV IW+  ++     
Sbjct: 911  AVIQNEFLY--EKAPFPECHASTIVELDNGDLVTAFFGGTKERNPDVCIWVCRKSHDSNT 968

Query: 67   WQSPIIADE------EPNVPM--------------WNPVLFKLPSNGLLLFYKIGQEVQK 106
            W +PI A +      +P+  +              WNPVLF++P   LLLF+KIG  V  
Sbjct: 969  WTAPIKAADGVFDLDDPDATIAGVTADIKDHRKACWNPVLFQVPGGDLLLFFKIGLNVPD 1028

Query: 107  WSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVT 166
            W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E    W   ME +
Sbjct: 1029 WTGWLVRSKDGGKTWSKREPLPKGFLGPIKNKPEFI-NGRIICPSSTEGNAGWRIHMEYS 1087

Query: 167  VDAGRSWRKYGPIYIPNESLS----------------VIQPVPFHTANRTLRVLMRSFNG 210
             D G++W+  GPI    E LS                VIQP      + TL+V+ R+ N 
Sbjct: 1088 DDMGKTWKTTGPIDAEQEFLSQHQGIGADDSLKRPIQVIQPSILKHKDGTLQVICRTRN- 1146

Query: 211  IGRVCMSESCDGGLTWSYAK-PTQLLNPNSGIDGVKLKDGRLLLAYNT---------VSR 260
               +  S S D G TWS      +L N NSG D V LKDGR +L YN            R
Sbjct: 1147 -SHLATSWSKDNGTTWSKVTLIDELPNNNSGTDAVTLKDGRHVLVYNNSKPQLRAKKAVR 1205

Query: 261  GVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
              L +ALS +DG  W   LTLE++   E+SYPA+IQ  DG +HIT+T+ R  IK++ +
Sbjct: 1206 TPLNLALS-EDGIHWKPVLTLEDSPIREYSYPAIIQGKDGKLHITFTWRRELIKYMKI 1262


>gi|261406844|ref|YP_003243085.1| hypothetical protein GYMC10_3012 [Paenibacillus sp. Y412MC10]
 gi|261283307|gb|ACX65278.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
          Length = 327

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 129/326 (39%), Positives = 191/326 (58%), Gaps = 26/326 (7%)

Query: 12  AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI 71
           A+EF F  +  PF SCHAST+V +  G  L A+FGGS EGAPDV IW     DG W +P+
Sbjct: 7   AKEFIFEEDR-PFASCHASTLVVLPDGDVLAAWFGGSREGAPDVAIWTARRTDGGWSAPV 65

Query: 72  IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
              +E  +P WNPVL+  P   LLLFYK+G  V +W   +  S + G +WSE ++L PG 
Sbjct: 66  KVADEEGLPHWNPVLYLRPDGKLLLFYKVGPRVAEWHTRIIHSDDNGFSWSEPKELVPGD 125

Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
           +   GP KNKPI+L NG LL  +S+E   +W ++++++ D G +W +   + + +  L  
Sbjct: 126 IGGRGPVKNKPIMLRNGTLLAPNSLEP--AWDAFIDISSDHGDTWTQMAIVPLDHGKLKL 183

Query: 188 --VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
             +IQP  + + + ++ +L RS  G   +  S+S DGG TW  A PT + N NSG D  +
Sbjct: 184 KGIIQPTLWESEDGSVHLLARSTEGA--MYRSDSQDGGFTWCEAYPTDMPNNNSGFDLAR 241

Query: 246 LKDGRLLLAYNTV-----------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
           L DG L +AYN              R  L + LS+DDG +W + L L+  ++ ++SYPAV
Sbjct: 242 LSDGTLAMAYNPTVPREDDPKGKGPRTPLVLRLSRDDGATWGEELPLDSGIS-QYSYPAV 300

Query: 295 IQASDGSVHITYTYNRTQIK--HVVL 318
           + A   +++I+YT+ R +I   H+VL
Sbjct: 301 V-AHGNNIYISYTWRRERIAFYHIVL 325


>gi|325914142|ref|ZP_08176495.1| putative neuraminidase (sialidase) [Xanthomonas vesicatoria ATCC
           35937]
 gi|325539645|gb|EGD11288.1| putative neuraminidase (sialidase) [Xanthomonas vesicatoria ATCC
           35937]
          Length = 381

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ +P    HAST++E   G  L A+FGG+ EGA DV IWL      RW+ P  IAD 
Sbjct: 36  FIADPSPTPQAHASTLLETRNGRLLAAWFGGAHEGAADVGIWLTERDAQRWRPPRRIADG 95

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS   +LP 
Sbjct: 96  TRATTDGATVPAWNPVLFQSATGSVQLYYKLGPNPRQWWGLRITSADDGAHWSVPRRLPD 155

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L +G +L  SS E    W + +E + D G  W++  P+  P+  +  I
Sbjct: 156 GILGPIKNKPVQLPDGRILAPSSSED-RGWRAHLEWSDDDGEHWQRGMPLNDPS-VIGAI 213

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W   +   + NPNSG D V L+DG
Sbjct: 214 QPSLLLHGDGRLQALGRSQQ--NKLFSTFSYDGGLHWQPMQLIDVENPNSGTDAVMLRDG 271

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VALS +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 272 RALLVYNPAIAGKDWWDGRGTLAVALS-NDGVHWQRVLTLEDSADDEFSYPAVIQTRDGL 330

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 331 VHVSYTWKRRRIKHVVLDPT 350


>gi|188990640|ref|YP_001902650.1| exo-alpha-sialidase [Xanthomonas campestris pv. campestris str.
           B100]
 gi|167732400|emb|CAP50594.1| exported exo-alpha-sialidase [Xanthomonas campestris pv.
           campestris]
          Length = 355

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 141/322 (43%), Positives = 181/322 (56%), Gaps = 26/322 (8%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG+ EGA DV IWL       WQ+P  IAD 
Sbjct: 36  FIADQAPTPQAHASTLLETRDGTLLAAWFGGAHEGAADVGIWLAQRGPQHWQTPRRIADG 95

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   +P WNPVLF+  +  + L+YK+G   + W G    S + GV WS  ++LP 
Sbjct: 96  AHAGVDGAALPAWNPVLFQSATGPVQLYYKLGPNPRDWWGLRITSTDDGVHWSPPQRLPN 155

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+     ++  I
Sbjct: 156 GILGPIKNKPVQLPNGRILAPSSSED-RGWRTHLEWSDDDGAHWQRGLPLN-DAATIGAI 213

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGVKL 246
           QP     A+  L+ L RS     ++  + S DGGL W   +P QLL   NPNSG D V L
Sbjct: 214 QPSLLLHADGRLQALGRSQQ--NKLFSAFSDDGGLHW---QPMQLLDVENPNSGTDAVML 268

Query: 247 KDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
            DGR LL YN           RG L VA+S DDG  W   LTLE++   EFSYPAVIQ  
Sbjct: 269 HDGRALLVYNPGIAGKQWWDGRGTLAVAVS-DDGVQWRRVLTLEDSPKDEFSYPAVIQTR 327

Query: 299 DGSVHITYTYNRTQIKHVVLLP 320
           DG VHI+YT+ R +IKHVV+ P
Sbjct: 328 DGLVHISYTWKRQRIKHVVIDP 349


>gi|390989306|ref|ZP_10259605.1| BNR/Asp-box repeat family protein [Xanthomonas axonopodis pv.
           punicae str. LMG 859]
 gi|372556064|emb|CCF66580.1| BNR/Asp-box repeat family protein [Xanthomonas axonopodis pv.
           punicae str. LMG 859]
          Length = 386

 Score =  220 bits (560), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 136/320 (42%), Positives = 179/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP-IIAD- 74
           F A+ AP    HAST++E   GH L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGHMLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRPIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++L  
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W      ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLHDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W  ALTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRALTLEDSAGDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|384428900|ref|YP_005638260.1| BNR-Asp-box repeat domain protein [Xanthomonas campestris pv.
           raphani 756C]
 gi|341938003|gb|AEL08142.1| BNR-Asp-box repeat domain protein [Xanthomonas campestris pv.
           raphani 756C]
          Length = 355

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 142/324 (43%), Positives = 183/324 (56%), Gaps = 30/324 (9%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F AN AP    HAST++E   G  L A+FGG+ EGA DV IWL       WQ+P  IAD 
Sbjct: 36  FIANPAPTAQAHASTLLETRDGTLLAAWFGGTHEGAADVGIWLAQRGPQHWQTPRRIADG 95

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   +P WNPVLF+     + L+YK+G   + W G    S + G  WS  ++LP 
Sbjct: 96  AHTGLDGAALPAWNPVLFQPRYGPVQLYYKLGPSPRDWWGLRITSTDDGAQWSPPQRLPN 155

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV- 188
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+   N++ ++ 
Sbjct: 156 GILGPIKNKPLQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGLPL---NDAATIG 211

Query: 189 -IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGV 244
            IQP     A+  L+ L RS     ++  + S DGGL W   +P QLL   NPNSG D V
Sbjct: 212 AIQPSLLLHADGRLQALGRSQQ--NKLFSAFSDDGGLHW---QPMQLLDVENPNSGTDAV 266

Query: 245 KLKDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
            L+DGR LL YN           RG L VA+S DDG  W   LTLE++   EFSYPAVIQ
Sbjct: 267 MLRDGRALLVYNPGIAGKQWWDGRGTLAVAVS-DDGVQWRRVLTLEDSPKDEFSYPAVIQ 325

Query: 297 ASDGSVHITYTYNRTQIKHVVLLP 320
             DG VHI+YT+ R +IKHVV+ P
Sbjct: 326 TRDGLVHISYTWKRQRIKHVVIDP 349


>gi|412988193|emb|CCO17529.1| exported exo-alpha-sialidase [Bathycoccus prasinos]
          Length = 436

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 127/325 (39%), Positives = 184/325 (56%), Gaps = 26/325 (8%)

Query: 20  NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNV 79
           +  P  S HAS+I E+  G FLV++F G  E  PDV I+   +++G+W +P +      V
Sbjct: 81  DHKPKPSHHASSICEISSGQFLVSWFAGQFESKPDVGIYTARYQNGKWSAPELVVHPKGV 140

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
           P WNPVLFK  +  L LFYK G+    W G   +S + G TWS    +P GI+GP+KNKP
Sbjct: 141 PTWNPVLFKRSNGDLFLFYKHGRTPANWKGYYVKSRDGGNTWSSSVAMPNGIVGPAKNKP 200

Query: 140 ILLENGLLLCGSSVESWNSWGSWMEVTVDA---GRSWRKYGPIYIPNE------SLSVIQ 190
           + L++G +L  SS E + + G   E+ V+A      ++ Y    +  +         VIQ
Sbjct: 201 LELDDGTILLPSSRE-FGARGKDWEIVVEAMTMEDEFKTYAKFAVSEKGGGASVQQRVIQ 259

Query: 191 PVPFHT------ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
           PV +        A   +R++ RS +      + +S   G  W+ A  +Q+  PN+G D V
Sbjct: 260 PVFWKQSMLAGDAANGIRMMARSRSSYAVAAVGDST--GRIWTDAVFSQIPCPNTGFDVV 317

Query: 245 KLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLA--MEFSYPAVIQ 296
           KL+DGRLL+ YN         RGVL +A+S DDGD+W   + LE++    +EFSYPA+IQ
Sbjct: 318 KLRDGRLLIVYNHSFKHAIAGRGVLALAVSHDDGDTWERVMRLEDSGGRLLEFSYPAIIQ 377

Query: 297 ASDGSVHITYTYNRTQIKHVVLLPN 321
           ASDG VH+TYT+ R  IKHV++ P+
Sbjct: 378 ASDGMVHVTYTWRRANIKHVIIDPS 402


>gi|188577373|ref|YP_001914302.1| BNR/Asp-box repeat domain-containing protein [Xanthomonas oryzae
           pv. oryzae PXO99A]
 gi|188521825|gb|ACD59770.1| BNR/Asp-box repeat domain protein [Xanthomonas oryzae pv. oryzae
           PXO99A]
          Length = 350

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 137/321 (42%), Positives = 179/321 (55%), Gaps = 24/321 (7%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG+ EGA DV IWL      +WQ+P  +AD 
Sbjct: 23  FIADPAPTPQAHASTLLETRDGRLLAAWFGGAHEGAADVGIWLAQRGPQQWQTPRHLADG 82

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  L L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 83  AHAGVDGAAVPAWNPVLFQSATGPLQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 142

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+   N++  + 
Sbjct: 143 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPL---NDASVIG 198

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            IQP      +  L+ L RS     ++  + S DGGL W       + NPNSG D V L+
Sbjct: 199 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSNDGGLHWQPMHLLDVENPNSGTDAVMLR 256

Query: 248 DGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           DGR LL YN           RG L VA+S +DG  W   LTLE +   EFSYPAVIQ  D
Sbjct: 257 DGRALLVYNPAIAGKDWWEGRGTLAVAVS-NDGTHWQRMLTLENSEKDEFSYPAVIQTRD 315

Query: 300 GSVHITYTYNRTQIKHVVLLP 320
           G VHI+YT+ R +IKHVVL P
Sbjct: 316 GLVHISYTWKRQRIKHVVLDP 336


>gi|58581386|ref|YP_200402.1| hypothetical protein XOO1763 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84623320|ref|YP_450692.1| hypothetical protein XOO_1663 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58425980|gb|AAW75017.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84367260|dbj|BAE68418.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 367

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 137/322 (42%), Positives = 179/322 (55%), Gaps = 24/322 (7%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG+ EGA DV IWL      +WQ+P  +AD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGRLLAAWFGGAHEGAADVGIWLAQRGPQQWQTPRHLADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  L L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 100 AHAGVDGAAVPAWNPVLFQSATGPLQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+   N++  + 
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPL---NDASVIG 215

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            IQP      +  L+ L RS     ++  + S DGGL W       + NPNSG D V L+
Sbjct: 216 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSNDGGLHWQPMHLLDVENPNSGTDAVMLR 273

Query: 248 DGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           DGR LL YN           RG L VA+S +DG  W   LTLE +   EFSYPAVIQ  D
Sbjct: 274 DGRALLVYNPAIAGKDWWEGRGTLAVAVS-NDGTHWQRMLTLENSEKDEFSYPAVIQTRD 332

Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
           G VHI+YT+ R +IKHVVL P 
Sbjct: 333 GLVHISYTWKRQRIKHVVLDPT 354


>gi|399031783|ref|ZP_10731638.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
 gi|398069838|gb|EJL61168.1| putative neuraminidase (sialidase) [Flavobacterium sp. CF136]
          Length = 351

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 128/317 (40%), Positives = 176/317 (55%), Gaps = 22/317 (6%)

Query: 16  TFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI---- 71
           ++ +N     + HA+T+VE+  G  + A+FGG  EGA DV I+L T+K+  W SP     
Sbjct: 35  SYISNEPITNNSHAATLVEIKPGQLMAAWFGGKHEGAKDVGIYLSTYKEKTWSSPQNIIK 94

Query: 72  -IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
            +  +   +P WNPVLFK  S  L LFYK+G+  ++W G M  S + G TWS  + LP G
Sbjct: 95  PLIKQGDTLPCWNPVLFKSKSKKLYLFYKVGKNPREWFGAMITSNDDGTTWSNPKYLPEG 154

Query: 131 ILGPSKNKPILLENGLLLCGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIP-NESLS 187
           ILGP KNKPI    G +LCGSS ES   N W S +E   +A   W K   I I  N++  
Sbjct: 155 ILGPIKNKPIETTQGTILCGSSTESIENNQWRSHIETYTEATDKWTK---INIENNKNFD 211

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           +IQP     +++ +++L RS +   ++  S S D G  W++     ++N NSG+D + L 
Sbjct: 212 IIQPTFLVHSDKIIQMLCRSKH--NKLITSWSKDNGQNWNHTDTLNVVNSNSGVDALTLS 269

Query: 248 DGRLLLAYNTVSRG--------VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
               LL  N + +G        +L V  SK DG  W     LE     EFSYPA+IQASD
Sbjct: 270 KKSFLLVNNPLKQGKDWFNGRNILDVEYSK-DGLYWTKLFDLENQKEGEFSYPAIIQASD 328

Query: 300 GSVHITYTYNRTQIKHV 316
           G +H+ YTYNR  IKHV
Sbjct: 329 GKIHVLYTYNRKYIKHV 345


>gi|418519411|ref|ZP_13085463.1| hypothetical protein WS7_00040 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410704855|gb|EKQ63334.1| hypothetical protein WS7_00040 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 386

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   GH L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++L  
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W      ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLHDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAGDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|294665214|ref|ZP_06730512.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605002|gb|EFF48355.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 386

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 134/320 (41%), Positives = 179/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   GH L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLARRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++L  
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W      ++ NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLRDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSARDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +I+HVVL P 
Sbjct: 335 VHVSYTWKRRRIRHVVLDPT 354


>gi|78048775|ref|YP_364950.1| neuraminidase [Xanthomonas campestris pv. vesicatoria str. 85-10]
 gi|78037205|emb|CAJ24950.1| putative neuraminidase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
          Length = 386

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADRAPTPQAHASTLLETRDGRLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 100 AHAGVDGVAVPAWNPVLFQSTTGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W       + NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLDVENPNSGTDAVMLRDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAHDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|346725888|ref|YP_004852557.1| neuraminidase [Xanthomonas axonopodis pv. citrumelo F1]
 gi|346650635|gb|AEO43259.1| neuraminidase (sialidase) [Xanthomonas axonopodis pv. citrumelo F1]
          Length = 386

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGRLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 100 AHAGVDGVAVPAWNPVLFQSTTGPVQLYYKLGPTPRQWWGLRLLSTDDGAHWSPPQRLPD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W       + NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLDVENPNSGTDAVMLRDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAHDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|380512551|ref|ZP_09855958.1| sialidase/neuraminidase [Xanthomonas sacchari NCPPB 4393]
          Length = 362

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 141/313 (45%), Positives = 171/313 (54%), Gaps = 19/313 (6%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD-EEPN- 78
           AP   CHAST+VE   G  L A+FGG  EGA DV IW+       WQ +  +AD  +P  
Sbjct: 44  APTAQCHASTLVETRDG-LLAAWFGGRHEGADDVAIWVARRDAHGWQPAQRVADGAQPQG 102

Query: 79  --VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSK 136
             +P WNPVLF+     L LFYK+G + ++W G    S + G  WS  E+LP GILGP K
Sbjct: 103 KPLPTWNPVLFQPAQGPLRLFYKVGPDPKRWWGMQTTSSDAGHHWSTPERLPDGILGPIK 162

Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT 196
           NKP+ L +G +L  SS E    W + ME + D G  W + GP       +  IQP     
Sbjct: 163 NKPVQLPSGRILSPSSSED-AGWVAHMEWSDDNGAHWTR-GPAMNDPARIGAIQPSVLLH 220

Query: 197 ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN 256
            +  L+ + RS      V  + S D G TW       L NPNSG D V L DGR LL YN
Sbjct: 221 KDGRLQAVGRSQQ--NHVFSTWSRDNGRTWEPMTLLDLANPNSGTDAVVLADGRSLLVYN 278

Query: 257 TVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
                      RGVL VALS DDG  W   LTLE++   EFSYPAVIQ  DG VHI+YT+
Sbjct: 279 PTEAGKDWWDGRGVLAVALS-DDGQHWTRVLTLEDSPKDEFSYPAVIQTRDGKVHISYTW 337

Query: 309 NRTQIKHVVLLPN 321
            RT+IKHVVL P 
Sbjct: 338 KRTRIKHVVLDPK 350


>gi|21243816|ref|NP_643398.1| hypothetical protein XAC3089 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109411|gb|AAM37934.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 386

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   GH L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++L  
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W      ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMYLLEVENPNSGTDAVVLHDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAGDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|294627334|ref|ZP_06705920.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598416|gb|EFF42567.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 387

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 134/319 (42%), Positives = 179/319 (56%), Gaps = 20/319 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   GH L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLARRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++L  
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W      ++ NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLRDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSARDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLP 320
           VH++YT+ R +I+HVVL P
Sbjct: 335 VHVSYTWKRRRIRHVVLDP 353


>gi|381169762|ref|ZP_09878925.1| BNR/Asp-box repeat family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689780|emb|CCG35412.1| BNR/Asp-box repeat family protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 386

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 177/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A  AP    HAST++E   GH L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIAAPAPTPQAHASTLLETRDGHLLAAWFGGEHEGAADVGIWLAQRGPQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++L  
Sbjct: 100 AHAGVDGTAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLAD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGVHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W      ++ NPNSG D V L DG
Sbjct: 218 QPSLLMYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLEVENPNSGTDAVVLHDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAGDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|295135992|ref|YP_003586668.1| neuraminidase [Zunongwangia profunda SM-A87]
 gi|294984007|gb|ADF54472.1| putative neuraminidase [Zunongwangia profunda SM-A87]
          Length = 345

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 134/317 (42%), Positives = 174/317 (54%), Gaps = 21/317 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI----- 71
           F    A F S HASTI E++ G  + A+FGG+ E  PDV I+        W SP      
Sbjct: 31  FIYTKASFPSAHASTIEELENGDIIAAWFGGTHERHPDVSIYTSIKTVSGWSSPKKVATG 90

Query: 72  IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
              +  + P WNPVLFK  +N L LFYKIG     W G  K S + G TWSE++ LP  I
Sbjct: 91  YQSDTLSYPTWNPVLFKT-NNKLFLFYKIGPSPSTWWGAYKISTDHGKTWSEKKMLPEDI 149

Query: 132 LGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           LGP KNKPI L+NG ++  SS E  +   W + ME++ D G++WRK  P+   N +  VI
Sbjct: 150 LGPIKNKPIQLKNGRIVSPSSEELQDGEIWKAHMEISDDYGKTWRKV-PVN-ENSAYKVI 207

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      N  L+   RS      V  S S D G TWS    ++L NPNSGID V L +G
Sbjct: 208 QPSIVQLQNGNLKAFFRS--DQDAVLESTSIDNGETWSDFSKSELANPNSGIDAVSLING 265

Query: 250 RLLLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
             L+ YN ++        R  L +A SK DG+ W D L LE     E+SYPA+IQ  +G 
Sbjct: 266 GFLMVYNPMASGDNWWEGRSKLYLAYSK-DGEHWQDILKLENQEKGEYSYPAIIQTENGE 324

Query: 302 VHITYTYNRTQIKHVVL 318
           ++ITYTY+R +IK+  L
Sbjct: 325 IYITYTYDREKIKYFRL 341


>gi|325925747|ref|ZP_08187120.1| hypothetical protein XPE_1074 [Xanthomonas perforans 91-118]
 gi|325543804|gb|EGD15214.1| hypothetical protein XPE_1074 [Xanthomonas perforans 91-118]
          Length = 386

 Score =  217 bits (552), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 178/320 (55%), Gaps = 20/320 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG  EGA DV IWL      +WQ+P  IAD 
Sbjct: 40  FIADPAPTPQAHASTLLETRGGRLLAAWFGGEHEGAADVGIWLAQRGVQQWQTPRRIADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 100 AHAGVDGVAVPAWNPVLFQSTTGPVQLYYKLGPTPRQWWGLRLLSTDDGAHWSPPQRLPD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+    +++  I
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLND-AKAIGAI 217

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP      +  L+ L RS     ++  + S DGGL W       + NPNSG D V L+DG
Sbjct: 218 QPSLLLYPDGRLQALGRSQQ--NKLFSTFSLDGGLHWQPMHLLDVENPNSGTDAVMLRDG 275

Query: 250 RLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           R LL YN           RG L VA+S +DG  W   LTLE++   EFSYPAVIQ  DG 
Sbjct: 276 RALLVYNPAIAGKDWWDGRGTLAVAVS-NDGVHWQRVLTLEDSAHDEFSYPAVIQTRDGL 334

Query: 302 VHITYTYNRTQIKHVVLLPN 321
           VH++YT+ R +IKHVVL P 
Sbjct: 335 VHVSYTWKRRRIKHVVLDPT 354


>gi|343083099|ref|YP_004772394.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351633|gb|AEL24163.1| hypothetical protein Cycma_0384 [Cyclobacterium marinum DSM 745]
          Length = 382

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 188/343 (54%), Gaps = 28/343 (8%)

Query: 2   KKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
           K+  G   LV  E  +P +  P    HAST+VE   G  + A+F G+ E  PDV IW+  
Sbjct: 43  KQKVGEGALVHAELIYPLDDRPTPQVHASTLVETPSG-IVAAFFAGTHEKNPDVGIWVSH 101

Query: 62  FKDGRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
             DG+W  P+     + +++   P WNPVLF+     L+LFYK+G + + W G +  S +
Sbjct: 102 LGDGQWTRPVEVANGVQNKDLRYPTWNPVLFQPKEGPLMLFYKVGPDPRGWWGMLMTSED 161

Query: 117 KGVTWSEREQL--PPGI---LGPSKNKPILLENGLLLCGSSVESWNS----WGSWMEVTV 167
            G TWSE  +L   P +   LGP KNK + LE+G ++  +S+E  +     W    E++ 
Sbjct: 162 DGKTWSEPRKLGEDPAVGPLLGPVKNKAVQLEDGTIIAPTSIERKHDDDVFWMVHFEMSK 221

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W   GPI    E    IQP      + +L++L R+  G+  +  S S D G TWS
Sbjct: 222 DNGQTWEVVGPINDGVE-FDAIQPSILFHKDGSLQILCRTRQGV--ISESWSTDNGQTWS 278

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--------RGVLKVALSKDDGDSWHDAL 279
             K T L NPNSG D V L DGR LL YN  +        R +L +A+S DDG  W   +
Sbjct: 279 PMKATSLPNPNSGTDAVTLADGRQLLIYNHSTKEGEEPKGRNILNLAIS-DDGKDWKPFM 337

Query: 280 TLE-ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           TLE E     +SYPA+IQ+ DG VH+TYTY+R  +K+VV+ P+
Sbjct: 338 TLENEPNESGYSYPAIIQSKDGMVHMTYTYDRKTVKYVVVDPS 380


>gi|384420180|ref|YP_005629540.1| BNR-Asp-box repeat domain protein [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463093|gb|AEQ97372.1| BNR-Asp-box repeat domain protein [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 367

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 135/322 (41%), Positives = 178/322 (55%), Gaps = 24/322 (7%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F  + AP    HAST++E   G  L A+FGG+ EGA DV IWL      +WQ+P  +AD 
Sbjct: 40  FITDPAPTPQAHASTLLETRDGRLLAAWFGGAHEGAADVGIWLAQRGPQQWQTPRRLADG 99

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   VP WNPVLF+  +  + L+YK+G   ++W G    S + G  WS  ++LP 
Sbjct: 100 AHAGVDGAAVPAWNPVLFQSATGPVQLYYKLGPNPRQWWGLRLLSTDDGAHWSPPQRLPD 159

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES--LS 187
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+   N++  + 
Sbjct: 160 GILGPIKNKPVQLPNGRILAPSSSED-RRWRAHLEWSDDDGAHWQRGMPL---NDASVIG 215

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            IQP      +  L+ L RS     ++  + S DGGL W       + NPNSG D V L+
Sbjct: 216 AIQPSLLLYPDGRLQALGRSQQ--NKLFSTFSNDGGLHWQPMHLLDVENPNSGTDAVMLR 273

Query: 248 DGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           DGR LL YN           RG L VA+S +DG  W   LTLE +   EFSYPAVIQ  D
Sbjct: 274 DGRALLVYNPAIAGKDWWEGRGTLAVAVS-NDGTHWQRMLTLENSEKDEFSYPAVIQTRD 332

Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
           G VHI+YT+ R +IKHVVL P 
Sbjct: 333 GLVHISYTWKRQRIKHVVLDPT 354


>gi|21232340|ref|NP_638257.1| hypothetical protein XCC2909 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66767527|ref|YP_242289.1| hypothetical protein XC_1200 [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21114112|gb|AAM42181.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gi|66572859|gb|AAY48269.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 355

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 139/322 (43%), Positives = 180/322 (55%), Gaps = 26/322 (8%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IAD- 74
           F A+ AP    HAST++E   G  L A+FGG+ EGA DV IWL       WQ+P  IAD 
Sbjct: 36  FIADQAPTPQAHASTLLETRDGTLLAAWFGGAHEGAADVGIWLAQRGPQHWQTPRRIADG 95

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   +P WNPVLF+  +  + L+YK+G   + W G    S + G  WS  ++LP 
Sbjct: 96  AHAGVDGAALPAWNPVLFQSATGPVQLYYKLGPNPRDWWGLRITSSDNGAHWSPPQRLPN 155

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L +G +L  SS E    W + +E + D G  W++  P+     ++  I
Sbjct: 156 GILGPIKNKPVQLPSGRILAPSSGED-RGWRAHLEWSDDDGAHWQRGLPLN-DAATIGAI 213

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGVKL 246
           QP     A+  L+ L RS     ++  + S DGGL W   +P QLL   NPNSG D V L
Sbjct: 214 QPSLLLHADGRLQALGRSQQ--NKLFSAFSDDGGLHW---QPMQLLDVENPNSGTDAVML 268

Query: 247 KDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
            DGR LL YN           RG L VA+S DDG  W   LTLE++   EFSYPAVIQ  
Sbjct: 269 HDGRALLVYNPGIAGKQWWDGRGTLAVAVS-DDGVQWRRVLTLEDSPKDEFSYPAVIQTR 327

Query: 299 DGSVHITYTYNRTQIKHVVLLP 320
           DG VHI+YT+ R +IKHVV+ P
Sbjct: 328 DGLVHISYTWKRQRIKHVVIDP 349


>gi|372222469|ref|ZP_09500890.1| alpha-L-rhamnosidase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 406

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 131/328 (39%), Positives = 188/328 (57%), Gaps = 21/328 (6%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR 66
           K +  EEF F    +P + CHAS+++E+D G  L  +FGGS EGAPDVKIWL +  K G+
Sbjct: 48  KAIDLEEFVF--TQSPTRDCHASSLLELDNGDLLCTWFGGSKEGAPDVKIWLARKPKGGQ 105

Query: 67  WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ 126
           WQ P+    +    ++NPVL KL    + ++Y +   +    G +  S + G TWS  ++
Sbjct: 106 WQPPVRIAGDDGQTVFNPVLVKLRGGDIQIYY-VSPGID--DGHVITSRDNGYTWSSPKK 162

Query: 127 LPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP-NES 185
           LP G +GP  NKP+ LE+G ++ G+S++        +E++ D   +W K GPI  P N  
Sbjct: 163 LPEGFVGPIVNKPVYLEDGTIIAGASLQGNPGKRIHVEISKDDAETWFKVGPISDPVNTK 222

Query: 186 LSVIQPVPFHTANRTLRVLMRS--FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
             +IQP     + + L++L RS   +   ++  S S DGG+TWS      L N NS ID 
Sbjct: 223 YQIIQPTILVHSQKRLQILARSNEKDENTKIAQSWSEDGGITWSLVTDITLPNNNSAIDA 282

Query: 244 VKLKDGRLLLAYNTVS----------RGVLKVALSKDDGDSWHDALTLEENLA-MEFSYP 292
           V L DGR LL YN  +          RG+L VA+SK DG +W  A  LE     +++SYP
Sbjct: 283 VTLDDGRHLLVYNHSTREDSIGGRKGRGILTVAVSK-DGINWDAAAVLEYRTGTVQYSYP 341

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVLLP 320
           AVIQ +DG VH+TY+++R  IKHVV+ P
Sbjct: 342 AVIQTNDGLVHVTYSWHRKLIKHVVIDP 369


>gi|325919351|ref|ZP_08181384.1| hypothetical protein XGA_0321 [Xanthomonas gardneri ATCC 19865]
 gi|325550164|gb|EGD20985.1| hypothetical protein XGA_0321 [Xanthomonas gardneri ATCC 19865]
          Length = 465

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 138/324 (42%), Positives = 180/324 (55%), Gaps = 28/324 (8%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP------ 70
           F A+ AP    HAST++E   G  L A+FGG+ EGA DV IWL       WQ+P      
Sbjct: 36  FIADPAPTPQAHASTLLETRDGTLLAAWFGGAHEGAADVGIWLAQRGPQHWQTPRRIADG 95

Query: 71  -IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
                +   +P WNPVLF+  +  + L+YK+G   + W G    S + G  WS  ++LP 
Sbjct: 96  AHAGADGAALPAWNPVLFQSTTGPVQLYYKLGPNPRDWWGLRITSTDDGAHWSPPQRLPD 155

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           GILGP KNKP+ L NG +L  SS E    W + +E + D G  W++  P+  P+  +  I
Sbjct: 156 GILGPIKNKPVQLPNGRILAPSSSED-RGWRAHLEWSDDDGAHWQRGMPLNEPS-VIGAI 213

Query: 190 QP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL---NPNSGIDGVK 245
           QP +  HT  R ++ L RS     ++  + S DGGL W   +P QLL   NPNSG D V 
Sbjct: 214 QPSLLLHTDGR-VQALGRSQQ--NKLFSTFSTDGGLHW---QPMQLLDVENPNSGTDAVM 267

Query: 246 LKDGRLLLAYNT--------VSRGVLKVALS-KDDGDSWHDALTLEENLAMEFSYPAVIQ 296
           L+DGR LL YN           RG L VA++  DDG  W   LTLE +   EFSYPAVIQ
Sbjct: 268 LRDGRALLVYNPGIAGKDWWDGRGTLAVAVALSDDGVHWQRVLTLENSAKDEFSYPAVIQ 327

Query: 297 ASDGSVHITYTYNRTQIKHVVLLP 320
             DG VHI+YT+ R +IKHVV+ P
Sbjct: 328 TRDGLVHISYTWKRQRIKHVVIDP 351


>gi|300770074|ref|ZP_07079953.1| BNR repeat-containing glycosyl hydrolase [Sphingobacterium
           spiritivorum ATCC 33861]
 gi|300762550|gb|EFK59367.1| BNR repeat-containing glycosyl hydrolase [Sphingobacterium
           spiritivorum ATCC 33861]
          Length = 342

 Score =  214 bits (544), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 134/324 (41%), Positives = 176/324 (54%), Gaps = 22/324 (6%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V   F F  N   F  CHAST+ E   G    ++FGG+ EG  DV IW   F   +W +
Sbjct: 24  VVKTGFLFEENRY-FAQCHASTLAETASGEIRASWFGGTHEGNKDVVIWSSAFDGQKWSA 82

Query: 70  PI-----IADEEPNVPMWNPVLFKLP-SNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           P+     I+ +      WNPVLFK    N L L+YK+G   ++W G +K S + G TWS 
Sbjct: 83  PVSVADGISHDTVRYACWNPVLFKAKEDNTLYLYYKVGPNPREWWGAVKTSADNGKTWSS 142

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
              LP GILGP KNKP+ L NG++L  SSVE + + W   +E + D  +SW  Y PI   
Sbjct: 143 ARLLPKGILGPVKNKPLELSNGIILSPSSVEVTEDRWIGNIERSDDQQKSWTSY-PI-DH 200

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
           N   ++IQP     A+  L+VL RS  G   V  S S D G  WS    T ++NPNS  D
Sbjct: 201 NSPFNIIQPSILQHADGRLQVLCRSKEGF--VMTSWSSDEGKNWSALSKTNMINPNSATD 258

Query: 243 GVKLKDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
            +++K+G  L+ YN           R  L++A SK DG  W D L LE+    EFSYP +
Sbjct: 259 AIRVKNG-FLIVYNPDIPGKDWWEGRARLRLAYSK-DGIIWKDILVLEDQDKGEFSYPTI 316

Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
           IQ +   VHITYTYNR  IKH++L
Sbjct: 317 IQDTKDLVHITYTYNRKNIKHIIL 340


>gi|302539930|ref|ZP_07292272.1| putative cytoplasmic protein [Streptomyces hygroscopicus ATCC
           53653]
 gi|302457548|gb|EFL20641.1| putative cytoplasmic protein [Streptomyces himastatinicus ATCC
           53653]
          Length = 356

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 127/330 (38%), Positives = 181/330 (54%), Gaps = 38/330 (11%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWN 83
           F  CHAST++ +  G  L A+F GS EGA DV IWL     G W +P+   +EP +P WN
Sbjct: 15  FGHCHASTLLPLPDGDVLAAWFAGSREGADDVAIWLARRTGGGWDAPVKVADEPGLPHWN 74

Query: 84  PVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI-----LGPSKNK 138
           P LF      +LLFYK G  +  W   + RS + G++WS   +L PG       GP KNK
Sbjct: 75  PALFATGHGEVLLFYKTGHRIPDWRTRVLRSRDGGLSWSAPAELVPGADGAGGRGPVKNK 134

Query: 139 PILLENGLLLCGSSVESWNS----WGSWMEVTVDAGRSWRKYGPIYIPNESL---SVIQP 191
           P+ L +G  L  +SVE   S    W ++ + + D G SWR+  P+ +   +     VI+P
Sbjct: 135 PLRLADGGWLAPASVEEPQSAGGRWDAFTDRSDDGGASWRRSAPVPLDRPAFPGAGVIEP 194

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
             + +A   +R+L+RS    GRVC S S DGG+ WS A+P  + N NSG+D V+L DGR+
Sbjct: 195 ALWESAPGEVRMLLRS--SCGRVCRSASHDGGVIWSTARPLDVPNNNSGLDAVRLADGRV 252

Query: 252 LLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLA------------------ME 288
           +LA+N V     +R  L +++S+DDG +W  A+ LE+  A                   E
Sbjct: 253 VLAHNPVAAEWGARTPLVLSVSEDDGITWRRAVVLEDRPASGPGAIVPDETGVRTDGHSE 312

Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           FSYPA++  +DG V +TYT+ R  I  V +
Sbjct: 313 FSYPAIVPWADG-VAVTYTWQRRGIAFVTV 341


>gi|372221459|ref|ZP_09499880.1| neuraminidase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 346

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 177/320 (55%), Gaps = 22/320 (6%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-- 71
           EF F     PFK CHA T+V ++ G  + ++F G+ E   DV I+    K+G W  P   
Sbjct: 30  EFVF--EKGPFKECHAPTLVVLEDGSIMASWFAGTHERHEDVAIYTSIKKEGSWSYPKLV 87

Query: 72  ---IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
              +A      P WNPVLFK P+  L L+YK+G     W G  K S + G TWSE ++L 
Sbjct: 88  ANGVAANGKKYPTWNPVLFKNPNGLLYLYYKVGPSPSTWWGVYKTSKDNGKTWSEAKKLS 147

Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESL 186
            GILGP KNKP+LL+NG +L  SSVE  N   W + ME ++D+G++W+K          L
Sbjct: 148 DGILGPIKNKPLLLKNGNILSPSSVELENGEIWLAHMEHSIDSGKTWQKIKIPQKKKHKL 207

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
             IQP      ++++  L RS     +V  S S + G+TWS AK   L NPNSGID + +
Sbjct: 208 --IQPTLLTLKDQSILALFRS--NQDKVVYSTSKNNGITWSKAKAMDLENPNSGIDALTM 263

Query: 247 KDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
           K+G  LL YN           R  L +  S+ DG  W   ++LE +   E+SYPA++Q  
Sbjct: 264 KNGNHLLVYNPALSGDQWWDGRSKLNLGYSQ-DGMQWKSIVSLENHQKGEYSYPAIVQDQ 322

Query: 299 DGSVHITYTYNRTQIKHVVL 318
           DG +H+ YTY+R +IK+V L
Sbjct: 323 DGYIHLVYTYDRKKIKYVKL 342


>gi|440733196|ref|ZP_20912962.1| sialidase/neuraminidase [Xanthomonas translucens DAR61454]
 gi|440363986|gb|ELQ01133.1| sialidase/neuraminidase [Xanthomonas translucens DAR61454]
          Length = 364

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 172/317 (54%), Gaps = 19/317 (5%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD- 74
           F    AP   CHAST++E   G  L A+FGG  EGA DV IW+       WQ +  +AD 
Sbjct: 41  FVNADAPTAQCHASTLLETRDG-LLAAWFGGRHEGADDVGIWVARRSAQGWQPAQRVADG 99

Query: 75  ---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
              +  ++P WNPVLF+     L LFYK+G + ++W G    S + GV WS  E+LP GI
Sbjct: 100 MQPQGASLPAWNPVLFQPAKGPLRLFYKVGPDPKRWWGMQTTSRDGGVHWSAPERLPDGI 159

Query: 132 LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
           LGP KNKP+ L +G +L  SS E    W + ME + D G  W + GP       +  IQP
Sbjct: 160 LGPIKNKPVQLADGRILSPSSSED-TGWVAHMEWSDDNGAHWTR-GPALNDPARIGAIQP 217

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
                 + +++ + RS      V  + S D G +W      +L NPNSG D   L DGR 
Sbjct: 218 SALVHTDGSVQAIGRSQQ--NHVFSAWSRDHGRSWEPMTLLELANPNSGTDAAVLADGRS 275

Query: 252 LLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           LL YN           RG L VALS  DG+ W   LTLE++   EFSYPAVIQ  DG VH
Sbjct: 276 LLVYNPTEAGKNWWDGRGTLAVALSA-DGNHWTRVLTLEDSAKDEFSYPAVIQTRDGLVH 334

Query: 304 ITYTYNRTQIKHVVLLP 320
           I+YT+ R +IKHVVL P
Sbjct: 335 ISYTWKRIRIKHVVLDP 351


>gi|433679939|ref|ZP_20511604.1| hypothetical protein BN444_03961 [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430814943|emb|CCP42233.1| hypothetical protein BN444_03961 [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 364

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 173/317 (54%), Gaps = 19/317 (5%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD- 74
           F    AP   CHAST++E   G  L A+FGG  EGA DV IW+       WQ +  +AD 
Sbjct: 41  FVNADAPTAQCHASTLLETRDG-LLAAWFGGRHEGADDVGIWVARRSAQGWQPAQRVADG 99

Query: 75  EEPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
           ++P    +P WNPVLF+     L LFYK+G + ++W G    S + GV WS  E+LP GI
Sbjct: 100 KQPQGAPLPAWNPVLFQPSKGPLRLFYKVGPDPKRWWGMQTTSRDGGVHWSAPERLPDGI 159

Query: 132 LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
           LGP KNKP+ L +G +L  SS E    W + ME + D G  W + GP       +  IQP
Sbjct: 160 LGPIKNKPVQLADGRILSPSSSED-TGWVAHMEWSDDNGAHWTR-GPALNDPARIGAIQP 217

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
                 + +++ + RS      V  + S D G +W      +L NPNSG D   L DGR 
Sbjct: 218 SALVHTDGSVQAIGRSQQ--NHVFSAWSRDHGRSWEPMTLLELANPNSGTDAAVLADGRS 275

Query: 252 LLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           LL YN           RG L VALS  DG+ W   LTLE++   EFSYPAVIQ  DG VH
Sbjct: 276 LLVYNPTEAGKNWWDGRGTLAVALSA-DGNHWTRVLTLEDSAKDEFSYPAVIQTRDGLVH 334

Query: 304 ITYTYNRTQIKHVVLLP 320
           I+YT+ R +IKHVVL P
Sbjct: 335 ISYTWKRIRIKHVVLDP 351


>gi|393781506|ref|ZP_10369701.1| hypothetical protein HMPREF1071_00569 [Bacteroides salyersiae
            CL02T12C01]
 gi|392676569|gb|EIY70001.1| hypothetical protein HMPREF1071_00569 [Bacteroides salyersiae
            CL02T12C01]
          Length = 1267

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 132/359 (36%), Positives = 188/359 (52%), Gaps = 54/359 (15%)

Query: 8    KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR- 66
            K ++ ++F +    A F  CH +T+ E  KG  + A+FGG+ E  PD  IW+     G  
Sbjct: 910  KAILKDQFLY--EQAAFPECHGATVAETPKGDLVAAFFGGTKERNPDCCIWVCRKVKGSD 967

Query: 67   -WQSPIIADEEPNVPM-----WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT 120
             W  P+   +    P      WNPVLF+LP   L+LFYK G +V  W+G + RS N G T
Sbjct: 968  VWTPPVKVADGVITPTLREACWNPVLFQLPKGELMLFYKRGDKVANWTGWLIRSKNGGKT 1027

Query: 121  WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI- 179
            WS  E+LP G LGP KNKP  + NG ++C SS E  N W    E++ D G++WRK GP+ 
Sbjct: 1028 WSSPEKLPDGFLGPVKNKPEYI-NGRIICPSSTEGENGWRIHFEISDDRGKTWRKVGPLE 1086

Query: 180  ----------------------------YIPNESLS---VIQPVPFHTANRTLRVLMRSF 208
                                         + +++L     IQP      +  L++L R+ 
Sbjct: 1087 ADLALLTQHQGGSNAGKAEEEDDPEGGEVVRDKNLKPIYAIQPSLLKHKDGRLQILCRTR 1146

Query: 209  NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--------- 259
            N  G++  + S D G +W+  + T+++N NSG D V LKDGR LL YN            
Sbjct: 1147 N--GKLAEAWSADNGDSWTKLRLTEVVNNNSGTDAVTLKDGRHLLVYNDFGALPGTRKGV 1204

Query: 260  RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            R  L VA+S +DG +W  A+ LE++   ++SYP+VIQ  DG VHI YT+ R +IK+V L
Sbjct: 1205 RTPLSVAIS-EDGMTWDAAVVLEDSPISQYSYPSVIQGKDGKVHIVYTWRRQRIKYVEL 1262


>gi|432881173|ref|ZP_20097545.1| hypothetical protein A317_03829 [Escherichia coli KTE154]
 gi|431412123|gb|ELG95208.1| hypothetical protein A317_03829 [Escherichia coli KTE154]
          Length = 348

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 122/319 (38%), Positives = 179/319 (56%), Gaps = 18/319 (5%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           ++ +E  FP      K  HASTI E+  G  LV++FGG+ EG  DVKIW  +  +  W  
Sbjct: 39  IIKQELIFPL--GLIKESHASTIAEMSNGTILVSWFGGTKEGDDDVKIWQSSMTESGWSK 96

Query: 70  PIIA-----DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P +      D +P +P WNPVLF +  N   LFYK+G+    W G    S N G++WS++
Sbjct: 97  PRVVAYGSEDGKP-LPTWNPVLFNI-HNKTYLFYKVGKTPSTWEGKYITSTNNGLSWSKK 154

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
           + LP GI GP KNKPIL    +L   SS E+   W   +E T +     + +  I +   
Sbjct: 155 QALPNGIYGPIKNKPILSGKNVLFP-SSEENDKGWFVHIETTSNM----KNFNKIEVSKP 209

Query: 185 S-LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
           +  S IQP      +  +++  RS  G+  +  S+S D G +W+  + + + + NSG+DG
Sbjct: 210 TGTSAIQPTLLKFRDNEIKMYTRSDQGV--LMESQSSDSGHSWTTLRKSTIPSTNSGVDG 267

Query: 244 VKLKDGRLLLAYN-TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           + L+DG  L+ +N    R +L ++ S D+G +W + LTLE     EFSYPA+IQ+S+G +
Sbjct: 268 IVLRDGTGLIVHNPGKDRSILSISRSNDNGKTWGNVLTLESQEGKEFSYPAIIQSSNGKI 327

Query: 303 HITYTYNRTQIKHVVLLPN 321
           HITYTYNR  IKH+ L  N
Sbjct: 328 HITYTYNRESIKHITLEEN 346


>gi|285018990|ref|YP_003376701.1| sialidase/neuraminidase [Xanthomonas albilineans GPE PC73]
 gi|283474208|emb|CBA16709.1| putative sialidase/neuraminidase protein [Xanthomonas albilineans
           GPE PC73]
          Length = 358

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 138/312 (44%), Positives = 167/312 (53%), Gaps = 20/312 (6%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW-QSPIIADEE----- 76
           P   CHAST+VE D G  L A+FGG  EGA DV IW+       W  +  +AD       
Sbjct: 39  PTAQCHASTLVETDDG-LLAAWFGGQHEGADDVGIWVARRDAHGWLPAQRVADGAQAHGA 97

Query: 77  PNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSK 136
           P +P WNPVLF+     L LFYK+G + + W G    S + G  WS  E+LP GILGP K
Sbjct: 98  PPLPAWNPVLFQAALGPLRLFYKVGPDPKGWWGMQITSSDGGRHWSAPERLPTGILGPIK 157

Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT 196
           NKP+ L +G +L  SS E    W + ME + D G  W + GP       +  IQP     
Sbjct: 158 NKPVQLASGRILSPSSSED-AGWVAHMEWSDDNGAHWTR-GPAMNDPAKIGAIQPSVLVH 215

Query: 197 ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN 256
           AN  ++ + RS      V  + S D G TW       L NPNSG D V L DGR LL YN
Sbjct: 216 ANGRVQAVGRSRQ--NHVFSTWSHDQGRTWEPMTLLNLANPNSGTDAVVLADGRSLLVYN 273

Query: 257 TVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
                      RG+L VALS +DG  W   LTLE +   EFSYPAVIQ  DG +HI+YT+
Sbjct: 274 PTEAGKEWWDGRGILAVALS-NDGLHWTRVLTLENSPKDEFSYPAVIQTRDGQIHISYTW 332

Query: 309 NRTQIKHVVLLP 320
            RT IKHVVL P
Sbjct: 333 KRTHIKHVVLDP 344


>gi|424796165|ref|ZP_18221930.1| exported exo-alpha-sialidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422795084|gb|EKU23838.1| exported exo-alpha-sialidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 364

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 172/316 (54%), Gaps = 21/316 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ-SPIIAD- 74
           F    AP   CHAST++E   G  L A+FGG  EGA DV IW+       WQ +  +AD 
Sbjct: 41  FVNADAPTAQCHASTLLETRDG-LLAAWFGGRHEGADDVGIWVARRGAQGWQRAQRVADG 99

Query: 75  EEPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
           ++P    +P WNPVLF+     L LFYK+G + ++W G    S + GV WS  E+LP GI
Sbjct: 100 KQPQGAPLPAWNPVLFQPAKGPLRLFYKVGPDPKRWWGMQTTSRDGGVHWSAPERLPDGI 159

Query: 132 LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
           LGP KNKP+ L +G +L  SS E    W + ME + D G  W + GP       +  IQP
Sbjct: 160 LGPIKNKPVQLADGRILSPSSSED-AGWVAHMEWSDDNGAHWTR-GPALNDPARIGAIQP 217

Query: 192 -VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
            V  HT  R   +     N    V  + S D G +W+      L NPNSG D V L DGR
Sbjct: 218 SVLVHTDGRVQAIGRSQQN---HVFSTWSRDHGRSWAPMTLLDLANPNSGTDAVVLADGR 274

Query: 251 LLLAYNTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            LL YN           RG L VALS  DG+ W   LTLE++   EFSYPAVIQ  DG V
Sbjct: 275 SLLVYNPTEAGKDWWDGRGTLAVALSG-DGNHWARVLTLEDSAKDEFSYPAVIQTRDGLV 333

Query: 303 HITYTYNRTQIKHVVL 318
           HI+YT+ R +IKHVVL
Sbjct: 334 HISYTWKRIRIKHVVL 349


>gi|146299804|ref|YP_001194395.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium johnsoniae
           UW101]
 gi|146154222|gb|ABQ05076.1| hypothetical lipoprotein [Flavobacterium johnsoniae UW101]
          Length = 349

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 126/311 (40%), Positives = 171/311 (54%), Gaps = 26/311 (8%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ------SPIIADEEPNV 79
           + HAST+VE++    L A+FGG  EGA DV I++ ++K+ +W        P+I D +  +
Sbjct: 44  ASHASTLVELENNTLLAAWFGGKYEGAKDVGIYISSYKEQKWSVPKELIKPLIKDGD-TL 102

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
           P WNPVLFK  S  L LFYK+G+  ++W G M  S ++G TWS  + LP GILGP +NKP
Sbjct: 103 PCWNPVLFKSKSQNLYLFYKVGKNPREWFGAMIVSKDEGKTWSNSKYLPKGILGPIRNKP 162

Query: 140 ILLENGLLLCGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE--SLSVIQPVPFH 195
           I    G++LCGSS ES   N W  ++E   +   SW     I   N+  +  +IQP    
Sbjct: 163 IETTPGVILCGSSTESVDDNKWRVFIETYTEGTDSW----TIADINDKKNFDIIQPTFLV 218

Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
            +++ +++L RS +   ++  S S D G TW       ++N NSG+D V L D   LL  
Sbjct: 219 HSDKEIQILSRSRH--NKLISSWSEDNGKTWQKTDSINVVNSNSGVDAVTLSDKSFLLVN 276

Query: 256 NTVS--------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307
           N +         R VL V  SK DG +W     LE     EFSYPA+IQ SD  VHI YT
Sbjct: 277 NPLKMGKDWFNGRNVLDVEYSK-DGVNWKKLFDLENQKEGEFSYPAIIQTSDKKVHILYT 335

Query: 308 YNRTQIKHVVL 318
           YNR  IKH   
Sbjct: 336 YNRKFIKHTAF 346


>gi|256426128|ref|YP_003126781.1| neuraminidase [Chitinophaga pinensis DSM 2588]
 gi|256041036|gb|ACU64580.1| putative neuraminidase [Chitinophaga pinensis DSM 2588]
          Length = 370

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 124/324 (38%), Positives = 176/324 (54%), Gaps = 23/324 (7%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV EEF       P  + HASTI  +  G  L ++FGGS E APDV I+   F  G+W +
Sbjct: 53  LVREEFIM--EKPPVPAAHASTITALPDGDLLASWFGGSRESAPDVCIYTARFHKGKWSA 110

Query: 70  PIIA-----DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           P +      ++      WNPVL    +  ++LFYK+G   ++W G +  S + G++WS  
Sbjct: 111 PEVVASGVINDSTRYAAWNPVLITTKAGRVILFYKVGPNPREWWGMLISSTDNGLSWSAP 170

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWN--SWGSWMEVTVDAGRSWRKYGPIYIP 182
           E+LP  ILGP KNKP+ L +G +L  SS ES +   W   +E++   G  W++   + I 
Sbjct: 171 ERLPGNILGPIKNKPVQLADGSILHPSSTESLDEKDWHIHVELSDSTGHQWKR---VPID 227

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            ++  VIQP     A + L++L RS      +  + S D G TWS      +LNPNSGID
Sbjct: 228 CDTFGVIQPSALFHAGKRLQMLCRSRQNA--IVQTWSTDNGQTWSPLTKQSMLNPNSGID 285

Query: 243 GVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
            V  ++G+ +L YN           R  L+VA+S  DG+ W D   L +    EFSYPA+
Sbjct: 286 AVTTRNGQQVLVYNPAVSGKDWWNGRNELRVAVSI-DGEHWKDVYVLVKEAKGEFSYPAI 344

Query: 295 IQASDGSVHITYTYNRTQIKHVVL 318
           IQ  DG +HITYTY+R  I++V L
Sbjct: 345 IQTKDGILHITYTYDRKNIRYVAL 368


>gi|332188854|ref|ZP_08390562.1| BNR/Asp-box repeat family protein [Sphingomonas sp. S17]
 gi|332011120|gb|EGI53217.1| BNR/Asp-box repeat family protein [Sphingomonas sp. S17]
          Length = 358

 Score =  210 bits (534), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 128/322 (39%), Positives = 173/322 (53%), Gaps = 26/322 (8%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-----IADE 75
           +AP+   HASTI E+  G    A+F GS E  PDV+IW        W +P+     I+ +
Sbjct: 39  NAPYPQAHASTIAELPDGTLAAAWFAGSGESRPDVRIWFARRGTKGWDAPVPVADGISAD 98

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS 135
               P WNPVLF+ P   L L+YK+G   ++W G + RS + G  W+  E+LP G+LGP 
Sbjct: 99  GRRYPTWNPVLFQPPGEPLHLYYKVGPNPRQWWGTVIRSSDGGRHWTRPERLPDGVLGPI 158

Query: 136 KNKPILLENGLLLCGSSVE----SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP 191
           KNKP++  +G  L  SS E      N W   +E + D G+SW   GP       +  IQP
Sbjct: 159 KNKPVIAPDGAWLSPSSREEGTAEANIWSLRIERSTDKGKSW-TVGPRIASPMHIEAIQP 217

Query: 192 -VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
            + FH   R L ++ R+  G   +  S S D G+TWS      L NPN+G D V L DGR
Sbjct: 218 SILFHRDGR-LELVARTRQGA--LAQSWSRDKGVTWSPIAAIDLPNPNAGTDAVTLADGR 274

Query: 251 LLLAYNTVS----------RGVLKVALSKDDGDSWHDALTLEEN-LAMEFSYPAVIQASD 299
            L+ YN  +          R  L + LS DDG  W +ALTLE   +   ++YPAV+Q  D
Sbjct: 275 QLVVYNHSAHAPDTPGDGPRWPLNIGLS-DDGVRWRNALTLESKPMPDGYAYPAVVQTRD 333

Query: 300 GSVHITYTYNRTQIKHVVLLPN 321
           G VH+TYT+NR  I+HVV+ P 
Sbjct: 334 GLVHVTYTWNRQHIRHVVIDPR 355


>gi|325299331|ref|YP_004259248.1| alpha-L-rhamnosidase [Bacteroides salanitronis DSM 18170]
 gi|324318884|gb|ADY36775.1| alpha-L-rhamnosidase [Bacteroides salanitronis DSM 18170]
          Length = 1273

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 139/353 (39%), Positives = 185/353 (52%), Gaps = 52/353 (14%)

Query: 14   EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPI 71
            E  F    A F  CHASTIVE++ G  + A+FGG+ E  PDV IW+     G   W  P+
Sbjct: 917  ENQFLYEKASFPECHASTIVELENGDLVAAFFGGTKERNPDVCIWVCRKPKGSDTWTEPV 976

Query: 72   -IAD-----EEPNVPM--------------WNPVLFKLPSNGLLLFYKIGQEVQKWSGCM 111
             +AD      +P+  +              WNPVLF++P   LLLF+KIG  V  W+G +
Sbjct: 977  KVADGVFDLNDPDAKIAGITADIKDHRKACWNPVLFQVPGGDLLLFFKIGLNVPDWTGWL 1036

Query: 112  KRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
             RS + G TWS RE LP G LGP KNKP  + NG +LC SS E    W   +E + D G+
Sbjct: 1037 VRSKDGGKTWSRREALPQGFLGPIKNKPEFI-NGRILCPSSTEGEQGWRIHIEYSDDMGK 1095

Query: 172  SWRKYGPI----YIPNE------------SLSVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
            +W+  GPI      P++             + VIQP      + TL+V+ R+ N    + 
Sbjct: 1096 TWKTTGPIPAELEFPSQYRNMNADEKEKRPILVIQPSILKHKDGTLQVICRTRN--SHLA 1153

Query: 216  MSESCDGGLTWSYAKPTQ-LLNPNSGIDGVKLKDGRLLLAYNTVS---------RGVLKV 265
             S S D G TWS     + L N NSG D V L+DGR  L YN  S         R  + +
Sbjct: 1154 TSWSKDNGSTWSKVTLIEGLPNNNSGTDAVTLQDGRHALVYNNFSPLLGDKKGVRTPINL 1213

Query: 266  ALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            ALS DDG  W   LTLE++   E+SYPA+IQ  DG +HIT+T+ R  IK++ +
Sbjct: 1214 ALS-DDGIHWTPVLTLEDSPVREYSYPAIIQGKDGKLHITFTWRRELIKYMEI 1265


>gi|395803564|ref|ZP_10482808.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
 gi|395434118|gb|EJG00068.1| BNR repeat-containing glycosyl hydrolase [Flavobacterium sp. F52]
          Length = 350

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 124/318 (38%), Positives = 171/318 (53%), Gaps = 20/318 (6%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-- 71
           E ++ A+     + HAST+VE      L A+FGG  EGA DV I++  +KD +W +P   
Sbjct: 32  ESSYIADPPVTTNSHASTLVEYKPNEILAAWFGGKYEGAKDVGIYISAYKDKKWSTPKEL 91

Query: 72  ---IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
              +      +P WNPVLFK  S  L LFYKIG+  ++W G MK S + G TW E + LP
Sbjct: 92  IQPLIKNGDTLPCWNPVLFKSKSQNLYLFYKIGKNPREWFGAMKVSKDNGTTWGEPKYLP 151

Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNS--WGSWMEVTVDAGRSWRKYGPIYIPNESL 186
            GILGP +NKPI    G++LCGSS ES N+  W   +E   +A  SW+K       N++ 
Sbjct: 152 EGILGPIRNKPIETAPGVILCGSSTESVNTDQWRVHVEEYTEATDSWKKIA--VENNQNF 209

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
            +IQP     +   L++L RS +   +V  S S D G +W       ++N NSGID + +
Sbjct: 210 DIIQPTFLIHSATDLQMLSRSKH--NKVVSSWSGDNGKSWIRTNTINVINSNSGIDALTV 267

Query: 247 KDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
                LL  N +         R +L V  SK DG +W     LE+    EFSYPA+IQ +
Sbjct: 268 NKNLFLLVNNPLPMGKDWFNGRNILDVEYSK-DGLNWTKLFDLEKQEKGEFSYPAIIQTA 326

Query: 299 DGSVHITYTYNRTQIKHV 316
           D  +H+ YTY+R  IKH 
Sbjct: 327 DKRIHVLYTYDRKYIKHT 344


>gi|298480648|ref|ZP_06998844.1| alpha-rhamnosidase [Bacteroides sp. D22]
 gi|298273082|gb|EFI14647.1| alpha-rhamnosidase [Bacteroides sp. D22]
          Length = 1300

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 133/347 (38%), Positives = 182/347 (52%), Gaps = 32/347 (9%)

Query: 1    MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
            +  + G +G + +EF    N  PF   HAS+I E  +G  + A+FGG+ E  PD  IW+ 
Sbjct: 946  LDAENGRQGFITDEFI--CNYPPFPESHASSIAETKEG-LVAAWFGGTKEKNPDCCIWVS 1002

Query: 61   TFKDGRWQSPI-IAD---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN 116
                  W  P  +AD   +      WNPVL   PS  L L+YKIG  V  WSG + RS++
Sbjct: 1003 RHTPQGWSKPQKVADGILDGTKYACWNPVLTVTPSGELQLYYKIGINVSGWSGHVIRSHD 1062

Query: 117  KGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKY 176
             G +WS  E LP G LGP KNKP+ +    ++C SS E  N W    E++ D G +W+K 
Sbjct: 1063 CGKSWSRPESLPEGFLGPIKNKPVWI-GKRMICPSSTEGENGWQVHFEISDDEGHTWKKT 1121

Query: 177  GPI----YIPNE----------SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
             PI     I  E          ++  IQP      +  L+ L R+ N  G +  + S D 
Sbjct: 1122 NPIDAQEIIETETIKRSNAQKSTIQAIQPTILVHRDGKLQALCRTQNN-GSIASTWSYDQ 1180

Query: 223  GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDS 274
            G TW+  + T+L N NSGIDGV LKDGR LL YN           R  + VA+S +DG  
Sbjct: 1181 GETWTPLEFTELPNNNSGIDGVTLKDGRFLLVYNHYRYIRGKKKKRTPINVAIS-NDGIH 1239

Query: 275  WHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            W   + LE++   ++SYP+VIQ  DG VHI YT+ R +IK+  L P+
Sbjct: 1240 WQAGVVLEDSPINQYSYPSVIQGKDGKVHIVYTWRRQRIKYACLDPD 1286


>gi|423221594|ref|ZP_17208064.1| hypothetical protein HMPREF1062_00250 [Bacteroides cellulosilyticus
            CL02T12C19]
 gi|392645921|gb|EIY39641.1| hypothetical protein HMPREF1062_00250 [Bacteroides cellulosilyticus
            CL02T12C19]
          Length = 1289

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 180/342 (52%), Gaps = 32/342 (9%)

Query: 6    GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
            G KGL+ ++F        F   HA+TIVE  +G  + A+FGG+ E  PD  IW+      
Sbjct: 940  GRKGLLEDDFI--CRKPSFPESHAATIVEGRRG-IVAAWFGGTKEKNPDCCIWVSRKTKT 996

Query: 66   RWQSP-IIAD---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
             W  P ++AD   +      WNPVL + P   L L+YKIG  V  WSG +  S + G TW
Sbjct: 997  GWTEPQMVADGVLDGTKYACWNPVLTETPKGELQLYYKIGVNVAGWSGHVVTSKDGGKTW 1056

Query: 122  SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
            S+   LP G LGP KNKP+ +    ++C SS E  N W    E++ D G++WRK GPI  
Sbjct: 1057 SKSRALPEGFLGPIKNKPVWI-GKRMICPSSTEDENGWRIHFEISDDEGKTWRKIGPITA 1115

Query: 182  --------------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                           ++++ VIQP      +  L+ L R+ N  G V  + S D G +WS
Sbjct: 1116 SEIVETDQVKFSNQKHKTIQVIQPTILTHKDGKLQALCRTQNN-GSVATTWSEDNGESWS 1174

Query: 228  YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
                  L N NSGIDG+ LKDGR LL YN           R  + VA+S DDG  W  A+
Sbjct: 1175 PVTFIDLPNNNSGIDGITLKDGRHLLVYNHYRYIKGKRKERTPINVAIS-DDGMHWKAAV 1233

Query: 280  TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             LE++   ++SYP+VIQ+ DG VHI YT+ R +IK+  L P 
Sbjct: 1234 VLEDSPINQYSYPSVIQSKDGKVHIVYTWRRQRIKYACLDPQ 1275


>gi|300726577|ref|ZP_07060019.1| putative alpha-rhamnosidase [Prevotella bryantii B14]
 gi|299776145|gb|EFI72713.1| putative alpha-rhamnosidase [Prevotella bryantii B14]
          Length = 1251

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 184/342 (53%), Gaps = 41/342 (11%)

Query: 8    KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG-- 65
            +  + ++F +   S P    HASTI E+  G  L AYFGG+ E  PDV IW+     G  
Sbjct: 908  EAFMEDQFLYEYTSFP--QAHASTITELKNGDLLAAYFGGTKEKNPDVCIWVSRKPKGAD 965

Query: 66   RWQSPII----ADEEPNVPM----------WNPVLFKLPSNGLLLFYKIGQEVQKWSGCM 111
            +W++P++     D+  + P+          WNPVLF +P   +  F+KI +E+  W G +
Sbjct: 966  KWEAPMMIADGTDDYLHRPLNSTDKARKACWNPVLFTMPDGEIWCFFKIAEEISDWQGWV 1025

Query: 112  KRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
             +S + G TWS+RE LP G +GP KNKP+L+ +  LLC SS E  + W   +E+     +
Sbjct: 1026 VKSKDGGKTWSKREMLPKGFIGPVKNKPVLIGD-RLLCPSSTEG-DWWKFHVEIYNIKTK 1083

Query: 172  SWRKYGPIYIPNESLS---------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
             W+  GPI   +  L+          IQP      +  L VLMRS N   ++  S S D 
Sbjct: 1084 EWKYVGPIEAEDAVLTDDQKVHPIKCIQPSILQLKDGRLMVLMRSHN--AKLAKSYSSDN 1141

Query: 223  GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGD 273
            G TWS  + +++ N  SG D V LKDGR +L YN            R  L +A+S +DG+
Sbjct: 1142 GETWSKVELSEVENNQSGTDAVTLKDGRHVLIYNNFETLMGTKKGPRTPLSIAVS-EDGE 1200

Query: 274  SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
             WH +LTLE++   ++SYPA+IQ  DG++H  YT+ R +I +
Sbjct: 1201 HWHHSLTLEDSPVSQYSYPAIIQGKDGNLHCVYTWRRERIAY 1242


>gi|224538250|ref|ZP_03678789.1| hypothetical protein BACCELL_03141, partial [Bacteroides
            cellulosilyticus DSM 14838]
 gi|224520136|gb|EEF89241.1| hypothetical protein BACCELL_03141 [Bacteroides cellulosilyticus DSM
            14838]
          Length = 1248

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 179/342 (52%), Gaps = 32/342 (9%)

Query: 6    GIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG 65
            G KGL+ ++F        F   HA+TIVE  +G  + A+FGG+ E  PD  IW+      
Sbjct: 899  GRKGLLEDDFI--CRKPSFPESHAATIVEGRRG-IVAAWFGGTKEKNPDCCIWVSRKTKT 955

Query: 66   RWQSP-IIAD---EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTW 121
             W  P ++AD   +      WNPVL + P   L L+YKIG  V  WSG +  S + G TW
Sbjct: 956  GWTEPQMVADGVLDGTKYACWNPVLTETPKGELQLYYKIGVNVAGWSGHVVTSKDGGKTW 1015

Query: 122  SEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
            S+   LP G LGP KNKP+ +    ++C SS E  N W    E++ D G++WRK GPI  
Sbjct: 1016 SKSRALPEGFLGPIKNKPVWI-GKRMICPSSTEDENGWRIHFEISDDEGKTWRKIGPITA 1074

Query: 182  --------------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                           ++++ VIQP      +  L+ L R+ N  G V  + S D G +WS
Sbjct: 1075 SEIVETDQVKFSNQKHKTIQVIQPTILTHKDGKLQALCRTQNN-GSVATTWSEDNGESWS 1133

Query: 228  YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
                  L N NSGIDG+ LKDGR LL YN           R  + VA+S DDG  W  A+
Sbjct: 1134 PVTFIDLPNNNSGIDGITLKDGRHLLVYNHYRYIKGKRKERTPINVAIS-DDGMHWKAAV 1192

Query: 280  TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             LE++   ++SYP+VIQ  DG VHI YT+ R +IK+  L P 
Sbjct: 1193 VLEDSPINQYSYPSVIQGKDGKVHIVYTWRRQRIKYACLDPQ 1234


>gi|32471321|ref|NP_864314.1| hypothetical protein RB1257 [Rhodopirellula baltica SH 1]
 gi|32443162|emb|CAD71993.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 375

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 128/342 (37%), Positives = 181/342 (52%), Gaps = 31/342 (9%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           +D    G++ ++F +  + APF SCHASTI +   G    A+FGG  EGA DV IW+   
Sbjct: 40  QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 96

Query: 63  KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
               W +P ++AD    +    P WNPVLF  PS    LF K+G   + W G +  S + 
Sbjct: 97  DGNSWSAPRMVADGVQPDGSRHPCWNPVLFHAPSGATWLFIKVGPSPKDWWGEVLFSDDS 156

Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
           G T+ +R +LP G+LGP + KP L+++G  LLCGSS E  + W    E     G++    
Sbjct: 157 GKTFRDRTRLPEGVLGPIRCKPELVDHGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 215

Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                W +   I+   E  + IQP     A+  LR L R+   +  +  ++S D G TW+
Sbjct: 216 VSASHWDRGEAIHDGKE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 272

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
               T + NPNSGID V  KDGR  L  N +         R  L +++S  DG ++ +  
Sbjct: 273 KPIATDMPNPNSGIDVVTAKDGRHWLISNPLPSKENGWGGRSRLTLSVST-DGKTYREVA 331

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            LE     EFSYPA+IQA DG +H+TYT+ R +IKHV+  PN
Sbjct: 332 VLENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIFDPN 373


>gi|255036622|ref|YP_003087243.1| hypothetical protein Dfer_2863 [Dyadobacter fermentans DSM 18053]
 gi|254949378|gb|ACT94078.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
          Length = 371

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/313 (39%), Positives = 173/313 (55%), Gaps = 17/313 (5%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           +EF F  +  PF  CHAST+V +D G FL+A+FGG+ E  PDV IWL   + G+W +P+ 
Sbjct: 45  KEFVF-GDQRPFPQCHASTLVRLDDGQFLIAWFGGTEEKNPDVGIWLSKGRPGQWSAPVE 103

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PP 129
             +      WNPVL K     ++L++K+G+E+  W   +K S + G TWS+  +L     
Sbjct: 104 VAKIREDAHWNPVLQKTSDGKVILYFKVGKEIAHWETWVKTSSDNGETWSDAYELVKGDK 163

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS-- 187
           G  GP K+K I L NG  L G+S E  N W  +++ + D G++W       I    +   
Sbjct: 164 GGRGPVKDKLIELSNGDWLAGASNEV-NRWEVFVDRSTDKGKTWTASPYFKIDTTEIKGK 222

Query: 188 -VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
             IQP  + +    + +L+R+  G+  +  S+S D G TWS    T L NPNSGID  KL
Sbjct: 223 GAIQPTLWESEPGQVHMLVRTTGGV--IGRSDSKDNGKTWSTITKTSLPNPNSGIDLAKL 280

Query: 247 KDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLAM-EFSYPAVIQASDG 300
            DG L+LAYN       SR  L + LS DDG +W D + +       E+SYPAVI   D 
Sbjct: 281 NDGTLVLAYNPDDHNWGSRSPLSLILSYDDGQNWTDKIDIATGKKEDEYSYPAVISWGD- 339

Query: 301 SVHITYTYNRTQI 313
           SV +TYTYNR +I
Sbjct: 340 SVAVTYTYNRRKI 352


>gi|312130600|ref|YP_003997940.1| hypothetical protein Lbys_1885 [Leadbetterella byssophila DSM
           17132]
 gi|311907146|gb|ADQ17587.1| hypothetical protein Lbys_1885 [Leadbetterella byssophila DSM
           17132]
          Length = 349

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 126/338 (37%), Positives = 186/338 (55%), Gaps = 32/338 (9%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
           +GL+ E   +P +  P   CHAST++EV+ G  L A+F G+ E   DV I +  FK+G+W
Sbjct: 18  RGLIHEGLVYPLDKKPTAQCHASTLLEVNDG-ILCAFFAGTHEKHADVGIRVARFKEGKW 76

Query: 68  QSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
             P+       ++    P WNPVLF+     + LFYK+G +V  W G    S ++G+TWS
Sbjct: 77  SWPVEVVNGYVNDTLRYPTWNPVLFRPKDGPIYLFYKVGPDVDHWWGAYVTSEDEGLTWS 136

Query: 123 E-----REQLPPGILGPSKNKPILLENGLLLCGSSVESWNS-----WGSWMEVTVDAGRS 172
                 +  +   +LGP KNK I L +G ++  +S+E   +     W  + EV+ D G++
Sbjct: 137 TPQVMGKHAIVGDLLGPVKNKAIQLPDGTIISPTSMERRGTENGRDWRIYFEVSKDNGKN 196

Query: 173 WRKYGPIYIPNESLS--VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
           W+   PI   N+ +    IQP      N  L++L R+   +  +  S S D G TWS   
Sbjct: 197 WQVIPPI---NDGVQYDAIQPSILIHKNGDLQILARTLQDV--LVTSWSKDKGKTWSPLT 251

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENL 285
           P+ L NPNSG D + L+DGR +L YN       +R VL VA+S +DG  W   +++ EN+
Sbjct: 252 PSGLPNPNSGTDALTLQDGRHVLVYNHSVRSGENRNVLNVAVS-EDGVDW-KMVSVLENV 309

Query: 286 AME--FSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            +   +SYPAVIQ  DG VHI+YTY R  IKH++  P+
Sbjct: 310 PIHSGYSYPAVIQTKDGKVHISYTYARQSIKHMIFDPS 347


>gi|449138024|ref|ZP_21773326.1| BNR/Asp-box repeat domain protein [Rhodopirellula europaea 6C]
 gi|448883351|gb|EMB13882.1| BNR/Asp-box repeat domain protein [Rhodopirellula europaea 6C]
          Length = 362

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 128/342 (37%), Positives = 183/342 (53%), Gaps = 31/342 (9%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           +D    G++ ++F +  + APF SCHASTI +   G F+ A+FGG  EGA DV IW+  +
Sbjct: 26  QDDAPAGVLQQQFIY--DEAPFPSCHASTICQTPNG-FVAAWFGGEREGAKDVSIWVSDY 82

Query: 63  KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
               W +P  +AD    +    P WNPVLF+ PS    LF K+G   + W G +  S + 
Sbjct: 83  DGKAWSAPRKVADGVQADRSRHPCWNPVLFQAPSGVTWLFIKVGPSPRDWWGEVLFSDDG 142

Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
           G T+ +R +LP G+LGP + KP L+++G  LLCGSS E  + W    E  +  G +    
Sbjct: 143 GKTFRDRTRLPEGVLGPIRCKPELVDDGKTLLCGSSTEH-DGWRVHFERLMLTGEANAEA 201

Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                W +   I+   E  + IQP     A+  LR L R+   +  +  ++S D G TW+
Sbjct: 202 VSETHWDRGEAIHDGKE-FAAIQPAILRLADGRLRALCRTMQSV--IVQTDSDDNGRTWT 258

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
               T + NPNSGID V  KDGR  L  N +         R  L +++S  DG+++ +  
Sbjct: 259 KPISTDMPNPNSGIDVVTAKDGRHWLISNPLPSKENGWGGRSRLTLSVST-DGETYREIA 317

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            LE     EFSYPA+IQA DG  H+TYT+ R +IKHV+  P 
Sbjct: 318 VLENEPKGEFSYPAIIQADDGKFHMTYTWKRKKIKHVIFDPT 359


>gi|417303952|ref|ZP_12090991.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica WH47]
 gi|327539742|gb|EGF26347.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica WH47]
          Length = 361

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 184/342 (53%), Gaps = 31/342 (9%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           +D    G++ ++F +  + APF SCHASTI +   G    A+FGG  EGA DV IW+   
Sbjct: 26  QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 82

Query: 63  KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
               W +P ++AD    ++   P WNPVLF  PS    LF K+G   + W G +  S + 
Sbjct: 83  DGNSWSAPRMVADGVQPDDSRHPCWNPVLFHAPSGVTWLFIKVGPSPKDWWGEVLFSDDG 142

Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
           G T+ +R +LP G+LGP + KP L+++G  LLCGSS E  + W    E     G++    
Sbjct: 143 GKTFRDRTRLPKGVLGPIRCKPELVDHGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 201

Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                W +   I+  NE  + IQP     A+  LR L R+   +  +  ++S D G TW+
Sbjct: 202 VSASHWDRGEAIHDGNE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 258

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
               T + NPNSG+D V  KDGR  L  N +         R  L +++S  DG ++ +  
Sbjct: 259 KPIATDMPNPNSGLDIVTAKDGRQWLISNPLPSKENGWGGRSRLTLSVST-DGKTYREVA 317

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            LE     EFSYPA+IQA DG +H+TYT+ R +IKHV++ P+
Sbjct: 318 VLENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIVDPS 359


>gi|440717920|ref|ZP_20898393.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
 gi|436436929|gb|ELP30617.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
          Length = 361

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 128/342 (37%), Positives = 180/342 (52%), Gaps = 31/342 (9%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           +D    G++ ++F +  + APF SCHASTI +   G    A+FGG  EGA DV IW+   
Sbjct: 26  QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 82

Query: 63  KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
               W +P  +AD    +    P WNPVLF  PS    LF K+G   + W G +  S + 
Sbjct: 83  DGNSWSAPRKVADGVQSDGSRHPCWNPVLFHAPSGATWLFIKVGPSPKDWWGEVLFSDDS 142

Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
           G T+ +R +LP G+LGP + KP L+++G  LLCGSS E  + W    E     G++    
Sbjct: 143 GKTFRDRTRLPEGVLGPIRCKPELVDHGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 201

Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                W +   I+   E  + IQP     A+  LR L R+   +  +  ++S D G TW+
Sbjct: 202 VSASHWDRGEAIHDGKE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 258

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
               T + NPNSGID V  KDGR  L  N +         R  L +++S  DG ++ +  
Sbjct: 259 KPIATDMPNPNSGIDVVTAKDGRHWLISNPLPSKENGWGGRSRLTLSVST-DGKTYREVA 317

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            LE     EFSYPA+IQA DG +H+TYT+ R +IKHV+  PN
Sbjct: 318 VLENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIFDPN 359


>gi|440716314|ref|ZP_20896825.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
 gi|436438660|gb|ELP32185.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SWK14]
          Length = 381

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 134/357 (37%), Positives = 177/357 (49%), Gaps = 45/357 (12%)

Query: 5   CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
            G   LV+ +  F     P K  HASTIVE   G  + A+F G+ E  PDV I +   ++
Sbjct: 28  AGEGALVSSQQIFDLADKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRHEN 86

Query: 65  GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           G+W   +     +       P WNPVLF+     L+LFYK+G   ++W G +  S + G 
Sbjct: 87  GQWTDSVEVVSGVQSSTLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146

Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
           TWS   +L        +LGP KNKPI L +G +LC SS E       + W    EVT D 
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPIELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206

Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
           G++W   GPI    E+   IQP      +  L+VL RS     R+  S S DGG TWS  
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQPSILTYPDNRLQVLCRSKEK--RIVESWSKDGGKTWSTL 263

Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWH 276
             T L NPNSG D V L+DGR +L YN                R +L +A+S  DG +W 
Sbjct: 264 TATDLPNPNSGTDAVTLQDGRQVLIYNHSEGLVRREDAPDLKPRRILNLAISS-DGKTWK 322

Query: 277 DALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             LTLE                 E+SYPA+IQ SDG +++ YTYNR  +KH V+ P+
Sbjct: 323 PVLTLEHETGPHPKDLDRRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 379


>gi|423213426|ref|ZP_17199955.1| hypothetical protein HMPREF1074_01487 [Bacteroides xylanisolvens
            CL03T12C04]
 gi|392693886|gb|EIY87116.1| hypothetical protein HMPREF1074_01487 [Bacteroides xylanisolvens
            CL03T12C04]
          Length = 1286

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 129/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 967  WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+SKD G +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVSKD-GINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|32475400|ref|NP_868394.1| hypothetical protein RB8501 [Rhodopirellula baltica SH 1]
 gi|32445941|emb|CAD78672.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 381

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/357 (36%), Positives = 177/357 (49%), Gaps = 45/357 (12%)

Query: 5   CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
            G   LV+ +  F     P K  HASTIVE   G  + A+F G+ E  PDV I +   ++
Sbjct: 28  AGEDALVSSQQIFDLADKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRHEN 86

Query: 65  GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           G+W   +     +       P WNPVLF+     L+LFYK+G   ++W G +  S + G 
Sbjct: 87  GQWTESVEVVSGVQSSTLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146

Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
           TWS   +L        +LGP KNKP+ L +G +LC SS E       + W    EVT D 
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPVELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206

Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
           G++W   GPI    E+   IQP      +  L++L RS     R+  S S DGG TWS  
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQPSILTYPDDRLQILCRSKEK--RIVESWSKDGGKTWSTL 263

Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWH 276
             T L NPNSG D V L+DGR +L YN                R +L +A+S  DG +W 
Sbjct: 264 TATDLPNPNSGTDAVTLQDGRQVLIYNHSEGLVRRKDAPDLKPRRILNLAISS-DGKTWK 322

Query: 277 DALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             LTLE                 E+SYPA+IQ SDG +++ YTYNR  +KH V+ P+
Sbjct: 323 PVLTLEHETGPHPKDSDRRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 379


>gi|421609016|ref|ZP_16050221.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
 gi|408500212|gb|EKK04666.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
          Length = 381

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 133/357 (37%), Positives = 177/357 (49%), Gaps = 45/357 (12%)

Query: 5   CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
            G   LV+ +  F     P K  HASTIVE   G  + A+F G+ E  PDV I +   ++
Sbjct: 28  AGEGALVSSQQIFDLADKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRHEN 86

Query: 65  GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           G+W   +     +       P WNPVLF+     L+LFYK+G   ++W G +  S + G 
Sbjct: 87  GQWTESVEVVSGVQSSTLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146

Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
           TWS   +L        +LGP KNKPI L +G +LC SS E       + W    EVT D 
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPIELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206

Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
           G++W   GPI    E+   IQP      +  L++L RS     R+  S S DGG TWS  
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQPSILTYPDDRLQILCRSKEK--RIVESWSKDGGKTWSTL 263

Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWH 276
             T L NPNSG D V L+DGR +L YN                R +L +A+S  DG +W 
Sbjct: 264 TATDLPNPNSGTDAVTLQDGRQVLIYNHSEGLVRREDAPDLKPRRILNLAISS-DGKTWK 322

Query: 277 DALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             LTLE                 E+SYPA+IQ SDG +++ YTYNR  +KH V+ P+
Sbjct: 323 PVLTLEHETGPHPKDADRRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 379


>gi|298481333|ref|ZP_06999526.1| alpha-rhamnosidase [Bacteroides sp. D22]
 gi|298272537|gb|EFI14105.1| alpha-rhamnosidase [Bacteroides sp. D22]
          Length = 1286

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 967  WTAPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|237720806|ref|ZP_04551287.1| glycoside hydrolase family 78 protein [Bacteroides sp. 2_2_4]
 gi|229449641|gb|EEO55432.1| glycoside hydrolase family 78 protein [Bacteroides sp. 2_2_4]
          Length = 1286

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--GNASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 967  WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGTKPIYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|383113010|ref|ZP_09933790.1| hypothetical protein BSGG_0147 [Bacteroides sp. D2]
 gi|382948894|gb|EFS29447.2| hypothetical protein BSGG_0147 [Bacteroides sp. D2]
          Length = 1290

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 130/381 (34%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 913  AILSDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 970

Query: 67   WQSPIIA-------------------------DEEPNV-----PMWNPVLFKLPSNGLLL 96
            W +P +A                         DE+  +       WNPVLF++P   L+L
Sbjct: 971  WTAPKLAADGVFSIKDSQAVLAGIDSTCTPVKDEKGKLIARRKACWNPVLFQIPGGDLIL 1030

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1031 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1089

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1090 NGWRVHFEISDDKGKTWKMIGPLDAELSVPTQNRKKGGMNVDDQEGGEAIRGEGAKPVYA 1149

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L+VL R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1150 IQPSILKHKDGRLQVLCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1207

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1208 GRHILIYNNFSTLPGTPKGPRTPLCVAIS-EDGINWKPVLTLEDSPISQYSYPSIIQGKD 1266

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1267 GKLHAIYTWRRQRIKYAEIDP 1287


>gi|160886119|ref|ZP_02067122.1| hypothetical protein BACOVA_04126 [Bacteroides ovatus ATCC 8483]
 gi|423286899|ref|ZP_17265750.1| hypothetical protein HMPREF1069_00793 [Bacteroides ovatus CL02T12C04]
 gi|156108932|gb|EDO10677.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides ovatus
            ATCC 8483]
 gi|392674437|gb|EIY67885.1| hypothetical protein HMPREF1069_00793 [Bacteroides ovatus CL02T12C04]
          Length = 1286

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 967  WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLNVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|262408006|ref|ZP_06084554.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294808089|ref|ZP_06766862.1| BNR/Asp-box repeat protein [Bacteroides xylanisolvens SD CC 1b]
 gi|345511520|ref|ZP_08791060.1| glycoside hydrolase family 78 [Bacteroides sp. D1]
 gi|229444052|gb|EEO49843.1| glycoside hydrolase family 78 [Bacteroides sp. D1]
 gi|262354814|gb|EEZ03906.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294444730|gb|EFG13424.1| BNR/Asp-box repeat protein [Bacteroides xylanisolvens SD CC 1b]
          Length = 1286

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 967  WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLNVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|293369107|ref|ZP_06615702.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CMC 3f]
 gi|292635819|gb|EFF54316.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CMC 3f]
          Length = 1286

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 967  WTAPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|294645187|ref|ZP_06722909.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CC 2a]
 gi|292639474|gb|EFF57770.1| BNR/Asp-box repeat protein [Bacteroides ovatus SD CC 2a]
          Length = 545

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 189/381 (49%), Gaps = 75/381 (19%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 168 AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 225

Query: 67  WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
           W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 226 WTTPKLAADGVFSLKDPQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 285

Query: 97  FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
           FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 286 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 344

Query: 157 NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
           N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 345 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIKGEGAKPVYA 404

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 405 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLNVPNNNSGTDAVTMKD 462

Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 463 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 521

Query: 300 GSVHITYTYNRTQIKHVVLLP 320
           G +H  YT+ R +IK+  + P
Sbjct: 522 GKLHAIYTWRRQRIKYAEIDP 542


>gi|421611775|ref|ZP_16052906.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
 gi|408497418|gb|EKK01946.1| BNR/Asp-box repeat domain protein [Rhodopirellula baltica SH28]
          Length = 361

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 180/342 (52%), Gaps = 31/342 (9%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           +D    G++ ++F +  + APF SCHASTI +   G    A+FGG  EGA DV IW+   
Sbjct: 26  QDDAPAGVLHQQFIY--DEAPFPSCHASTICQTSNG-LAAAWFGGEREGAKDVSIWVSDH 82

Query: 63  KDGRWQSP-IIAD----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK 117
               W +P ++AD    +    P WNPVLF  PS    LF K+G   + W G +  S + 
Sbjct: 83  DGNSWSAPRMVADGVQPDGSRHPCWNPVLFHAPSGVTWLFIKVGPSPKDWWGEVLFSDDS 142

Query: 118 GVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTVDAGRS---- 172
           G T+ +R +LP G+LGP + KP L++ G  LLCGSS E  + W    E     G++    
Sbjct: 143 GKTFRDRTRLPEGVLGPIRCKPELVDQGKTLLCGSSTEH-DGWRVHFERLTLTGKADSDA 201

Query: 173 -----WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
                W +   I+   E  + IQP     A+  LR L R+   +  +  ++S D G TW+
Sbjct: 202 VSASHWDRGEAIHDGKE-FAAIQPTILRLADGRLRALCRTMQSV--IVQTDSNDNGRTWT 258

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDAL 279
               T + NPNSGID V  KDGR  L  N +         R  L +++S  DG ++ +  
Sbjct: 259 KPIATDMPNPNSGIDIVTAKDGRQWLISNPLPSKENGWGRRSRLTLSVST-DGKTYREVA 317

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
            LE     EFSYPA+IQA DG +H+TYT+ R +IKHV+  P+
Sbjct: 318 ALENEPKGEFSYPAIIQADDGKLHMTYTWKRKKIKHVIFDPS 359


>gi|423298606|ref|ZP_17276662.1| hypothetical protein HMPREF1070_05327 [Bacteroides ovatus CL03T12C18]
 gi|392662349|gb|EIY55910.1| hypothetical protein HMPREF1070_05327 [Bacteroides ovatus CL03T12C18]
          Length = 1286

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 129/381 (33%), Positives = 191/381 (50%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR- 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+ +  KD + 
Sbjct: 909  AILSDEFLY--KKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKDSKE 966

Query: 67   WQSPIIA--------DEEPNVP----------------------MWNPVLFKLPSNGLLL 96
            W +P +A        D +  +                        WNPVLF++P   L+L
Sbjct: 967  WTAPQLAADGVFSLKDSQAALAGIDSTCTPVKNEKGKLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGDMNVDDQEGGEAIKGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|336402501|ref|ZP_08583236.1| hypothetical protein HMPREF0127_00549 [Bacteroides sp. 1_1_30]
 gi|335948350|gb|EGN10064.1| hypothetical protein HMPREF0127_00549 [Bacteroides sp. 1_1_30]
          Length = 1286

 Score =  197 bits (500), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 127/381 (33%), Positives = 186/381 (48%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------------------------------EPNVPMWNPVLFKLPSNGLLL 96
            W +P +A +                                    WNPVLF++P   L+L
Sbjct: 967  WTAPKLAADGVFSLKDSQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|295084330|emb|CBK65853.1| Alpha-L-rhamnosidase N-terminal domain./BNR/Asp-box repeat./Bacterial
            alpha-L-rhamnosidase. [Bacteroides xylanisolvens XB1A]
          Length = 1286

 Score =  197 bits (500), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 127/381 (33%), Positives = 186/381 (48%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++ +EF +   +A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILTDEFLY--ENASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGSKE 966

Query: 67   WQSPIIADE------------------------------EPNVPMWNPVLFKLPSNGLLL 96
            W +P +A +                                    WNPVLF++P   L+L
Sbjct: 967  WTAPKLAADGVFSLKDSQAVLAGIDSTCTPVKDAKGTLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREALPKGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGGVNVDDQEGGEAIEGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1263 GKLHAIYTWRRQRIKYAEIDP 1283


>gi|336416259|ref|ZP_08596595.1| hypothetical protein HMPREF1017_03703 [Bacteroides ovatus 3_8_47FAA]
 gi|335938990|gb|EGN00869.1| hypothetical protein HMPREF1017_03703 [Bacteroides ovatus 3_8_47FAA]
          Length = 1286

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 128/379 (33%), Positives = 188/379 (49%), Gaps = 75/379 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +   S P   CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 909  AILSDEFLYKKTSFP--ECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKLKGSKE 966

Query: 67   WQSPIIA-------------------------DEEPNV-----PMWNPVLFKLPSNGLLL 96
            W +P +A                         DE+  +       WNPVLF++P   L+L
Sbjct: 967  WTAPKLAADGVFSLKDSQAVLAGIDSTCTPVKDEQGKLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGDMNVDDQEGGEAIKGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVL 318
            G +H  YT+ R +IK+  +
Sbjct: 1263 GKLHAIYTWRRQRIKYAEI 1281


>gi|312129645|ref|YP_003996985.1| neuraminidase [Leadbetterella byssophila DSM 17132]
 gi|311906191|gb|ADQ16632.1| putative neuraminidase [Leadbetterella byssophila DSM 17132]
          Length = 336

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 121/325 (37%), Positives = 177/325 (54%), Gaps = 28/325 (8%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
           V +E   PA   P  S HAST+VE+  G+ L A+FGG  EG P V I+L  F    W   
Sbjct: 21  VVQEHQIPA---PTASVHASTLVELQNGNLLAAWFGGIEEGDPSVGIYLSRFNGKVWSE- 76

Query: 71  IIADE--------EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
             A+E        +   P WNPVL +   N L L+YK+G   ++W   +K S ++G +WS
Sbjct: 77  --AEEVVLPSQAGDTTYPCWNPVLVRNKENTLFLYYKVGPNPREWWAQVKTSNDEGKSWS 134

Query: 123 EREQLPPGILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKYGPIY 180
           E   LP G +GP + KP++L NG  L  SS E+   + W +  EV+ + G +W+    + 
Sbjct: 135 EARDLPEGHMGPVRVKPVMLANGDFLYPSSTETPETDYWSAHFEVSDNEGNNWQM---VP 191

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +  ++  +IQP     +   + +L RS +   RV  + S + G +W+    T L NPN+G
Sbjct: 192 MDCDTFQIIQPTILEFSEGKMLMLARSRH--NRVIAAYSHNYGNSWTKPFATNLPNPNAG 249

Query: 241 IDGVKLKDGRLLLAYNTVS------RGVLKVAL-SKDDGDSWHDALTLEENLAMEFSYPA 293
           ID VK+   R L+ YN +        G  K+ L S  DG  W + LTLE++   EFSYPA
Sbjct: 250 IDAVKVGKNRFLMVYNPLLAGKNWWEGRSKLVLGSSHDGLDWKEILTLEDHEKGEFSYPA 309

Query: 294 VIQASDGSVHITYTYNRTQIKHVVL 318
           +I+  DG +HI+YTY+R  IKH++L
Sbjct: 310 IIRTKDGKIHISYTYDRKFIKHLIL 334


>gi|299145502|ref|ZP_07038570.1| putative alpha-rhamnosidase [Bacteroides sp. 3_1_23]
 gi|298515993|gb|EFI39874.1| putative alpha-rhamnosidase [Bacteroides sp. 3_1_23]
          Length = 1286

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 128/379 (33%), Positives = 190/379 (50%), Gaps = 75/379 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGR- 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+ +  KD + 
Sbjct: 909  AILSDEFLY--KKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKDSKE 966

Query: 67   WQSPIIA--------DEEPNVP----------------------MWNPVLFKLPSNGLLL 96
            W +P +A        D +  +                        WNPVLF++P   L+L
Sbjct: 967  WTAPQLAADGVFSLKDSQAALAGIDSTCTPVKNEKGKLIARRKACWNPVLFQIPGGDLIL 1026

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1027 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSTEGS 1085

Query: 157  NSWGSWMEVTVDAGRSWRKYGPI----YIPNES------------------------LSV 188
            N W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1086 NGWRVHFEISDDKGKTWKMVGPLDAELSVPTQNRKKGDMNVDDQEGGEAIKGEGAKPVYA 1145

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   +V  + S D G TWS      + N NSG D V +KD
Sbjct: 1146 IQPSILKHKDGRLQILCRTRNA--QVATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1203

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1204 GRHILIYNNFSTLPGTPKGPRTPLCVAVS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 1262

Query: 300  GSVHITYTYNRTQIKHVVL 318
            G +H  YT+ R +IK+  +
Sbjct: 1263 GKLHAIYTWRRQRIKYAEI 1281


>gi|373853224|ref|ZP_09596023.1| BNR/Asp-box repeat domain protein [Opitutaceae bacterium TAV5]
 gi|372472751|gb|EHP32762.1| BNR/Asp-box repeat domain protein [Opitutaceae bacterium TAV5]
          Length = 337

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 131/330 (39%), Positives = 176/330 (53%), Gaps = 31/330 (9%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPI 71
           EF F   +AP  SCHAST+ E   G  + A+FGG  E  PDV IWL       G+W  P+
Sbjct: 13  EFIF--ETAPTPSCHASTLAETPHG-LVAAWFGGKRECFPDVGIWLARLDAATGQWTEPV 69

Query: 72  -IAD------EEPNV---PMWNPVLFK---LPSNGLLLFYKIGQEVQKWSGCMKRSYNKG 118
            +A+      E+  V     WNPVLF+    P   L+LFYK+G+  Q W G M  S + G
Sbjct: 70  EVANGIQHTTEDGQVLRYACWNPVLFQPRHTPGAPLVLFYKVGKTPQTWWGMMTTSADGG 129

Query: 119 VTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKY 176
            TW+   +LP GI GP KNKP+ L +G  LC +S E      W    E T DAGR+W + 
Sbjct: 130 HTWASHRRLPEGICGPVKNKPVELPDGTWLCPASTEESVTTGWRVHFERTADAGRTWTRT 189

Query: 177 GPIYIPNESLS--VIQP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
            P+   N+ +    IQP +  H   R L+ + R+   + R+  ++S D G TWS      
Sbjct: 190 APV---NDGIDYGAIQPSILRHDGGRRLQAIGRT-QLVKRLFSTDSLDAGRTWSEIAFLD 245

Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
           + NP+SG D V L DGR  L  N     R  L +ALS  DG  W   L LE     E+SY
Sbjct: 246 VPNPDSGTDAVTLADGRHALICNPTESGRSPLSLALSS-DGLRWTRVLDLETGPG-EYSY 303

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           PA+IQ +DG +H+T+T+ R +IKH  + P+
Sbjct: 304 PAIIQTADGHLHMTWTWKRERIKHAAIDPS 333


>gi|225165905|ref|ZP_03727673.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
 gi|224799856|gb|EEG18317.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
          Length = 381

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 172/323 (53%), Gaps = 34/323 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPI-IAD------EE 76
           SCHASTIVE  +G  + A+FGG  E  PDV IWL       GRW +P+ +A+      E+
Sbjct: 60  SCHASTIVETTQGELVAAWFGGKRECFPDVGIWLSRRDAITGRWSAPVEVANGIQHTTED 119

Query: 77  PNV---PMWNPVLFK---LPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
            N      WNPVLF+    P   L+LFYK+G+  Q W G M  S + G TW+   +LP G
Sbjct: 120 GNTLRYACWNPVLFQPRHTPGAPLVLFYKVGKTPQTWWGMMTTSTDGGRTWTAHRRLPEG 179

Query: 131 ILGPSKNKPILLENGLLLCGSSVES--WNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
           I GP KNKPI L +G  LC +S E      W    E T DAGR+W +  P+   N+ +  
Sbjct: 180 ICGPVKNKPIELPDGTWLCPASTEESVTTGWRVHFERTDDAGRTWTRTAPV---NDGIDY 236

Query: 188 -VIQP-VPFHTANRTLRVLMRSFNGIGRVCMSE------SCDGGLTWSYAKPTQLLNPNS 239
             IQP +  H A  +     R    IGR  +++      S D G TWS      + NP+S
Sbjct: 237 GAIQPSILMHPAPASTTPPAR-LQAIGRTQLAKRLFSTTSLDAGRTWSEITFLDVPNPDS 295

Query: 240 GIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           G D V L+D R  L YN     R  L +ALS +DG  W   L LE     E+SYPA+IQ 
Sbjct: 296 GTDAVTLRDSRHALIYNPTESGRSPLSLALS-NDGLHWTRILDLETTPG-EYSYPAIIQT 353

Query: 298 SDGSVHITYTYNRTQIKHVVLLP 320
           +DG +H+T+T+ R +IKH  + P
Sbjct: 354 TDGLLHMTWTWRRERIKHASIDP 376


>gi|255691747|ref|ZP_05415422.1| putative alpha-rhamnosidase [Bacteroides finegoldii DSM 17565]
 gi|260622633|gb|EEX45504.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1289

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 125/381 (32%), Positives = 188/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 912  AILSDEFLY--EKAAFPECHGATIVEMKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 970  WSAPKLAADGVFLLKDPQAILAGIDSTCTPVKDAKGKLIARRKACWNPVLFQVPGGDLIL 1029

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  ++ G ++C SS E  
Sbjct: 1030 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPKGFLGPIKNKPEYID-GRIICPSSTEGS 1088

Query: 157  NSWGSWMEVTVDAGRSWRKYGPIY----IPNES------------------------LSV 188
              W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1089 KGWRVHFEISDDKGKTWKMVGPLEAELSVPTQNRKKGGINTDDQEGGEAIKGEGAKPIYA 1148

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   ++  + S D G TWS      + N NSG D V +KD
Sbjct: 1149 IQPSILKHKDGKLQILCRTRNA--QIATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1206

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG SW   LTLE++   ++SYP++I+  D
Sbjct: 1207 GRHVLIYNNFSTLPGTPKGPRTPLCVAIS-EDGISWKPVLTLEDSPISQYSYPSIIEGKD 1265

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1266 GKLHAIYTWRRQRIKYAEIDP 1286


>gi|423299817|ref|ZP_17277842.1| hypothetical protein HMPREF1057_00983 [Bacteroides finegoldii
            CL09T03C10]
 gi|408473626|gb|EKJ92148.1| hypothetical protein HMPREF1057_00983 [Bacteroides finegoldii
            CL09T03C10]
          Length = 1289

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 125/381 (32%), Positives = 188/381 (49%), Gaps = 75/381 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 912  AILSDEFLY--EKAAFPECHGATIVEMKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +P   +                        WNPVLF++P   L+L
Sbjct: 970  WSAPKLAADGVFSLKDPQAILAGIDSTCTPVKDVKGKLIAHRKACWNPVLFQVPGGDLIL 1029

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG +V  W+G + RS + G TWS+RE LP G LGP KNKP  ++ G ++C SS E  
Sbjct: 1030 FYKIGLKVSDWTGWLVRSRDGGKTWSKREPLPKGFLGPIKNKPEYID-GRIICPSSTEGS 1088

Query: 157  NSWGSWMEVTVDAGRSWRKYGPIY----IPNES------------------------LSV 188
              W    E++ D G++W+  GP+     +P ++                        +  
Sbjct: 1089 KGWRVHFEISDDKGKTWKMVGPLEAELSVPTQNRKKGGINTDDQEGREAIKGEDAKPIYA 1148

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L++L R+ N   ++  + S D G TWS      + N NSG D V +KD
Sbjct: 1149 IQPSILKHKDGKLQILCRTRNA--QIATAWSSDNGDTWSKVTLLDVPNNNSGTDAVTMKD 1206

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG SW   LTLE++   ++SYP++I+  D
Sbjct: 1207 GRHVLIYNNFSTLPGTPKGPRTPLCVAIS-EDGISWKPVLTLEDSPISQYSYPSIIEGKD 1265

Query: 300  GSVHITYTYNRTQIKHVVLLP 320
            G +H  YT+ R +IK+  + P
Sbjct: 1266 GKLHAIYTWRRQRIKYAGIDP 1286


>gi|386724698|ref|YP_006191024.1| hypothetical protein B2K_21575 [Paenibacillus mucilaginosus K02]
 gi|384091823|gb|AFH63259.1| hypothetical protein B2K_21575 [Paenibacillus mucilaginosus K02]
          Length = 332

 Score =  193 bits (490), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 122/321 (38%), Positives = 176/321 (54%), Gaps = 25/321 (7%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGRWQSPI 71
           +E+ F  +  PF SCHASTI  +  G  + A+FGGS EGA DV IW+      G W +P 
Sbjct: 12  KEYIF-EDERPFASCHASTIELLPDGGAVAAWFGGSKEGAGDVAIWVAHRGAQGGWTAPR 70

Query: 72  IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
            A   P  P WNPVLF+     LLL+YK G  ++ WS  +  S + G TWS   +L PG 
Sbjct: 71  QAAGYPATPHWNPVLFRSEDGTLLLYYKTGIRIEHWSTLVMESTDDGYTWSGPAELVPGD 130

Query: 132 ---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
               GP KNKP+ L +G +   +S+E   +W S+++++ D GR+W +   + +P +  + 
Sbjct: 131 RGGRGPVKNKPVRLADGTIAAPASLEP--AWDSFVDLSPDGGRTWNRSDTVPMPADRSAF 188

Query: 188 ----VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
               +IQP  + +A   + +L RS  G   +  S+S DGG TW  A  T L N NSG+D 
Sbjct: 189 LGKGIIQPSLWESAPGCVHMLTRSTAGA--IYRSDSADGGRTWCPAYATSLPNNNSGLDV 246

Query: 244 VKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
           VKL DG L L  N            R  L + LS+D+G SW+    ++E    ++SYPA+
Sbjct: 247 VKLADGTLALVCNPTRPEPGKLKGPRSPLVLLLSQDNGVSWYGETVIDEG-EKQYSYPAI 305

Query: 295 IQASDGSVHITYTYNRTQIKH 315
           + A  G VH+TYT+ R +I +
Sbjct: 306 V-AQGGRVHVTYTWKRERIAY 325


>gi|337748986|ref|YP_004643148.1| hypothetical protein KNP414_04748 [Paenibacillus mucilaginosus
           KNP414]
 gi|379721955|ref|YP_005314086.1| hypothetical protein PM3016_4159 [Paenibacillus mucilaginosus 3016]
 gi|336300175|gb|AEI43278.1| hypothetical protein KNP414_04748 [Paenibacillus mucilaginosus
           KNP414]
 gi|378570627|gb|AFC30937.1| hypothetical protein PM3016_4159 [Paenibacillus mucilaginosus 3016]
          Length = 332

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 121/321 (37%), Positives = 176/321 (54%), Gaps = 25/321 (7%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGRWQSPI 71
           +E+ F  +  PF SCHASTI  +  G  + A+FGGS EGA DV IW+      G W +P 
Sbjct: 12  KEYIF-EDERPFASCHASTIELLPDGGAVAAWFGGSKEGAGDVAIWVAHRGAQGGWTAPR 70

Query: 72  IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
           +A   P  P WNPVLF+     LLL+YK G  ++ WS  + +S + G TWS   +L PG 
Sbjct: 71  LAAGYPGTPHWNPVLFRREDGTLLLYYKTGTRIEHWSTLVMKSTDDGHTWSGPAELVPGD 130

Query: 132 ---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
               GP KNKP+ L +G +   +S+E   +W S+++++ D GR+W + G + +P +  + 
Sbjct: 131 RGGRGPVKNKPVRLADGTIAAPASLEP--AWDSFVDLSPDGGRTWTRSGDVPMPADRSAF 188

Query: 188 ----VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
               +IQP  + +    + +L RS  G   V  S+S DGG TW  A  T L N NSG+D 
Sbjct: 189 LGKGIIQPSLWESEPGCVHMLTRSTAGA--VYRSDSADGGRTWCPAYATSLPNNNSGLDV 246

Query: 244 VKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
            KL DG L L  N            R  L + LS+D+G SW+    ++E    ++SYPA+
Sbjct: 247 AKLGDGTLALVCNPTRPEPGKLKGPRSPLVLLLSQDNGVSWYGETVIDEG-EKQYSYPAI 305

Query: 295 IQASDGSVHITYTYNRTQIKH 315
           + A    VH+TYT+ R +I +
Sbjct: 306 V-ARGSRVHVTYTWKRERIAY 325


>gi|375254465|ref|YP_005013632.1| BNR/Asp-box repeat-containing protein [Tannerella forsythia ATCC
           43037]
 gi|363406763|gb|AEW20449.1| BNR/Asp-box repeat protein [Tannerella forsythia ATCC 43037]
          Length = 351

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 174/316 (55%), Gaps = 20/316 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI----- 71
           F   + PF+SCH+ST+VE++ G+ L A+FGG  EG+ DV IWL     G W SP      
Sbjct: 37  FVFENPPFQSCHSSTVVEMENGNVLCAWFGGKHEGSNDVSIWLSELSSGTWTSPTLVADG 96

Query: 72  IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
           I ++      WNPVL++     L LFYK+G     W G  K S + G TWS+  +LP GI
Sbjct: 97  IQNDTTRYACWNPVLYR-DKTKLFLFYKVGTSPSTWWGEYKVSTDDGKTWSKVAKLPDGI 155

Query: 132 LGPSKNKPILLENGLLLCGSSVESWNS-WGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
           LGP KNK I +   +L   S+  + N+ W S++E++      W+   PI   + S  VIQ
Sbjct: 156 LGPIKNKAIRIGKTVLHPSSTETAGNTRWKSFIELSDRNLTQWKSV-PIDTAS-SYKVIQ 213

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
           P         ++ L+RS +    +  S S D G +W+    T +L+PNSGID V L +G 
Sbjct: 214 PTLLCYGENRIQALLRSDS--NEILQSWSNDNGQSWTPITGTGILHPNSGIDAVGLSNGV 271

Query: 251 LLLAYNTVSRG--------VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            LL YN   +G         L VA S  DG +W+D L LE++   EFSYPA+I+  +  +
Sbjct: 272 KLLVYNPAIKGENWWDGRSRLTVAESV-DGLTWNDILILEKHEKGEFSYPAIIRLKNDDI 330

Query: 303 HITYTYNRTQIKHVVL 318
            ITYTYNR +IK+V +
Sbjct: 331 LITYTYNRLKIKYVFI 346


>gi|150005175|ref|YP_001299919.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
 gi|149933599|gb|ABR40297.1| glycoside hydrolase family 78 [Bacteroides vulgatus ATCC 8482]
          Length = 1321

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 936  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 993

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 994  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1053

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1054 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1112

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1113 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1172

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1173 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1230

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1231 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1289

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1290 SIIQGKDGKLHAVYTWRRQRVAYKEL 1315


>gi|423311882|ref|ZP_17289819.1| hypothetical protein HMPREF1058_00431 [Bacteroides vulgatus
            CL09T03C04]
 gi|392689261|gb|EIY82539.1| hypothetical protein HMPREF1058_00431 [Bacteroides vulgatus
            CL09T03C04]
          Length = 1324

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 939  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 997  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMNVPTALRKANAANVDDQEGGEAIKGE 1175

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318


>gi|294777784|ref|ZP_06743230.1| BNR/Asp-box repeat protein [Bacteroides vulgatus PC510]
 gi|294448372|gb|EFG16926.1| BNR/Asp-box repeat protein [Bacteroides vulgatus PC510]
          Length = 1324

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 939  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 997  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKFRARRKACWNPVLFQI 1056

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKENAANVDDQEGGEAIKGE 1175

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318


>gi|265750680|ref|ZP_06086743.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_33FAA]
 gi|263237576|gb|EEZ23026.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_33FAA]
          Length = 1321

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 936  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 993

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 994  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1053

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1054 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1112

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1113 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1172

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1173 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1230

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1231 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1289

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1290 SIIQGKDGKLHAVYTWRRQRVAYKEL 1315


>gi|212694641|ref|ZP_03302769.1| hypothetical protein BACDOR_04172 [Bacteroides dorei DSM 17855]
 gi|212663142|gb|EEB23716.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides dorei DSM
            17855]
          Length = 1324

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 939  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 997  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1175

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318


>gi|423239192|ref|ZP_17220308.1| hypothetical protein HMPREF1065_00931 [Bacteroides dorei CL03T12C01]
 gi|392646979|gb|EIY40685.1| hypothetical protein HMPREF1065_00931 [Bacteroides dorei CL03T12C01]
          Length = 1321

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 936  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 993

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 994  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1053

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1054 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1112

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1113 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1172

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1173 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1230

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1231 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1289

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1290 SIIQGKDGKLHAVYTWRRQRVAYKEL 1315


>gi|319642682|ref|ZP_07997328.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_40A]
 gi|345520277|ref|ZP_08799675.1| glycoside hydrolase family 78 [Bacteroides sp. 4_3_47FAA]
 gi|254836104|gb|EET16413.1| glycoside hydrolase family 78 [Bacteroides sp. 4_3_47FAA]
 gi|317385770|gb|EFV66703.1| glycoside hydrolase family 78 [Bacteroides sp. 3_1_40A]
          Length = 1324

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 939  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 997  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1115

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKENAANVDDQEGGEAIKGE 1175

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318


>gi|380695289|ref|ZP_09860148.1| alpha-rhamnosidase [Bacteroides faecis MAJ27]
          Length = 1289

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 186/382 (48%), Gaps = 75/382 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 911  AIISDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKAKGAKE 968

Query: 67   WQSPIIADE------EPNVPM------------------------WNPVLFKLPSNGLLL 96
            W +P +A +      +PN  +                        WNPVLF++P   L+L
Sbjct: 969  WTAPKLAADGVFALNDPNAEIAGIDTACTPVKDTKGKLTARRKACWNPVLFQIPGGDLIL 1028

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG  V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1029 FYKIGLSVADWTGWLVRSKDGGKTWSKREALPQGFLGPIKNKPEYI-NGRIICPSSREGK 1087

Query: 157  NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
              W    E + D G++W                RK G I + ++             +  
Sbjct: 1088 GGWRIHFEYSDDKGKTWKTTESVPAELSVPTQNRKKGGINVDDQEAGEAIQGKGAQPILA 1147

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L+VL R+ N +  +  + S D G TW+    + + N NSG D V + D
Sbjct: 1148 IQPSILKHKDGRLQVLCRTRNAL--LATAWSSDNGETWNKVTLSNVPNNNSGTDAVTMSD 1205

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L VA+S +DG  W   LTLE++   ++SYP++IQ  D
Sbjct: 1206 GRHILIYNNFSTQPGTPKGPRTPLCVAIS-EDGIHWKPVLTLEDSPISQYSYPSIIQGKD 1264

Query: 300  GSVHITYTYNRTQIKHVVLLPN 321
            G +H  YT+ R +IK+  + P+
Sbjct: 1265 GKLHAIYTWRRQRIKYAEIDPS 1286


>gi|237711094|ref|ZP_04541575.1| glycoside hydrolase family 78 protein [Bacteroides sp. 9_1_42FAA]
 gi|229454938|gb|EEO60659.1| glycoside hydrolase family 78 protein [Bacteroides sp. 9_1_42FAA]
          Length = 1305

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 184/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 920  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 977

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 978  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1037

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  ++ G ++C
Sbjct: 1038 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVD-GRIIC 1096

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
             SS E    W    E+  D G++W+  GP+     +P                       
Sbjct: 1097 PSSTEGDGGWRIHFEILDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1156

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1157 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1214

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1215 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1273

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1274 SIIQGKDGKLHAVYTWRRQRVAYKEL 1299


>gi|187918984|ref|YP_001888015.1| hypothetical protein Bphyt_4268 [Burkholderia phytofirmans PsJN]
 gi|187717422|gb|ACD18645.1| conserved hypothetical protein [Burkholderia phytofirmans PsJN]
          Length = 366

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 122/325 (37%), Positives = 175/325 (53%), Gaps = 22/325 (6%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            +V+ E   P  +  F  CHAST V   +GH + A+  GS E A DV +W+  F   +W 
Sbjct: 34  AIVSTEVVNPPQNIAF--CHAST-VAFSQGHLVAAWLAGSKEAANDVGVWVARFSGNQWS 90

Query: 69  SPI-IAD-EEPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
            P+ +AD   P+   + + NP+LF      L+LFY+ G+    W      S + G TW++
Sbjct: 91  PPVRVADGRSPDGEALTVINPILFSPKRGPLMLFYRRGKLPADWHPLRMTSLDGGATWTK 150

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPIYIP 182
              L PGI GP+K+KP+ L NG+++ GSS E ++ W    E ++D G +W   Y  + +P
Sbjct: 151 PVALDPGISGPAKDKPVELSNGVVIAGSSTE-YDGWKIHFERSMDGGNTWHVVYPAVGLP 209

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
             ++  IQP      +  L+ L+R+ +G   V  ++S D G TWS      + N NSG+D
Sbjct: 210 --TVQAIQPTILDHRHGQLQALVRTKSGF--VFSTKSSDWGKTWSALARLDIPNSNSGLD 265

Query: 243 GVKLKDGRLLLAYNTV-------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
            V L DGR L+  N +        R  L V +S D   ++ D L LE     EFSYPAVI
Sbjct: 266 AVTLTDGRDLIVTNPLPYVEGRWDRHKLSVLISADH-QTYRDVLDLENEAGQEFSYPAVI 324

Query: 296 QASDGSVHITYTYNRTQIKHVVLLP 320
           Q+ DG VHITYT+ +  IKHVVL P
Sbjct: 325 QSPDGMVHITYTWKKIYIKHVVLDP 349


>gi|345515271|ref|ZP_08794777.1| glycoside hydrolase family 78 protein [Bacteroides dorei 5_1_36/D4]
 gi|345455881|gb|EEO44380.2| glycoside hydrolase family 78 protein [Bacteroides dorei 5_1_36/D4]
          Length = 1324

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 126/386 (32%), Positives = 185/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 939  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 997  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE LP G LGP KNKP  +++  ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLPKGFLGPIKNKPEYVDD-RIIC 1115

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPI----YIPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1175

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318


>gi|224535729|ref|ZP_03676268.1| hypothetical protein BACCELL_00593, partial [Bacteroides
           cellulosilyticus DSM 14838]
 gi|224522654|gb|EEF91759.1| hypothetical protein BACCELL_00593 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 1003

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 127/379 (33%), Positives = 184/379 (48%), Gaps = 75/379 (19%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 625 AILSDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 682

Query: 67  WQSPIIA--------DEEPNVP----------------------MWNPVLFKLPSNGLLL 96
           W SP +A        D +  +                        WNPVLF++P   L+L
Sbjct: 683 WTSPQLAADGVFSIKDAQATLAGIDSTCTPVTNARGKLVARRKACWNPVLFQIPGGDLIL 742

Query: 97  FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
           FYKIG  V  W+G + RS + G TWS+RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 743 FYKIGLNVGDWTGWLVRSRDGGKTWSKREPLPEGFLGPIKNKPEYI-NGRIICPSSREGG 801

Query: 157 NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
             W    E++ D G++W                RK G + + ++             +  
Sbjct: 802 KGWRIHFEISDDNGKTWKTTESLAAEFSVLTQHRKKGSVNVDDQEGGEAVKGEGPKPIYA 861

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           IQP      +  L+VL R+ N   R+  + S D G TWS      + N NSG D V L+D
Sbjct: 862 IQPSILMHKDGRLQVLCRTRN--ARIATAWSNDNGDTWSKVTLLDVPNNNSGTDAVTLQD 919

Query: 249 GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           GR +L YN  S         R  L VA+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 920 GRHVLIYNNFSTLPGTPKGPRTPLCVAIS-EDGINWQPVLTLEDSPISQYSYPSIIQGKD 978

Query: 300 GSVHITYTYNRTQIKHVVL 318
           G +H  YT+ R ++K+  +
Sbjct: 979 GRLHAVYTWRRQRVKYAEI 997


>gi|423228041|ref|ZP_17214447.1| hypothetical protein HMPREF1063_00267 [Bacteroides dorei CL02T00C15]
 gi|423243304|ref|ZP_17224380.1| hypothetical protein HMPREF1064_00586 [Bacteroides dorei CL02T12C06]
 gi|392637077|gb|EIY30952.1| hypothetical protein HMPREF1063_00267 [Bacteroides dorei CL02T00C15]
 gi|392645311|gb|EIY39039.1| hypothetical protein HMPREF1064_00586 [Bacteroides dorei CL02T12C06]
          Length = 1324

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 126/386 (32%), Positives = 184/386 (47%), Gaps = 82/386 (21%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            G+V ++F +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 939  GIVEDQFLY--EQASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGATE 996

Query: 67   WQSPIIADE------EPNVPM-------------------------------WNPVLFKL 89
            W +P +A +      +P   +                               WNPVLF++
Sbjct: 997  WSAPYLAADGVFSLDDPQAVLAGITAESTPADAGPVASTFKGDKSRARRKACWNPVLFQI 1056

Query: 90   PSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLC 149
            P   L+LFYKIG +V  WSG + RS + G TWS+RE L  G LGP KNKP  ++ G ++C
Sbjct: 1057 PGGDLILFYKIGLKVADWSGWLVRSKDGGKTWSQREPLLKGFLGPIKNKPEYVD-GRIIC 1115

Query: 150  GSSVESWNSWGSWMEVTVDAGRSWRKYGPIY----IPN---------------------- 183
             SS E    W    E++ D G++W+  GP+     +P                       
Sbjct: 1116 PSSTEGDGGWRIHFEISDDKGKTWKMVGPVEAEMSVPTALRKANAANVDDQEGGEAIKGE 1175

Query: 184  --ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              + +  IQP      +  L+VL R+ N   ++  S S D G TWS      + N NSG 
Sbjct: 1176 GEKPIYAIQPSILRHKDGRLQVLCRTRNA--QIATSWSSDNGETWSKVTLLDVPNNNSGT 1233

Query: 242  DGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
            D V +KDGR +L YN  S         R  L VA+S DDG  W + +TLE++   ++SYP
Sbjct: 1234 DAVTMKDGRHVLIYNDFSTLPGTPKGPRTPLCVAVS-DDGIHWKNVMTLEDSPISQYSYP 1292

Query: 293  AVIQASDGSVHITYTYNRTQIKHVVL 318
            ++IQ  DG +H  YT+ R ++ +  L
Sbjct: 1293 SIIQGKDGKLHAVYTWRRQRVAYKEL 1318


>gi|242241312|ref|YP_002989493.1| hypothetical protein Dd703_3918 [Dickeya dadantii Ech703]
 gi|242133369|gb|ACS87671.1| conserved hypothetical protein [Dickeya dadantii Ech703]
          Length = 350

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 173/345 (50%), Gaps = 43/345 (12%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
           V ++F  P +   F +CHASTI  +  G    A+F G  EG+ D  IWL  ++ G+W +P
Sbjct: 6   VKQQFVLPTDHRYFGNCHASTIAALPDGRLRAAWFAGEKEGSGDTAIWLANYQQGQWHAP 65

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
           +    E  +P WNPVL +     L LFYK+G +V  W   +  S + G++WS    L PG
Sbjct: 66  VRLAWEDGLPHWNPVLHR-QDGALWLFYKVGADVHHWQTRVMVSGDDGLSWSAPRMLVPG 124

Query: 131 I---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL- 186
                GP KNK +++ NG  L   SVE    W ++++++ D G  W+   PI + +++  
Sbjct: 125 DSAPRGPVKNKLLVMSNGEWLAPGSVEDDRDWDAFVDISGDRGEHWQAT-PIPLAHQAPG 183

Query: 187 ----------------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
                                        VIQP  + +A   + VLMRS  G   +  S+
Sbjct: 184 DGQHDALWQGLEHAALWENDLTRVFQWDGVIQPSAWESAPGQVHVLMRSTRGA--IYRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
           S D G  W  A  T L N NSGID V L  GRL+L YN V     SR  L  A S D+G+
Sbjct: 242 SPDYGRRWCDAYATALPNNNSGIDVVHLGAGRLVLVYNPVTGNWRSRYPLTAAYSADNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            W + + LE+    EFSYPA+I   D ++H+TYT+NR  I +  L
Sbjct: 302 HWENLIDLEQEPG-EFSYPAIIADGD-TLHVTYTWNRKNIVYCAL 344


>gi|440751400|ref|ZP_20930633.1| hypothetical protein C943_3326 [Mariniradius saccharolyticus AK6]
 gi|436480042|gb|ELP36307.1| hypothetical protein C943_3326 [Mariniradius saccharolyticus AK6]
          Length = 348

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 116/319 (36%), Positives = 176/319 (55%), Gaps = 16/319 (5%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V  EF F  +  PF  CHASTI ++  G +++A+F GS E   DV IW    K G W +
Sbjct: 25  IVKCEFIF-GDERPFPQCHASTIEKLGNGEYILAWFAGSHEKNDDVAIWTAKGKPGAWTA 83

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P    +  N P WNPVLF  P   L+L++K+G+E+  W   +  S +KG TWS  ++L  
Sbjct: 84  PKKLAKVRNDPHWNPVLFSAPDGRLVLYFKVGKEIDDWETWVMESLDKGETWSAPKELVA 143

Query: 130 GI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
           G     GP +N  ++L +G  L  +SVE    W ++++ + DAG+SW++   + +    +
Sbjct: 144 GDRGGRGPVRNHILVLSDGTWLAPASVEKNRVWNAFVDFSTDAGKSWQRTDTLALDRHLV 203

Query: 187 ---SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
               VIQP  + +    + +L+RS    G++  +ES D G TWS    T L N NSGID 
Sbjct: 204 LGEGVIQPALWESEPGHIHMLLRS--STGKIARAESKDFGRTWSPVYLTDLPNNNSGIDL 261

Query: 244 VKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAM-EFSYPAVIQA 297
           VK+  G + L YN VS     R  +++ALS D+G +W     +E      EFSYP++I+ 
Sbjct: 262 VKMPSGVIALVYNPVSGNWGKRYPMQLALSLDNGVTWPIQQEIEAGSGEDEFSYPSIIE- 320

Query: 298 SDGSVHITYTYNRTQIKHV 316
            +G +   YT+NR +I+ V
Sbjct: 321 DEGDLVFCYTWNRQRIRFV 339


>gi|29346423|ref|NP_809926.1| alpha-rhamnosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29338319|gb|AAO76120.1| putative alpha-rhamnosidase [Bacteroides thetaiotaomicron VPI-5482]
          Length = 1290

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 126/379 (33%), Positives = 182/379 (48%), Gaps = 75/379 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 912  AIISDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969

Query: 67   WQSPIIADE-------------------EPNVPM-----------WNPVLFKLPSNGLLL 96
            W +P +A +                    P V             WNPVLF++P   L+L
Sbjct: 970  WTAPKLAADGVFSLKDSQAALAGIDSTCTPVVDAKGKLTARRKACWNPVLFQIPGGDLIL 1029

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG  V  W+G + RS + G TW +RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1030 FYKIGLNVGDWTGWLVRSKDGGKTWGKREALPEGFLGPIKNKPEYI-NGRIICPSSREGK 1088

Query: 157  NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
              W    E + D G++W                RK G I + ++             +  
Sbjct: 1089 GGWRIHFEYSDDKGKTWKTTESVPAELSVPTQNRKKGGINVDDQEAGEAIQGEGAQPILA 1148

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L+VL R+ N   ++  S S D G TWS    + + N NSG D V + D
Sbjct: 1149 IQPSILKHKDGRLQVLCRTRN--AKIATSWSSDNGETWSKVTLSNVPNNNSGTDAVTMSD 1206

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L +A+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1207 GRHILIYNHFSTLPGTPKGPRTPLCIAIS-EDGINWKPILTLEDSPISQYSYPSIIQGKD 1265

Query: 300  GSVHITYTYNRTQIKHVVL 318
            G +H  YT+ R +IK+  +
Sbjct: 1266 GKLHAIYTWRRQRIKYTEI 1284


>gi|383122653|ref|ZP_09943345.1| hypothetical protein BSIG_0602 [Bacteroides sp. 1_1_6]
 gi|382984354|gb|EES70329.2| hypothetical protein BSIG_0602 [Bacteroides sp. 1_1_6]
          Length = 1290

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 126/379 (33%), Positives = 182/379 (48%), Gaps = 75/379 (19%)

Query: 9    GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
             ++++EF +    A F  CH +TIVE+  G  + ++FGG+ E  PD  IW+     G   
Sbjct: 912  AIISDEFLY--EKASFPECHGATIVELKNGDLVASFFGGTKERNPDCCIWVCRKPKGAKE 969

Query: 67   WQSPIIADE-------------------EPNVPM-----------WNPVLFKLPSNGLLL 96
            W +P +A +                    P V             WNPVLF++P   L+L
Sbjct: 970  WTAPKLAADGVFSLKDSQAALAGIDSTCTPVVDAKGKLTARRKACWNPVLFQIPGGDLIL 1029

Query: 97   FYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
            FYKIG  V  W+G + RS + G TW +RE LP G LGP KNKP  + NG ++C SS E  
Sbjct: 1030 FYKIGLNVGDWTGWLVRSKDGGKTWGKREALPEGFLGPIKNKPEYI-NGRIICPSSREGK 1088

Query: 157  NSWGSWMEVTVDAGRSW----------------RKYGPIYIPNES------------LSV 188
              W    E + D G++W                RK G I + ++             +  
Sbjct: 1089 GGWRIHFEYSDDKGKTWKTTESVPAELSVPTQNRKKGGINVDDQEAGEAIQGEGAQPILA 1148

Query: 189  IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
            IQP      +  L+VL R+ N   ++  S S D G TWS    + + N NSG D V + D
Sbjct: 1149 IQPSILKHKDGRLQVLCRTRN--AKIATSWSSDNGETWSKVTLSNVPNNNSGTDAVTMSD 1206

Query: 249  GRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
            GR +L YN  S         R  L +A+S +DG +W   LTLE++   ++SYP++IQ  D
Sbjct: 1207 GRHILIYNHFSTLPGTPKGPRTPLCIAIS-EDGINWKPILTLEDSPISQYSYPSIIQGKD 1265

Query: 300  GSVHITYTYNRTQIKHVVL 318
            G +H  YT+ R +IK+  +
Sbjct: 1266 GKLHAIYTWRRQRIKYTEI 1284


>gi|404451851|ref|ZP_11016803.1| hypothetical protein A33Q_21047 [Indibacter alkaliphilus LW1]
 gi|403762440|gb|EJZ23505.1| hypothetical protein A33Q_21047 [Indibacter alkaliphilus LW1]
          Length = 347

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 112/319 (35%), Positives = 175/319 (54%), Gaps = 16/319 (5%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V+E F F  +  PF  CHAST+  +  G F++A+F G+ E A DV IW+   +DG W +
Sbjct: 26  VVSEGFVF-GDERPFDQCHASTLESLGDGQFMIAWFAGTEEKADDVGIWISKGRDGDWTA 84

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P +  +      WNPVLF  P   L L++K+G+E+  W   +  S + G TWSE  +L  
Sbjct: 85  PELVAKVREDAHWNPVLFNAPDGKLYLYFKVGKEIDHWETWVMESRDNGSTWSEPVELVE 144

Query: 130 GI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
           G     GP +N  ++L +G  L  +S+E    W ++++ + D G++W     + +  E +
Sbjct: 145 GDRGGRGPVRNHVLVLFDGTWLAPASLEKNRVWNAFVDRSEDEGKTWVNSETLILDREKI 204

Query: 187 ---SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
               VIQP  + +A   + +L+R+    G++  S+S DGGLTWS  + T L N NSGID 
Sbjct: 205 PGEGVIQPALWESAALYVHMLLRT--SAGKIGRSDSHDGGLTWSPVELTDLPNNNSGIDI 262

Query: 244 VKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEE-NLAMEFSYPAVIQA 297
             +    L L YN V      R  +++A+S D+G +W     +EE N   EFSYP+++  
Sbjct: 263 AHISGDTLALIYNPVGENWGKRYPMELAVSYDNGKTWPLKQVIEEGNGQDEFSYPSILY- 321

Query: 298 SDGSVHITYTYNRTQIKHV 316
            +G +   YT+NR +IK V
Sbjct: 322 DNGELVFCYTWNREKIKFV 340


>gi|271502586|ref|YP_003335612.1| hypothetical protein Dd586_4080 [Dickeya dadantii Ech586]
 gi|270346141|gb|ACZ78906.1| conserved hypothetical protein [Dickeya dadantii Ech586]
          Length = 348

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 176/346 (50%), Gaps = 43/346 (12%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
              ++  P     F +CHAST+V +  G   VA+F G  EG+ D  IWL + + GRW  P
Sbjct: 6   TTRQYVLPEGHDYFGNCHASTVVALPGGRLRVAWFAGEKEGSGDTAIWLASHEQGRWSPP 65

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
           +    E  +  WNPVL    +  L LFYK+G +V  W   ++ S ++G +WS    L PG
Sbjct: 66  VRVAWEDGLAHWNPVL-HWQAGTLWLFYKVGADVHHWQTRVQLSTDEGASWSAPRPLVPG 124

Query: 131 I---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN---- 183
                GP KNK +++ NG  L  +SVE    W ++++++ D G+ W+   PI + +    
Sbjct: 125 DSAPRGPVKNKLLVMSNGEWLAPASVEDDRHWDAFVDLSADQGQRWQT-APIPLAHRLPG 183

Query: 184 ----ESL---------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
               E+L                      VIQP  + ++   + VLMRS  G   +  S+
Sbjct: 184 EHGREALWSGLEQAALWENDLTRVFQWDGVIQPSAWESSPGRVHVLMRSTRGA--LYRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G  W  A    L N NSG+D V L  GRL+L YN V+     R  L VA S D+G+
Sbjct: 242 SDDYGRHWCEAYAVDLPNNNSGVDLVHLGAGRLVLVYNPVTGNWHRRYPLAVACSTDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
            W DAL LE+    EFSYPA+I   D ++H+TYT+NR  I +  L+
Sbjct: 302 HWQDALCLEQEPG-EFSYPAIIADGD-TLHVTYTWNRKNIVYCALI 345


>gi|420156344|ref|ZP_14663187.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
 gi|394757642|gb|EJF40659.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
          Length = 351

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 177/343 (51%), Gaps = 38/343 (11%)

Query: 1   MKKDCGIKGLVA--EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW 58
           MK D  I GL    E+ +      PF+SCHAS++  +  G F   +F GS EG  DV IW
Sbjct: 1   MKTD--IAGLFRQIEKESILGEQPPFESCHASSLAILPNGDFFCVWFAGSREGEDDVAIW 58

Query: 59  LQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKG 118
                 G WQ P I  ++  +P WNPVLF      + LF+KIG+E+  W      S++  
Sbjct: 59  GARRSGGSWQKPQIL-QQNGLPHWNPVLFVQKDGSVWLFFKIGREISSWRTWASVSHDGC 117

Query: 119 VTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
            TWS   +L PG +   GP +NK I+L +G  L  +S+E+   W S+ + + D G++W K
Sbjct: 118 HTWSPARELVPGDVGGRGPVRNKLIVLSSGRWLAPASLEN-GQWRSFADRSDDEGKTWSK 176

Query: 176 YGPIYIP-----NESLS---------------VIQPVPFHTANRTLRVLMRSFNGIGRVC 215
              ++IP      +SL+                IQP  + +A   + +LMR+  G   + 
Sbjct: 177 SSEVFIPLAGTGTKSLTYQEIPVSEQSFTGRGAIQPTLWESAPGKVHMLMRTSEGF--IA 234

Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKD 270
            S+S DGG TW     T L N NSGID  +  DGRL L YN V      R  L +ALS+D
Sbjct: 235 RSDSQDGGETWCEGYATSLPNNNSGIDLDRTADGRLFLVYNPVGESWGPRFPLHLALSED 294

Query: 271 DGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           +G +W D  TLEE    E+SYP+V   SD S+ +TYT  R  +
Sbjct: 295 NGQTWEDIYTLEEAEG-EYSYPSV-NYSDNSLFVTYTNCRKDV 335


>gi|307128843|ref|YP_003880859.1| hypothetical protein Dda3937_02530 [Dickeya dadantii 3937]
 gi|306526372|gb|ADM96302.1| Putative cytoplasmic protein [Dickeya dadantii 3937]
          Length = 374

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 122/347 (35%), Positives = 175/347 (50%), Gaps = 43/347 (12%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +  ++F  P     F +CHAST+  + +G   VA+F G  EG+ D  IWL   + GRW  
Sbjct: 31  MTTQQFVLPEGHDYFGNCHASTVAVLPEGRLRVAWFAGEKEGSGDTAIWLACAEQGRWLQ 90

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P+    E  V  WNPVL    +  L LFYK+G +V  W   +  S + G +WS    L P
Sbjct: 91  PVRVAWEDGVAHWNPVL-HWQAGTLWLFYKVGADVHHWQTRVAVSTDDGASWSAPRLLVP 149

Query: 130 GI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN--- 183
           G     GP KNK +++ NG  L  +SVE    W ++++++ D GRSW+   PI + +   
Sbjct: 150 GDSAPRGPVKNKLLVMSNGEWLAPASVEDDRHWDAFVDLSGDRGRSWQT-APIPLTHRTP 208

Query: 184 -----ESL---------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
                E+L                      VIQP  + ++   + VLMRS  G   +  S
Sbjct: 209 GERDGEALWQGLEQAALWENDLTRVFQWDGVIQPSAWESSPGQVHVLMRSTRGA--LYRS 266

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G  W  A    L N NSG+D   L  GRL+L YN V+     R  L VA S D+G
Sbjct: 267 DSDDYGRRWREAYAIDLPNNNSGVDLAHLGAGRLVLVYNPVTGNWRHRYPLTVACSTDNG 326

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
           + W +++ LE+    EFSYPA+I   D ++H+TYT+NR  I +  L+
Sbjct: 327 EHWENSIDLEQEPG-EFSYPAIIADGD-TLHVTYTWNRKNIVYCALI 371


>gi|168467297|ref|ZP_02701139.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Newport str. SL317]
 gi|418761271|ref|ZP_13317416.1| hypothetical protein SEEN185_01701 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35185]
 gi|418768828|ref|ZP_13324872.1| hypothetical protein SEEN199_18734 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35199]
 gi|418769767|ref|ZP_13325794.1| hypothetical protein SEEN539_09873 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21539]
 gi|418775994|ref|ZP_13331943.1| hypothetical protein SEEN953_12207 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 33953]
 gi|418780335|ref|ZP_13336224.1| hypothetical protein SEEN188_02327 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35188]
 gi|418786235|ref|ZP_13342055.1| hypothetical protein SEEN559_06356 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21559]
 gi|418802426|ref|ZP_13358053.1| hypothetical protein SEEN202_07479 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35202]
 gi|419787803|ref|ZP_14313510.1| hypothetical protein SEENLE01_16150 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|419792175|ref|ZP_14317818.1| hypothetical protein SEENLE15_23177 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|195630395|gb|EDX49021.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Newport str. SL317]
 gi|392619298|gb|EIX01683.1| hypothetical protein SEENLE01_16150 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|392619559|gb|EIX01943.1| hypothetical protein SEENLE15_23177 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|392730828|gb|EIZ88068.1| hypothetical protein SEEN199_18734 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35199]
 gi|392739213|gb|EIZ96352.1| hypothetical protein SEEN539_09873 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21539]
 gi|392740889|gb|EIZ98004.1| hypothetical protein SEEN185_01701 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35185]
 gi|392746627|gb|EJA03633.1| hypothetical protein SEEN953_12207 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 33953]
 gi|392749249|gb|EJA06227.1| hypothetical protein SEEN559_06356 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21559]
 gi|392749385|gb|EJA06362.1| hypothetical protein SEEN188_02327 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35188]
 gi|392777439|gb|EJA34122.1| hypothetical protein SEEN202_07479 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35202]
          Length = 347

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/346 (35%), Positives = 171/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G CEG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G  V  W      S ++G TWS    L 
Sbjct: 64  TPQRVVAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            G +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I A  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344


>gi|406662497|ref|ZP_11070592.1| putative neuraminidase (sialidase) [Cecembia lonarensis LW9]
 gi|405553570|gb|EKB48775.1| putative neuraminidase (sialidase) [Cecembia lonarensis LW9]
          Length = 350

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 110/322 (34%), Positives = 175/322 (54%), Gaps = 16/322 (4%)

Query: 7   IKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR 66
           + G+   EF F     PF+ CHAS+I  +  G +++ +F G+ E   DV IW+   + G 
Sbjct: 29  VPGIEHFEFVFEG-EMPFEQCHASSIEVLGDGRYMMVWFAGTHEKNDDVGIWMAQGRPGN 87

Query: 67  WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ 126
           W  P +  +  N   WNPVLF+ P+  L L++K+G+E+  W      S + G TWS  E+
Sbjct: 88  WSFPKLLVKVRNDAHWNPVLFQNPAGRLFLYFKVGKEIDDWETWYIHSDDLGNTWSNPEE 147

Query: 127 LPPGI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
           L PG     GP +N  ++L NG  L  +S+E    W ++++ + D G++W K   + +  
Sbjct: 148 LMPGDQGGRGPVRNHVLVLSNGTWLAPASIEKNRVWNAFVDASSDYGKTWVKSDTLELDR 207

Query: 184 ESLS---VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           + ++   VIQP  + +    + +L+R+    G++  S+S D G TWS    T L N NSG
Sbjct: 208 QLINGEGVIQPALWESCPGQVHMLLRT--STGKIGRSDSNDYGETWSPVYLTDLPNNNSG 265

Query: 241 IDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLE-ENLAMEFSYPAV 294
           ID   +    L LAYN V     SR  + +A+S+D+GDSW   + +E      EFSYP++
Sbjct: 266 IDVAHMGGKSLALAYNPVGKNWGSRYPMTLAISEDNGDSWPFKIDIEVGKEGNEFSYPSL 325

Query: 295 IQASDGSVHITYTYNRTQIKHV 316
           +  + G + + YT+NRT I+ V
Sbjct: 326 LYEA-GHLVLCYTWNRTNIRFV 346


>gi|167553718|ref|ZP_02347464.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Saintpaul str. SARA29]
 gi|418846517|ref|ZP_13401286.1| hypothetical protein SEEN443_16062 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19443]
 gi|418858518|ref|ZP_13413132.1| hypothetical protein SEEN470_02240 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19470]
 gi|418864008|ref|ZP_13418544.1| hypothetical protein SEEN536_15791 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19536]
 gi|418868680|ref|ZP_13423121.1| hypothetical protein SEEN176_02779 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 4176]
 gi|205321879|gb|EDZ09718.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Saintpaul str. SARA29]
 gi|392810390|gb|EJA66410.1| hypothetical protein SEEN443_16062 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19443]
 gi|392831857|gb|EJA87484.1| hypothetical protein SEEN536_15791 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19536]
 gi|392832489|gb|EJA88109.1| hypothetical protein SEEN470_02240 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19470]
 gi|392837370|gb|EJA92940.1| hypothetical protein SEEN176_02779 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 4176]
          Length = 347

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 171/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G CEG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|161503647|ref|YP_001570759.1| hypothetical protein SARI_01730 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160864994|gb|ABX21617.1| hypothetical protein SARI_01730 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 347

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 124/346 (35%), Positives = 172/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   PA+ A    CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQKIIPASRATPFQCHASTLVCLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
            P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS   +L 
Sbjct: 64  KPQRVAACEGEAHWNPVLFS-PSDRLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPTELV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW-------RKYGP 178
            G +   GP+KNK +L  NG  +   S+E+   W ++++ + D G+ W         +  
Sbjct: 123 KGDILPRGPAKNKLLLASNGAWIAPGSIENAEHWEAFVDRSCDEGKHWDISFVPLEPHNM 182

Query: 179 IYIPNESL---------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           I   N +L                      VIQP  + ++   + +L+RS  G   V  S
Sbjct: 183 ISAKNVALWEGIKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--VFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S D+G
Sbjct: 241 DSTDYGATWSVARATALPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASCDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            SW   L LE N   E+SYPA+I   +G VHITYT+NR  I +  L
Sbjct: 301 TSWLPLLELESNRG-EYSYPAIIN-EEGVVHITYTWNRKNIVYCRL 344


>gi|168239713|ref|ZP_02664771.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. SL480]
 gi|194736503|ref|YP_002114266.1| hypothetical protein SeSA_A1340 [Salmonella enterica subsp.
           enterica serovar Schwarzengrund str. CVM19633]
 gi|194712005|gb|ACF91226.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. CVM19633]
 gi|197287621|gb|EDY27013.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. SL480]
          Length = 347

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 173/346 (50%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +   F  CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPI----Y 180
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W   + P+     
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEADNA 182

Query: 181 IPNESLS-----------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           IP  +++                       VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 IPGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V + DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|29141600|ref|NP_804942.1| hypothetical protein t1130 [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|213053004|ref|ZP_03345882.1| hypothetical protein Salmoneentericaenterica_08861 [Salmonella
           enterica subsp. enterica serovar Typhi str. E00-7866]
 gi|213421059|ref|ZP_03354125.1| hypothetical protein Salmonentericaenterica_26257 [Salmonella
           enterica subsp. enterica serovar Typhi str. E01-6750]
 gi|213622722|ref|ZP_03375505.1| hypothetical protein SentesTyp_36516 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-2068]
 gi|213646491|ref|ZP_03376544.1| hypothetical protein SentesTy_03656 [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
 gi|213855360|ref|ZP_03383600.1| hypothetical protein SentesT_15367 [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
 gi|289828543|ref|ZP_06546385.1| hypothetical protein Salmonellentericaenterica_18887 [Salmonella
           enterica subsp. enterica serovar Typhi str. E98-3139]
 gi|378959292|ref|YP_005216778.1| BNR/Asp-box repeat domain-containing protein [Salmonella enterica
           subsp. enterica serovar Typhi str. P-stx-12]
 gi|29137228|gb|AAO68791.1| hypothetical protein t1130 [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|374353164|gb|AEZ44925.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
          Length = 347

 Score =  174 bits (441), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G CEG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G  V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATFLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SKGGVVHITYTWNRKNIVYCRL 344


>gi|224584240|ref|YP_002638038.1| hypothetical protein SPC_2491 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|375114152|ref|ZP_09759322.1| Neuraminidase [Salmonella enterica subsp. enterica serovar
           Choleraesuis str. SCSA50]
 gi|224468767|gb|ACN46597.1| hypothetical protein SPC_2491 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|322714298|gb|EFZ05869.1| Neuraminidase [Salmonella enterica subsp. enterica serovar
           Choleraesuis str. SCSA50]
          Length = 347

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I A  G VHITYT+NR  I +  L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344


>gi|167771446|ref|ZP_02443499.1| hypothetical protein ANACOL_02812 [Anaerotruncus colihominis DSM
           17241]
 gi|167666086|gb|EDS10216.1| BNR/Asp-box repeat protein [Anaerotruncus colihominis DSM 17241]
          Length = 398

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 116/343 (33%), Positives = 172/343 (50%), Gaps = 49/343 (14%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
           + ++   P +   F +CH+STIV+V K  +L AY  G CEG PD+ IWL    DG W  P
Sbjct: 56  INQQLIMPEDQTWFGNCHSSTIVKVGKNEYLCAYMAGECEGKPDMAIWLSRCVDGIWLQP 115

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG 130
                    P WNPVL+      + L +K+G  V  W   + +S + G TWS   +  PG
Sbjct: 116 EKIMGVYRFPHWNPVLY-FDGTTVTLIFKVGPSVPLWYSMISQSKDLGKTWSVPREAVPG 174

Query: 131 ILGP---SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP-NESL 186
              P   S+NK I   NG L    S+E+   W S+++++ D GR+W K+    IP N  L
Sbjct: 175 DYQPRITSRNKIIRGSNGYLYGPCSIETEKYWDSYIDISKDNGRTWSKHT---IPFNHDL 231

Query: 187 ------------------------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
                                          +IQP  + ++      L+RS  G  R+  
Sbjct: 232 RQEDKNGGSWNGLAQGALWENDLATVLKWDGIIQPTIWESSAGIFHALLRSTRG--RIYR 289

Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
           ++S D G TW  A PT + N NSGID  K+ +G L+LAYN +S     R  + +++S+D+
Sbjct: 290 TDSTDYGDTWCEAYPTDIPNNNSGIDIAKMDNGTLVLAYNPISGNWSARSPISLSISEDN 349

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGS-VHITYTYNRTQI 313
           G+++   + LE     EFSYPA++  +DGS +++TYTY R  I
Sbjct: 350 GNTFSQPVHLETKDG-EFSYPAIL--ADGSHIYMTYTYKRKSI 389


>gi|417334319|ref|ZP_12117570.1| hypothetical protein LTSEALA_2795 [Salmonella enterica subsp.
           enterica serovar Alachua str. R6-377]
 gi|353575510|gb|EHC38228.1| hypothetical protein LTSEALA_2795 [Salmonella enterica subsp.
           enterica serovar Alachua str. R6-377]
          Length = 347

 Score =  173 bits (439), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRKVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAASEGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I A  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344


>gi|161614362|ref|YP_001588327.1| hypothetical protein SPAB_02110 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|161363726|gb|ABX67494.1| hypothetical protein SPAB_02110 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
          Length = 347

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G  V  W      S ++G TWS    L 
Sbjct: 64  TPQRVVAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            G +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKRVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|423139679|ref|ZP_17127317.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. houtenae
           str. ATCC BAA-1581]
 gi|379052233|gb|EHY70124.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. houtenae
           str. ATCC BAA-1581]
          Length = 347

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 175/346 (50%), Gaps = 43/346 (12%)

Query: 10  LVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           LV  +   P + + PF+ CHAST+V +  G  + A+F G  EG+ D  IWL  +    W 
Sbjct: 5   LVNRQVILPESGNEPFQ-CHASTLVRLPCGTLVAAWFAGLWEGSEDTAIWLSRYDHSIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
            P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS   +L 
Sbjct: 64  KPQRVTVREGKAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSPPTELV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            G +   GP KNK +L  NG+ +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGVWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWEGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G+TWS A+ T L N NSGID V ++DG L+LA N V+     R  L + +S ++G
Sbjct: 241 DSVDYGVTWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGRRYPLSLIVSHNNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 KSWLPLLDLESDRG-EYSYPAII-SEGGIVHITYTWNRKNIVYCRL 344


>gi|62179834|ref|YP_216251.1| hypothetical protein SC1264 [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
 gi|62127467|gb|AAX65170.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
          Length = 347

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 169/345 (48%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W  +++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRVFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SETNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I A  G VHITYT+NR  I +  L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAIISAG-GVVHITYTWNRKNIVYCRL 344


>gi|200391019|ref|ZP_03217630.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Virchow str. SL491]
 gi|199603464|gb|EDZ02010.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Virchow str. SL491]
          Length = 347

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGKLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLVRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|168264223|ref|ZP_02686196.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Hadar str. RI_05P066]
 gi|205347291|gb|EDZ33922.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Hadar str. RI_05P066]
          Length = 347

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLITSQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|168244341|ref|ZP_02669273.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL486]
 gi|194449596|ref|YP_002045253.1| hypothetical protein SeHA_C1371 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. SL476]
 gi|386591110|ref|YP_006087510.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|419728984|ref|ZP_14255945.1| hypothetical protein SEEH1579_06336 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|419734605|ref|ZP_14261494.1| hypothetical protein SEEH1563_08194 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|419738478|ref|ZP_14265240.1| hypothetical protein SEEH1573_12664 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|419744710|ref|ZP_14271363.1| hypothetical protein SEEH1566_17357 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|419747768|ref|ZP_14274271.1| hypothetical protein SEEH1565_01244 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|421570695|ref|ZP_16016380.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00322]
 gi|421575919|ref|ZP_16021525.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00325]
 gi|421580796|ref|ZP_16026350.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00326]
 gi|421586419|ref|ZP_16031900.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00328]
 gi|194407900|gb|ACF68119.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL476]
 gi|205336718|gb|EDZ23482.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL486]
 gi|381297283|gb|EIC38377.1| hypothetical protein SEEH1563_08194 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|381297687|gb|EIC38773.1| hypothetical protein SEEH1579_06336 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|381302276|gb|EIC43320.1| hypothetical protein SEEH1573_12664 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|381307908|gb|EIC48756.1| hypothetical protein SEEH1566_17357 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|381316479|gb|EIC57228.1| hypothetical protein SEEH1565_01244 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|383798154|gb|AFH45236.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|402519291|gb|EJW26654.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00326]
 gi|402519872|gb|EJW27227.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00325]
 gi|402523275|gb|EJW30593.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00322]
 gi|402527818|gb|EJW35076.1| Neuraminidase virulence factor [Salmonella enterica subsp. enterica
           serovar Heidelberg str. CFSAN00328]
          Length = 347

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|207857245|ref|YP_002243896.1| hypothetical protein SEN1799 [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|421358257|ref|ZP_15808555.1| hypothetical protein SEEE3139_09409 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421364478|ref|ZP_15814710.1| hypothetical protein SEEE0166_17747 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421366733|ref|ZP_15816935.1| hypothetical protein SEEE0631_06073 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421373445|ref|ZP_15823585.1| hypothetical protein SEEE0424_17144 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421377170|ref|ZP_15827269.1| hypothetical protein SEEE3076_13088 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421381669|ref|ZP_15831724.1| hypothetical protein SEEE4917_12838 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421385349|ref|ZP_15835371.1| hypothetical protein SEEE6622_08654 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421390323|ref|ZP_15840298.1| hypothetical protein SEEE6670_10927 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421393583|ref|ZP_15843527.1| hypothetical protein SEEE6426_04619 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421398371|ref|ZP_15848279.1| hypothetical protein SEEE6437_06551 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421404183|ref|ZP_15854027.1| hypothetical protein SEEE7246_13025 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421409694|ref|ZP_15859484.1| hypothetical protein SEEE7250_18127 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421413417|ref|ZP_15863171.1| hypothetical protein SEEE1427_14046 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421418729|ref|ZP_15868430.1| hypothetical protein SEEE2659_18131 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421422405|ref|ZP_15872073.1| hypothetical protein SEEE1757_13914 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421426358|ref|ZP_15875986.1| hypothetical protein SEEE5101_11107 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421432891|ref|ZP_15882459.1| hypothetical protein SEEE8B1_21287 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421434896|ref|ZP_15884442.1| hypothetical protein SEEE5518_08095 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421442212|ref|ZP_15891672.1| hypothetical protein SEEE1618_22209 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421444503|ref|ZP_15893933.1| hypothetical protein SEEE3079_10672 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|421448192|ref|ZP_15897587.1| hypothetical protein SEEE6482_06556 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|436615807|ref|ZP_20514272.1| hypothetical protein SEE22704_12964 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436661040|ref|ZP_20517144.1| hypothetical protein SEE30663_03338 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436802173|ref|ZP_20525236.1| hypothetical protein SEECHS44_18315 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436809155|ref|ZP_20528535.1| hypothetical protein SEEE1882_12004 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436815089|ref|ZP_20532640.1| hypothetical protein SEEE1884_09883 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436844512|ref|ZP_20538270.1| hypothetical protein SEEE1594_15593 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436854157|ref|ZP_20543791.1| hypothetical protein SEEE1566_20694 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436857445|ref|ZP_20545965.1| hypothetical protein SEEE1580_09000 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436864618|ref|ZP_20550585.1| hypothetical protein SEEE1543_09785 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436873818|ref|ZP_20556542.1| hypothetical protein SEEE1441_17432 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436877984|ref|ZP_20558839.1| hypothetical protein SEEE1810_06327 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436888475|ref|ZP_20564804.1| hypothetical protein SEEE1558_13714 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436895742|ref|ZP_20568498.1| hypothetical protein SEEE1018_09447 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436901624|ref|ZP_20572534.1| hypothetical protein SEEE1010_07259 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436912337|ref|ZP_20578166.1| hypothetical protein SEEE1729_13185 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436922269|ref|ZP_20584494.1| hypothetical protein SEEE0895_22380 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436926994|ref|ZP_20586820.1| hypothetical protein SEEE0899_11154 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436936288|ref|ZP_20591728.1| hypothetical protein SEEE1457_13246 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436943478|ref|ZP_20596424.1| hypothetical protein SEEE1747_14387 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436951034|ref|ZP_20600089.1| hypothetical protein SEEE0968_10029 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436961641|ref|ZP_20605015.1| hypothetical protein SEEE1444_12060 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436970765|ref|ZP_20609158.1| hypothetical protein SEEE1445_10221 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436982073|ref|ZP_20613569.1| hypothetical protein SEEE1559_09954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436994712|ref|ZP_20618980.1| hypothetical protein SEEE1565_14509 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437001793|ref|ZP_20621072.1| hypothetical protein SEEE1808_02374 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437020015|ref|ZP_20627166.1| hypothetical protein SEEE1811_10403 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437033926|ref|ZP_20632810.1| hypothetical protein SEEE0956_16111 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437045625|ref|ZP_20637923.1| hypothetical protein SEEE1455_19166 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437053838|ref|ZP_20642637.1| hypothetical protein SEEE1575_20376 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437058808|ref|ZP_20645655.1| hypothetical protein SEEE1725_13019 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437070369|ref|ZP_20651547.1| hypothetical protein SEEE1745_20038 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437076498|ref|ZP_20654861.1| hypothetical protein SEEE1791_13902 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437081140|ref|ZP_20657592.1| hypothetical protein SEEE1795_05026 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437091697|ref|ZP_20663297.1| hypothetical protein SEEE6709_11337 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437101910|ref|ZP_20666359.1| hypothetical protein SEEE9058_03884 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437121633|ref|ZP_20671827.1| hypothetical protein SEEE0816_08842 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437131102|ref|ZP_20677232.1| hypothetical protein SEEE0819_13345 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437138854|ref|ZP_20681336.1| hypothetical protein SEEE3072_11262 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437145709|ref|ZP_20685616.1| hypothetical protein SEEE3089_10037 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437156786|ref|ZP_20692322.1| hypothetical protein SEEE9163_21197 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437159253|ref|ZP_20693767.1| hypothetical protein SEEE151_05614 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437166082|ref|ZP_20697867.1| hypothetical protein SEEEN202_03741 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437178723|ref|ZP_20704841.1| hypothetical protein SEEE3991_16458 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437187795|ref|ZP_20710076.1| hypothetical protein SEEE3618_20478 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437198442|ref|ZP_20711381.1| hypothetical protein SEEE1831_04112 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|437258728|ref|ZP_20716648.1| hypothetical protein SEEE2490_04544 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437268297|ref|ZP_20721767.1| hypothetical protein SEEEL909_07913 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437277135|ref|ZP_20726654.1| hypothetical protein SEEEL913_09775 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437293445|ref|ZP_20732160.1| hypothetical protein SEEE4941_15102 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437312214|ref|ZP_20736322.1| hypothetical protein SEEE7015_13535 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437328108|ref|ZP_20740890.1| hypothetical protein SEEE7927_13718 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437343667|ref|ZP_20745835.1| hypothetical protein SEEECHS4_15995 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437373562|ref|ZP_20749657.1| hypothetical protein SEEE2558_14484 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22558]
 gi|437404159|ref|ZP_20752017.1| hypothetical protein SEEE2217_01700 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437443993|ref|ZP_20758159.1| hypothetical protein SEEE4018_10092 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437460590|ref|ZP_20761544.1| hypothetical protein SEEE6211_04232 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437481393|ref|ZP_20768925.1| hypothetical protein SEEE4441_18971 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437494631|ref|ZP_20772574.1| hypothetical protein SEEE4647_14758 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437503932|ref|ZP_20774946.1| hypothetical protein SEEE9845_04038 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437540090|ref|ZP_20782264.1| hypothetical protein SEEE9317_18535 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|437560982|ref|ZP_20786266.1| hypothetical protein SEEE0116_15785 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437577678|ref|ZP_20791027.1| hypothetical protein SEEE1117_16834 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437586346|ref|ZP_20793307.1| hypothetical protein SEEE1392_05599 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437601775|ref|ZP_20798007.1| hypothetical protein SEEE0268_06583 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437612714|ref|ZP_20801340.1| hypothetical protein SEEE0316_00512 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437632540|ref|ZP_20806510.1| hypothetical protein SEEE0436_03904 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437658095|ref|ZP_20811426.1| hypothetical protein SEEE1319_05243 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437669255|ref|ZP_20815457.1| hypothetical protein SEEE4481_02719 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437697048|ref|ZP_20822711.1| hypothetical protein SEEE6297_16475 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437709448|ref|ZP_20826116.1| hypothetical protein SEEE4220_10960 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437815168|ref|ZP_20842438.1| hypothetical protein SEEE3944_23353 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|437869314|ref|ZP_20848301.1| hypothetical protein SEEE5621_06740 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|437982824|ref|ZP_20853362.1| hypothetical protein SEEE5646_05468 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|438087086|ref|ZP_20859233.1| hypothetical protein SEEE2625_08452 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438099815|ref|ZP_20863559.1| hypothetical protein SEEE1976_07464 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438110647|ref|ZP_20868045.1| hypothetical protein SEEE3407_07431 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|445169197|ref|ZP_21395136.1| hypothetical protein SEE8A_002845 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445198050|ref|ZP_21400936.1| hypothetical protein SE20037_15191 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445314862|ref|ZP_21411932.1| hypothetical protein SEE436_024495 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 436]
 gi|445328324|ref|ZP_21412930.1| hypothetical protein SEE18569_010557 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|445353163|ref|ZP_21421055.1| hypothetical protein SEE13_020140 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445363842|ref|ZP_21424765.1| hypothetical protein SEE23_013035 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|206709048|emb|CAR33379.1| hypothetical protein SEN1799 [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|395983967|gb|EJH93157.1| hypothetical protein SEEE0166_17747 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395988561|gb|EJH97717.1| hypothetical protein SEEE3139_09409 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|395989388|gb|EJH98522.1| hypothetical protein SEEE0631_06073 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395996564|gb|EJI05609.1| hypothetical protein SEEE0424_17144 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|396000792|gb|EJI09806.1| hypothetical protein SEEE3076_13088 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|396001632|gb|EJI10644.1| hypothetical protein SEEE4917_12838 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396014133|gb|EJI23019.1| hypothetical protein SEEE6670_10927 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396016786|gb|EJI25653.1| hypothetical protein SEEE6622_08654 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396017466|gb|EJI26331.1| hypothetical protein SEEE6426_04619 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396024991|gb|EJI33775.1| hypothetical protein SEEE7250_18127 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396027263|gb|EJI36027.1| hypothetical protein SEEE7246_13025 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396031444|gb|EJI40171.1| hypothetical protein SEEE6437_06551 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396038007|gb|EJI46651.1| hypothetical protein SEEE2659_18131 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396040505|gb|EJI49129.1| hypothetical protein SEEE1427_14046 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396041720|gb|EJI50343.1| hypothetical protein SEEE1757_13914 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396049107|gb|EJI57650.1| hypothetical protein SEEE8B1_21287 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396053865|gb|EJI62358.1| hypothetical protein SEEE5101_11107 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396059277|gb|EJI67732.1| hypothetical protein SEEE5518_08095 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396062889|gb|EJI71300.1| hypothetical protein SEEE1618_22209 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|396066934|gb|EJI75294.1| hypothetical protein SEEE3079_10672 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|396073792|gb|EJI82092.1| hypothetical protein SEEE6482_06556 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|434957577|gb|ELL51206.1| hypothetical protein SEECHS44_18315 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434966972|gb|ELL59807.1| hypothetical protein SEEE1882_12004 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434967625|gb|ELL60431.1| hypothetical protein SEE22704_12964 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434973205|gb|ELL65593.1| hypothetical protein SEEE1884_09883 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434979098|gb|ELL71090.1| hypothetical protein SEEE1594_15593 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434982960|gb|ELL74768.1| hypothetical protein SEEE1566_20694 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434989597|gb|ELL81147.1| hypothetical protein SEEE1580_09000 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434995653|gb|ELL86969.1| hypothetical protein SEEE1543_09785 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|434998575|gb|ELL89796.1| hypothetical protein SEEE1441_17432 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|435007921|gb|ELL98748.1| hypothetical protein SEEE1810_06327 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435010185|gb|ELM00971.1| hypothetical protein SEEE1558_13714 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435015631|gb|ELM06157.1| hypothetical protein SEEE1018_09447 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435020063|gb|ELM10488.1| hypothetical protein SEE30663_03338 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|435021058|gb|ELM11447.1| hypothetical protein SEEE1010_07259 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435024587|gb|ELM14793.1| hypothetical protein SEEE0895_22380 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435026582|gb|ELM16713.1| hypothetical protein SEEE1729_13185 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435036835|gb|ELM26654.1| hypothetical protein SEEE0899_11154 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435039126|gb|ELM28907.1| hypothetical protein SEEE1457_13246 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435043677|gb|ELM33394.1| hypothetical protein SEEE1747_14387 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435050780|gb|ELM40284.1| hypothetical protein SEEE1444_12060 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435051501|gb|ELM41003.1| hypothetical protein SEEE0968_10029 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435057054|gb|ELM46423.1| hypothetical protein SEEE1445_10221 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435063943|gb|ELM53090.1| hypothetical protein SEEE1565_14509 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435066327|gb|ELM55415.1| hypothetical protein SEEE1559_09954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435074664|gb|ELM63488.1| hypothetical protein SEEE1808_02374 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435076816|gb|ELM65598.1| hypothetical protein SEEE0956_16111 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435079652|gb|ELM68347.1| hypothetical protein SEEE1811_10403 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435079913|gb|ELM68607.1| hypothetical protein SEEE1455_19166 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435088852|gb|ELM77307.1| hypothetical protein SEEE1575_20376 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435090340|gb|ELM78742.1| hypothetical protein SEEE1745_20038 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435094621|gb|ELM82960.1| hypothetical protein SEEE1725_13019 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435105795|gb|ELM93832.1| hypothetical protein SEEE1791_13902 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435111759|gb|ELM99647.1| hypothetical protein SEEE1795_05026 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435112603|gb|ELN00468.1| hypothetical protein SEEE6709_11337 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435123889|gb|ELN11380.1| hypothetical protein SEEE9058_03884 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435125076|gb|ELN12532.1| hypothetical protein SEEE0819_13345 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435125720|gb|ELN13157.1| hypothetical protein SEEE0816_08842 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435132376|gb|ELN19574.1| hypothetical protein SEEE3072_11262 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435135393|gb|ELN22502.1| hypothetical protein SEEE9163_21197 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435137170|gb|ELN24241.1| hypothetical protein SEEE3089_10037 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435150186|gb|ELN36870.1| hypothetical protein SEEE151_05614 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435152140|gb|ELN38770.1| hypothetical protein SEEE3991_16458 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435153439|gb|ELN40047.1| hypothetical protein SEEEN202_03741 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435156727|gb|ELN43215.1| hypothetical protein SEEE3618_20478 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435165975|gb|ELN51977.1| hypothetical protein SEEE2490_04544 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435173321|gb|ELN58831.1| hypothetical protein SEEEL913_09775 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435174476|gb|ELN59918.1| hypothetical protein SEEEL909_07913 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435180884|gb|ELN65989.1| hypothetical protein SEEE4941_15102 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435183346|gb|ELN68321.1| hypothetical protein SEEE7015_13535 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435191342|gb|ELN75908.1| hypothetical protein SEEE7927_13718 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435191444|gb|ELN76001.1| hypothetical protein SEEECHS4_15995 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435205577|gb|ELN89166.1| hypothetical protein SEEE2217_01700 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435205632|gb|ELN89218.1| hypothetical protein SEEE2558_14484 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22558]
 gi|435210693|gb|ELN93931.1| hypothetical protein SEEE4018_10092 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435217808|gb|ELO00223.1| hypothetical protein SEEE4441_18971 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435220882|gb|ELO03156.1| hypothetical protein SEEE6211_04232 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435222329|gb|ELO04443.1| hypothetical protein SEEE1831_04112 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|435227148|gb|ELO08671.1| hypothetical protein SEEE4647_14758 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435236459|gb|ELO17194.1| hypothetical protein SEEE9845_04038 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435237453|gb|ELO18132.1| hypothetical protein SEEE9317_18535 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|435238308|gb|ELO18957.1| hypothetical protein SEEE0116_15785 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435242620|gb|ELO22924.1| hypothetical protein SEEE1117_16834 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435260304|gb|ELO39515.1| hypothetical protein SEEE1392_05599 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435260787|gb|ELO39977.1| hypothetical protein SEEE0268_06583 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435265287|gb|ELO44168.1| hypothetical protein SEEE0316_00512 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435270790|gb|ELO49275.1| hypothetical protein SEEE1319_05243 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435276636|gb|ELO54638.1| hypothetical protein SEEE6297_16475 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435282233|gb|ELO59857.1| hypothetical protein SEEE4481_02719 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435282893|gb|ELO60492.1| hypothetical protein SEEE0436_03904 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435289569|gb|ELO66522.1| hypothetical protein SEEE4220_10960 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435295413|gb|ELO71875.1| hypothetical protein SEEE3944_23353 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435318294|gb|ELO91235.1| hypothetical protein SEEE2625_08452 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435325413|gb|ELO97278.1| hypothetical protein SEEE1976_07464 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435331854|gb|ELP02952.1| hypothetical protein SEEE3407_07431 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|435335251|gb|ELP05536.1| hypothetical protein SEEE5621_06740 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|435337895|gb|ELP07341.1| hypothetical protein SEEE5646_05468 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|444863100|gb|ELX87933.1| hypothetical protein SE20037_15191 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444863705|gb|ELX88522.1| hypothetical protein SEE8A_002845 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444871931|gb|ELX96317.1| hypothetical protein SEE436_024495 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 436]
 gi|444873340|gb|ELX97641.1| hypothetical protein SEE13_020140 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444880101|gb|ELY04185.1| hypothetical protein SEE18569_010557 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|444883553|gb|ELY07432.1| hypothetical protein SEE23_013035 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
          Length = 347

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|205353019|ref|YP_002226820.1| hypothetical protein SG1872 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|375123847|ref|ZP_09769011.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Gallinarum str. SG9]
 gi|445131150|ref|ZP_21381649.1| hypothetical protein SEEG9184_020752 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|205272800|emb|CAR37723.1| hypothetical protein SG1872 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|326628097|gb|EGE34440.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Gallinarum str. SG9]
 gi|444850990|gb|ELX76086.1| hypothetical protein SEEG9184_020752 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
          Length = 347

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTKSFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|421884817|ref|ZP_16316022.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Senftenberg str. SS209]
 gi|379985531|emb|CCF88295.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Senftenberg str. SS209]
          Length = 347

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|417518819|ref|ZP_12181108.1| hypothetical protein LTSEUGA_2789 [Salmonella enterica subsp.
           enterica serovar Uganda str. R8-3404]
 gi|353648337|gb|EHC91256.1| hypothetical protein LTSEUGA_2789 [Salmonella enterica subsp.
           enterica serovar Uganda str. R8-3404]
          Length = 347

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS    G +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRS--TCGAIFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLITSQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|437725350|ref|ZP_20829766.1| hypothetical protein SEEE1616_06119, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
 gi|435292569|gb|ELO69326.1| hypothetical protein SEEE1616_06119, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
          Length = 351

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 9   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 68

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 69  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 127

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 128 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 187

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 188 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 245

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 246 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 305

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 306 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 348


>gi|197251200|ref|YP_002146789.1| hypothetical protein SeAg_B1929 [Salmonella enterica subsp.
           enterica serovar Agona str. SL483]
 gi|440761730|ref|ZP_20940799.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Agona str. SH11G1113]
 gi|440767598|ref|ZP_20946574.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Agona str. SH08SF124]
 gi|440774047|ref|ZP_20952935.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Agona str. SH10GFN094]
 gi|197214903|gb|ACH52300.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
 gi|436413565|gb|ELP11498.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Agona str. SH10GFN094]
 gi|436419507|gb|ELP17382.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Agona str. SH08SF124]
 gi|436424510|gb|ELP22281.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Agona str. SH11G1113]
          Length = 347

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|194444968|ref|YP_002040506.1| hypothetical protein SNSL254_A1356 [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
 gi|417366046|ref|ZP_12138473.1| hypothetical protein LTSEHVI_2715 [Salmonella enterica subsp.
           enterica serovar Hvittingfoss str. A4-620]
 gi|418795705|ref|ZP_13351406.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|418798443|ref|ZP_13354120.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|418808791|ref|ZP_13364344.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|418812947|ref|ZP_13368468.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|418816972|ref|ZP_13372460.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21538]
 gi|418820413|ref|ZP_13375846.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22425]
 gi|418824215|ref|ZP_13379586.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
 gi|418832840|ref|ZP_13387774.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|418835003|ref|ZP_13389902.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|418839690|ref|ZP_13394522.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|418852273|ref|ZP_13406977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|418855503|ref|ZP_13410159.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19593]
 gi|194403631|gb|ACF63853.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL254]
 gi|353592952|gb|EHC50836.1| hypothetical protein LTSEHVI_2715 [Salmonella enterica subsp.
           enterica serovar Hvittingfoss str. A4-620]
 gi|392758233|gb|EJA15108.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|392767100|gb|EJA23872.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|392774173|gb|EJA30868.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|392775474|gb|EJA32166.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|392789140|gb|EJA45660.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 21538]
 gi|392792682|gb|EJA49136.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Newport str. CVM 22425]
 gi|392796910|gb|EJA53238.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|392804284|gb|EJA60449.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|392812134|gb|EJA68129.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|392815413|gb|EJA71355.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|392821561|gb|EJA77385.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19593]
 gi|392824194|gb|EJA79982.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
          Length = 347

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|437781881|ref|ZP_20836562.1| hypothetical protein SEEE2651_18221, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 76-2651]
 gi|435299905|gb|ELO76014.1| hypothetical protein SEEE2651_18221, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 76-2651]
          Length = 341

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 118/342 (34%), Positives = 169/342 (49%), Gaps = 41/342 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNLVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVY 341


>gi|16760641|ref|NP_456258.1| hypothetical protein STY1869 [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|25512485|pir||AE0716 hypothetical protein STY1869 [imported] - Salmonella enterica
           subsp. enterica serovar Typhi (strain CT18)
 gi|16502937|emb|CAD02102.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
           Typhi]
          Length = 347

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 169/346 (48%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G CEG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G  V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +    P KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRSPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATFLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SKGGVVHITYTWNRKNIVYCRL 344


>gi|16764604|ref|NP_460219.1| hypothetical protein STM1252 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|167991947|ref|ZP_02573046.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|197265228|ref|ZP_03165302.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|374980252|ref|ZP_09721582.1| hypothetical protein SEE_02317 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|378444682|ref|YP_005232314.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. D23580]
 gi|378449728|ref|YP_005237087.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. 14028S]
 gi|378699142|ref|YP_005181099.1| hypothetical protein SL1344_1188 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|378983810|ref|YP_005246965.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|378988594|ref|YP_005251758.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. UK-1]
 gi|379700422|ref|YP_005242150.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|383495967|ref|YP_005396656.1| hypothetical protein UMN798_1299 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
 gi|422025401|ref|ZP_16371834.1| hypothetical protein B571_06201 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|422030405|ref|ZP_16376607.1| hypothetical protein B572_06153 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|427548970|ref|ZP_18927144.1| hypothetical protein B576_06301 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427564625|ref|ZP_18931847.1| hypothetical protein B577_05655 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427584472|ref|ZP_18936644.1| hypothetical protein B573_05696 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427606886|ref|ZP_18941458.1| hypothetical protein B574_05724 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427632073|ref|ZP_18946406.1| hypothetical protein B575_06292 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427655242|ref|ZP_18951163.1| hypothetical protein B578_05907 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427660578|ref|ZP_18956069.1| hypothetical protein B579_06527 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427666430|ref|ZP_18960841.1| hypothetical protein B580_06089 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|16419768|gb|AAL20178.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|197243483|gb|EDY26103.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|205329805|gb|EDZ16569.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|261246461|emb|CBG24271.1| hypothetical protein STMMW_12591 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. D23580]
 gi|267993106|gb|ACY87991.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. 14028S]
 gi|301157790|emb|CBW17283.1| hypothetical protein SL1344_1188 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|312912238|dbj|BAJ36212.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. T000240]
 gi|321223872|gb|EFX48935.1| hypothetical protein SEE_02317 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|323129521|gb|ADX16951.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|332988141|gb|AEF07124.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. UK-1]
 gi|380462788|gb|AFD58191.1| hypothetical protein UMN798_1299 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
 gi|414020699|gb|EKT04275.1| hypothetical protein B571_06201 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414020749|gb|EKT04322.1| hypothetical protein B576_06301 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414022231|gb|EKT05722.1| hypothetical protein B572_06153 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|414034715|gb|EKT17633.1| hypothetical protein B577_05655 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414035915|gb|EKT18761.1| hypothetical protein B573_05696 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414039437|gb|EKT22108.1| hypothetical protein B574_05724 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414049051|gb|EKT31277.1| hypothetical protein B578_05907 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414050524|gb|EKT32695.1| hypothetical protein B575_06292 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414055087|gb|EKT37007.1| hypothetical protein B579_06527 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414060598|gb|EKT42107.1| hypothetical protein B580_06089 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
          Length = 347

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGRLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGIVHITYTWNRKNIVYCRL 344


>gi|375001658|ref|ZP_09725998.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. enterica
           serovar Infantis str. SARB27]
 gi|353076346|gb|EHB42106.1| BNR/Asp-box repeat protein [Salmonella enterica subsp. enterica
           serovar Infantis str. SARB27]
          Length = 347

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 174/347 (50%), Gaps = 43/347 (12%)

Query: 9   GLVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
            LV  +   P + + PF+ CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W
Sbjct: 4   ALVNRQVILPESGTEPFQ-CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIW 62

Query: 68  QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
            +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS+   L
Sbjct: 63  TTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSKPAPL 121

Query: 128 PPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
               +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ 
Sbjct: 122 VNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDN 181

Query: 185 SLS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
           ++S                            VIQP  + ++   + +L+RS  G   +  
Sbjct: 182 AISGINVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFR 239

Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
           S+S D G TWS A+ T L N NSGID V + DG L+LA N V+     R  L +  S+D+
Sbjct: 240 SDSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPLSLIASQDN 299

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 300 GESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|416422214|ref|ZP_11690118.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 315996572]
 gi|416431233|ref|ZP_11695456.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 495297-1]
 gi|416441288|ref|ZP_11701500.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 495297-3]
 gi|416446574|ref|ZP_11705164.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 495297-4]
 gi|416451993|ref|ZP_11708660.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 515920-1]
 gi|416458994|ref|ZP_11713503.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 515920-2]
 gi|416467786|ref|ZP_11717635.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 531954]
 gi|416479548|ref|ZP_11722357.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. NC_MB110209-0054]
 gi|416489424|ref|ZP_11726188.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|416497443|ref|ZP_11729711.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|416507413|ref|ZP_11735361.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. SARB31]
 gi|416523558|ref|ZP_11741193.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. ATCC BAA710]
 gi|416528495|ref|ZP_11743945.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. LQC 10]
 gi|416535613|ref|ZP_11747867.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. SARB30]
 gi|416542799|ref|ZP_11751799.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 19N]
 gi|416553920|ref|ZP_11757948.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 29N]
 gi|416558037|ref|ZP_11759980.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 42N]
 gi|416571596|ref|ZP_11766830.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 4441 H]
 gi|416576252|ref|ZP_11768939.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 81038-01]
 gi|416585953|ref|ZP_11775253.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|416590965|ref|ZP_11778140.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 414877]
 gi|416599002|ref|ZP_11783353.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 366867]
 gi|416607919|ref|ZP_11788913.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 413180]
 gi|416611186|ref|ZP_11790616.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 446600]
 gi|416624269|ref|ZP_11797925.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 609458-1]
 gi|416629135|ref|ZP_11799899.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 556150-1]
 gi|416639489|ref|ZP_11804558.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 609460]
 gi|416650788|ref|ZP_11810553.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 507440-20]
 gi|416653554|ref|ZP_11811923.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 556152]
 gi|416665961|ref|ZP_11817112.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB101509-0077]
 gi|416685717|ref|ZP_11825075.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|416697216|ref|ZP_11828141.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB110209-0055]
 gi|416707208|ref|ZP_11832306.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB111609-0052]
 gi|416714504|ref|ZP_11837822.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 2009083312]
 gi|416717061|ref|ZP_11839342.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 2009085258]
 gi|416725187|ref|ZP_11845557.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 315731156]
 gi|416731619|ref|ZP_11849475.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2009159199]
 gi|416736115|ref|ZP_11851832.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008282]
 gi|416748761|ref|ZP_11858977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008283]
 gi|416759859|ref|ZP_11864667.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|416761919|ref|ZP_11865969.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008285]
 gi|416767302|ref|ZP_11869799.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008287]
 gi|417463557|ref|ZP_12164709.1| hypothetical protein LTSEMON_2682 [Salmonella enterica subsp.
           enterica serovar Montevideo str. S5-403]
 gi|418485908|ref|ZP_13054890.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 80959-06]
 gi|418486692|ref|ZP_13055640.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035278]
 gi|418495457|ref|ZP_13061899.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035318]
 gi|418499249|ref|ZP_13065658.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035320]
 gi|418503128|ref|ZP_13069497.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035321]
 gi|418506461|ref|ZP_13072793.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035327]
 gi|418527050|ref|ZP_13093007.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008286]
 gi|322616641|gb|EFY13550.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 315996572]
 gi|322619844|gb|EFY16718.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 495297-1]
 gi|322622412|gb|EFY19257.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 495297-3]
 gi|322627936|gb|EFY24726.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 495297-4]
 gi|322632966|gb|EFY29709.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 515920-1]
 gi|322636788|gb|EFY33491.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 515920-2]
 gi|322641401|gb|EFY38040.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 531954]
 gi|322645176|gb|EFY41705.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. NC_MB110209-0054]
 gi|322650117|gb|EFY46531.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|322655691|gb|EFY51993.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|322660015|gb|EFY56254.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 19N]
 gi|322665417|gb|EFY61605.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 81038-01]
 gi|322667605|gb|EFY63766.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|322673601|gb|EFY69703.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 414877]
 gi|322677527|gb|EFY73591.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 366867]
 gi|322679808|gb|EFY75847.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 413180]
 gi|322687281|gb|EFY83253.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 446600]
 gi|323192398|gb|EFZ77628.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 609458-1]
 gi|323199476|gb|EFZ84569.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 556150-1]
 gi|323203664|gb|EFZ88686.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 609460]
 gi|323214133|gb|EFZ98893.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 556152]
 gi|323217769|gb|EGA02484.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB101509-0077]
 gi|323218920|gb|EGA03431.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|323224991|gb|EGA09247.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB110209-0055]
 gi|323229572|gb|EGA13695.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB111609-0052]
 gi|323232795|gb|EGA16891.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 2009083312]
 gi|323240167|gb|EGA24211.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 2009085258]
 gi|323242846|gb|EGA26867.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 315731156]
 gi|323247778|gb|EGA31717.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2009159199]
 gi|323254338|gb|EGA38155.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008282]
 gi|323256116|gb|EGA39853.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008283]
 gi|323259438|gb|EGA43074.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|323267034|gb|EGA50519.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008285]
 gi|323272138|gb|EGA55552.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008287]
 gi|353631044|gb|EHC78435.1| hypothetical protein LTSEMON_2682 [Salmonella enterica subsp.
           enterica serovar Montevideo str. S5-403]
 gi|363549711|gb|EHL34044.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. ATCC BAA710]
 gi|363553423|gb|EHL37671.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. SARB31]
 gi|363553812|gb|EHL38058.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. LQC 10]
 gi|363562106|gb|EHL46212.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 29N]
 gi|363565821|gb|EHL49845.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. SARB30]
 gi|363574126|gb|EHL57999.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 4441 H]
 gi|363577604|gb|EHL61424.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 42N]
 gi|366055798|gb|EHN20133.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 80959-06]
 gi|366059313|gb|EHN23587.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035318]
 gi|366071784|gb|EHN35878.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035320]
 gi|366072817|gb|EHN36900.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035278]
 gi|366074852|gb|EHN38914.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035321]
 gi|366083181|gb|EHN47108.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. CT_02035327]
 gi|366827670|gb|EHN54568.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. 507440-20]
 gi|372204519|gb|EHP18046.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008286]
          Length = 347

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 170/346 (49%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|168821866|ref|ZP_02833866.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Weltevreden str. HI_N05-537]
 gi|409250454|ref|YP_006886265.1| hypothetical protein SENTW_1976 [Salmonella enterica subsp.
           enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|205341660|gb|EDZ28424.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Weltevreden str. HI_N05-537]
 gi|320086282|emb|CBY96056.1| hypothetical protein SENTW_1976 [Salmonella enterica subsp.
           enterica serovar Weltevreden str. 2007-60-3289-1]
          Length = 347

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 41/345 (11%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTGSFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ ++
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAI 183

Query: 187 S----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
           S                            VIQP  + ++   + +L+RS  G   +  S+
Sbjct: 184 SGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSD 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGD 273
           S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+
Sbjct: 242 SIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGE 301

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 302 SWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|204930845|ref|ZP_03221718.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Javiana str. GA_MM04042433]
 gi|452120576|ref|YP_007470824.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Javiana str. CFSAN001992]
 gi|204320304|gb|EDZ05508.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Javiana str. GA_MM04042433]
 gi|451909580|gb|AGF81386.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Javiana str. CFSAN001992]
          Length = 347

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 122/347 (35%), Positives = 177/347 (51%), Gaps = 43/347 (12%)

Query: 9   GLVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
            LV  +   P + + PF+ CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W
Sbjct: 4   ALVNRQVILPESGTEPFQ-CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIW 62

Query: 68  QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
            +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L
Sbjct: 63  TTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPL 121

Query: 128 PPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPI---- 179
               +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W   + P+    
Sbjct: 122 VNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEADN 181

Query: 180 YIPNESLS-----------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
            IP  +++                       VIQP  + ++   + +L+RS  G   +  
Sbjct: 182 AIPGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFR 239

Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
           S+S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+
Sbjct: 240 SDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDN 299

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 300 GESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|418512678|ref|ZP_13078918.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Pomona str. ATCC 10729]
 gi|366083506|gb|EHN47427.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Pomona str. ATCC 10729]
          Length = 347

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 174/347 (50%), Gaps = 43/347 (12%)

Query: 9   GLVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67
            LV  +   P + + PF+ CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W
Sbjct: 4   ALVNRQVILPESGTEPFQ-CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIW 62

Query: 68  QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
            +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L
Sbjct: 63  TTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPL 121

Query: 128 PPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
               +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ 
Sbjct: 122 VNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDN 181

Query: 185 SLS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
           ++S                            VIQP  + ++   + +L+RS  G   +  
Sbjct: 182 AISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFR 239

Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDD 271
           S+S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+
Sbjct: 240 SDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDN 299

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 300 GESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|437817680|ref|ZP_20842764.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SARB17]
 gi|435308158|gb|ELO83156.1| cytoplasmic protein [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SARB17]
          Length = 347

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 168/346 (48%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV      P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRHVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVTLWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASHDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|343084744|ref|YP_004774039.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342353278|gb|AEL25808.1| hypothetical protein Cycma_2061 [Cyclobacterium marinum DSM 745]
          Length = 368

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 165/317 (52%), Gaps = 16/317 (5%)

Query: 10  LVAEEFTFP-ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           L  EE+        PF  CHASTI  +  G++L A+F GS E   DV IW+   K G W 
Sbjct: 45  LTTEEYEHIFGEDRPFAQCHASTITGLGDGNYLAAWFAGSHEKHNDVGIWVSKGKAGDWS 104

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
            P +  +  N P WNPVLF  P   + L++K+G+E+  W   ++ + + G TWSE  +L 
Sbjct: 105 EPELLVKVRNEPHWNPVLFNAPDGKVYLYFKVGKEIDYWETWVQYTEDGGETWSEARELV 164

Query: 129 PGI---LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
           PG     GP +N  ++L +G  L  +S E    W  +++ + D G++W     + +    
Sbjct: 165 PGDKGGRGPVRNHMLVLSDGTWLAPASDEKNKVWTVFVDRSEDGGKTWTATEKLDMDRSV 224

Query: 186 LS---VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
           ++   VIQP  + +    + +LMRS +  G +C S+S D G TWS     +L N NSGID
Sbjct: 225 ITGEGVIQPALWESKPGHVHMLMRSSS--GNICRSDSEDYGKTWSKVTEIELPNNNSGID 282

Query: 243 GVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLA-MEFSYPAVIQ 296
              ++  ++ L YN V+     R  + +A+S+D+G +W     +E+     E SYPA+  
Sbjct: 283 VAHIEGEKIALIYNPVAENWGDRFPISIAVSEDNGKTWPLKFEIEKGEGDDELSYPAMFY 342

Query: 297 ASDGSVHITYTYNRTQI 313
             DG +   YT+NR  +
Sbjct: 343 -EDGHLVACYTWNRETV 358


>gi|238910930|ref|ZP_04654767.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Tennessee str. CDC07-0191]
          Length = 347

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 120/347 (34%), Positives = 172/347 (49%), Gaps = 43/347 (12%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNLWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISETNVALWDGIKKGVLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V +++G L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQNGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I  S+GS VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDRG-EYSYPAII--SEGSVVHITYTWNRKNIVYCRL 344


>gi|412993057|emb|CCO16590.1| exported exo-alpha-sialidase [Bathycoccus prasinos]
          Length = 490

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 131/398 (32%), Positives = 195/398 (48%), Gaps = 88/398 (22%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVD---------KGHFLVAYFGGSCEGAPDVKIWL 59
            +++ +F FP  + P  S H++TI+ +D         +   L ++F G+ E APDV I+ 
Sbjct: 94  AIISSQFIFPVGTYP--SSHSATIIALDNEEEEEEGGRYRLLASWFAGTEENAPDVAIYT 151

Query: 60  QTF--KDGRWQS-PIIA-------DEEPNV------------PMWNPVLFKL-PSNGLLL 96
            T+  +   WQS P +A       D    V             +WNPVL K   +N + L
Sbjct: 152 STYDSRTSSWQSIPDVAVSAFRPDDSSCAVISGGEGCKLRENSLWNPVLVKENENNHVAL 211

Query: 97  FYKIGQEVQKWSGCMKRSYNKGVTWSER-EQLPPGILGPSKNKPILLENGLLLCGSSVES 155
           FYK G+    W G +KRS + G TW ++ E+LP  I+GP+KNK + + + L L  SS E+
Sbjct: 212 FYKAGKHPSIWEGYVKRSKDDGKTWEQQAEKLPRNIIGPAKNKILEVNDRLWLAPSSREA 271

Query: 156 -----WNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRS--- 207
                  +W   +E + D G+SW     + +P +  + IQP  +   +  LR++ R+   
Sbjct: 272 NAKDGSRNWKLIVERSRDGGKSWDPEENVEVPFDG-NAIQPSMWRGDDGYLRMVARTATD 330

Query: 208 FNGIGR------------------VCMSESCDG-GLTWSYAKPTQLLNPNSGIDGVKLKD 248
           ++  GR                  +  S+S D  G  W  AKPT L  PNSGID VKL+D
Sbjct: 331 YDENGRRADQVKVKKQTFIRGKKYIIYSKSLDKEGEKWMKAKPTSLPGPNSGIDAVKLRD 390

Query: 249 GRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTL------------------EEN 284
           GR+++ YN      ++ RG L VA+S DDG +W  + T+                   + 
Sbjct: 391 GRVVIIYNHSWGLKSLGRGKLNVAISHDDGLTWTPSSTVVLESFDDGAISDVNVDKKNQK 450

Query: 285 LAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVVLLPN 321
             +E SYPAVIQ      +HI YTY+R  IKHVV+ PN
Sbjct: 451 KPIEMSYPAVIQDPKTDLIHIVYTYDRQTIKHVVINPN 488


>gi|417326873|ref|ZP_12112443.1| hypothetical protein LTSEADE_2802 [Salmonella enterica subsp.
           enterica serovar Adelaide str. A4-669]
 gi|353572055|gb|EHC35813.1| hypothetical protein LTSEADE_2802 [Salmonella enterica subsp.
           enterica serovar Adelaide str. A4-669]
          Length = 360

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 171/359 (47%), Gaps = 54/359 (15%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTSIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            G +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NGDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+                  
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYRYPLSLIASQK 300

Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 RYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 357


>gi|417475643|ref|ZP_12170405.1| hypothetical protein LTSERUB_3011 [Salmonella enterica subsp.
           enterica serovar Rubislaw str. A4-653]
 gi|353643625|gb|EHC87773.1| hypothetical protein LTSERUB_3011 [Salmonella enterica subsp.
           enterica serovar Rubislaw str. A4-653]
          Length = 360

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 121/359 (33%), Positives = 173/359 (48%), Gaps = 54/359 (15%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +   F  CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPI----Y 180
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W   + P+     
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEADNA 182

Query: 181 IPNESLS-----------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           IP  +++                       VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 IPGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
           +S D G TWS A+ T L N NSGID V + DG L+LA N V+                  
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPLSTATGESAK 300

Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 RYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 357


>gi|402306159|ref|ZP_10825210.1| bacterial alpha-L-rhamnosidase [Prevotella sp. MSX73]
 gi|400379926|gb|EJP32755.1| bacterial alpha-L-rhamnosidase [Prevotella sp. MSX73]
          Length = 1338

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 102/267 (38%), Positives = 142/267 (53%), Gaps = 38/267 (14%)

Query: 82   WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL 141
            WNPVLF +P   L LFYK+G  V  W+G + +S + G TWS+RE LP G +GP KNKP L
Sbjct: 1068 WNPVLFTMPDGELWLFYKVGATVADWTGWLVKSRDGGRTWSDREPLPKGFIGPVKNKPEL 1127

Query: 142  LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES---------------- 185
            ++ G L+CGSS E+ + W   +E+       W+  GP+   N +                
Sbjct: 1128 VD-GRLVCGSSTEN-DGWRFHVEILDLKTGRWKYVGPVEAENRARTDDVEPGCDMVAPVY 1185

Query: 186  --------LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
                    +  IQP      +  L+VLMR+ N   ++  S S DGG TWS    T++ N 
Sbjct: 1186 KQGEGPRPIYSIQPSILRLKDGRLQVLMRTHN--AKLATSFSADGGDTWSPVTLTEVPNN 1243

Query: 238  NSGIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAME 288
             SG D V L+DGR +L YN            R  + +ALS DDG  WH ALTLE++   +
Sbjct: 1244 QSGTDAVTLRDGRHVLIYNDFETLPGTKKGPRTPISIALS-DDGTHWHHALTLEDSPINQ 1302

Query: 289  FSYPAVIQASDGSVHITYTYNRTQIKH 315
            +SYPA+I+  DG +H  YT+ R +I +
Sbjct: 1303 YSYPAIIEGRDGKLHCVYTWRRQRIAY 1329


>gi|168233143|ref|ZP_02658201.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Kentucky str. CDC 191]
 gi|194471425|ref|ZP_03077409.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Kentucky str. CVM29188]
 gi|194457789|gb|EDX46628.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Kentucky str. CVM29188]
 gi|205332708|gb|EDZ19472.1| BNR/Asp-box repeat domain protein [Salmonella enterica subsp.
           enterica serovar Kentucky str. CDC 191]
          Length = 347

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/346 (34%), Positives = 169/346 (48%), Gaps = 41/346 (11%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESRTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+   L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARAIFLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 ESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 344


>gi|410031410|ref|ZP_11281240.1| hypothetical protein MaAK2_19509 [Marinilabilia sp. AK2]
          Length = 351

 Score =  167 bits (424), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 106/315 (33%), Positives = 169/315 (53%), Gaps = 16/315 (5%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIA 73
           EF F     PF+ CHAS+I  +  G +++ +F G+ E   DV IW+     G W  P + 
Sbjct: 36  EFVFQG-KMPFEQCHASSIESLGNGQYMMVWFAGTHEKNDDVGIWMAKGSPGSWSVPKLL 94

Query: 74  DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI-- 131
            +  +   WNPVLF+  +  L+LF+K+G+E+  W      S + G +WS  E+L  G   
Sbjct: 95  VKVRDDAHWNPVLFQTSAGRLILFFKVGKEIDDWETWWMYSEDAGNSWSNPEELVEGDRG 154

Query: 132 -LGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL---S 187
             GP +N  ++L NG  L  +S+E    W ++++ + D G++W K   + +  + +    
Sbjct: 155 GRGPVRNHMLVLSNGNWLAPASIEKNRVWNAFVDASKDKGKTWVKSDTLQLDRQLVVGEG 214

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           VIQP  + ++   + +L+R+    G++  S+S D G TWS    T L N NSGID   L 
Sbjct: 215 VIQPALWESSPGQVHMLLRT--SAGKIGRSDSNDYGETWSPMLLTDLPNNNSGIDVAHLG 272

Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEE-NLAMEFSYPAVIQASDGS 301
              L LAYN V      R  + +A+S+D+G SW   + +E+     EFSYP++I   +G 
Sbjct: 273 GRTLALAYNPVGKNWGPRYPMTLAISEDNGHSWPIKIDIEKGEEGNEFSYPSLIH-EEGY 331

Query: 302 VHITYTYNRTQIKHV 316
           + + YT+NRT I+ V
Sbjct: 332 LVLCYTWNRTNIRFV 346


>gi|315607322|ref|ZP_07882322.1| alpha-rhamnosidase [Prevotella buccae ATCC 33574]
 gi|315251025|gb|EFU31014.1| alpha-rhamnosidase [Prevotella buccae ATCC 33574]
          Length = 1338

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 101/267 (37%), Positives = 141/267 (52%), Gaps = 38/267 (14%)

Query: 82   WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL 141
            WNPVLF +P   L LFYK+G  V  W+G + +S + G TWS+RE LP G +GP KNKP L
Sbjct: 1068 WNPVLFTMPDGELWLFYKVGAIVADWTGWLVKSRDGGRTWSDREPLPKGFIGPVKNKPEL 1127

Query: 142  LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES---------------- 185
            ++ G L+CGSS E+ + W   +E+       W+  GP+     +                
Sbjct: 1128 VD-GRLVCGSSTEN-DGWRFHVEILDLKTGRWKYVGPVEAEGRARTDDVEPGCDMVAPVY 1185

Query: 186  --------LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
                    +  IQP      +  L+VLMR+ N   ++  S S DGG TWS    T++ N 
Sbjct: 1186 KQGEGPRPIYSIQPSILRLKDGRLQVLMRTHN--AKLATSFSADGGDTWSPVTLTEVPNN 1243

Query: 238  NSGIDGVKLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAME 288
             SG D V L+DGR +L YN            R  + +ALS DDG  WH ALTLE++   +
Sbjct: 1244 QSGTDAVTLRDGRHVLIYNDFETLPGTKKGPRTPISIALS-DDGTHWHHALTLEDSPINQ 1302

Query: 289  FSYPAVIQASDGSVHITYTYNRTQIKH 315
            +SYPA+I+  DG +H  YT+ R +I +
Sbjct: 1303 YSYPAIIEGRDGKLHCVYTWRRQRIAY 1329


>gi|323141219|ref|ZP_08076120.1| BNR/Asp-box repeat protein [Phascolarctobacterium succinatutens YIT
           12067]
 gi|322414362|gb|EFY05180.1| BNR/Asp-box repeat protein [Phascolarctobacterium succinatutens YIT
           12067]
          Length = 338

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/328 (35%), Positives = 164/328 (50%), Gaps = 40/328 (12%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP--IIADEEPNVP 80
           P   CHAST++ +  G  L  +FGG+ EG  DV I++    D  W  P  ++     N  
Sbjct: 11  PTAFCHASTVLALPDGSLLCCWFGGTHEGESDVGIYISRRTDAGWSEPKLLVNGAAAN-- 68

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL----GPSK 136
            WNPVLF      LLLFYK GQ++  W   + +S + G TWS  ++L PG +    GP +
Sbjct: 69  -WNPVLFAGADGRLLLFYKQGQQIADWQTWLLQSTDNGETWSGPQELVPGDVSGGRGPVR 127

Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI-------------PN 183
           NKP+ L +G +L G+S E    W ++ +++ D G SW K   + I              N
Sbjct: 128 NKPLRLASGRILAGASTEH-GIWKAFADISDDDGASWHKSSAVQIEGLQYQTGEKTADSN 186

Query: 184 ESLS--------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
            ++S        VIQP  + +A+ ++ +L+RS  G   V  S+S D G TW  A    L 
Sbjct: 187 IAVSQQSFYGRGVIQPSLWQSADGSVHMLLRSSEGF--VYRSDSADDGETWCSAYALSLP 244

Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
           N NSG+D V+   G L L  N V+     R  L +  S D+G SW     LE   A EFS
Sbjct: 245 NNNSGLDLVRTAGGVLYLVCNPVAANWGQRSPLSLFKSTDEGASWQKLFDLETEPA-EFS 303

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YPA+I A    + +TYTY R  I  V L
Sbjct: 304 YPAII-ADGEDLMLTYTYKRCNIACVRL 330


>gi|417358590|ref|ZP_12133452.1| hypothetical protein LTSEGIV_2690 [Salmonella enterica subsp.
           enterica serovar Give str. S5-487]
 gi|353590824|gb|EHC49250.1| hypothetical protein LTSEGIV_2690 [Salmonella enterica subsp.
           enterica serovar Give str. S5-487]
          Length = 361

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 169/360 (46%), Gaps = 55/360 (15%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
           +S D G TWS A+ T L N NSGID V + DG L+LA N V+                  
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMPDGTLILALNPVNGNWGKRYPQRYPLSLIAG 300

Query: 260 -RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 301 KRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 358


>gi|323144495|ref|ZP_08079096.1| hypothetical protein HMPREF9444_01764 [Succinatimonas hippei YIT
           12066]
 gi|322415708|gb|EFY06441.1| hypothetical protein HMPREF9444_01764 [Succinatimonas hippei YIT
           12066]
          Length = 332

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/317 (34%), Positives = 172/317 (54%), Gaps = 37/317 (11%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI-IADEEPNVPMW 82
           F  CHAS ++++D G  +  +F GS EG  D  I++   K G    P+ IA  +     W
Sbjct: 18  FNMCHASNLIKLDDGTIITVFFAGSHEGNSDTSIYMCRAKAGESSEPVKIAGSDE--AHW 75

Query: 83  NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKP 139
           NPVLF +    L+LFYK+G  +  W   + RSY++G+TWSE E+L PG +   GP +NKP
Sbjct: 76  NPVLFAVSDEELVLFYKVGNIIATWRTMIVRSYDRGLTWSEPEELVPGDIGGRGPVRNKP 135

Query: 140 ILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK-----------YGP----IYIPNE 184
           + + +G +LC  S+E+   W ++++++ D  R+  K           + P    I +  +
Sbjct: 136 VRISSGAILCPGSLEN-GPWRAFLDISYDNLRTLEKSAEISYQEEGNFSPLNKGIEVSEQ 194

Query: 185 SLS---VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
           S S   VIQP  +   ++ + VL+RS    G+V  ++S DGG T+    P  + N NSG+
Sbjct: 195 SFSGKGVIQPTIWED-DKGVHVLLRS--TYGKVIRADSIDGGRTFLKPYPVNMDNNNSGL 251

Query: 242 DGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
           D V + +G+L L  N V      R  L +  S +DG ++ +   L +    EFSYP  I+
Sbjct: 252 DAVYV-NGKLYLVCNPVGGNWADRTPLTL-FSSEDGINFKEEAVLADGPG-EFSYPC-IR 307

Query: 297 ASDGSVHITYTYNRTQI 313
           A D +++I+YTY+R  I
Sbjct: 308 AYDNALYISYTYSRKNI 324


>gi|218129082|ref|ZP_03457886.1| hypothetical protein BACEGG_00656 [Bacteroides eggerthii DSM 20697]
 gi|217988717|gb|EEC55036.1| Alpha-L-rhamnosidase N-terminal domain protein [Bacteroides eggerthii
            DSM 20697]
          Length = 1345

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 159/323 (49%), Gaps = 58/323 (17%)

Query: 45   FGGSCEGAPDVKIWLQTFKDGR----WQSPI------IADEEPNVPMWNPVLFKLPSNGL 94
             GG+ E AP V +      DG+    W +P       + DE    P +NPVLF++P   L
Sbjct: 1029 MGGAREAAPAVPM-----GDGKNYPWWMAPAKNSGKPLVDETWRKPCYNPVLFQIPGGDL 1083

Query: 95   LLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVE 154
            +L+YKIG  V  W+G   RS N G TW + E L  G LGP KNKP+ + NG ++  SS E
Sbjct: 1084 VLYYKIGYGVGDWTGWQVRSKNGGKTWGKPEALAKGFLGPIKNKPVYI-NGRIIAPSSTE 1142

Query: 155  SWNSWGSWMEVTVDAGRSWRKYGPI----YIPN--------------------------E 184
              N W    E++ D G++W+  GP+     +P                           +
Sbjct: 1143 G-NGWKFHFEISDDKGKTWKYVGPVDAEYSLPTALRKAGIKKETGDDLEAGEVLTEKGAQ 1201

Query: 185  SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +  IQP      +  L  + R+ N   ++ ++ S D G TWS    + L N NSG D V
Sbjct: 1202 PILCIQPSILQLKDGRLMAIGRTRN--AKLAVTYSSDCGDTWSKVVLSDLPNNNSGTDAV 1259

Query: 245  KLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
             L DGR ++ +N  +         R  + +A+S+DDG +W +A+ LE++   ++SYP++I
Sbjct: 1260 TLADGRQVVVFNDFATLPGTPKGVRTPVSIAVSEDDGKTWKNAVVLEDSPISQYSYPSII 1319

Query: 296  QASDGSVHITYTYNRTQIKHVVL 318
            Q  DG +H  YT+ R +I +  +
Sbjct: 1320 QGKDGKLHCVYTWRRKRIAYKAI 1342



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDG 65
           K +V +EF +    A F  CH +T+VE++ G  + A+FGG+ E  PD  IW   + +   
Sbjct: 910 KCIVTDEFLY--EDAGFPQCHGATVVELENGDLVAAFFGGTREKNPDCNIWTCRKPYGSK 967

Query: 66  RWQSPIIA 73
            W  P  A
Sbjct: 968 EWTKPYKA 975


>gi|317474864|ref|ZP_07934134.1| bacterial alpha-L-rhamnosidase [Bacteroides eggerthii 1_2_48FAA]
 gi|316909002|gb|EFV30686.1| bacterial alpha-L-rhamnosidase [Bacteroides eggerthii 1_2_48FAA]
          Length = 1315

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 104/323 (32%), Positives = 159/323 (49%), Gaps = 58/323 (17%)

Query: 45   FGGSCEGAPDVKIWLQTFKDGR----WQSPI------IADEEPNVPMWNPVLFKLPSNGL 94
             GG+ E AP V +      DG+    W +P       + DE    P +NPVLF++P   L
Sbjct: 999  MGGAREAAPAVPM-----GDGKNYPWWMAPAKNSGKPLVDETWRKPCYNPVLFQIPGGDL 1053

Query: 95   LLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVE 154
            +L+YKIG  V  W+G   RS N G TW + E L  G LGP KNKP+ + NG ++  SS E
Sbjct: 1054 VLYYKIGYGVGDWTGWQVRSKNGGKTWGKPEALAKGFLGPIKNKPVYI-NGRIIAPSSTE 1112

Query: 155  SWNSWGSWMEVTVDAGRSWRKYGPI----YIPN--------------------------E 184
              N W    E++ D G++W+  GP+     +P                           +
Sbjct: 1113 G-NGWKFHFEISDDKGKTWKYVGPVDAEYSLPTALRKSGIKKETGDDLEAGEVLTEKGAQ 1171

Query: 185  SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +  IQP      +  L  + R+ N   ++ ++ S D G TWS    + L N NSG D V
Sbjct: 1172 PILCIQPSILQLKDGRLMAIGRTRN--AKLAVTYSSDCGDTWSKVVLSDLPNNNSGTDAV 1229

Query: 245  KLKDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
             L DGR ++ +N  +         R  + +A+S+DDG +W +A+ LE++   ++SYP++I
Sbjct: 1230 TLADGRQVVVFNDFATLPGTPKGVRTPVSIAVSEDDGKTWKNAVVLEDSPISQYSYPSII 1289

Query: 296  QASDGSVHITYTYNRTQIKHVVL 318
            Q  DG +H  YT+ R +I +  +
Sbjct: 1290 QGKDGKLHCVYTWRRKRIAYKAI 1312



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDG 65
           K +V +EF +    A F  CH +T+VE++ G  + A+FGG+ E  PD  IW   + +   
Sbjct: 880 KCIVTDEFLY--EDAGFPQCHGATVVELENGDLVAAFFGGTREKNPDCNIWTCRKPYGSK 937

Query: 66  RWQSPIIA 73
            W  P  A
Sbjct: 938 EWTKPYKA 945


>gi|197286767|ref|YP_002152639.1| hypothetical protein PMI2938 [Proteus mirabilis HI4320]
 gi|227355042|ref|ZP_03839453.1| BNR repeat-containing glycosyl hydrolase [Proteus mirabilis ATCC
           29906]
 gi|194684254|emb|CAR45786.1| conserved hypothetical protein [Proteus mirabilis HI4320]
 gi|227164829|gb|EEI49676.1| BNR repeat-containing glycosyl hydrolase [Proteus mirabilis ATCC
           29906]
          Length = 350

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 115/343 (33%), Positives = 169/343 (49%), Gaps = 41/343 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           +F      + F  CHASTIV V +    L A+F G  EG+ +  IWL   +   WQ    
Sbjct: 9   QFVLDNAHSLFNHCHASTIVRVPNSERLLTAFFAGDKEGSGNTAIWLAIKEGNDWQPAQP 68

Query: 73  ADEEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
             + P V  WNPVL   PS G + LFYK G +V  W+     S + G +WS   +L  G 
Sbjct: 69  VVKNPGVAHWNPVLHVDPSTGNIWLFYKTGPDVHSWTTQYVISKDGGNSWSLPSELVSGD 128

Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG-PIY------- 180
           +   GP KNK +++ NG  L   SVE    W ++++++ D G+ W++   PI        
Sbjct: 129 VTPRGPVKNKVLVMSNGEWLAPGSVEDDRYWDAFVDISSDNGQHWQRVDIPIAHHQGGQA 188

Query: 181 -------IPNESL------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
                  + +++L             VIQP  + +    +  +MRS    G++  S+S D
Sbjct: 189 EHEIWQGLKDDALWETDLQRVFQWDGVIQPTLWGSQPGHVHAMMRSTR--GKIYRSDSTD 246

Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWH 276
            G +W  A  T L N NSGID V   DG L L YN  S     R  + V+LS D+G +W 
Sbjct: 247 YGRSWCPAYATTLPNNNSGIDVVSFADGLLALVYNPNSGNWSRRYPISVSLSSDNGSTWS 306

Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
           +   L +    EFSYPA+I A + ++H+T+T+NR  I +  L+
Sbjct: 307 EPFDLLDGEG-EFSYPAII-AENNTLHVTFTWNRKNIVYQQLI 347


>gi|226327532|ref|ZP_03803050.1| hypothetical protein PROPEN_01403 [Proteus penneri ATCC 35198]
 gi|225204058|gb|EEG86412.1| BNR/Asp-box repeat protein [Proteus penneri ATCC 35198]
          Length = 350

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/343 (33%), Positives = 168/343 (48%), Gaps = 41/343 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           +F      + F  CHASTIV V +    L A+F G  EG+ +  IWL   +   WQ    
Sbjct: 9   QFVLDNAHSLFNHCHASTIVRVPNSERLLTAFFAGDKEGSGNTAIWLVMKEGDHWQHAQP 68

Query: 73  ADEEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
             + P V  WNPVL   PS G + LFYK G +V  W+     S + G +WS   +L  G 
Sbjct: 69  VVKNPGVAHWNPVLHVDPSTGNIWLFYKTGPDVHSWTTQYVISKDGGNSWSLPSELVSGD 128

Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG-PIY------- 180
           +   GP KNK +++ NG  L   SVE    W ++++++ D G+ W +   PI        
Sbjct: 129 VAPRGPVKNKVLVMSNGEWLAPGSVEDDQFWDAFVDISSDNGQHWLRVDIPIEHHKGGQT 188

Query: 181 -------IPNESL------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
                  + +++L             VIQP  + +    +  +MRS  G   +  S+S D
Sbjct: 189 EHEIWQGLKDDALWETDLQRVFQWDGVIQPTLWESQPGHVHAMMRSTRGT--IYRSDSTD 246

Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWH 276
            G +W  A  T L N NSGID V   DG+L L YN  S     R  + V+LS D+G +W 
Sbjct: 247 YGRSWCPAYATILPNNNSGIDVVSFADGQLALVYNPNSGNWSRRYPISVSLSSDNGSTWS 306

Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
           +   L +    EFSYPA+I A + ++H+T+T+NR  I +  L+
Sbjct: 307 EPFDLLDGEG-EFSYPAII-AENNTLHVTFTWNRKNIVYQQLM 347


>gi|425069501|ref|ZP_18472616.1| hypothetical protein HMPREF1311_02686 [Proteus mirabilis WGLW6]
 gi|425070956|ref|ZP_18474062.1| hypothetical protein HMPREF1310_00352 [Proteus mirabilis WGLW4]
 gi|404596927|gb|EKA97434.1| hypothetical protein HMPREF1311_02686 [Proteus mirabilis WGLW6]
 gi|404599781|gb|EKB00234.1| hypothetical protein HMPREF1310_00352 [Proteus mirabilis WGLW4]
          Length = 350

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 114/343 (33%), Positives = 168/343 (48%), Gaps = 41/343 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           +F      + F  CHASTI  V +    L A+F G  EG+ +  IWL   +   WQ    
Sbjct: 9   QFVLDNAHSLFNHCHASTIARVPNSERLLTAFFAGDKEGSGNTAIWLAIKEGNDWQPAQP 68

Query: 73  ADEEPNVPMWNPVLFKLPSNG-LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
             + P V  WNPVL   PS G + LFYK G +V  W+     S + G +WS   +L  G 
Sbjct: 69  VVKNPGVAHWNPVLHVDPSTGNIWLFYKTGPDVHSWTTQYVISKDGGNSWSLPSELVSGD 128

Query: 132 L---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG-PIY------- 180
           +   GP KNK +++ NG  L   SVE    W ++++++ D G+ W++   PI        
Sbjct: 129 VTPRGPVKNKVLVMSNGEWLAPGSVEDDRYWDAFVDISSDNGQHWQRVDIPIAHHQGGQA 188

Query: 181 -------IPNESL------------SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
                  + +++L             VIQP  + +    +  +MRS    G++  S+S D
Sbjct: 189 EHEIWQGLKDDALWETDLQRVFQWDGVIQPTLWGSQPGHVHAMMRSTR--GKIYRSDSTD 246

Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWH 276
            G +W  A  T L N NSGID V   DG L L YN  S     R  + V+LS D+G +W 
Sbjct: 247 YGRSWCPAYATTLPNNNSGIDVVSFADGLLALVYNPNSGNWSRRYPISVSLSSDNGSTWS 306

Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
           +   L +    EFSYPA+I A + ++H+T+T+NR  I +  L+
Sbjct: 307 EPFDLLDGEG-EFSYPAII-AENNTLHVTFTWNRKNIVYQQLI 347


>gi|417421639|ref|ZP_12160118.1| hypothetical protein LTSEMIS_3387 [Salmonella enterica subsp.
           enterica serovar Mississippi str. A4-633]
 gi|353618915|gb|EHC69466.1| hypothetical protein LTSEMIS_3387 [Salmonella enterica subsp.
           enterica serovar Mississippi str. A4-633]
          Length = 326

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/328 (34%), Positives = 160/328 (48%), Gaps = 55/328 (16%)

Query: 41  LVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKI 100
           + A+F G CEG+ D  IWL  ++   W +P           WNPVLF  PS+ L LFYK+
Sbjct: 1   MAAWFAGLCEGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKV 59

Query: 101 GQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWN 157
           G  V  W      S ++G TWS    L  G +   GP KNK +L  NG  +   S+ES  
Sbjct: 60  GSGVHVWKTWFITSSDRGFTWSTPAPLVNGDILPRGPVKNKLLLASNGAWIAPGSIESPE 119

Query: 158 SWGSWMEVTVDAGRSWR-KYGPI----YIPNESLS-----------------------VI 189
            W ++++ + D G+ W   + P+     IP  +++                       VI
Sbjct: 120 RWRAFVDRSSDEGKHWNISFVPLEPDNAIPGTNVALWDGVKKGMLWECCLENLLRWDGVI 179

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++DG
Sbjct: 180 QPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDG 237

Query: 250 RLLLAYNTVS-------------------RGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
            L+LA N V+                   R  L +  S+D+G+SW   L LE +   E+S
Sbjct: 238 TLILALNPVNGNWGKRYPLSTATGESARKRYPLSLIASQDNGESWLPLLDLESDRG-EYS 296

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YPA+I +  G VHITYT+NR  I +  L
Sbjct: 297 YPAII-SEGGVVHITYTWNRKNIVYCRL 323


>gi|56413761|ref|YP_150836.1| hypothetical protein SPA1594 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|56128018|gb|AAV77524.1| hypothetical protein SPA1594 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
          Length = 320

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 108/310 (34%), Positives = 153/310 (49%), Gaps = 41/310 (13%)

Query: 45  FGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEV 104
           F G  EG+ D  IWL  ++   W +P           WNPVLF  PS+ L LFYK+G +V
Sbjct: 13  FAGLREGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDV 71

Query: 105 QKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGS 161
             W      S ++G TWS    L    +   GP KNK +L  NG  +   S+ES   W +
Sbjct: 72  HVWKTWFIISSDRGFTWSTPAPLVNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRA 131

Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLS----------------------------VIQPVP 193
           +++ + D G+ W        P+ ++S                            VIQP  
Sbjct: 132 FVDRSSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTL 191

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
           + +++  + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++DG L+L
Sbjct: 192 WESSSGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLIL 249

Query: 254 AYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
           A N V+     R  L +  S D+G+SW   L LE +   E+SYPA+I +  G VHITYT+
Sbjct: 250 ALNPVNGNWGKRYPLSLIASHDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTW 307

Query: 309 NRTQIKHVVL 318
           NR  I +  L
Sbjct: 308 NRKNIVYCRL 317


>gi|289810295|ref|ZP_06540924.1| hypothetical protein Salmonellaentericaenterica_40322 [Salmonella
           enterica subsp. enterica serovar Typhi str. AG3]
          Length = 305

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/306 (34%), Positives = 151/306 (49%), Gaps = 41/306 (13%)

Query: 49  CEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWS 108
           CEG+ D  IWL  ++   W +P           WNPVLF  PS+ L LFYK+G  V  W 
Sbjct: 2   CEGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWK 60

Query: 109 GCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEV 165
                S ++G TWS    L    +   GP KNK +L  NG  +   S+ES   W ++++ 
Sbjct: 61  TWFITSSDRGFTWSTPAPLVNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDR 120

Query: 166 TVDAGRSWRKYGPIYIPNESLS----------------------------VIQPVPFHTA 197
           + D G+ W        P+ ++S                            VIQP  + ++
Sbjct: 121 SSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESS 180

Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
              + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++DG L+LA N 
Sbjct: 181 PGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATFLPNNNSGIDLVSMQDGTLILALNP 238

Query: 258 VS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQ 312
           V+     R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  
Sbjct: 239 VNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SKGGVVHITYTWNRKN 296

Query: 313 IKHVVL 318
           I +  L
Sbjct: 297 IVYCRL 302


>gi|197362683|ref|YP_002142320.1| hypothetical protein SSPA1481 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
 gi|197094160|emb|CAR59661.1| hypothetical protein SSPA1481 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
          Length = 305

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 106/305 (34%), Positives = 151/305 (49%), Gaps = 41/305 (13%)

Query: 50  EGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG 109
           EG+ D  IWL  ++   W +P           WNPVLF  PS+ L LFYK+G +V  W  
Sbjct: 3   EGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKT 61

Query: 110 CMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVT 166
               S ++G TWS    L    +   GP KNK +L  NG  +   S+ES   W ++++ +
Sbjct: 62  WFIISSDRGFTWSTPAPLVNDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRS 121

Query: 167 VDAGRSWRKYGPIYIPNESLS----------------------------VIQPVPFHTAN 198
            D G+ W        P+ ++S                            VIQP  + +++
Sbjct: 122 SDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSS 181

Query: 199 RTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV 258
             + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++DG L+LA N V
Sbjct: 182 GHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPV 239

Query: 259 S-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           +     R  L +  S D+G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I
Sbjct: 240 NGNWGKRYPLSLIASHDNGESWLPLLDLESDHG-EYSYPAII-SEGGVVHITYTWNRKNI 297

Query: 314 KHVVL 318
            +  L
Sbjct: 298 VYCRL 302


>gi|406836101|ref|ZP_11095695.1| hypothetical protein SpalD1_30819 [Schlesneria paludicola DSM
           18645]
          Length = 543

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 110/305 (36%), Positives = 151/305 (49%), Gaps = 24/305 (7%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           H STI E   G  L  ++GGS E A D  ++L   + G   WQ P    + P+    N V
Sbjct: 237 HCSTIAEAANGDLLCLWYGGSYESADDQTLFLARRRAGERAWQEPQAILKGPDPLPGNGV 296

Query: 86  LFKLPSNGLLLFYKIGQEV------QKWSGC--MKR-SYNKGVTWS-EREQLPPGILGPS 135
           +F      + + +   + V      Q W  C  M R S + G+TWS + E L   +L   
Sbjct: 297 IFVDGHKRVWIVWCRMESVRPIGRGQGWDKCRLMARVSTDHGLTWSSDTEFLDQKLLAVP 356

Query: 136 KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
           +N PILL +G L+           GS   +  D G+ WR+ G  + P  S    QP    
Sbjct: 357 RNPPILLSSGNLVLPLEAIVDGVEGSIFLIGADGGKQWRRGG--FTPGGS----QPAVIQ 410

Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
             +++L  LMR      R+   ES DGG  WS A P+ L NP+SGI   +L +G  ++ +
Sbjct: 411 RRDKSLFALMRK---APRLTQIESRDGGDNWSQATPSSLRNPDSGISMTQLANGHAVVVF 467

Query: 256 N--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           N   ++R  L +A S D+G +W   L LE N   E+SYP VIQ SDG +HITYTY R  I
Sbjct: 468 NDTELNRTPLSIARSIDEGHTWETPLHLESNPG-EYSYPCVIQTSDGKIHITYTYRRYSI 526

Query: 314 KHVVL 318
           KHV L
Sbjct: 527 KHVEL 531


>gi|227538528|ref|ZP_03968577.1| cytoplasmic protein [Sphingobacterium spiritivorum ATCC 33300]
 gi|227241447|gb|EEI91462.1| cytoplasmic protein [Sphingobacterium spiritivorum ATCC 33300]
          Length = 309

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 91/225 (40%), Positives = 121/225 (53%), Gaps = 12/225 (5%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           +V ++F F  N   F  CHAST+ E   G    ++FGG+ EG  DV IW   F   +W +
Sbjct: 1   MVKKDFLFEENRY-FAQCHASTLAETASGEIRASWFGGTHEGNKDVVIWSAAFDGQKWSA 59

Query: 70  PI-----IADEEPNVPMWNPVLFKLPS-NGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           P+     I+ +      WNPVLFK    N L L+YK+G   ++W G +K S + G TWS 
Sbjct: 60  PVSVADGISHDTVRYACWNPVLFKTKEDNTLYLYYKVGPNPREWWGAVKTSADNGKTWSS 119

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
              LP GILGP KNKP+ L NG++L  SSVE + + W   +E + D  +SW  Y PI   
Sbjct: 120 ASLLPKGILGPVKNKPLELANGIILSPSSVEVTEDRWMGHVERSDDQQKSWTSY-PI-DH 177

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           N   ++IQP     A+  L+VL RS  G   V  S S D G  WS
Sbjct: 178 NSPFNIIQPSILQHADGRLQVLCRSKEGF--VMTSWSSDEGKNWS 220


>gi|427752421|ref|ZP_18965946.1| hypothetical protein B581_07445, partial [Salmonella enterica
           subsp. enterica serovar Typhimurium str. STm5]
 gi|414066191|gb|EKT46800.1| hypothetical protein B581_07445, partial [Salmonella enterica
           subsp. enterica serovar Typhimurium str. STm5]
          Length = 305

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 99/303 (32%), Positives = 144/303 (47%), Gaps = 39/303 (12%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGRLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDG 272
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+     R  L +  S+D+G
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNG 300

Query: 273 DSW 275
           +SW
Sbjct: 301 ESW 303


>gi|157803702|ref|YP_001492251.1| hypothetical protein A1E_02610 [Rickettsia canadensis str. McKiel]
 gi|157784965|gb|ABV73466.1| hypothetical protein A1E_02610 [Rickettsia canadensis str. McKiel]
          Length = 363

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 119/362 (32%), Positives = 186/362 (51%), Gaps = 62/362 (17%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG-- 65
           EEF    N++  K  HA+TI   D    ++++FG     S +   +  IW+   FK    
Sbjct: 9   EEFIL--NNSDLKDVHAATIFAFDDQKLIISFFGKTQSESGKSEENNSIWISLGFKKKNH 66

Query: 66  -RWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGC 110
            +W +P  II+ +   + ++P+          NPV+    +N LL+F KIG   + WSG 
Sbjct: 67  YQWTTPERIISPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGI 125

Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           + RSY++G+TW ++ +L  GILGP++NKP++ EN  +L   S ES     S++E + D  
Sbjct: 126 LSRSYDRGLTW-QKPELLHGILGPARNKPLIYENN-ILSPCSRESCIDDFSYIEYSSDL- 182

Query: 171 RSWRKYGPIYIPNESL------SVIQPVPFHTANRTLRVLM----RSFNGI--GRVCMSE 218
            +W    PI   N  +        IQP    ++N   +++M    R  + +    +  S 
Sbjct: 183 HTWNLSNPILPFNPQIFQKGYRGFIQPT-LVSSNLPNKIIMLVRPRKTDSLISTYIHRSI 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
           S + G+ WS  +P  LLNP+S ID + L    LLLAYN V     SR +L VA S D+G 
Sbjct: 242 SINKGIYWSNLEPVNLLNPDSAIDTINLSSDVLLLAYNRVINNHKSRNILSVATSYDEGL 301

Query: 274 SWHDALTLEEN------------LAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVV--L 318
           +W   +T+  +            L+ E+SYPA+I + D + +H+ YT+NR  +KH V  L
Sbjct: 302 NWR-PITIRNSIYPEGDIEYSNILSEEYSYPAIIMSPDNNEIHVIYTFNRINLKHKVFQL 360

Query: 319 LP 320
           LP
Sbjct: 361 LP 362


>gi|379022881|ref|YP_005299542.1| hypothetical protein RCA_02435 [Rickettsia canadensis str. CA410]
 gi|376323819|gb|AFB21060.1| hypothetical protein RCA_02435 [Rickettsia canadensis str. CA410]
          Length = 363

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 118/362 (32%), Positives = 185/362 (51%), Gaps = 62/362 (17%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG-- 65
           EEF    N++  K  HA+TI   D    + ++FG     S +   +  IW+   FK    
Sbjct: 9   EEFIL--NNSDLKDVHAATIFAFDDQKLITSFFGKTQSESGKSEENNSIWISLGFKKNNH 66

Query: 66  -RWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGC 110
            +W +P  II+ +   + ++P+          NPV+    +N LL+F KIG   + WSG 
Sbjct: 67  YQWTTPERIISPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGI 125

Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           + RSY++G+TW ++ +L  GILGP++NKP++ EN  +L   S ES     S++E + D  
Sbjct: 126 LSRSYDRGLTW-QKPELLHGILGPARNKPLIYENN-ILSPCSRESCIDDFSYIEYSSDL- 182

Query: 171 RSWRKYGPIYIPNESL------SVIQPVPFHTANRTLRVLM----RSFNGI--GRVCMSE 218
            +W    PI   N  +        IQP    ++N   +++M    R  + +    +  S 
Sbjct: 183 HTWNLSNPILPFNPQIFQKGYRGFIQPT-LVSSNLPNKIIMLVRPRKTDSLISTYIHRSI 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
           S + G+ WS  +P  LLNP+S ID + L    LLLAYN V     SR +L +A S D+G 
Sbjct: 242 SINKGIYWSNLEPVNLLNPDSAIDTINLSSDVLLLAYNRVINNHKSRNILSLATSYDEGL 301

Query: 274 SWHDALTLEEN------------LAMEFSYPAVIQASDGS-VHITYTYNRTQIKHVV--L 318
           +W   +T+  +            L+ E+SYPA+I + D + +H+ YT+NR  +KH V  L
Sbjct: 302 NWR-PITIRNSIYPEGDIEYSNILSEEYSYPAIIMSPDNNEIHVIYTFNRINLKHKVFQL 360

Query: 319 LP 320
           LP
Sbjct: 361 LP 362


>gi|412987550|emb|CCO20385.1| putative cytoplasmic protein [Bathycoccus prasinos]
          Length = 562

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 117/379 (30%), Positives = 186/379 (49%), Gaps = 75/379 (19%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEV-----DKGHFLVAYFGGSCEGAPDVKIWL--QT 61
           G    +      ++P   CHA+T + +     DK   LVA+FGGS EGA DV IW   + 
Sbjct: 163 GATRNQAKLNERTSPTFGCHATTSMHIKNPDTDKLEVLVAWFGGSYEGAEDVGIWTARRD 222

Query: 62  FKDGRWQSPIIADE---------------------EPNV----PMWNPVLF---KLPSNG 93
            + G W  P +A +                     EP      P WNPVLF      S+G
Sbjct: 223 EETGMWTKPKLAAKKFRNKPYEGKNLKHNSPYNKPEPERKGGEPHWNPVLFCAGAGESDG 282

Query: 94  -----------LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PPGILGPSKNKP 139
                      ++L++K+G ++  W     RS + G TWSE ++      G  GP KNKP
Sbjct: 283 DGASLGSCEKEIVLYFKLGWKIWFWETWEVRSRDGGNTWSEPKETVKNDKGGRGPVKNKP 342

Query: 140 ILLENGLLLCGSSVE------SWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-LSVIQPV 192
           I+L +G  L  +S+E          W ++ ++++D G++W K   I +P+E+    IQP 
Sbjct: 343 IVLSDGTWLAPASLEGPMPGQKRKQWRAFADISIDLGKTW-KPSSIILPSENGYGNIQPT 401

Query: 193 PFHTANRTLRVLMRSFNGIGRVCM--SESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KDG 249
            + +      +L+RS  G   + +  S+S DGG TW+ ++ T+L N NSG+D   L K  
Sbjct: 402 VWESEPGHAHMLLRSSRGRKGMWIWRSDSKDGGKTWTQSRKTKLPNNNSGLDVAYLPKSK 461

Query: 250 RLLLAYNTV----SRGVLKVALSKDDGDSWHDALTLE------ENLAMEFSYPAVI---- 295
            L+LAYN V    SR  +++A+SKD+G++W   + L+       N   E+SYP+ +    
Sbjct: 462 VLVLAYNHVMNVDSRSPIRLAISKDNGENWELQVELDNEKGSVHNAGHEYSYPSCVPWPA 521

Query: 296 -QASDGSVHITYTYNRTQI 313
            +  +  V +TYT++R ++
Sbjct: 522 KEYGEEGVSVTYTWHRRRV 540


>gi|383312842|ref|YP_005365643.1| hypothetical protein MCE_05845 [Candidatus Rickettsia amblyommii
           str. GAT-30V]
 gi|378931502|gb|AFC70011.1| hypothetical protein MCE_05845 [Candidatus Rickettsia amblyommii
           str. GAT-30V]
          Length = 364

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/362 (31%), Positives = 183/362 (50%), Gaps = 61/362 (16%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG-- 65
           EEF    N++  K  HA+TI   D    + ++FG     S +   +  IW+   FK    
Sbjct: 9   EEFIL--NNSDLKDVHAATIFAFDDKKLITSFFGKTRSESGKSEENNSIWISLGFKSNNH 66

Query: 66  -RWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGC 110
            +W +P  I++ +   + ++P+          NPV+    +N LL+F KIG   + WSG 
Sbjct: 67  YQWTTPERIVSPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGI 125

Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           + RSY++G+TW ++ +L  GILGP++NKP++ +N +L    S ES     S++E + D  
Sbjct: 126 LSRSYDRGLTW-QKPELLHGILGPARNKPLIYKNNIL-SPCSRESCIDDFSYIEYSSDL- 182

Query: 171 RSWRKYGPIYIPNESL------SVIQPVPFHTANRTLRVLM----RSFNGI--GRVCMSE 218
            +W    PI   N  +        IQP    ++N   +++M    R  + +    +  S 
Sbjct: 183 HTWNLSNPILPSNPKIFQKGYRGFIQPT-LVSSNLPNKIIMLVRPRKTDALMSTYIHRSI 241

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGD 273
           S + G+ WS  +P  LLNP+S ID + L D  LLLAYN V     SR +L +A+S D+G 
Sbjct: 242 SINKGIYWSNLEPVNLLNPDSAIDAINLGDDTLLLAYNRVINHLKSRNILSLAISYDEGL 301

Query: 274 SWHDALTLEENL-----------AMEFSYPAVIQAS--DGSVHITYTYNRTQIKHVV--L 318
           +W+  +                 + E+SYPA+I  S  +  +H+ YT+NR  +KH V  L
Sbjct: 302 NWYPIVIKNSTYPEGDIEYSNITSEEYSYPAIIIMSPDNNEIHVIYTFNRINLKHKVFQL 361

Query: 319 LP 320
           LP
Sbjct: 362 LP 363


>gi|417389583|ref|ZP_12153340.1| hypothetical protein LTSEMIN_1521, partial [Salmonella enterica
           subsp. enterica serovar Minnesota str. A4-603]
 gi|353621404|gb|EHC71229.1| hypothetical protein LTSEMIN_1521, partial [Salmonella enterica
           subsp. enterica serovar Minnesota str. A4-603]
          Length = 299

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 96/295 (32%), Positives = 137/295 (46%), Gaps = 47/295 (15%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +   F  CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSKDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS------ 122
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS      
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122

Query: 123 EREQLPPGIL----------GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRS 172
             + LP G L          GP KNK +L  NG  +   S+ES   W ++++ + D G+ 
Sbjct: 123 NDDILPRGPLVNDDILPLPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKH 182

Query: 173 WRKYGPIYIPNESLS----------------------------VIQPVPFHTANRTLRVL 204
           W        P+ ++S                            VIQP  + ++   + +L
Sbjct: 183 WNISFVPLEPDNAISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHML 242

Query: 205 MRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS 259
           +RS  G   +  S+S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 243 LRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVN 295


>gi|417539863|ref|ZP_12192042.1| hypothetical protein LTSEWAN_3128 [Salmonella enterica subsp.
           enterica serovar Wandsworth str. A4-580]
 gi|353663716|gb|EHD02332.1| hypothetical protein LTSEWAN_3128 [Salmonella enterica subsp.
           enterica serovar Wandsworth str. A4-580]
          Length = 305

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 133/282 (47%), Gaps = 34/282 (12%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGVKKGMLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS 259
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVN 282


>gi|417339308|ref|ZP_12120895.1| hypothetical protein LTSEBAI_0535 [Salmonella enterica subsp.
           enterica serovar Baildon str. R6-199]
 gi|357960520|gb|EHJ84337.1| hypothetical protein LTSEBAI_0535 [Salmonella enterica subsp.
           enterica serovar Baildon str. R6-199]
          Length = 295

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 133/282 (47%), Gaps = 34/282 (12%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W        P+ +
Sbjct: 123 NDDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHWSISFVPLEPDNA 182

Query: 186 LS----------------------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +S                            VIQP  + ++   + +L+RS  G   +  S
Sbjct: 183 ISGTNVALWDGIKKGRLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRS 240

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS 259
           +S D G TWS A+ T L N NSGID V ++DG L+LA N V+
Sbjct: 241 DSIDYGATWSVARATSLPNNNSGIDLVSMQDGTLILALNPVN 282


>gi|417373939|ref|ZP_12143845.1| hypothetical protein LTSEINV_2977 [Salmonella enterica subsp.
           enterica serovar Inverness str. R8-3668]
 gi|353601102|gb|EHC56810.1| hypothetical protein LTSEINV_2977 [Salmonella enterica subsp.
           enterica serovar Inverness str. R8-3668]
          Length = 321

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/264 (33%), Positives = 128/264 (48%), Gaps = 34/264 (12%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVL 86
           CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +P           WNPVL
Sbjct: 32  CHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPVL 91

Query: 87  FKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILLE 143
           F  PS+ L LFYK+G +V  W      S ++G TWS    L    +   GP KNK +L  
Sbjct: 92  F-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVNDDILPRGPVKNKLLLAS 150

Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS---------------- 187
           NG  +   S+ES   W ++++ + D G+ W        P+ ++S                
Sbjct: 151 NGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKGMLW 210

Query: 188 ------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
                       VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L 
Sbjct: 211 ECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLP 268

Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS 259
           N NSGID V ++DG L+LA N V+
Sbjct: 269 NNNSGIDLVSMQDGTLILALNPVN 292


>gi|58267692|ref|XP_571002.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134112085|ref|XP_775578.1| hypothetical protein CNBE2920 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50258237|gb|EAL20931.1| hypothetical protein CNBE2920 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57227236|gb|AAW43695.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 372

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 108/343 (31%), Positives = 155/343 (45%), Gaps = 58/343 (16%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLF 87
           H STI +   G   V +FGG+ EG  DVKIW     +G W  P +     ++  WNPV F
Sbjct: 22  HCSTI-QAACGVEYVTWFGGTKEGTSDVKIWFSKCTNGCWTPPRVIAGSGDIVHWNPVSF 80

Query: 88  KLPSNG----LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI----LGPSKNKP 139
                G    L +FYK G  +  W   +  + + G TW+E  +L PG      GP KN P
Sbjct: 81  LPDPKGAPGHLYVFYKTGTPIPTWKTYVIETIDGGNTWTEPRELVPGDHSGGRGPQKNPP 140

Query: 140 ILLENGLLLCGSSVE-----SWNSWGSWMEV--------TVDAGRSWRKYGPIYIPNE-- 184
           I+L NG  L G+S E         W ++ ++        T   G  W K   I +P +  
Sbjct: 141 IVLSNGDWLSGASKEVTLEGGKGLWDAFADIAPTPKEGDTFKQGERWVKAEFIKLPADRG 200

Query: 185 -------SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
                     +IQP  + +A++   V M   + IGR+  ++S D G TW+    T L N 
Sbjct: 201 VEGGSFPGEGIIQPSLWESADKPGHVHMMCRSSIGRIVRADSEDYGRTWTPGYATDLPNN 260

Query: 238 NSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEEN-------- 284
           NSG    +L+DGRL+ A N V      R  L + +S DDG++W    TLE+         
Sbjct: 261 NSGQCVTRLRDGRLVAAVNDVYKNWGPRTPLVLKVSFDDGETWSPWCTLEDQAPPASFQR 320

Query: 285 -LAM----------EFSYPAVIQASDGS---VHITYTYNRTQI 313
            +A+          EFSYP V   ++     V ++YT+ R  I
Sbjct: 321 VIALETGIVNDGKSEFSYPTVTPTAETDEIGVWVSYTWQRRGI 363


>gi|239947969|ref|ZP_04699722.1| BNR/Asp-box repeat domain protein [Rickettsia endosymbiont of
           Ixodes scapularis]
 gi|239922245|gb|EER22269.1| BNR/Asp-box repeat domain protein [Rickettsia endosymbiont of
           Ixodes scapularis]
          Length = 375

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 179/371 (48%), Gaps = 58/371 (15%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVK---- 56
           M+    I     +EF    +    K  HA  ++ +++   +V +FG        ++    
Sbjct: 1   MQNKIPILETTIDEFIL--HDLDLKDVHAVNLLVLNEQELIVTFFGKRKSEPKGLEENNS 58

Query: 57  IW----LQTFKDGRWQSP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFY 98
           IW    L+  +  +W  P  I++ +   + ++P+          NPVL  L +N LL+F 
Sbjct: 59  IWVCFGLKENRRYQWSIPQRIVSPQYFQDHHIPLKKDNGVISCGNPVL-TLFNNQLLIFS 117

Query: 99  KIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNS 158
           KIG   + WSG + RSY+ GVTW E E L  GILGP++NKP++L++ ++L  SS ES   
Sbjct: 118 KIGPYPRTWSGILSRSYDHGVTWQEPEILH-GILGPTRNKPLILQDNIMLSPSSRESCID 176

Query: 159 WGSWMEVTVDAGRSWRKYGPIYIPNESL------SVIQP--VPF-HTANRTLRVLMRSFN 209
              ++E + D  R W    PI   N  +        IQP  VP   TA   + +L+R  N
Sbjct: 177 DFPYVESSSDL-RIWHTSSPILTHNPKIFQEGYRGFIQPTLVPLTDTAGHKIIMLVRPRN 235

Query: 210 -----GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS----- 259
                    +  S S + G  WS  KP  LLNP+S ID V L +  LLLAYN V      
Sbjct: 236 TNPSYSKLHIHRSVSVNKGKDWSNLKPIDLLNPDSAIDAVNLGNNLLLLAYNRVVHYSKF 295

Query: 260 RGVLKVALSKDDGDSWHDA-----------LTLEENLAMEFSYPAVIQASDGS-VHITYT 307
           R +L +A+S ++G +W+             +   +  + E+SYPA++  ++   +H+ YT
Sbjct: 296 RNILSLAVSFNEGLNWYPIKIKYSTFPEGDIEYSKITSQEYSYPAILMDTNNQEIHVVYT 355

Query: 308 YNRTQIKHVVL 318
           +NR   KH V 
Sbjct: 356 FNRVNFKHKVF 366


>gi|414590411|tpg|DAA40982.1| TPA: hypothetical protein ZEAMMB73_041549 [Zea mays]
          Length = 99

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 61/94 (64%), Positives = 74/94 (78%)

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV 262
           +L+RS   IG VCM++S DGGL WS+A+ + L NPNSGIDG+K+KDGR+ LAYNTVSRG 
Sbjct: 1   MLLRSHQTIGSVCMADSYDGGLRWSFARKSVLPNPNSGIDGIKMKDGRVALAYNTVSRGT 60

Query: 263 LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
           LKVA+S DDG SW + +TLE     EFSYPAVIQ
Sbjct: 61  LKVAVSTDDGLSWQEVVTLENAEGWEFSYPAVIQ 94


>gi|404399265|ref|ZP_10990849.1| hypothetical protein PfusU_05869 [Pseudomonas fuscovaginae UPB0736]
          Length = 372

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 162/363 (44%), Gaps = 77/363 (21%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-QTFKDGRWQSPIIADEEPNVPM 81
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L +   +G W  P+   E+P    
Sbjct: 14  PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRKSPEGEWSEPVKLSEDPTRSE 73

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKN 137
            NP+LF+ P   L L +       + +  ++R  S++ G+TW + E L   PGI    + 
Sbjct: 74  QNPILFQPPQGPLWLIWTAQVSGNQETAIVRRRLSHDDGLTWGQIETLFDEPGIF--VRQ 131

Query: 138 KPILLENG---------LLLCGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
            P++L NG         + L G   E W  +   S + ++ DAG SW+++    +P ES 
Sbjct: 132 PPVILRNGDWALPVWYCIALPG---EKWVGSHDVSAVMISSDAGHSWQRHD---VP-EST 184

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
            ++       A+ +L  L RS      +  S S D G TWS   PT+L N NS I  V+L
Sbjct: 185 GLVHMNIQQMADASLLGLFRS-RWADHIHESRSTDNGRTWSTPVPTELANNNSSIQFVRL 243

Query: 247 KDGRLLLAYNTVS---------------------------------------RGVLKVAL 267
             G L L YN V+                                       R  + +A+
Sbjct: 244 ASGELALVYNPVNAEGITERRTSLYDEIEDEGDDRVVPTAKAGGRAAVWGIPRAPMSLAI 303

Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           S+D+G SW   + LE            E L  E+SYP++IQ  +GS+ + +TY R +IKH
Sbjct: 304 SRDEGRSWPVRIDLELGDGFCLTNNSQEKLNREYSYPSIIQTRNGSLQVAFTYFRQRIKH 363

Query: 316 VVL 318
           V L
Sbjct: 364 VHL 366


>gi|383764448|ref|YP_005443430.1| hypothetical protein CLDAP_34930 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381384716|dbj|BAM01533.1| hypothetical protein CLDAP_34930 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 316

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/310 (31%), Positives = 150/310 (48%), Gaps = 19/310 (6%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIAD 74
           FP  S  + + H +TI E+  G  + A+FGGS E APD  +    +  +  RW +P    
Sbjct: 9   FP-RSDDYPTVHCATIAELPNGDLIAAWFGGSYETAPDQILLTARYDLRQERWSAPRPLA 67

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQL--PP 129
                 +  PV +      + LF+   QE + W   +   +RS ++G TWS  ++L   P
Sbjct: 68  VMEGKALGQPVFWVRKDGDIWLFFDAIQE-RDWRSALPFVQRSADEGRTWSAPQRLFDYP 126

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           G++  S+   +    G ++  +  E  N W S M ++ D G+SWR   PI  P  +   I
Sbjct: 127 GLMFRSR---LCFLPGRVIVPAYDE--NRWESLMMISDDEGKSWRLTAPIITPQGN---I 178

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
            P   H     L   +R+    G +  + S DGG  W   +PT L NPNSGID ++L+  
Sbjct: 179 HPCLVHLQGGKLLAYLRTGGHGGVIWRTLSFDGGECWEKPEPTALPNPNSGIDLLRLRSD 238

Query: 250 RLLLAYNTVSRGVLKVALSKDD-GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
           RLLLAYN  +     +AL+     + W     LE++   E+SYP ++Q  +  +H+ YTY
Sbjct: 239 RLLLAYNPSAERRTPLALAVASLQEDWRPVRILEQD-DHEYSYPTLLQTQNSDIHLVYTY 297

Query: 309 NRTQIKHVVL 318
            R  I+H   
Sbjct: 298 RRQHIQHACF 307


>gi|390455324|ref|ZP_10240852.1| hypothetical protein PpeoK3_15016 [Paenibacillus peoriae KCTC 3763]
          Length = 395

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 157/360 (43%), Gaps = 77/360 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  L  +F G+ EG PD+ I+L     G   W  PI   ++P     NPV
Sbjct: 36  HAANLLPLDNGDLLCVWFAGTQEGIPDISIYLSRLNKGAEEWTEPIKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  ++   S ++G TW +   L   PG       +PI 
Sbjct: 96  LFSAPDGKLWLLYTAQKSGNQDTAIIRYRISEDEGYTWGDIGTLFDTPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C ++  E W  N   S ++++ D GR+W +     +P +SL  +   
Sbjct: 153 VLDNGDWLLPIWYCYTNPGEKWTGNKDVSAVKISSDQGRTWTEAS---VP-DSLGCVHMN 208

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L  L RS      +  SES D G TWS  +PT L N NS I   KL +G L 
Sbjct: 209 IEKLDDGRLIALFRS-RWADYIYQSESVDHGRTWSSPQPTSLPNNNSSIQFTKLDNGHLA 267

Query: 253 LAYNTVS------------------------------------------RGVLKVALSKD 270
           L +N +S                                          R  + VA+SKD
Sbjct: 268 LVFNNISANESMERRASLYDEIEDEEEEAVVEPVTVSAAEERQAFWGAPRAPMTVAISKD 327

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +G +W +   +E            E L  E+SYP++ Q+ DG +HI +TY R  IK+VV+
Sbjct: 328 NGLTWSEVKNIEVGDGYAMTNNSREKLNREYSYPSIKQSKDGKIHIAFTYFRQAIKYVVI 387


>gi|375306733|ref|ZP_09772026.1| hypothetical protein WG8_0550 [Paenibacillus sp. Aloe-11]
 gi|375081120|gb|EHS59335.1| hypothetical protein WG8_0550 [Paenibacillus sp. Aloe-11]
          Length = 395

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 158/360 (43%), Gaps = 77/360 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  L  +F G+ EG PD+ I+L     G   W  P+   ++P     NPV
Sbjct: 36  HAANLLPLDNGDLLCVWFAGTQEGIPDISIYLSRLNKGAEEWTEPVKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  ++   S ++G TW +   L   PG       +PI 
Sbjct: 96  LFSAPDGKLWLLYTAQKSGNQDTAIIRYRISEDEGYTWGDIGTLFDTPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C ++  E W  N   S ++++ D GR+W +   + +P +SL  +   
Sbjct: 153 VLDNGDWLLPIWYCYTNPGEKWTGNKDVSAVKISSDQGRTWTE---VSVP-DSLGCVHMN 208

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L  L RS      +  SES D G TWS  +PT L N NS I   KL +G L 
Sbjct: 209 IEKLDDGRLIALFRS-RWADYIYQSESVDHGRTWSSPQPTSLPNNNSSIQFTKLDNGHLA 267

Query: 253 LAYNTVS------------------------------------------RGVLKVALSKD 270
           L +N +S                                          R  + VA+S+D
Sbjct: 268 LVFNNISANESMERRASLYDEIEDEEEEAVVEPVTAPAAEERQAFWGAPRAPMTVAISQD 327

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +G +W +   +E            E L  E+SYP++ Q+ DG +HI +TY R  IK+VV+
Sbjct: 328 NGLTWSEVKNIEVGDGYAMTNNSREKLNREYSYPSIKQSKDGKIHIAFTYFRQAIKYVVI 387


>gi|420156329|ref|ZP_14663172.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
 gi|394757627|gb|EJF40644.1| BNR repeat-like domain protein [Clostridium sp. MSTE9]
          Length = 387

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 102/336 (30%), Positives = 155/336 (46%), Gaps = 47/336 (13%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
            PF +CH+  ++E+  G  L  +F G+ EG  DV I       G  RW  P+    +P  
Sbjct: 31  GPFATCHSPAMLELPGGDMLCCWFAGTYEGDADVHIVCARLPKGADRWTEPVAVSGDPRR 90

Query: 80  PMWNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGI 131
              NP LF  P   +   Y        G++  +++  ++  +S++ G+TWS+ E + P  
Sbjct: 91  SEQNPSLFYGPDGAVWCMYTAQLDRVAGKDNMQFTSQIRCQKSFDGGLTWSDYETVFP-R 149

Query: 132 LGPSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            G    +PI +L+NG       LC  S E  +   S  +++ D G++WR    + +PN S
Sbjct: 150 EGSFCRQPIQVLKNGRWIFGNWLCTDSEEGLSGDPSVFQLSDDQGKTWRM---VEMPN-S 205

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              + P      N  L   MRS      V  SES D G TWS  K T L N NS I  ++
Sbjct: 206 RGRVHPTVVELDNGHLVAFMRSREA-DNVYRSESLDWGETWSEPKSTPLPNNNSSISAIR 264

Query: 246 LKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEE 283
            K GR+ +AYN                 R  + VALS+D G ++         +    EE
Sbjct: 265 TKSGRVAVAYNPTCTPSPVPGKAAWPGLRCPVAVALSEDGGLTFPLIRWMERGEGYIGEE 324

Query: 284 NLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
           N     ++ YP ++Q++DG +H+TY  Y R  IK+V
Sbjct: 325 NKTNNKQYEYPYLMQSTDGRLHLTYAAYTRKCIKYV 360


>gi|333898278|ref|YP_004472151.1| hypothetical protein Psefu_0070 [Pseudomonas fulva 12-X]
 gi|333113543|gb|AEF20057.1| BNR/Asp-box repeat-containing protein [Pseudomonas fulva 12-X]
          Length = 374

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 155/357 (43%), Gaps = 68/357 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L       G W  P    E+    
Sbjct: 15  PYAQNHAANLHELADGTVLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDATRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NP+LF+ P+  L L +       + +  ++R  S + G+TW   E L   +    ++ 
Sbjct: 75  EQNPILFQAPNGPLWLIWTAQISGNQETAIVRRRLSEDGGLTWGPIETLFDEVGTFVRHP 134

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
           P +L NG     +  C +   E W  N   S + ++ D GR+W ++    +P  + +V  
Sbjct: 135 PAVLSNGDWVFPVWYCITQPGEKWIGNHDVSAVMISCDQGRTWSRHD---VPQSTGAVHM 191

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
            V    A+ TL  L RS      +  S S D G TWS   PT+L N NS I  ++L  G 
Sbjct: 192 NV-HQLADGTLLGLFRS-RWADHIYASRSTDLGRTWSVPTPTELPNNNSSIQYIRLASGE 249

Query: 251 LLLAYNTVS---------------------------------------RGVLKVALSKDD 271
           L L YN ++                                       R  + +A+S+D+
Sbjct: 250 LALVYNPINAEGIEQRRASLYDEIEDEGDERVTPTVNADGRGAVWGIPRAPMSLAISRDE 309

Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           G SW   L +E            E L  EFSYP++IQA DGS+H+ +TY R +IKHV
Sbjct: 310 GRSWPLRLDIELGDGFCMTNNSQEKLNREFSYPSIIQARDGSLHVAFTYFRQKIKHV 366


>gi|374321886|ref|YP_005075015.1| hypothetical protein HPL003_10155 [Paenibacillus terrae HPL-003]
 gi|357200895|gb|AET58792.1| hypothetical protein HPL003_10155 [Paenibacillus terrae HPL-003]
          Length = 395

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 156/360 (43%), Gaps = 77/360 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  L  +F G+ EG PD+ I+L     G   W  P+   ++P     NPV
Sbjct: 36  HAANLLPLDNGDLLCVWFAGTQEGIPDISIYLSRLNKGAEEWMEPVKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  ++   S ++G TW +   L   PG       +PI 
Sbjct: 96  LFSAPDGKLWLLYTAQKSGNQDTAIIRYRISEDEGYTWGDIGTLFDTPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C ++  E W  N   S ++++ D GR+W +   + +P  SL  +   
Sbjct: 153 VLDNGDWLLPIWYCYTNPGEKWTGNKDVSAVKISSDQGRTWTE---VSVPG-SLGCVHMN 208

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L  L RS      +  SES D G TWS   PT L N NS I   KL +G L 
Sbjct: 209 IEKLDDGRLIALFRS-RWADYIYQSESVDHGRTWSVPIPTSLPNNNSSIQFTKLSNGHLA 267

Query: 253 LAYNTVS------------------------------------------RGVLKVALSKD 270
           L +N +S                                          R  + VA+SKD
Sbjct: 268 LVFNNISANESMDRRASLYDEIEDEEEETVVESAAPPAVEERQAFWGAPRAPMTVAISKD 327

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +G +W +   +E            E L  E+SYP++ Q+ DG +HI +TY R  IK+VV+
Sbjct: 328 NGLTWSEVKNIEVGDGYAMTNNSKEKLNREYSYPSIKQSRDGKIHIAFTYFRQAIKYVVI 387


>gi|298157616|gb|EFH98696.1| glycosyl hydrolase, family 43 [Pseudomonas savastanoi pv.
           savastanoi NCPPB 3335]
          Length = 374

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L  +    G W  P    E+ +  
Sbjct: 15  PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDASRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132

Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG  L           E W  N   S + ++ D G SW ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWSRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V     + +L  L RS      +  S S D G +WS   PT+L N NS I  VKL  
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYRSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKHV 
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|71734525|ref|YP_275595.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           phaseolicola 1448A]
 gi|71555078|gb|AAZ34289.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. phaseolicola
           1448A]
          Length = 395

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L  +    G W  P    E+ +  
Sbjct: 36  PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDSSRS 95

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 96  EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDKPGTF--VR 153

Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG  L           E W  N   S + ++ D G SW ++    +P+ + SV
Sbjct: 154 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 210

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V     + +L  L RS      +  S S D G +WS   PT+L N NS I  VKL  
Sbjct: 211 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 268

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 269 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 328

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKHV 
Sbjct: 329 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 388

Query: 318 L 318
           L
Sbjct: 389 L 389


>gi|289627728|ref|ZP_06460682.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. NCPPB 3681]
 gi|289647409|ref|ZP_06478752.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. 2250]
 gi|422581538|ref|ZP_16656680.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. 0893_23]
 gi|422596133|ref|ZP_16670416.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           lachrymans str. M301315]
 gi|330866387|gb|EGH01096.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. 0893_23]
 gi|330986433|gb|EGH84536.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           lachrymans str. M301315]
          Length = 374

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L  +    G W  P    E+ +  
Sbjct: 15  PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDSSRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132

Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG  L           E W  N   S + ++ D G SW ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V     + +L  L RS      +  S S D G +WS   PT+L N NS I  VKL  
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKHV 
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|422608475|ref|ZP_16680456.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           mori str. 301020]
 gi|330892098|gb|EGH24759.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           mori str. 301020]
          Length = 374

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L  +    G W  P    E+ +  
Sbjct: 15  PYAQNHAANLHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDSSRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132

Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG  L           E W  N   S + ++ D G SW ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V     + +L  L RS      +  S S D G +WS   PT+L N NS I  VKL  
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSSSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAPRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKHV 
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVP 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|422681666|ref|ZP_16739934.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           tabaci str. ATCC 11528]
 gi|331011008|gb|EGH91064.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           tabaci str. ATCC 11528]
          Length = 374

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 153/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L  +    G W  P    E+    
Sbjct: 15  PYAQNHAANMHELADGTLLCTWFAGTQEGMADIFVLLSRRDPATGVWSEPQKMSEDATRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDEPGTF--VR 132

Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG  L           E W  N   S + ++ D G SW ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V     + +L  L RS      +  S S D G +WS   PT+L N NS I  VKL  
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSIDRGRSWSAPTPTELPNNNSSIQFVKLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKHV 
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|416015500|ref|ZP_11563074.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. B076]
 gi|320325058|gb|EFW81127.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. B076]
          Length = 374

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L  +    G W  P    E+ +  
Sbjct: 15  PYAQNHAANLHELADGTLLCTWFAGTQEGMVDIFVLLSRRDPATGVWSEPQKMSEDSSRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPGGPLWLIWTAQISGNQETAIIRRRLSLDNGQSWGPIDTLFDKPGTF--VR 132

Query: 137 NKPILLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG  L           E W  N   S + ++ D G SW ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWLLPVWYCITGPGEKWVGNHDVSAVMISTDQGHSWTRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V     + +L  L RS      +  S S D G +WS   PT+L N NS I  VKL  
Sbjct: 190 HMNV-HQLPDGSLLGLFRS-RWADFIYSSRSTDRGRSWSAPTPTELPNNNSSIQFVKLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSAEGHAQRRASLYDEIEDEGDDRVTPGARQDGKAAVWGIPRAPMSLAISR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKHV 
Sbjct: 308 DGGHSWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVH 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|116669604|ref|YP_830537.1| hypothetical protein Arth_1043 [Arthrobacter sp. FB24]
 gi|116609713|gb|ABK02437.1| conserved hypothetical protein [Arthrobacter sp. FB24]
          Length = 414

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 155/380 (40%), Gaps = 91/380 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HA+ ++ +  G     +FGG+ EG PD+ IW    + G  +W  P     +   
Sbjct: 31  APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFSALEPGSSQWSEPQQLSNDSTR 90

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPP-----GIL 132
              NP+LF  P+  L L Y   +   + +  ++R  S + G TW E E L P     G+ 
Sbjct: 91  SEQNPILFTAPNGALWLLYTAQKAGNQDTAEVRRRTSMDGGRTWGEVETLFPANETGGVF 150

Query: 133 GPSKNKPILLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
              +  P++L +G L+        +  E W  NS  S + ++ DAG +W ++    +P  
Sbjct: 151 --VRQLPVVLPSGRLIVPIFRCITTPGEKWVGNSDDSAVMISDDAGATWTEH---VLPGS 205

Query: 185 SLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              V   IQPV    A+ TL  L RS      +  S S D G TWS   PT+L N NS I
Sbjct: 206 LGCVHMNIQPV----ADGTLLALFRS-RWADSIYESRSTDDGSTWSEPVPTELPNNNSSI 260

Query: 242 DGVKLKDGRLLLAYNT-------------------------------------------- 257
               L DGRL L YN                                             
Sbjct: 261 QFTALADGRLALVYNHSRAEASTERRLSLYDEIDDDGLAEEQGQVAEPDASAFSEDDGSR 320

Query: 258 -----VSRGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDG 300
                  R  + +A+S+D G SW         D   L  N    L  E+SYP++ Q  DG
Sbjct: 321 KAFWGTPRSPMTLAISEDSGRSWPIRRNLDVGDGYCLSNNSRDGLNREYSYPSIHQGPDG 380

Query: 301 SVHITYTYNRTQIKHVVLLP 320
           S++I YTY R  IK V + P
Sbjct: 381 SLNIAYTYFRQAIKFVRVDP 400


>gi|440746234|ref|ZP_20925519.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae
           BRIP39023]
 gi|440371361|gb|ELQ08204.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae
           BRIP39023]
          Length = 374

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L       G W  P    E+    
Sbjct: 15  PYAQNHAANLHELADGTVLCTWFAGTQEGMADIFVLLSRRDPVTGVWSEPRKMSEDSTRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPDGPLWLIWTAQISGNQETAIIRRRLSLDHGQSWGPIDTLFDEPGTF--VR 132

Query: 137 NKPILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG     +  C +   E W  N   S + ++ D G +W ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWVLPVWYCITGPGEKWVGNHDVSAVMISTDQGHNWSRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V    A+ +L  L RS      +  S S D G TWS   PT+L N NS I  V+L  
Sbjct: 190 HMNV-HQLADGSLLGLFRS-RWADFIYSSRSSDRGRTWSVPAPTELPNNNSSIQFVRLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSADGHAQRRGSLYDEIEDEGDERITPGPREDGKAAVWGIPRAPMSLAVSR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G +W   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKH+ 
Sbjct: 308 DGGGTWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHIH 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|422641302|ref|ZP_16704726.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae Cit 7]
 gi|330953690|gb|EGH53950.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae Cit 7]
          Length = 374

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 72/361 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVP 80
           P+   HA+ + E+  G  L  +F G+ EG  D+ + L       G W  P    E+    
Sbjct: 15  PYAQNHAANLHELADGTVLCTWFAGTQEGMADIFVLLSRRDPVTGVWSEPRKMSEDSTRS 74

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSK 136
             NP+LF+ P   L L +       + +  ++R  S + G +W   + L   PG     +
Sbjct: 75  EQNPILFQAPDGPLWLIWTAQISGNQETAIIRRRLSLDHGQSWGPIDTLFDEPGTF--VR 132

Query: 137 NKPILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             P++L+NG     +  C +   E W  N   S + ++ D G +W ++    +P+ + SV
Sbjct: 133 QPPVVLDNGDWVLPVWYCITGPGEKWVGNHDVSAVMISTDQGHNWSRHD---VPDSTGSV 189

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V    A+ +L  L RS      +  S S D G TWS   PT+L N NS I  V+L  
Sbjct: 190 HMNV-HQLADGSLLGLFRS-RWADFIYSSRSSDRGRTWSVPAPTELPNNNSSIQFVRLAT 247

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
             L L YN VS                                       R  + +A+S+
Sbjct: 248 DELALVYNPVSADGHAQRRASLYDEIEDEGDERITPGPREDGKAAVWGIPRAPMSLAVSR 307

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G +W   L LE            E L  EFSYP++IQA+DGS+H+ +TY R +IKH+ 
Sbjct: 308 DGGGTWPTRLDLELGDGFCLTNNSQEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHIH 367

Query: 318 L 318
           L
Sbjct: 368 L 368


>gi|359776238|ref|ZP_09279555.1| hypothetical protein ARGLB_037_02350 [Arthrobacter globiformis NBRC
           12137]
 gi|359306678|dbj|GAB13384.1| hypothetical protein ARGLB_037_02350 [Arthrobacter globiformis NBRC
           12137]
          Length = 415

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 163/402 (40%), Gaps = 94/402 (23%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           +  D  +K     +F +    AP    HA+ ++ +  G     +FGG+ EG PD+ IW  
Sbjct: 11  ISPDGAVKRADGADFAYLP--APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFS 68

Query: 61  TFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYN 116
           T + G  +W       ++P     NP+LF  P+  L L Y   +   + +  ++R  S +
Sbjct: 69  TLEPGSDQWSPAQQLSDDPTRSEQNPILFAAPNGALWLLYTAQKAGNQDTAEVRRRTSTD 128

Query: 117 KGVTWSEREQLPP-----GILGPSKNKPILLENGLLLC------GSSVESW--NSWGSWM 163
            G TW   E L P     G+    +  P+LL +G L+        +  E W  NS  S +
Sbjct: 129 SGRTWGPVETLFPANETGGVF--VRQLPVLLPSGRLIIPIFRCITTPGEKWVGNSDDSAV 186

Query: 164 EVTVDAGRSWRKYGPIYIPNESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
            ++ D G +W ++    +P     V   IQPV    A+ +L  L RS      +  S S 
Sbjct: 187 MISDDGGATWSEH---ILPGSLGCVHMNIQPV----ADGSLLALFRS-RWADSIYESRST 238

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------------------------ 256
           D G TWS   PT+L N NS I    L DGRL L YN                        
Sbjct: 239 DDGTTWSEPVPTELPNNNSSIQFAALADGRLALVYNHSRAEENTERRLSLYDEIDDDGLA 298

Query: 257 --------------------------TVSRGVLKVALSKDDGDSW--------HDALTLE 282
                                        R  + +A+S+D G SW         D   L 
Sbjct: 299 AEQGQLTEPDPAAGAGTDDGGRRAFWGTPRSPMTLAISEDSGRSWPIRRNLDVGDGYCLS 358

Query: 283 EN----LAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
            N    L  E+SYP+V Q  DG+++I YTY R  IK+V + P
Sbjct: 359 NNSRDGLNREYSYPSVHQGPDGALNIAYTYFRQAIKYVRVDP 400


>gi|302877447|ref|YP_003846011.1| BNR/Asp-box repeat-containing protein [Gallionella
           capsiferriformans ES-2]
 gi|302580236|gb|ADL54247.1| BNR/Asp-box repeat protein [Gallionella capsiferriformans ES-2]
          Length = 432

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 106/390 (27%), Positives = 159/390 (40%), Gaps = 84/390 (21%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           M+    +K   A   T  A+S  F   HA++ +E+  G     +F GS EGA DV I   
Sbjct: 34  MQATVPVKVTSARYETHFASSKLFTQVHAASSIELKDGRIRAFWFSGSREGAKDVAIHSA 93

Query: 61  TF--KDGRWQSPIIADEEPNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG 109
            F    G+W + +IA                + NP++ ++ S+G L  Y +   +  W+G
Sbjct: 94  VFDPATGQWSAELIAATREQTQSTLHRYIAKLGNPIVGRM-SDGRLRMYYVAVSLGGWAG 152

Query: 110 ---CMKRSYNKGVTWSEREQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSW 162
                  S ++G TW+   +L       I    K  P L  +G L      E  + +G  
Sbjct: 153 SSITTMTSSDEGTTWTTPRRLITSPFMNISTLVKGAPFLYSDGTLGLPVYHEFISKFGEM 212

Query: 163 MEVTVDAGRSWRKYGPIYIPNESLS-----VIQPVPFHTANRTLRVLMR--SFNGIGRVC 215
           + ++ +          + +  + L+      +QPV       + +VLMR  S +G  RV 
Sbjct: 213 LHLSQEG---------VVLDKQRLAAGGQGTLQPVVLMQNEHSAKVLMRYSSASGPHRVV 263

Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGD 273
              + D GL WS  + T L NP++ + GV L DGR+L   N   + R  L + LS+D G 
Sbjct: 264 AVSTQDAGLHWSAPEKTALRNPDAAVTGVTLPDGRMLAVLNDQELGRDTLSLVLSQDGGV 323

Query: 274 SWHDALTLEE---------------NLAM------------------------------- 287
           +W     LE+               N A                                
Sbjct: 324 TWRVVQLLEDQQQASSQPDEANFIKNAAQMIESSDAISTSRLEPAVESARRTVCHDGHCR 383

Query: 288 -EFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            EFSYP +IQAS G  H+ YT+NRT IKHV
Sbjct: 384 YEFSYPYLIQASGGDFHLVYTWNRTFIKHV 413


>gi|50122687|ref|YP_051854.1| hypothetical protein ECA3765 [Pectobacterium atrosepticum SCRI1043]
 gi|49613213|emb|CAG76664.1| conserved hypothetical protein [Pectobacterium atrosepticum
           SCRI1043]
          Length = 395

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 153/361 (42%), Gaps = 79/361 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KD-GRWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++    KD G W       E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKDSGSWSKAAKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L Y   +   + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
           +LENG     +  C     E W  N   S ++++ D G++WR+Y    +PN +  V +  
Sbjct: 153 VLENGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWREYP---VPNSTGCVHMNI 209

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
            P      TL  L RS      +  S S DGG TWS    T L N NS I    LK+G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSEPVATDLPNNNSSIQVTTLKNGHL 266

Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
            L +N +S                                          R  + +A+S+
Sbjct: 267 ALVFNHMSAADATDRRLSLYDEIEDEEDKASGAKMPEVQAGDRSAFWGAPRAPMTLAISE 326

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKHVV
Sbjct: 327 DGGKSWPYQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAPDGKLHVAFTYFRQAIKHVV 386

Query: 318 L 318
           +
Sbjct: 387 V 387


>gi|392379261|ref|YP_004986420.1| glycosyl hydrolase (modular protein) [Azospirillum brasilense
           Sp245]
 gi|356881628|emb|CCD02617.1| glycosyl hydrolase (modular protein) [Azospirillum brasilense
           Sp245]
          Length = 724

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 156/376 (41%), Gaps = 84/376 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G  L A+FGG+ EG PDV I +     G  RW++P+   ++P  
Sbjct: 342 SPCVQNHAANLALLPNGDLLCAWFGGTQEGIPDVSIHVARLPAGGDRWEAPVKLSDDPGR 401

Query: 80  PMWNPVLFKLPSNGLLLFYKI---GQEVQKWSGCMKRSYNKGVTWSEREQL--PPGILGP 134
              NPVLF  P   L L Y     G +      C + S ++G +W     L    G  G 
Sbjct: 402 SEQNPVLFPAPDGTLWLMYTAQVSGHQDTAIVRC-RVSADQGRSWGPVRTLFEAEGHTGV 460

Query: 135 SKNKPIL-LENG-----LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
              +P++ L+NG       LC G+   +W  +   S + ++ D GR+W ++    +P+ +
Sbjct: 461 FIRQPVVVLDNGDWLLPTFLCHGTPGRAWVGDHDTSAVRISSDGGRTWTEHA---VPDST 517

Query: 186 LSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             V +  VP   A+  L    RS      V  S S DGG +W+  +PT+L N NS I   
Sbjct: 518 GCVHMNIVPL--ADGRLAAFYRS-RWADHVYRSLSSDGGRSWTAPEPTELPNNNSSIQVT 574

Query: 245 KLKDGRLLLAYNTVS--------------------------------------------- 259
            L DGRL + YN  S                                             
Sbjct: 575 ALSDGRLAIVYNHASAADATGRRLGLYDDIEEAGETANPAAQPAAEPAVEPPAEGRTAFW 634

Query: 260 ---RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHI 304
              R  L +ALS D G +W     LE            E L  EFSYPA++Q  DG + I
Sbjct: 635 GAPRAPLTLALSSDGGVTWPLRRDLEVGDGYCMTNNSKEKLNREFSYPAILQTPDGRLRI 694

Query: 305 TYTYNRTQIKHVVLLP 320
            +TY R  IK+  L P
Sbjct: 695 AFTYFRQAIKYTELPP 710


>gi|302882514|ref|XP_003040166.1| hypothetical protein NECHADRAFT_44471 [Nectria haematococca mpVI
           77-13-4]
 gi|256721036|gb|EEU34453.1| hypothetical protein NECHADRAFT_44471 [Nectria haematococca mpVI
           77-13-4]
          Length = 379

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 102/359 (28%), Positives = 158/359 (44%), Gaps = 74/359 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++++     L A+FGGS EG PD+ IWL   + G   W SP     E N    NPV
Sbjct: 18  HASNLLQLPDKTLLCAWFGGSQEGLPDISIWLSRLEPGSSSWASPKKISYEENRSCQNPV 77

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGILGPSKNKPI-L 141
           LF++P +G +      Q+     G    +++S ++G+TWSE  QL PG  G    +PI +
Sbjct: 78  LFRVPDSGEIWLLHTSQDAGNQDGAYVLVRKSSDQGITWSEANQLLPGKTGIFTRQPIVI 137

Query: 142 LENG-----LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L++G     +  C G+    W  N   S +  + D G++W +      P+ S +V   + 
Sbjct: 138 LKDGTWVLPVFYCRGTPGHRWIGNDDISGVFYSKDGGKTWNEK---QFPDSSGAVHMNIV 194

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
              + ++L V +        V  S S + GL W+  +   L NPN GI   +L+ G+L +
Sbjct: 195 PPASEKSLWVALYRSRWADNVYRSTSTN-GLDWAKPEALTLPNPNRGICAARLRSGKLAI 253

Query: 254 AYNTVS--------------------------------------RGVLKVALSKDDGDSW 275
            +N  +                                      R  L V +S DDG +W
Sbjct: 254 VFNRSAASPDSLRREGLYDDITPEDDKRPNQVAVGQKSAIWGTPRKALTVGVSDDDGLTW 313

Query: 276 HDAL-------------TLEENLAMEFSYPAV-IQASDGS--VHITYTYNRTQIKHVVL 318
            + +             T +EN   E SYP++ IQ  DG    HI +T++R  IKHV +
Sbjct: 314 TERVLEDGDGFCGTNSSTGKEN--RELSYPSILIQDGDGPEVTHIAFTFHRQFIKHVRI 370


>gi|337748984|ref|YP_004643146.1| hypothetical protein KNP414_04746 [Paenibacillus mucilaginosus
           KNP414]
 gi|379721953|ref|YP_005314084.1| hypothetical protein PM3016_4157 [Paenibacillus mucilaginosus 3016]
 gi|386724696|ref|YP_006191022.1| hypothetical protein B2K_21565 [Paenibacillus mucilaginosus K02]
 gi|336300173|gb|AEI43276.1| hypothetical protein KNP414_04746 [Paenibacillus mucilaginosus
           KNP414]
 gi|378570625|gb|AFC30935.1| hypothetical protein PM3016_4157 [Paenibacillus mucilaginosus 3016]
 gi|384091821|gb|AFH63257.1| hypothetical protein B2K_21565 [Paenibacillus mucilaginosus K02]
          Length = 354

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 94/328 (28%), Positives = 149/328 (45%), Gaps = 47/328 (14%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
            HA+ ++ +D G  L  +F GS EG PD  + L     G  RW +P     +P     NP
Sbjct: 30  SHAANLLSLDNGDLLCTWFTGSGEGNPDTNVVLSRLPAGSDRWTTPEQLSSDPERSEQNP 89

Query: 85  VLFKLPSNGLLLFYKIGQ-EVQKWSGCMKR-SYNKGVTWSEREQLP--PGILGPSKNKPI 140
           VLF+ P   L L +   +   QK S  + R S ++G+TW   E +   PGI    ++ P+
Sbjct: 90  VLFQAPDGRLWLLHTSNEPHNQKTSRIVTRLSEDRGLTWGPAEVITDRPGIF--LRHPPV 147

Query: 141 LLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRT 200
           +  +G  +  +     +   S +  + D GR+W +     +P  S+  +Q      ++ +
Sbjct: 148 VTGSGDWVLPAYYCRLDGHYSVVLRSTDQGRTWTE---TEVPG-SIHRVQMNIVPRSDGS 203

Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS- 259
           L  + RS     R+ MSES D G TWS    + L N NS     +L DGR+ L YN  + 
Sbjct: 204 LLAMFRSRQA-DRIYMSESVDAGRTWSPPVKSPLANNNSSTQAARLTDGRIALIYNDSTM 262

Query: 260 -------------------RGVLKVALSKDDGDSWHDALTLEENLAME----------FS 290
                              R  L +++S+D+G +W     L  N+ M           +S
Sbjct: 263 ERDQFRWVQRKGETRKKPLRTPLTLSISEDEGRTW----PLTRNVQMADLEYKESEVGYS 318

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YP++I   DG +HI ++Y R  IK+V +
Sbjct: 319 YPSIIAPGDGRLHIAFSYLRKGIKYVCV 346


>gi|405120802|gb|AFR95572.1| BNR/Asp-box repeat family protein [Cryptococcus neoformans var.
           grubii H99]
          Length = 354

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 94/307 (30%), Positives = 136/307 (44%), Gaps = 57/307 (18%)

Query: 64  DGRWQSPIIADEEPNVPMWNPVLFKLPSNG----LLLFYKIGQEVQKWSGCMKRSYNKGV 119
           DGRW  P +     ++  WNPV F     G    L +FYK G  +  W   +  + + G 
Sbjct: 39  DGRWTPPRVIAGRGDIVHWNPVSFLPDPKGAPGHLYVFYKTGTPIPTWKTYVIETVDGGT 98

Query: 120 TWSEREQLPPGI----LGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEV----- 165
           TW+E  +L PG      GP KN PI+L NG  L G+S E         W ++ ++     
Sbjct: 99  TWTEPRELVPGDHSGGRGPQKNPPIVLSNGDWLSGASKEVTLGGGKGLWDAFADIAPSPK 158

Query: 166 ---TVDAGRSWRKYGPIYIPNE---------SLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
              T   G  W K   I +P +            +IQP  + +A +   V M   + IGR
Sbjct: 159 EGDTFKQGERWVKAEFIKLPADRGVEGGSFPGEGIIQPSLWESAEKPGHVHMMCRSSIGR 218

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALS 268
           +  ++S D G TW+    T L N NSG    +L+DGRL++A N V      R  L + +S
Sbjct: 219 IVRADSEDYGRTWTPGYATDLPNNNSGQCVTRLRDGRLVIAVNDVYTNWGPRTPLVLKVS 278

Query: 269 KDDGDSWHDALTLEEN---------LAM----------EFSYPAVIQASDGS---VHITY 306
            DDG +W    TLE+          +A+          EFSYP V   ++     V ++Y
Sbjct: 279 SDDGKTWTSWCTLEDQAPPASFQKVIALETGIVNDGKSEFSYPTVTPTAETDEIGVWVSY 338

Query: 307 TYNRTQI 313
           T+ R  I
Sbjct: 339 TWQRRGI 345


>gi|253689902|ref|YP_003019092.1| hypothetical protein PC1_3540 [Pectobacterium carotovorum subsp.
           carotovorum PC1]
 gi|251756480|gb|ACT14556.1| conserved hypothetical protein [Pectobacterium carotovorum subsp.
           carotovorum PC1]
          Length = 395

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 153/361 (42%), Gaps = 79/361 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G   W   +   E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDNWSKAVKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L Y   +   + +  ++  +S + G +W E   L   PG       +PI 
Sbjct: 96  LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDFGASWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
           +L NG     +  C     E W  N+  S ++++ D G++WR+Y    +PN +  V +  
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNNDDSAVKISSDQGKTWREYP---VPNSTGCVHMNI 209

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
            P      TL  L RS      +  S S DGG TWS   PT L N NS I    LK+G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSDPVPTDLPNNNSSIQVTTLKNGHL 266

Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
            L +N +S                                          R  + +A+S+
Sbjct: 267 ALVFNHMSAADATERRLSLYDEIEDEEDKASNAKMPEVQAGSRSAFWGAPRAPMTLAISE 326

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKHVV
Sbjct: 327 DGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKHVV 386

Query: 318 L 318
           +
Sbjct: 387 V 387


>gi|255954997|ref|XP_002568251.1| Pc21g12200 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211589962|emb|CAP96117.1| Pc21g12200 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 392

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 147/357 (41%), Gaps = 73/357 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
           CHAS ++ +  G  L  +FGG+ EG PD+ I+L     G   W   +    +      NP
Sbjct: 37  CHASNLLRLPNGDLLCTWFGGNMEGKPDISIYLSRLPSGGQTWGEAVKVTHDDGRSEQNP 96

Query: 85  VLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI 140
           VLF+ PS  L L Y   Q   + S  +KR  S + G TWS    L   PG     +   I
Sbjct: 97  VLFRHPSGELWLLYTSQQGGNQDSAVVKRVISSDNGATWSGPTILFDDPGTF--IRQPVI 154

Query: 141 LLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +LENG+ +       G     W  N   S +  + D GR+W +   + +P ES   +   
Sbjct: 155 ILENGVFVVPTFKCRGEPGAKWIGNDDISVIRTSADQGRTWSE---VAVP-ESTGCVHME 210

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N +   L RS      + +S S D GL WS  +PT L NPN+GI    L  GR++
Sbjct: 211 IQRLKNGSYLALYRS-RWADYIHLSTSSD-GLDWSAPQPTSLPNPNAGICFDVLSSGRVV 268

Query: 253 LAYNTVS----------------------------------------RGVLKVALSKDDG 272
             YN  S                                        R  L VA S DDG
Sbjct: 269 AVYNHSSRKNAEARREGLYDEITEEGQDTRPNQKDRTDGKEAFWGAPRAPLCVAWSDDDG 328

Query: 273 DSWH-------DALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            SW        D   L    E+    E SYP+++   DG++HI +T+ R  IK+V +
Sbjct: 329 KSWQHRTLEEGDGFCLTNNSEQKRNRELSYPSMVVGEDGTIHIAFTFWRQTIKYVQI 385


>gi|290509605|ref|ZP_06548976.1| glycosyl hydrolase, BNR repeat [Klebsiella sp. 1_1_55]
 gi|289778999|gb|EFD86996.1| glycosyl hydrolase, BNR repeat [Klebsiella sp. 1_1_55]
          Length = 388

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 156/356 (43%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  +  +FGGS EG  D+ IW      G  RW   +   ++P+    NPV
Sbjct: 33  HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P N L L +       + +  ++   S++ G +W   + L   PG       +PI 
Sbjct: 93  LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ +G     +  C +   E W  N+  S ++++ D G+SWR    + +P ESL  +  +
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMI 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L    RS      +  S+S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATTLDNGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G+SW 
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    E+ L  EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380


>gi|221065827|ref|ZP_03541932.1| BNR repeat-containing glycosyl hydrolase [Comamonas testosteroni
           KF-1]
 gi|220710850|gb|EED66218.1| BNR repeat-containing glycosyl hydrolase [Comamonas testosteroni
           KF-1]
          Length = 384

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 149/362 (41%), Gaps = 67/362 (18%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ ++ +D G     +FGGS EG  D+ +++     G  +W  PI    +   
Sbjct: 22  SPCVQAHAANLMVLDDGTLGCVWFGGSMEGRSDISVFMSRLDPGAAQWSEPIQLSHDAER 81

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NPVLF  P   L L +   Q   + +  ++R  S ++G++W   E L     G    
Sbjct: 82  SEQNPVLFPAPGGELWLLHTAQQSGHQNTSVVRRRLSRDQGLSWEPTETLADAPAGTFVR 141

Query: 138 KPILLENG----LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
           +PI +       L +     +   +W      S +  + D GRSW++   I +P  SL  
Sbjct: 142 QPIHVHTDGSWLLPVFHCRAQPGQAWDGSHDDSGVLRSADQGRSWQR---IAVPG-SLGC 197

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           +       ++  L    RS      V  S S D GL+W   + T+L N NS I  ++L D
Sbjct: 198 VHMNIVQASDGGLLAFFRS-RWADHVYRSRSDDCGLSWQEPEATELPNNNSSIQALRLAD 256

Query: 249 GRLLLAYNTVS-------------------------------------RGVLKVALSKDD 271
           GRL + +N  S                                     R  + +ALS DD
Sbjct: 257 GRLAMIFNASSAADATQRRESLYDELGDSSATPAKAAAGERRAFWGAPRAPMTLALSADD 316

Query: 272 ------------GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
                       GD W  +   E     E+SYP++ Q++DG++H+ YT  R  I+HV + 
Sbjct: 317 GLSWPWQRNLEVGDGWCMSNDSEHGRNREYSYPSIRQSADGALHLAYTVFRQHIRHVRVQ 376

Query: 320 PN 321
           P+
Sbjct: 377 PD 378


>gi|365856121|ref|ZP_09396145.1| BNR/Asp-box repeat protein [Acetobacteraceae bacterium AT-5844]
 gi|363718349|gb|EHM01690.1| BNR/Asp-box repeat protein [Acetobacteraceae bacterium AT-5844]
          Length = 402

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/358 (27%), Positives = 153/358 (42%), Gaps = 72/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G     +FGG+ EG PD+  +    + G  RW +P+   ++      NP+
Sbjct: 40  HAANLMPLPDGSLGCVWFGGTQEGVPDISAYFSRLEKGGTRWSAPVRLSDDDTRSEQNPL 99

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL---PPGILGPSKNKPI 140
           LF  PS  L L +       + +  +++  S + G +W   E L    PG     +   +
Sbjct: 100 LFPAPSGELWLIWTAQISGNQDTAIVRKRISRDNGRSWGPIETLFGPRPGGGTFMRQPVV 159

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +LENG     + +C S+  + W  +   S + ++ D GRSW +   + +P ESL  +   
Sbjct: 160 VLENGDWLLPVWICASTPGKKWVGDDDTSAVMISSDRGRSWTE---VAVP-ESLGCVHMS 215

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL    RS      V +S S D G +WS  +PT+L N NS I   +L +G L 
Sbjct: 216 IVDLRNGTLAAFYRS-RWADHVYVSRSTDNGRSWSAPQPTELPNNNSSIQVTRLANGHLA 274

Query: 253 LAYNT----------------------------------------VSRGVLKVALSKDDG 272
           + +N                                         V R  + VA+S+DDG
Sbjct: 275 MVFNESSAEDATERRLSLYDDIEDDTDDGKVVAPIDTSRRTTFWGVPRAPMTVAISEDDG 334

Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            SW     LE            E L  EFSYP+++Q  +G +HI YT  R  IK+V +
Sbjct: 335 QSWPFRRNLETGDGYCMTNNSREGLNREFSYPSIVQGPEGDLHIAYTVYRQAIKYVTV 392


>gi|444307837|ref|ZP_21143550.1| hypothetical protein G205_22151 [Arthrobacter sp. SJCon]
 gi|443479848|gb|ELT42830.1| hypothetical protein G205_22151 [Arthrobacter sp. SJCon]
          Length = 413

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 107/381 (28%), Positives = 155/381 (40%), Gaps = 91/381 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HA+ ++ +  G     +FGG+ EG PD+ IW    + G  +W SP    ++   
Sbjct: 30  APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFSALEPGSSQWSSPEQLSDDSTR 89

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPP-----GIL 132
              NP+LF  P   L L Y   +   + +  ++R  S + G TW E E L P     G+ 
Sbjct: 90  SEQNPILFTAPDKSLWLLYTAQKAGNQDTAEVRRRISTDSGRTWGEVETLFPANETGGVF 149

Query: 133 GPSKNKPILLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
              +  P++L +G L+        +  E W  NS  S + ++ D G +W ++    +P  
Sbjct: 150 --VRQLPVVLPSGRLIVPIFRCITTPGEKWVGNSDDSAVMISEDGGATWTEH---VLPGS 204

Query: 185 SLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              V   IQPV    A+ +L  L RS      +  S S D G TWS   PT+L N NS I
Sbjct: 205 LGCVHMNIQPV----ADGSLLALFRS-RWADSIYESRSTDDGSTWSEPVPTELPNNNSSI 259

Query: 242 DGVKLKDGRLLLAYNT-------------------------------------------- 257
               L DGRL L YN                                             
Sbjct: 260 QFTALADGRLALVYNHSRAGEGTERRLSLYDEIDDDGLADEQGQVAEPDAAAFSEDDGVK 319

Query: 258 -----VSRGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDG 300
                  R  + +A+S+D G +W         D   L  N    L  E+SYP++ Q  DG
Sbjct: 320 RAFWGTPRSPMTLAISEDSGRTWPIRRNLDVGDGYCLSNNSRDGLNREYSYPSIHQGPDG 379

Query: 301 SVHITYTYNRTQIKHVVLLPN 321
           +++I YTY R  IK V + P 
Sbjct: 380 ALNIAYTYFRQAIKFVRVDPQ 400


>gi|302540315|ref|ZP_07292657.1| BNR/Asp-box repeat protein [Streptomyces hygroscopicus ATCC 53653]
 gi|302457933|gb|EFL21026.1| BNR/Asp-box repeat protein [Streptomyces himastatinicus ATCC 53653]
          Length = 401

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 156/373 (41%), Gaps = 81/373 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HA+ +  +  G     +FGG+ EG PD+ +W      G   W  P+   ++P  
Sbjct: 32  APAVQNHAANLTVLPGGDLGCVWFGGTQEGVPDISVWFSRLAPGADTWSEPVRLSDDPAR 91

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQLPPGILGPSK- 136
              NP+LF  P+  L L Y   Q   + +    ++ S + G TW     L P   G    
Sbjct: 92  SEQNPLLFPTPAGELWLLYTAQQAGNQDTAEVRVRVSGDNGDTWDAPRTLFPATGGGGVF 151

Query: 137 -NKPI-LLENGLLL-----CGSSVES-W--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +PI +L++G  L     C S+  + W  +   S + V+ D GR+WR+     +P  + 
Sbjct: 152 IRQPIAVLDSGRWLLPVFHCVSTPGAKWVGDHDTSAVMVSDDEGRTWRER---PVPGSTG 208

Query: 187 SV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
            V   + P+P    + TL  L RS      V  S S DGG +WS  +PT+L N NS +  
Sbjct: 209 CVHMNVHPLP----DGTLLALFRS-RWADAVYRSHSTDGGESWSEPEPTELPNNNSSVQY 263

Query: 244 VKLKDGRLLLAYNTVS-------------------------------------------R 260
           V L DGRL L YN  S                                           R
Sbjct: 264 VPLGDGRLALVYNHSSRADATARRVSLYDEIDDDGEAGDAPAAAPSQREDAPGAFWGAPR 323

Query: 261 GVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTY 308
             + +ALS D G +W     L+            + L  E SYP + Q  DG++HI YTY
Sbjct: 324 APMTLALSADGGRTWPVRRDLDVGDGHCLTNNSRDRLNRELSYPTIRQTPDGTLHIAYTY 383

Query: 309 NRTQIKHVVLLPN 321
           +R  IK+V + P+
Sbjct: 384 HRQAIKYVRVSPD 396


>gi|417305704|ref|ZP_12092654.1| conserved hypothetical protein, secreted, partial [Rhodopirellula
           baltica WH47]
 gi|327538010|gb|EGF24704.1| conserved hypothetical protein, secreted [Rhodopirellula baltica
           WH47]
          Length = 239

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 99/202 (49%), Gaps = 17/202 (8%)

Query: 5   CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
            G   LV+    F     P K  HASTIVE   G  + A+F G+ E  PDV I +   ++
Sbjct: 28  AGEGALVSSRQIFDLADKPTKESHASTIVETPNG-LVAAWFAGTRERDPDVGIRVSRHEN 86

Query: 65  GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           G+W   +     +       P WNPVLF+     L+LFYK+G   ++W G +  S + G 
Sbjct: 87  GQWTDSVEVVSGVQSSMLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWGMLTTSQDGGK 146

Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE-----SWNSWGSWMEVTVDA 169
           TWS   +L        +LGP KNKP+ L +G +LC SS E       + W    EVT D 
Sbjct: 147 TWSWPTKLGEAHTIGHLLGPVKNKPVELADGTILCPSSTEIEYADGSSHWRVHFEVTKDL 206

Query: 170 GRSWRKYGPIYIPNESLSVIQP 191
           G++W   GPI    E+   IQP
Sbjct: 207 GKTWEVIGPIQT-GETFHAIQP 227


>gi|407978690|ref|ZP_11159518.1| hypothetical protein BA1_05802 [Bacillus sp. HYC-10]
 gi|407414721|gb|EKF36351.1| hypothetical protein BA1_05802 [Bacillus sp. HYC-10]
          Length = 391

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 165/380 (43%), Gaps = 75/380 (19%)

Query: 7   IKGLVAEEFTFPANSAPF--KSC---HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQT 61
           I G++ +    P     F   +C   HA+ ++E+D G  L  +FGG+ EG PD+ I++  
Sbjct: 10  IDGILRQNKQDPVRKEAFLPTNCVQNHAANLIELDNGDVLCVWFGGTQEGIPDISIYMSR 69

Query: 62  FKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNK 117
            K G  +W   +    +      NPVLF+  S  L L Y       + +  ++   S ++
Sbjct: 70  LKQGSDQWTEAVKLSNDSTRSEQNPVLFQEKSGRLWLLYTAQLSGNQDTAIVRYRVSEDR 129

Query: 118 GVTWSEREQL--PPGILGPSKNKPILLENG-----LLLCGS--SVE-SWNSWGSWMEVTV 167
           G TW E   L   PG     +   ++L+N      +  C +   V+ + N   S ++++ 
Sbjct: 130 GETWGEIGTLFDQPGTF--IRQPIVVLDNDDWLLPVFYCKTIPGVKWTGNRDVSAVKISS 187

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W +   + +PN +  V   +    A+ TL  L RS      + MS S D G TWS
Sbjct: 188 DQGKTWEE---VIVPNSTGCVHMNIG-KCADGTLIALFRS-RFADSIYMSRSVDHGRTWS 242

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---------------------------- 259
             K T+L N NS I    L++G L L YN ++                            
Sbjct: 243 EPKATELPNNNSSIQFTVLQNGHLALVYNHINADDQTERRASLYDEIEDEGDTRTAVDTE 302

Query: 260 ---------RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQAS 298
                    R  + +A+SKDDG +W     LE            + L  E+SYP++ +  
Sbjct: 303 ARPAFWGTPRAPMTLAISKDDGATWPVRRNLEIGDGYAMTNNSKDKLNREYSYPSITEGK 362

Query: 299 DGSVHITYTYNRTQIKHVVL 318
           DG +HI +T+ R  IK+V +
Sbjct: 363 DGKLHIAFTFYRQAIKYVCV 382


>gi|321261720|ref|XP_003195579.1| hypothetical protein CGB_H0050C [Cryptococcus gattii WM276]
 gi|317462053|gb|ADV23792.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 394

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 158/366 (43%), Gaps = 65/366 (17%)

Query: 10  LVAEEFTFPANS-APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
           LV  EF    +  AP   C   T + V  G     +FGGS EGA D +IW     DG+W 
Sbjct: 23  LVTREFVSKDDERAPSIHC---TNIAVAGGVRYSTWFGGSKEGATDNQIWFSKNADGKWT 79

Query: 69  SP--IIADEE-PNVPMWNPVLF---KLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
            P  I  + E  +V  WNPVLF   +     + +F+K    +  W      S + G TWS
Sbjct: 80  EPRAIAGNREGEDVVYWNPVLFIPDRSQPKTIHVFFKKHTPIPVWVTFWMDSDDGGDTWS 139

Query: 123 EREQLPPGI-----LGPSKNKPILLENGLLLCGSSVESWNSWGS------WMEVTVD--- 168
           E  +L PG       GP KN PI+L NG  L   S E  N  GS      W +V +    
Sbjct: 140 EPRELVPGPDGARGRGPQKNPPIVLSNGDWLSAGSYEVTNPPGSGAAGDSWADVALKPGP 199

Query: 169 -----AGRSWRKYGPIYIPNE---------SLSVIQPVPFHTANRTLRVLMRSFNGIGRV 214
                 G  W +   I +P++            VIQP  + + ++     M   + +G V
Sbjct: 200 GDDFKQGDKWVRSKLIELPSDRGKSDGGFPGEGVIQPGIWESKDQPGHCHMMMRSSVGCV 259

Query: 215 CMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSK 269
             ++S D G TW  A  T L N NSG+    L+DGR++ A N  +     R  L +A+S+
Sbjct: 260 IQADSVDYGRTWGPAFRTSLPNNNSGLCITTLRDGRVVWAGNYQTQNWGPRTPLCLAISE 319

Query: 270 DDGDSWHDALTLEE-------------------NLAMEFSYPAVI---QASDGSVHITYT 307
           DDG +W    TLE+                   +   EFSYP +I      +  V +++T
Sbjct: 320 DDGKTWKLWATLEDAPPPEDFKRVIALETGIVNDGRSEFSYPCLIPTENDDEDGVWMSWT 379

Query: 308 YNRTQI 313
           + R  I
Sbjct: 380 WQRRGI 385


>gi|301060837|ref|ZP_07201651.1| BNR/Asp-box repeat protein [delta proteobacterium NaphS2]
 gi|300445086|gb|EFK09037.1| BNR/Asp-box repeat protein [delta proteobacterium NaphS2]
          Length = 398

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 152/345 (44%), Gaps = 48/345 (13%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG---R 66
            +  +F  P  +      H  +I E+  G    A++GG+ EGA DV I+  T   G    
Sbjct: 51  FIKHDFINPGTAGAM--VHVGSICEMTDGQLAAAWYGGTREGAKDVAIFFSTKPPGTNTS 108

Query: 67  WQSP-IIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRS 114
           W  P +I D              + NP+LF  P   L L Y +   V  WSG    +K S
Sbjct: 109 WSPPKVIVDRAKATRELKRYIRKVGNPILFSGPGKQLWLIY-VSIAVGGWSGSSINVKTS 167

Query: 115 YNKGVTWSEREQLPPGILGPSKNKPILLENG---LLLCGS-------SVESWNSWGS--- 161
            + G  W+E ++L    L P  N   L++     L+  G+       +V  ++ +     
Sbjct: 168 RDGGERWTESQRL---TLSPFFNISELVKGRPLPLITMGARMNQDRFAVPIYHEFLGNFP 224

Query: 162 ---WMEVTVDAGRSWRKYGPIYIPNESL---SVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
              W+    D+GR       +Y  +      S IQP              RS +   RV 
Sbjct: 225 EILWISFCKDSGRV------VYEKSRMAGGKSFIQPSIAALNTHFATAFYRSVSTDKRVA 278

Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--SRGVLKVALSKDDGD 273
            + + D G  W+      L NP+S +D + L+   +LLA+N    SR ++++A SKD G 
Sbjct: 279 AATTEDCGQNWTVPDYLDLPNPDSAVDALPLQRNCILLAFNDSRNSREIMQLAFSKDGGK 338

Query: 274 SWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +W    TLE +   EFSYP +I+  +G +H+ YT+ R +IKH+V 
Sbjct: 339 NWTRIHTLENSPNEEFSYPYMIRGRNGLIHVVYTWKRKRIKHLVF 383


>gi|291521032|emb|CBK79325.1| Predicted neuraminidase (sialidase) [Coprococcus catus GD/7]
          Length = 386

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/334 (27%), Positives = 153/334 (45%), Gaps = 47/334 (14%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
           + + HA  ++E++ G  L A+F GS EG+ D+ I    +++G   W  P+    +     
Sbjct: 33  YSTAHAPALLELENGDMLCAWFAGSFEGSKDISIVCARWENGSDHWSEPVQVSYDSERSE 92

Query: 82  WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILG 133
            NP LFK P   +   Y        G++  +++  ++  +S++ G TW E E + P   G
Sbjct: 93  QNPSLFKGPDGAVWCMYTAQLDRMEGKDNMQFTSIIRCQKSFDGGKTWGEAEVIFPE-EG 151

Query: 134 PSKNKPI-LLENGLLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
               +PI +L NG  +     C  SV       +   ++ D GR+W+K   + +P  + +
Sbjct: 152 SFCRQPIQVLSNGRWIFGNWICTDSVNGLEGDPTAFRISDDEGRTWKK---VDMPESNGA 208

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           V   V      + +  L   F     + +SES D G TW+   PT L N NS I  +KL 
Sbjct: 209 VHANVVELEGGKLVCFLRSRF--ADNIYISESKDWGDTWTKPVPTVLPNNNSSISALKLA 266

Query: 248 DGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEENL 285
            GR+ +AYN                 R  + VALS+D G +W         +  T +EN 
Sbjct: 267 SGRIAIAYNPTHAPHPVYGKVAWPGLRCPVAVALSEDGGKTWPMIRVMERGEGFTGDENT 326

Query: 286 A--MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
               ++ YP ++Q  DG +H+ + Y NR  IK++
Sbjct: 327 TNNKQYEYPYLMQGKDGRLHLAFAYKNRIGIKYM 360


>gi|261822947|ref|YP_003261053.1| hypothetical protein Pecwa_3711 [Pectobacterium wasabiae WPP163]
 gi|261606960|gb|ACX89446.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163]
 gi|385873395|gb|AFI91915.1| BNR/Asp-box repeat protein [Pectobacterium sp. SCC3193]
          Length = 395

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 153/361 (42%), Gaps = 79/361 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G   W   +   E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSKAVKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L Y   +   + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
           +LENG     +  C     E W  N+  S ++++ D G++WR+Y    +PN +  V +  
Sbjct: 153 VLENGDWLLPVFYCRVQPGEKWVGNNDDSAVKISSDRGKTWREYS---VPNSTGCVHMNI 209

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
            P      TL  L RS      +  S S DGG TWS    T L N NS I    L++G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSDPVATDLPNNNSSIQVTTLQNGHL 266

Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
            L +N +S                                          R  + +A+S+
Sbjct: 267 ALVFNHMSAADATDRRLSLYDEIEDEEDKASGAKMPEVQAGDRSAFWGAPRAPMTLAISE 326

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKHVV
Sbjct: 327 DGGKSWPWQRNVEVGDGYCMTNNSTEKLNREFSYPSVKQAPDGKLHLAFTYFRQAIKHVV 386

Query: 318 L 318
           +
Sbjct: 387 V 387


>gi|403059973|ref|YP_006648190.1| hypothetical protein PCC21_035340 [Pectobacterium carotovorum
           subsp. carotovorum PCC21]
 gi|402807299|gb|AFR04937.1| hypothetical protein PCC21_035340 [Pectobacterium carotovorum
           subsp. carotovorum PCC21]
          Length = 395

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 155/363 (42%), Gaps = 83/363 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G   W   +   E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSQAVKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L Y   +   + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
           +L NG     +  C     E W  N   S ++++ D G++W++Y    +PN +  V   I
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWQEYP---VPNSTGCVHMNI 209

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
            P+     + TL  L RS      +  S S DGG TWS   PT+L N NS I    L++G
Sbjct: 210 TPL----KDGTLLALYRS-RWADFIYQSRSTDGGKTWSDPVPTELPNNNSSIQVTTLENG 264

Query: 250 RLLLAYNTVS------------------------------------------RGVLKVAL 267
            L L +N +S                                          R  + +A+
Sbjct: 265 HLALVFNHMSAADATERRLSLYDEIEDEEDKASNAKMPEVQAGGRSAFWGAPRAPMTLAI 324

Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           S+D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKH
Sbjct: 325 SEDGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKH 384

Query: 316 VVL 318
           VV+
Sbjct: 385 VVV 387


>gi|397168581|ref|ZP_10492019.1| BNR/Asp-box repeat family protein [Enterobacter radicincitans DSM
           16656]
 gi|396090116|gb|EJI87688.1| BNR/Asp-box repeat family protein [Enterobacter radicincitans DSM
           16656]
          Length = 385

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 153/363 (42%), Gaps = 72/363 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ I+ +  G  L  +FGGS EG  D+ ++      G  RW   +   ++P  
Sbjct: 26  SPCPQNHAANILPLPDGTLLCVWFGGSQEGVADISVYGSRLAPGAQRWSDAVKLSDDPTR 85

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPS 135
              NPVLF  P N L L +       + +  +  ++S + G +W E   L   PG     
Sbjct: 86  SEQNPVLFLAPDNTLWLLWTAQISGNQDTAIVRYRQSADFGASWGEIGTLLDKPGTF--- 142

Query: 136 KNKPI-LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +PI +LENG     +  C +   E W  N   S ++++ D G+SWR    + +P +SL
Sbjct: 143 IRQPITVLENGNWLLPVFYCRTQPGEKWVGNDDISAVKISADRGKSWRD---VDVP-QSL 198

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
             +          TL  L RS      +  S S DGG +WS  +PT L N NS I    L
Sbjct: 199 GCVHMNITALPGGTLVALFRS-RWADNIYYSHSTDGGESWSVPEPTVLPNNNSSIQVTTL 257

Query: 247 KDGRLLLAYNTVS-------------------------------------RGVLKVALSK 269
            DG L L +N +S                                     R  + VA+S 
Sbjct: 258 TDGTLALVFNHMSAAGATERRASLYDEIDDGDGRKEPTVTSGRSAFWGAPRAPMTVAISP 317

Query: 270 DDGDSWHDALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW     L            +E L  EFSYP++ Q +DG +HI YT+ R  IK+V 
Sbjct: 318 DGGKSWPWQRNLDEGDGYCMTNNSQEKLNREFSYPSIKQGADGDLHIAYTWFRQAIKYVR 377

Query: 318 LLP 320
           + P
Sbjct: 378 VSP 380


>gi|424933506|ref|ZP_18351878.1| Glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
           subsp. pneumoniae KpQ3]
 gi|407807693|gb|EKF78944.1| Glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
           subsp. pneumoniae KpQ3]
          Length = 388

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 155/356 (43%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  +  +FGGS EG  D+ IW      G  RW   +   ++P+    NPV
Sbjct: 33  HAANLLPLDDGSLMCVWFGGSLEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P N L L +       + +  ++   S++ G +W   + L   PG       +PI 
Sbjct: 93  LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ +G     +  C +   E W  N+  S ++++ D G+SWR    + +P ESL  +   
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L    RS      +  S+S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATPLDNGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G+SW 
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    E+ L  EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380


>gi|288935569|ref|YP_003439628.1| glycoside hydrolase family protein [Klebsiella variicola At-22]
 gi|288890278|gb|ADC58596.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           variicola At-22]
          Length = 388

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 155/357 (43%), Gaps = 71/357 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  +  +FGGS EG  D+ IW      G  RW   +   ++P+    NPV
Sbjct: 33  HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P N L L +       + +  ++   S++ G +W   + L   PG       +PI 
Sbjct: 93  LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ +G     +  C +   E W  N+  S ++++ D G+SWR    + +P ESL  +   
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L    RS      +  S+S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATTLDNGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G+SW 
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
                   D   +    E+ L  EFSYP++ Q +DG++HI YT+ R  IK+V + P 
Sbjct: 325 WLRHLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSPQ 381


>gi|227115492|ref|ZP_03829148.1| hypothetical protein PcarbP_21165 [Pectobacterium carotovorum
           subsp. brasiliensis PBR1692]
          Length = 395

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 155/363 (42%), Gaps = 83/363 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G   W   +   E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSQAVKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L Y   +   + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDFGATWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
           +L NG     +  C     E W  N   S ++++ D G++W++Y    +PN +  V   I
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWQEYP---VPNSTGCVHMNI 209

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
            P+     + TL  L RS      +  S S DGG TWS   PT+L N NS I    L++G
Sbjct: 210 TPL----KDGTLLALYRS-RWADFIYQSRSTDGGKTWSDPVPTELPNNNSSIQVTTLENG 264

Query: 250 RLLLAYNTVS------------------------------------------RGVLKVAL 267
            L L +N +S                                          R  + +A+
Sbjct: 265 HLALVFNHMSAADATERRLSLYDEIEDEEDKASNAKMPEVQAGGRSAFWGAPRAPMTLAI 324

Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           S+D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKH
Sbjct: 325 SEDGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKH 384

Query: 316 VVL 318
           VV+
Sbjct: 385 VVV 387


>gi|330006452|ref|ZP_08305616.1| BNR/Asp-box repeat protein [Klebsiella sp. MS 92-3]
 gi|386034753|ref|YP_005954666.1| glycosyl hydrolase family protein [Klebsiella pneumoniae KCTC 2242]
 gi|424830554|ref|ZP_18255282.1| glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
           subsp. pneumoniae Ecl8]
 gi|425091448|ref|ZP_18494533.1| hypothetical protein HMPREF1308_01708 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW5]
 gi|328535850|gb|EGF62282.1| BNR/Asp-box repeat protein [Klebsiella sp. MS 92-3]
 gi|339761881|gb|AEJ98101.1| glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae KCTC
           2242]
 gi|405612507|gb|EKB85258.1| hypothetical protein HMPREF1308_01708 [Klebsiella pneumoniae subsp.
           pneumoniae WGLW5]
 gi|414707981|emb|CCN29685.1| glycosyl hydrolase, BNR repeat protein [Klebsiella pneumoniae
           subsp. pneumoniae Ecl8]
          Length = 388

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 155/356 (43%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  +  +FGGS EG  D+ IW      G  RW   +   ++P+    NPV
Sbjct: 33  HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLSDDPDRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P N L L +       + +  ++   S++ G +W   + L   PG       +PI 
Sbjct: 93  LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ +G     +  C +   E W  N+  S ++++ D G+SWR    + +P ESL  +   
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L    RS      +  S+S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATPLDNGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G+SW 
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    E+ L  EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380


>gi|354723546|ref|ZP_09037761.1| hypothetical protein EmorL2_11843 [Enterobacter mori LMG 25706]
          Length = 388

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 154/358 (43%), Gaps = 74/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W   +   ++      NPV
Sbjct: 33  HAANLLPLPDGALMCVWFGGTQEGIADISVWGSRLTQGSQQWSDAVKLSDDATRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P N L L +       + +  +  ++S + G TW E + L   PG     +   ++
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSLDLGQTWGEIDTLLDKPGTF--IRQPIVV 150

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP-- 191
           LENG     +  C +   E W  N   S ++++ D G++WR    + +P ESL  +    
Sbjct: 151 LENGNWLLPVFYCRTRPGEKWVGNDDISAVKISEDRGKTWRD---VVVP-ESLGCVHMNI 206

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
            P H  N TL  L RS  G   +  S+S DGG +WS  + T L N NS I    L  G L
Sbjct: 207 TPLH--NGTLVALFRSRWG-DNIYYSQSTDGGESWSVPETTALPNNNSSIQVTTLSSGEL 263

Query: 252 LLAYNTVS-------------------------------------RGVLKVALSKDDGDS 274
            L +N +S                                     R  + VA+S D G S
Sbjct: 264 ALVFNHMSAAGALERRASLYDEIDDGDDRKEPVVTRGRAAFWGAPRAPMTVAISPDGGKS 323

Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           W     L+E             L  EFSYP++ Q S+G++HI YTY R  IK+V + P
Sbjct: 324 WPWQRNLDEGDGYCMTNNSMKKLNREFSYPSIKQGSEGNIHIAYTYFRQAIKYVRVTP 381


>gi|421082558|ref|ZP_15543441.1| BNR/Asp-box repeat family protein [Pectobacterium wasabiae CFBP
           3304]
 gi|401702795|gb|EJS93035.1| BNR/Asp-box repeat family protein [Pectobacterium wasabiae CFBP
           3304]
          Length = 395

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 153/361 (42%), Gaps = 79/361 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G   W   +   E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSKAVKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P + L L Y   +   + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDDVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQP 191
           +LENG     +  C     E W  N+  S ++++ D G++WR+Y    +PN +  V +  
Sbjct: 153 VLENGDWLLPVFYCRVQPGEKWVGNNDDSAVKISSDRGKTWREYP---VPNSTGCVHMNI 209

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
            P      TL  L RS      +  S S DGG TWS    T L N NS I    LK+G L
Sbjct: 210 TPLQDG--TLLALYRS-RWADFIYQSRSTDGGKTWSDPVATDLPNNNSSIQVTTLKNGHL 266

Query: 252 LLAYNTVS------------------------------------------RGVLKVALSK 269
            L +N +S                                          R  + +A+S+
Sbjct: 267 ALVFNHMSAADATDRRLSLYDEIEDEEDKASGAKMPEVQAGDRSAFWGAPRAPMTLAISE 326

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKHVV
Sbjct: 327 DGGKSWPWQRNVEVGDGYCMTNNSTEKLNREFSYPSVKQAPDGKLHLAFTYFRQAIKHVV 386

Query: 318 L 318
           +
Sbjct: 387 V 387


>gi|386335766|ref|YP_006031936.1| hypothetical protein RSPO_m00762 [Ralstonia solanacearum Po82]
 gi|334198216|gb|AEG71400.1| conserved hypothetical protein [Ralstonia solanacearum Po82]
          Length = 392

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 151/360 (41%), Gaps = 70/360 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P    HA+ +  +  G  L A+FGG+ EG PDV I+L     G   WQ  +   ++P   
Sbjct: 32  PCVQNHAANLHALPNGDLLCAWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NPVLF  P   L L Y       + +  ++R  S ++G +W   E L        +  
Sbjct: 92  EQNPVLFTTPDGALWLIYTAQLSGHQNTAIVRRRISTDRGRSWGPIETLFDQAGTFVRQP 151

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
            ++L +G     + LC +   E W  N   S + ++ D GR+W    P  +P +S+  + 
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDVSAVMISEDQGRTWT---PHTVP-DSVGCVH 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
                 A+ +L  L RS      V  S S D G  WS  +PT L N NS I  V L++G 
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHVYASRSRD-GRAWSAPQPTALPNNNSSIQFVALRNGH 265

Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
           L L YN                                           R  L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTRGTAVWGAPRAPLSLAISED 325

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W     LE            +    E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385


>gi|325962498|ref|YP_004240404.1| neuraminidase (sialidase) [Arthrobacter phenanthrenivorans Sphe3]
 gi|323468585|gb|ADX72270.1| putative neuraminidase (sialidase) [Arthrobacter phenanthrenivorans
           Sphe3]
          Length = 419

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 161/400 (40%), Gaps = 92/400 (23%)

Query: 1   MKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ 60
           +  D  +K     +F +    AP    HA+ ++ +  G     +FGG+ EG PD+ IW  
Sbjct: 12  ITPDGAVKRADGADFAYLP--APTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFS 69

Query: 61  TFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYN 116
             + G  +W  P    ++      NP+LF      L L Y   +   + +  ++R  S +
Sbjct: 70  ALEPGSSQWSEPQQLSDDSTRSEQNPILFTNTDGALWLLYTAQKAGNQDTAEVRRRISTD 129

Query: 117 KGVTWSEREQLPP-----GILGPSKNKPILLENGLLLC------GSSVESW--NSWGSWM 163
            G TW E E L P     G+    +  P++L +G L+        +  E W  NS  S +
Sbjct: 130 SGRTWGEVETLFPANETGGVF--VRQLPVVLPSGRLIVPIFRCITTPGEKWVGNSDDSAV 187

Query: 164 EVTVDAGRSWRKYGPIYIPNESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
            ++ DAG +W ++    +P     V   IQPV    A+ +L  L RS      +  S S 
Sbjct: 188 MISDDAGATWTEH---VLPGSLGCVHMNIQPV----ADGSLLALFRS-RWADAIYESRST 239

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT----------------------- 257
           D G TWS   PT+L N NS I    L DGRL L YN                        
Sbjct: 240 DDGSTWSEPVPTELPNNNSSIQFTALADGRLALVYNHSRAQENTERRLSLYDEIDDDGLA 299

Query: 258 -------------------------VSRGVLKVALSKDDGDSW--------HDALTLEEN 284
                                      R  + +A+S+D G +W         D   L  N
Sbjct: 300 EEQGQLAEPDAAAVADDGSRKAFWGTPRSPMTLAISEDSGRTWPIRRNLDVGDGYCLSNN 359

Query: 285 ----LAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
               L  E+SYP++ Q  DGS++I YTY R  IK V + P
Sbjct: 360 SRDGLNREYSYPSIHQGPDGSLNIAYTYFRQAIKFVRVDP 399


>gi|168700772|ref|ZP_02733049.1| arylsulfatase A [Gemmata obscuriglobus UQM 2246]
          Length = 861

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 148/362 (40%), Gaps = 65/362 (17%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQT 61
           K  G  G  A E  FP +    +  HA  IVE   G  LV+++ GS E  A DV ++   
Sbjct: 485 KKEGKDGSFAAELIFPLHK---QHNHAPGIVECPNGDLLVSWYRGSGERSADDVAVYGAR 541

Query: 62  FKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLF---------------YKIGQEV 104
            K G   W    +  + P  P  N  ++      L LF               Y++  + 
Sbjct: 542 KKAGADTWSDAFLMADTPGFPDCNTTMWVDKDGKLWLFWPLILANSWESCLTSYRVSSDY 601

Query: 105 Q-------KWSGC--MKRSYNKGVTWSEREQLPPGILGPSKN------------------ 137
           Q       +W G   +K    +GV   E E     +   SK                   
Sbjct: 602 QTGGAPKWEWQGTIPLKPKDFEGVMLREFEAWKKQVAAASKMPFEPDAAPLKKKVGDKLL 661

Query: 138 ---------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
                    KP++L  G +L     +++++    M ++ D G++W    P+         
Sbjct: 662 SRLGWQPRCKPVVLGTGRVLLPLYSDTYDA--GLMALSDDGGKTWTASQPL----AGFGS 715

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           IQP      + TL   MR      +V   ES D G +W     T L+NP SG+D V+L  
Sbjct: 716 IQPAVLERKDGTLVAYMRENGVFKKVRACESKDRGESWGTVYSTDLINPGSGLDAVRLAS 775

Query: 249 GRLLLAYNTVSRG--VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
           G  L+ YN   +G   L V+LS D+G +W     LE++   ++ YPAVIQ+ DG+VH  Y
Sbjct: 776 GNWLIVYNDTVKGRSRLAVSLSDDEGKTWKWTRHLEKHETGQYHYPAVIQSKDGAVHAVY 835

Query: 307 TY 308
           +Y
Sbjct: 836 SY 837


>gi|429859179|gb|ELA33968.1| glycosyl hydrolase [Colletotrichum gloeosporioides Nara gc5]
          Length = 379

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 151/356 (42%), Gaps = 70/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HAS ++++     L  +FGGS EG PD+ IWL   + G   W SP     + N    NPV
Sbjct: 19  HASNLLQLPDKTLLCTWFGGSQEGLPDISIWLSRQEPGSDTWSSPQKISSDTNRSCQNPV 78

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKP-IL 141
           LF+ P  G++  +   Q+     GC    + S + GVTWSE  +   G+ G    +P ++
Sbjct: 79  LFRAPKTGVIWLFHTSQDAGNQDGCYIMSRTSSDNGVTWSEARKPLKGVTGAFVRQPLVV 138

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV-IQPV 192
           L +G     +  C S+  + W  N+  S +  + D G +W +     +P+   +V + P+
Sbjct: 139 LADGTWILPVFHCRSTPGQRWIGNNDISAVFYSRDDGETWTES---VVPDSVGAVHMNPI 195

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                        RS      V  S S DG L W+  KPT L NPNSGI   +L  G L 
Sbjct: 196 IPSKEGSDYVAFYRS-RWADNVYRSTSSDG-LNWTAPKPTSLPNPNSGICAARLPSGNLA 253

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           + +N  +                                      R  L V +S D+G +
Sbjct: 254 IVFNRSAAEVGMAKREGLYDDITPEDDKRPNQAAVNGKTAIWGTPRKTLTVGISSDEGLT 313

Query: 275 WHDALTLE-----------ENLAMEFSYPAV-IQASDGSVHITYTYNRTQIKHVVL 318
           W + +  E           E    E SYP++ + A +  VH+ +T++R  IK+V +
Sbjct: 314 WKERVLEEGDGFCMTNNSIEKTNRELSYPSIFVDAEESVVHVAFTFHRQHIKYVRI 369


>gi|378978714|ref|YP_005226855.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae HS11286]
 gi|419974388|ref|ZP_14489807.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH1]
 gi|419979991|ref|ZP_14495279.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH2]
 gi|419985417|ref|ZP_14500558.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH4]
 gi|419990995|ref|ZP_14505963.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH5]
 gi|419997124|ref|ZP_14511922.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH6]
 gi|420003333|ref|ZP_14517979.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH7]
 gi|420008981|ref|ZP_14523467.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH8]
 gi|420015314|ref|ZP_14529615.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH9]
 gi|420020612|ref|ZP_14534798.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH10]
 gi|420025987|ref|ZP_14539992.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH11]
 gi|420032064|ref|ZP_14545881.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH12]
 gi|420037597|ref|ZP_14551250.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH14]
 gi|420043577|ref|ZP_14557064.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH16]
 gi|420049293|ref|ZP_14562602.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH17]
 gi|420054797|ref|ZP_14567968.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH18]
 gi|420061145|ref|ZP_14574137.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH19]
 gi|420066729|ref|ZP_14579527.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH20]
 gi|420071224|ref|ZP_14583871.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae KPNIH21]
 gi|420077268|ref|ZP_14589734.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae KPNIH22]
 gi|420085227|ref|ZP_14597461.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae KPNIH23]
 gi|421910221|ref|ZP_16340011.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K26BO]
 gi|421915507|ref|ZP_16345110.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K28BO]
 gi|428149143|ref|ZP_18996973.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
           ST512-K30BO]
 gi|428942990|ref|ZP_19015938.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae VA360]
 gi|364518125|gb|AEW61253.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae HS11286]
 gi|397345815|gb|EJJ38935.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH1]
 gi|397347501|gb|EJJ40608.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH2]
 gi|397351811|gb|EJJ44893.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH4]
 gi|397363393|gb|EJJ56033.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH6]
 gi|397364918|gb|EJJ57545.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH5]
 gi|397369701|gb|EJJ62300.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH7]
 gi|397376556|gb|EJJ68809.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH9]
 gi|397382436|gb|EJJ74597.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH8]
 gi|397387607|gb|EJJ79622.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH10]
 gi|397396048|gb|EJJ87743.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH11]
 gi|397398387|gb|EJJ90050.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH12]
 gi|397405162|gb|EJJ96633.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH14]
 gi|397413685|gb|EJK04897.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH17]
 gi|397413873|gb|EJK05079.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH16]
 gi|397422392|gb|EJK13361.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH18]
 gi|397429209|gb|EJK19928.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH20]
 gi|397431639|gb|EJK22311.1| glycoside hydrolase family protein [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH19]
 gi|397440548|gb|EJK30950.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae KPNIH21]
 gi|397446150|gb|EJK36373.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae KPNIH22]
 gi|397449437|gb|EJK39573.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae subsp. pneumoniae KPNIH23]
 gi|410115885|emb|CCM82636.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K26BO]
 gi|410122197|emb|CCM87735.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K28BO]
 gi|426297581|gb|EKV60063.1| putative glycosyl hydrolase, BNR repeat protein [Klebsiella
           pneumoniae VA360]
 gi|427540908|emb|CCM93111.1| expressed protein [Klebsiella pneumoniae subsp. pneumoniae
           ST512-K30BO]
          Length = 388

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 155/356 (43%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +D G  +  +FGGS EG  D+ IW      G  RW   +   ++P+    NPV
Sbjct: 33  HAANLLPLDDGSLMCVWFGGSQEGKADISIWGSRLAPGSDRWSEAVKLCDDPDRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P N L L +       + +  ++   S++ G +W   + L   PG       +PI 
Sbjct: 93  LFQAPDNVLWLLWTAQFAGNQDTAIVRYRLSHDGGRSWGAIDTLLDQPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ +G     +  C +   E W  N+  S ++++ D G+SWR    + +P ESL  +   
Sbjct: 150 VMSDGNWLLPVFYCRTEPGEKWVGNNDVSAVKISSDCGKSWRD---VAVP-ESLGCVHMS 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L    RS      +  S+S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLPDGRLAAFFRS-RWADHIWFSQSSDQGESWSAPVPTTLPNNNSSIQATPLDNGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G+SW 
Sbjct: 265 LVFNNMSAAGATERRASLYDEIADDDGRREPEATGKSAFWGAPRAPMTVAISADGGESWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    E+ L  EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WLRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGADGNLHIAYTWYRQAIKYVRVSP 380


>gi|300697882|ref|YP_003748543.1| Putative Glycosyl hydrolase, BNR repeat [Ralstonia solanacearum
           CFBP2957]
 gi|299074606|emb|CBJ54159.1| Putative Glycosyl hydrolase, BNR repeat [Ralstonia solanacearum
           CFBP2957]
          Length = 392

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 151/360 (41%), Gaps = 70/360 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P    HA+ +  +  G  L  +FGG+ EG PDV I+L     G   WQ  +   ++P   
Sbjct: 32  PCVQNHAANLHALPNGDLLCTWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NPVLF  P   L L Y       + +  ++R  S + G +W   E L        +  
Sbjct: 92  EQNPVLFTTPDGALWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIETLFDQAGTFVRQP 151

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
            ++L +G     + LC +   E W  N   S + ++ D GR+W ++    +P +S+  + 
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDVSAVMISKDQGRTWTQHA---VP-DSVGCVH 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
                 A+ +L  L RS     RV  S S DG   WS  +PT L N NS I  V L++G 
Sbjct: 208 MNVQTLADGSLLALYRS-RWADRVYASRSRDG-RAWSAPQPTALPNNNSSIQFVALRNGH 265

Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
           L L YN                                           R  L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTRGTAFWGAPRAPLSLAISED 325

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W     LE            +    E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385


>gi|451820791|ref|YP_007456992.1| putative neuraminidase [Clostridium saccharoperbutylacetonicum
           N1-4(HMT)]
 gi|451786770|gb|AGF57738.1| putative neuraminidase [Clostridium saccharoperbutylacetonicum
           N1-4(HMT)]
          Length = 398

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 151/366 (41%), Gaps = 82/366 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HAS ++ +  G  L A+F G+ EG  D+ I++     G   W  P+   E+P     NP+
Sbjct: 31  HASNVMPLANGDLLCAWFSGTQEGIADISIFMSRLNKGETVWTEPVKLSEDPTKSEQNPI 90

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P+  L L Y   +   + +  ++   S + G TW   E L   PG     +   ++
Sbjct: 91  LFPAPNGDLWLIYTAQKSGNQDTAIVRYRVSSDNGYTWGPIETLFDKPGTF--VRQPIVV 148

Query: 142 LENG-----LLLCGS-SVESWNSWG--SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L+N      +  C +   + WN     S ++++ D G +W +Y    +PN +  V   V 
Sbjct: 149 LDNSEWLIPIFYCYTPQGQKWNGDYDVSAVKISADNGNTWTEYE---VPNSTGCVHMNVE 205

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + TL  L RS      + MS S D G TW+   PT+L N NS I    L +G L L
Sbjct: 206 -KLHDGTLLALYRS-RWADNIYMSRSTDNGRTWTEPVPTELPNNNSSIQFTVLNNGHLAL 263

Query: 254 AYNTVS-------------------------------------------------RGVLK 264
            +N ++                                                 R  L 
Sbjct: 264 IFNDINAEKATERRASLYDEIEDDEEASEAVDNTENSFVPEDNGTERKTAFWGTPRAPLA 323

Query: 265 VALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQ 312
           +A+S+D G +W     +E            E +  EFSYP+V Q SDG ++IT+TY R  
Sbjct: 324 IAISEDGGKTWPYKRNIEVGDGYCMSNNSKEKVNREFSYPSVKQTSDGKINITFTYFRQY 383

Query: 313 IKHVVL 318
           IKHV +
Sbjct: 384 IKHVCV 389


>gi|157693727|ref|YP_001488189.1| hypothetical protein BPUM_2975 [Bacillus pumilus SAFR-032]
 gi|157682485|gb|ABV63629.1| hypothetical protein BPUM_2975 [Bacillus pumilus SAFR-032]
          Length = 391

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/354 (27%), Positives = 156/354 (44%), Gaps = 70/354 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ IVE+D G  L  +FGG+ EG PD+ I++   K G  +W   +    +P     NPV
Sbjct: 36  HAANIVEMDNGDVLCVWFGGTQEGIPDISIYMSRLKRGSNKWTKAVKLSSDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF+     L L Y       + +  ++   S ++G TW E + L   PG     +   ++
Sbjct: 96  LFQEKDGRLWLLYTAQLSGNQDTAIVRYRLSEDRGETWGEIDTLFDQPGTF--IRQPIVV 153

Query: 142 LENG-----LLLCGS--SVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L+N      +  C +   V+ + N   S ++++ D G++W +   + +PN +  V   + 
Sbjct: 154 LDNQDWLLPVFYCKTIPGVKWTGNRDVSAVKISGDQGKTWEE---VIVPNSTGYVHMNIG 210

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
              A+ +L  L RS      + +S S D G TWS  K T+L N NS I    L +G L L
Sbjct: 211 -KCADGSLLALFRS-RFADSIYISRSVDHGRTWSDPKATELPNNNSSIQFTVLNNGHLAL 268

Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
            YN ++                                     R  + +A+S D+G++W 
Sbjct: 269 VYNHINADEQTERRASLYDEIEDEGDTRTAVDTEARPAFWGTPRAPMTLAVSVDNGETWP 328

Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
               LE            + L  E+SYP++ +  DG +HI +TY R  IK+V +
Sbjct: 329 IRRNLEVGDGYAMTNNSKDKLNREYSYPSITEGKDGKLHIAFTYYRQAIKYVCV 382


>gi|357056123|ref|ZP_09117178.1| hypothetical protein HMPREF9467_04150 [Clostridium clostridioforme
           2_1_49FAA]
 gi|355381371|gb|EHG28497.1| hypothetical protein HMPREF9467_04150 [Clostridium clostridioforme
           2_1_49FAA]
          Length = 386

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 159/338 (47%), Gaps = 55/338 (16%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
           +K+ H   ++E++ G  L  +F GS EG  DV +     K G  RW+ P++   +P+   
Sbjct: 33  YKTAHGPGLLELENGDLLCVWFAGSFEGNADVNVICARLKKGSSRWEEPVLVSHDPDRSE 92

Query: 82  WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILG 133
            NP LF  P   +   Y        G++  +++  ++  +S + G TWS+ E + P   G
Sbjct: 93  QNPSLFAGPDGKVWAMYTSQLSRVEGKDNMQFTSVIRCQKSADGGETWSDYETVFPE-EG 151

Query: 134 PSKNKPI-LLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGPIYIPNES-- 185
               +PI +LE+G  + G+ + + +  G     +   ++ D G++W+    + +P  +  
Sbjct: 152 SFCRQPIQVLESGRWIFGNWICTDSELGLTGDPTAFRISDDQGKTWKT---VDMPESNGA 208

Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +V++  P H     L   MRS      +  SES D G TW+   PT L N NS I  
Sbjct: 209 VHANVVELEPGH-----LAAFMRS-RAADHIYRSESMDNGDTWTAPVPTVLPNNNSSISA 262

Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE------- 282
           VKLK GR+ +AYN                 R  + VALS+D G ++     +E       
Sbjct: 263 VKLKSGRIAVAYNPTCTPCPTPGVASWPGLRCPVAVALSEDGGLTFPMIRYMERGEGYMG 322

Query: 283 -ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
            EN     ++ YP ++Q +DG +H+ + Y +R  +K++
Sbjct: 323 AENKTNNRQYEYPYIMQGADGRLHLAFAYKDRLSVKYM 360


>gi|407712890|ref|YP_006833455.1| BNR repeat-containing glucosyll hydrolase [Burkholderia
           phenoliruptrix BR3459a]
 gi|407235074|gb|AFT85273.1| BNR repeat-containing glucosyll hydrolase [Burkholderia
           phenoliruptrix BR3459a]
          Length = 395

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 151/357 (42%), Gaps = 78/357 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L A+FGG+ EG PD+ I+L     G   W  P+   ++P     NPV
Sbjct: 34  HAANLLALANGDLLCAWFGGTQEGVPDISIYLSRLAQGSSTWSEPVRLSDDPTRSEQNPV 93

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNKPILL- 142
           LF  P   L L Y       + +  ++R  S + G TW   + L   + G    +PI++ 
Sbjct: 94  LFAAPDGKLWLIYTAQLSGHQNTSIVRRRVSGDNGHTWGPIDTL-FDLPGTFVRQPIVVA 152

Query: 143 ENGLLLCG------SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---IQP 191
            +G  LC       +  E W  N   S +  + D G +W  +    +P     V   IQP
Sbjct: 153 RDGAWLCPVFLCRVAPGERWSGNDDVSAVMRSTDNGATWTAHD---VPGSVGCVHMNIQP 209

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
           +P    + +L  L RS      + MS S D G+ WS  +P  L N NS I  V L +G L
Sbjct: 210 LP----DGSLVALFRS-RWADHIHMSRSAD-GIAWSEPEPLALPNNNSSIQFVTLANGHL 263

Query: 252 LLAYNTVS----------------------------------------RGVLKVALSKDD 271
            L +N  S                                        R  + +A+S D 
Sbjct: 264 ALVFNNSSAAQSTGRRASLYDDIEDAEDSGTLATQAASARGTAFWGAPRAPMTLAVSMDG 323

Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           G +W     LE            + L  EFSYP+++Q++DG +HI YTY R +IK+V
Sbjct: 324 GRTWPVMRNLETGDGYCMTNNSTDKLNREFSYPSIVQSADGRLHIAYTYFRQRIKYV 380


>gi|220911882|ref|YP_002487191.1| hypothetical protein Achl_1110 [Arthrobacter chlorophenolicus A6]
 gi|219858760|gb|ACL39102.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6]
          Length = 413

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 109/392 (27%), Positives = 158/392 (40%), Gaps = 91/392 (23%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQ 68
            A+   F    AP    HA+ ++ +  G     +FGG+ EG PD+ IW    + G  +W 
Sbjct: 19  TADGADFAYLPAPTVQSHAANLLTLPDGRLGCVWFGGTQEGVPDISIWFSALEPGSKQWS 78

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQ 126
           +P    ++      NP+LF      L L Y   +   + +  ++R  S + G TW + E 
Sbjct: 79  APEQLSDDSTRSEQNPILFTNTDGALWLLYTAQKAGNQDTAEVRRRISLDSGRTWGDVET 138

Query: 127 L-----PPGILGPSKNKPILLENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSW 173
           L       G+    +  P++L +G L+        +  E W  NS  S + ++ DAG +W
Sbjct: 139 LFAANETGGVF--VRQLPVVLPSGRLIIPIFRCITTPGEKWVGNSDDSAVMISDDAGATW 196

Query: 174 RKYGPIYIPNESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
            +     +P     V   IQPV    A+ +L  L RS      +  S S D G TWS   
Sbjct: 197 TE---TVLPGSLGCVHMNIQPV----ADGSLLALFRS-RWADSIYESRSTDDGSTWSEPV 248

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNT--------------------------------- 257
           PT+L N NS I  V LKDGRL L YN                                  
Sbjct: 249 PTELPNNNSSIQFVALKDGRLALVYNHSRAGEGTERRLSLYDEIDDDGLADEQGQVAEPD 308

Query: 258 ----------------VSRGVLKVALSKDDGDSW--------HDALTLEEN----LAMEF 289
                             R  + +A+S+D G SW         D   L  N    L  E+
Sbjct: 309 ASAFSEDDGVKRAFWGTPRSPMTLAISEDSGRSWPIRRNLDVGDGYCLSNNSRDGLNREY 368

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
           SYP++ Q  DG+++I YTY R  IK V + P 
Sbjct: 369 SYPSIHQGPDGALNIAYTYFRQAIKFVRVDPQ 400


>gi|374991679|ref|YP_004967174.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
           bingchenggensis BCW-1]
 gi|297162331|gb|ADI12043.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
           bingchenggensis BCW-1]
          Length = 409

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 156/379 (41%), Gaps = 91/379 (24%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HA+ +  +  G     +FGG+ EG  D+ +W      G   W  P+   ++   
Sbjct: 38  APAVQNHAANLAVLPGGDLGCVWFGGTQEGVADICVWFSRLAPGADTWTDPVRLSDDLTR 97

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQL-----PPGIL 132
              NPVLF  P+  L L +   Q   + +    ++ S + G TW     L       G+ 
Sbjct: 98  SEQNPVLFSAPTGELWLLHTAQQAGNQDTAEVRLRTSADGGATWGPTRTLFAATATGGVF 157

Query: 133 GPSKNKPILLENG--LLLCGSSVES----W--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
              +  P++L++G  LL     V +    W  +   S + ++ D GR+WR+     +P  
Sbjct: 158 --VRQPPVVLDDGRWLLPVFHCVATPGIPWVGDHDTSAVMISDDQGRTWREQ---PVPGS 212

Query: 185 SLSV---IQPVPFHTANRTLRVLMRS--FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
           +  V   I P+P    + TL  L RS   + I R   S S DGG TWS    T+L N NS
Sbjct: 213 TGCVHMNIHPLP----DGTLLALFRSRWADAIHR---SHSRDGGETWSEPVATELPNNNS 265

Query: 240 GIDGVKLKDGRLLLAYNTVS---------------------------------------- 259
            I  V L DGRL L YN  S                                        
Sbjct: 266 SIQYVPLADGRLALVYNHSSAADATARRVSLYDEIDDDGQAGESPAATAAPEVADGGKTA 325

Query: 260 -----RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSV 302
                R  + +ALS D GD+W     L+            + L  E SYP+V Q+ DG++
Sbjct: 326 FWGAPRAPMTLALSSDGGDTWPHRRDLDTGDGHCLTNNSRDRLNRELSYPSVTQSPDGAL 385

Query: 303 HITYTYNRTQIKHVVLLPN 321
           HI YTY+R  IK+V + P+
Sbjct: 386 HIAYTYHRRAIKYVRVAPD 404


>gi|375263597|ref|YP_005025827.1| hypothetical protein VEJY3_22196 [Vibrio sp. EJY3]
 gi|369844024|gb|AEX24852.1| hypothetical protein VEJY3_22196 [Vibrio sp. EJY3]
          Length = 385

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 150/356 (42%), Gaps = 70/356 (19%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNVPM 81
           +   HAS ++ +  G  L  +F G+ EG  D+ I     +     W+  +    +P    
Sbjct: 32  YPQNHASNLLVLPNGDLLCTWFAGTQEGVSDISILCSRLRKDSNEWEPAVKLSNDPERSE 91

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKN 137
            NPVLF  P N L L +       + +  +  ++S++ G TW + + L   PG     + 
Sbjct: 92  QNPVLFLDPDNTLWLLWTAQLAGNQDTAIVRYRKSFDLGETWQDIDVLIDKPGTF--IRQ 149

Query: 138 KPILLENG---LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
             ++L NG   L +   +++    W      S ++++ D G++WR    + +P ESL  +
Sbjct: 150 PIVVLPNGNWLLPIFNCTIKPGEKWVGNYDTSSVKISEDKGQTWRD---VDVP-ESLGCV 205

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
                   N +L  + RS      +  S S D G TWS   PT+L N NS I    L +G
Sbjct: 206 HMSVALLKNGSLIAVYRS-RWADSIYQSYSHDNGETWSAPVPTELPNNNSSIQLTALNNG 264

Query: 250 RLLLAYNTVS-------------------------------------RGVLKVALSKDDG 272
            L + +N +S                                     R  + +A+SKDDG
Sbjct: 265 DLAMVFNNMSAEGATDRRSSLYDEIEDDSDAKEPEIIDGKSAFWGAPRAPMSLAISKDDG 324

Query: 273 DSWHDALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            SW     L            +E L  EFSYP + Q+SDG++HI YTY R  IK+V
Sbjct: 325 ASWQIVANLDEGDGFCMTNNSQEKLNREFSYPTITQSSDGAIHIAYTYFRQAIKYV 380


>gi|288961203|ref|YP_003451542.1| glycosyl hydrolase [Azospirillum sp. B510]
 gi|288913511|dbj|BAI74998.1| glycosyl hydrolase [Azospirillum sp. B510]
          Length = 408

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 151/376 (40%), Gaps = 83/376 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ I  +  G     +FGG+ EG PD+ I+      G  RW  P+   ++P  
Sbjct: 35  SPCVQNHAANITVLGNGDLGCVWFGGTQEGIPDISIYFSRLAQGSDRWTDPVKLSDDPTR 94

Query: 80  PMWNPVLFKLPSNGLLLFYKI---GQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS- 135
              NPVLF  P   L LFY     G +   +  C + S + G +W     L  G  G   
Sbjct: 95  SEQNPVLFPAPDGTLWLFYTAQVSGNQDTAFVRC-RSSADHGHSWGPIRTLFEGTNGDGI 153

Query: 136 --KNKPILLENG-----LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
             +   ++L NG       LC G+  E W  ++  S + ++ D G SW ++    +P+  
Sbjct: 154 FIRQPVVVLPNGDWLMPTFLCHGTPGEKWVGDNDTSAVRISSDQGESWTEHA---VPDSV 210

Query: 186 LSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             V +  VP    + TL    RS      V  S S D G +WS    T+L N NS I   
Sbjct: 211 GCVHMSVVPLR--DSTLAAFYRS-RWADHVYRSLSTDNGRSWSAPVATELPNNNSSIQVT 267

Query: 245 KLKDGRLLLAYNTVS--------------------------------------------- 259
            L DGRL + +N  S                                             
Sbjct: 268 ALADGRLAIVFNQSSAENATGRRASLYDEIEDDVPAGGAAVAVAPAPVKAEALARKAFWG 327

Query: 260 --RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHIT 305
             R  L +ALS+D G +W     LE            ++L  EFSYP+V Q +DG +H+ 
Sbjct: 328 APRAPLTLALSEDGGLTWPLRRNLEVGDGYCMTNNSKDSLNREFSYPSVTQTADGDLHVA 387

Query: 306 YTYNRTQIKHVVLLPN 321
           +TY R  IK+  + P+
Sbjct: 388 FTYFRQAIKYSRVPPD 403


>gi|408675436|ref|YP_006875184.1| neuraminidase (sialidase)-like protein [Emticicia oligotrophica DSM
           17448]
 gi|387857060|gb|AFK05157.1| neuraminidase (sialidase)-like protein [Emticicia oligotrophica DSM
           17448]
          Length = 395

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 156/360 (43%), Gaps = 72/360 (20%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
           K  HAS++V +  G +L  +F GS E  + DVKI     + G+  W  P +  + PN+P 
Sbjct: 37  KHTHASSLVNLPNGDYLCVWFYGSGERTSDDVKIMGARLEKGKKTWSEPFLMADTPNIPD 96

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SYNKG--VTWSERE--------- 125
            NPVLF      L L + I  +  KW   + R      Y K     WS ++         
Sbjct: 97  CNPVLFLNHEKKLFLVW-IAVQANKWEQSILRFRTSTDYQKSGPPVWSWQDNILLKPDDE 155

Query: 126 ----------QLPPGILG------------------PSKN--------KPILLENGLLLC 149
                     +LPP   G                  P+K         KP+LL N  ++ 
Sbjct: 156 FAKEVNKRLKELPPPTAGWAGYAPKYDESINIASNDPTKRSIGWMTRIKPLLLPNNRIVL 215

Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMR-SF 208
               + +N   S M ++ D G SW    P+         IQP      N  L  LMR S 
Sbjct: 216 PLYSDGFNM--SLMAISDDDGTSWHPSLPLV----GRGPIQPAVIQKKNGNLYALMRDSG 269

Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVAL- 267
           +   RV  SES D G TW  +  T++ N  + ++ + LKDGR     N +  G  +++L 
Sbjct: 270 DDPSRVHESESTDLGETWKASSKTKIPN-TASVELLVLKDGRWAFLGNDIDDGRYRLSLF 328

Query: 268 -SKDDGDSWHDALTLE--ENLAMEFSYPAVIQASDGSVHITYTY----NRTQIKHVVLLP 320
            S D+G +W     LE  E     FSYP++IQ  DG +H+TY+Y    ++  IK+V++ P
Sbjct: 329 LSDDEGKTWKWKTKLENVEPNKGSFSYPSLIQTEDGLIHLTYSYHLEKDKKSIKYVIVDP 388


>gi|389574379|ref|ZP_10164443.1| hypothetical protein BAME_30120 [Bacillus sp. M 2-6]
 gi|388425987|gb|EIL83808.1| hypothetical protein BAME_30120 [Bacillus sp. M 2-6]
          Length = 391

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 155/354 (43%), Gaps = 70/354 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++E+D G  L  +FGG+ EG PD+ I++   K G  +W   +   ++      NPV
Sbjct: 36  HAANLIELDNGDLLCVWFGGTQEGIPDISIYMSRLKQGSNQWTKAVKLSDDSTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF+     L L Y       + +  ++   S ++G TW E   L   PG     +   ++
Sbjct: 96  LFQEKDGRLWLLYTAQLSGNQDTALVRYRLSEDRGETWGEIGTLFDQPGTF--IRQPIVV 153

Query: 142 LENG-----LLLCGS--SVE-SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L+N      +  C +   V+ + N   S ++++ D G++W +   + +PN +  V   + 
Sbjct: 154 LDNDDWLLPVFYCKTIPGVKWTGNRDVSAVKISSDQGKTWEE---VTVPNSTGCVHMNIG 210

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
              A+ +L  L RS      + MS S D G TWS  + T+L N NS I    L +G L L
Sbjct: 211 -KCADGSLLALFRS-RFADSIYMSRSVDHGRTWSEPRATELPNNNSSIQFTVLHNGHLAL 268

Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
            YN ++                                     R  + +A+S DDG +W 
Sbjct: 269 VYNHINADDQTERRASLYDEIEDEGDTRTAVDTEARPAFWGTPRAPMTLAISTDDGATWP 328

Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
               LE            + L  E+SYP++ +  DGS+HI +T+ R  IK+V +
Sbjct: 329 VRRNLEIGDGYAMTNNSKDKLNREYSYPSITEGKDGSLHIAFTFYRQAIKYVCV 382


>gi|227327959|ref|ZP_03831983.1| hypothetical protein PcarcW_11745 [Pectobacterium carotovorum
           subsp. carotovorum WPP14]
          Length = 395

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 154/363 (42%), Gaps = 83/363 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G   W + +   E+      NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGIADISIYMSRLVKGSDSWSNAVKLSEDATRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L Y   +   + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDNVLWLLYTAQKSGNQDTAIVRYRQSTDLGATWGEIGTLLDQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
           +L NG     +  C     E W  N   S ++++ D G++W++Y    +PN +  V   I
Sbjct: 153 VLANGDWLLPVFYCRVQPGEKWVGNDDDSAVKISSDQGKTWQEYP---VPNSTGCVHMNI 209

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
            P+     + TL  L RS      +  S S DGG TWS    T L N NS I    L++G
Sbjct: 210 TPL----KDGTLLALYRS-RWADFIYQSRSTDGGKTWSDPVATDLPNNNSSIQVTTLENG 264

Query: 250 RLLLAYNTVS------------------------------------------RGVLKVAL 267
            L L +N +S                                          R  + +A+
Sbjct: 265 HLALVFNHMSAADATERRLSLYDEIEDEEDKASDAKMPEVQAGGRSAFWGAPRAPMTLAI 324

Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           S+D G SW     +E            E L  EFSYP+V QA DG +H+ +TY R  IKH
Sbjct: 325 SEDGGKSWPWQRNIEVGDGYCMTNNSTEKLNREFSYPSVKQAQDGKLHVAFTYFRQAIKH 384

Query: 316 VVL 318
           VV+
Sbjct: 385 VVV 387


>gi|83749872|ref|ZP_00946839.1| Hypothetical protein RRSL_00197 [Ralstonia solanacearum UW551]
 gi|207739581|ref|YP_002257974.1| hypothetical protein RSIPO_04281 [Ralstonia solanacearum IPO1609]
 gi|83723462|gb|EAP70673.1| Hypothetical protein RRSL_00197 [Ralstonia solanacearum UW551]
 gi|206592961|emb|CAQ59867.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609]
          Length = 392

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 150/360 (41%), Gaps = 70/360 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P    HA+ +  +  G  L  +FGG+ EG PDV I+L     G   WQ  +   ++P   
Sbjct: 32  PCVQNHAANLHALPNGDLLCTWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NPVLF  P+  L L Y       + +  ++R  S + G +W   E L        +  
Sbjct: 92  EQNPVLFTTPAGALWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIETLFEQAGTFVRQP 151

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
            ++L +G     + LC +   E W  N   S + ++ D GR+W    P  +P +S+  + 
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDVSAVMISEDQGRTWT---PHAVP-DSVGCVH 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
                 A+ +L  L RS      V  S S DG   WS  +PT L N NS I  V L++G 
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHVYASRSRDG-RAWSAPQPTALPNNNSSIQFVALRNGH 265

Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
           L L YN                                           R  L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTHGTAFWGAPRAPLSLAISED 325

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W     LE            +    E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385


>gi|421898935|ref|ZP_16329301.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
 gi|206590141|emb|CAQ37102.1| conserved hypothetical protein [Ralstonia solanacearum MolK2]
          Length = 392

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 150/360 (41%), Gaps = 70/360 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P    HA+ +  +  G  L  +FGG+ EG PDV I+L     G   WQ  +   ++P   
Sbjct: 32  PCVQNHAANLHALPNGDLLCTWFGGTQEGIPDVSIYLSRLPAGGAEWQPAVKLSDDPTRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NPVLF  P+  L L Y       + +  ++R  S + G +W   E L        +  
Sbjct: 92  EQNPVLFTTPAGALWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIETLFEQAGTFVRQP 151

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
            ++L +G     + +C +   E W  N   S + ++ D GR+W    P  +P +S+  + 
Sbjct: 152 IVVLSDGAWACPVFMCRTQPGERWVGNDDVSAVMISEDQGRTWT---PHAVP-DSVGCVH 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
                 A+ +L  L RS      V  S S DG   WS  +PT L N NS I  V L++G 
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHVYASRSRDG-RAWSAPQPTALPNNNSSIQFVALRNGH 265

Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
           L L YN                                           R  L +A+S+D
Sbjct: 266 LALVYNESDASHSSGRRTSLYDDIEDAEDHGALRDQSASTHGTAFWGAPRAPLSLAISED 325

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W     LE            +    E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFTYFRQRIKYVTV 385


>gi|392979080|ref|YP_006477668.1| hypothetical protein A3UG_11160 [Enterobacter cloacae subsp.
           dissolvens SDM]
 gi|392325013|gb|AFM59966.1| hypothetical protein A3UG_11160 [Enterobacter cloacae subsp.
           dissolvens SDM]
          Length = 387

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 150/356 (42%), Gaps = 70/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W       ++      NPV
Sbjct: 32  HAANLLPLPDGSLMCVWFGGTQEGIADISVWGSRLSPGSQQWSEAEKLSDDATRSEQNPV 91

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P N L L +       + +  +  ++S ++G TW +   L   PG     +   ++
Sbjct: 92  LFLAPDNVLWLLWTAQMSGNQDTAIVRYRQSRDRGKTWGDIATLLDKPGTF--IRQPIVV 149

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           LENG     +  C +   E W  N   S ++++ D G++WR    + +P +SL  +    
Sbjct: 150 LENGNWLLPVFYCRTRPGEKWVGNDDISAVKISEDGGKTWRD---VEVP-DSLGCVHMNI 205

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               N TL  L RS      +  S+S D G +WS  +PT L N NS I    L  G L L
Sbjct: 206 TPLCNGTLVALFRS-RWADNIYYSQSTDNGESWSVPEPTVLPNNNSSIQVTTLASGELAL 264

Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
            +N +S                                     R  + VA+S D G SW 
Sbjct: 265 VFNFMSAAGAQERRTSLYDEIDDGDGRKEPVVTHGRAAFWGAPRAPMTVAISPDGGKSWP 324

Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
               L+            E L  EFSYP++ Q+ DGS+HI YTY R  IK+V + P
Sbjct: 325 WQRHLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGSLHIAYTYFRQAIKYVRVTP 380


>gi|343084746|ref|YP_004774041.1| neuraminidase (sialidase)-like protein [Cyclobacterium marinum DSM
           745]
 gi|342353280|gb|AEL25810.1| neuraminidase (sialidase)-like protein [Cyclobacterium marinum DSM
           745]
          Length = 393

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 163/377 (43%), Gaps = 75/377 (19%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTF--KD 64
           K LV+E   FP         H S+IV +  G  L A+F GS E  A DV+I       KD
Sbjct: 25  KVLVSESLIFPLQQ---NHVHGSSIVSLPNGDLLTAWFEGSGERKADDVRIMGARLRKKD 81

Query: 65  GRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SY-NKG 118
            +W  P +  + PN+P  NPVLF L   G L    I  +  +W   + R     SY N G
Sbjct: 82  RQWGEPFLMADTPNIPDCNPVLF-LNQEGKLFLVWIAVQANRWEQSILRFRTSTSYENAG 140

Query: 119 V---TWSEREQLPPG-------------------------------ILGPSKN------- 137
               +W +   L PG                               I+  SK+       
Sbjct: 141 APIWSWQDNILLKPGEEFHQEVAKRFEQLPENNSGWSEYAPQYDQMIIEASKDSKKSSIG 200

Query: 138 -----KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
                KP++L +G +L     + +N   S M ++ D G +W    P+         IQP 
Sbjct: 201 WMTRIKPVVLTSGRILLPLYSDGFNF--SMMAISDDDGTTWFPSKPLV----GRGPIQPA 254

Query: 193 PFHTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
                N  +  +MR S +   R+  S S D G+TWS ++ T + N  S ++ + L +G  
Sbjct: 255 LGQRENGEIMAIMRDSGDAPTRIHKSYSQDEGMTWSASEKTSIPNTAS-VELLVLDNGTW 313

Query: 252 LLAYNTVSRGVLKVAL--SKDDGDSWHDALTLEENL--AMEFSYPAVIQASDGSVHITYT 307
           L   N +  G  +++L  S D+G +W   + LE+       FSYP++IQ  DG VHITY+
Sbjct: 314 LFLGNDIDDGRYRISLYISDDEGQTWRGKIYLEDEKKDVGGFSYPSLIQDIDGMVHITYS 373

Query: 308 YNRTQ----IKHVVLLP 320
           Y+  +    IK++ + P
Sbjct: 374 YHLNEKGKSIKYLKIDP 390


>gi|418402771|ref|ZP_12976276.1| hypothetical protein SM0020_21702 [Sinorhizobium meliloti
           CCNWSX0020]
 gi|359503265|gb|EHK75822.1| hypothetical protein SM0020_21702 [Sinorhizobium meliloti
           CCNWSX0020]
          Length = 401

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 144/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+      G  +W  P     +   
Sbjct: 34  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAEH 93

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 94  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 147

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + V++D GRSW+    
Sbjct: 148 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 203

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   N S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 204 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 262

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L+DGRL +  N V+                                     R 
Sbjct: 263 SSITVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 322

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E+                E SYP +++  DGS+H+ YTY+
Sbjct: 323 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHVAYTYH 382

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 383 RRAIKYVRLPP 393


>gi|407724134|ref|YP_006843795.1| hypothetical protein BN406_06513 [Sinorhizobium meliloti Rm41]
 gi|407324194|emb|CCM72795.1| hypothetical protein BN406_06513 [Sinorhizobium meliloti Rm41]
          Length = 395

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 144/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+      G  +W  P     +   
Sbjct: 28  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAEH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + V++D GRSW+    
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   N S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L+DGRL +  N V+                                     R 
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCAPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E+                E SYP +++  DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHVAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLPP 387


>gi|378582298|ref|ZP_09830937.1| hypothetical protein CKS_3144 [Pantoea stewartii subsp. stewartii
           DC283]
 gi|377815035|gb|EHT98151.1| hypothetical protein CKS_3144 [Pantoea stewartii subsp. stewartii
           DC283]
          Length = 385

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 153/357 (42%), Gaps = 71/357 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  L  +FGG+ EG  D+ +W      G   W   +   ++P     NPV
Sbjct: 33  HAANILPLHDGSLLCVWFGGTQEGIADISVWCSRLPAGSHIWSEAVKLSDDPTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW   + L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQLSGNQDTAIVRYRQSGDLGKTWGPIDTLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +LENG     +  C +   E W  N   S ++++ D G+SWR    + +P ESL  +   
Sbjct: 150 VLENGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGQSWRD---VVVP-ESLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + +L  L RS      + +S S DGG +WS  + T L N NS I    L+ G L 
Sbjct: 206 ITALQDGSLVALYRS-RWADNIYLSRSTDGGESWSAPEATVLPNNNSSIQVTTLQSGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N++S                                    R  + VA+S D G +W 
Sbjct: 265 LVFNSMSAAGATERRASLYDEIDDGDGRKEPEATGRSAFWGAPRAPMTVAISADGGKTWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
                   D   +    +E L  EFSYP++ Q + G++HI YTY R  IK+  + P+
Sbjct: 325 WQKNLDEGDGYCMTNNSQEKLNREFSYPSIKQDAQGTLHIAYTYFRQAIKYARVTPD 381


>gi|400289912|ref|ZP_10791939.1| hypothetical protein SRA_03251 [Streptococcus ratti FA-1 = DSM
           20564]
 gi|399920703|gb|EJN93520.1| hypothetical protein SRA_03251 [Streptococcus ratti FA-1 = DSM
           20564]
          Length = 390

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 150/335 (44%), Gaps = 45/335 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWN 83
           + HA  ++E   G  L A+F GS EG+ D+ I +      +  W  P I  +  +    N
Sbjct: 37  TAHAPALIETQDGGLLCAWFAGSFEGSGDISIAVSKLDPEKQVWSIPQIVSKGKDRSEQN 96

Query: 84  PVLFKLPSNGLLLFY------KIGQEVQKWSGCM--KRSYNKGVTWSEREQLPPGILGPS 135
           P  F+ P   + L Y      + G++  +++  +  ++S ++G TW E + L P     S
Sbjct: 97  PAFFRAPDGQIWLIYTSQLSRQEGKDNMQFTSVIMVQKSSDEGQTWGEPQILFPEEGTFS 156

Query: 136 KNKPILLENG-----LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
           +    +L NG       LC  S E   +  +   V+ DAG++W++   + +P+ +  V  
Sbjct: 157 RQAIQVLSNGRWIFATWLCEDSAEGLTNDPTEFRVSDDAGQTWKR---VRMPDSNGRVHA 213

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
            V    A   +  +   F     V +SES D G +W+  + T L N N+ I  +KL+ G 
Sbjct: 214 NVIEVEAGHLVAFMRSRF--ADNVYISESYDFGDSWTIPQKTVLPNNNASISAIKLQSGD 271

Query: 251 LLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE--------ENLA-- 286
           + LAYN  +              R  + V++S+D G +W     LE        EN    
Sbjct: 272 IALAYNVNAARHPEFGKVAWPGLRNPVAVSISEDFGKTWPIGRILEPAEGFIGTENKTNN 331

Query: 287 MEFSYPAVIQASDGSVHITYTY-NRTQIKHVVLLP 320
            ++ YP + Q  DG +H+ Y Y NR  +K+V   P
Sbjct: 332 AQYEYPTLYQTKDGLLHLVYAYRNRLCVKYVRFSP 366


>gi|309780704|ref|ZP_07675445.1| BNR/Asp-box repeat protein [Ralstonia sp. 5_7_47FAA]
 gi|404394394|ref|ZP_10986198.1| hypothetical protein HMPREF0989_03334 [Ralstonia sp. 5_2_56FAA]
 gi|308920386|gb|EFP66042.1| BNR/Asp-box repeat protein [Ralstonia sp. 5_7_47FAA]
 gi|348613456|gb|EGY63041.1| hypothetical protein HMPREF0989_03334 [Ralstonia sp. 5_2_56FAA]
          Length = 391

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 152/359 (42%), Gaps = 69/359 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P    HA+ +  +  G  L  +FGG+ EG PDV I+L     G   WQ      E+P   
Sbjct: 32  PCVQNHAANLHALPNGDLLCVWFGGTQEGIPDVSIYLSRLPAGTSEWQPATKLSEDPTRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NPVLF  P+  L L Y       + +  ++R  S + G +W   + L        +  
Sbjct: 92  EQNPVLFSTPTGELWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIDTLFDQAGTFVRQP 151

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
            ++L +G     + LC +   E W  N   S + ++ D GR+W ++    +P +S+  + 
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDISAVMISEDQGRTWSRHD---VP-DSIGCVH 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
                 A+ +L  + RS      +  S S DG  TWS  +PT L N NS I  V L++G 
Sbjct: 208 MNVQTLADGSLLAVYRS-RWADHIYASRSQDG-RTWSAPQPTTLPNNNSSIQFVALRNGH 265

Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
           L L YN                                           R  L +A+S+D
Sbjct: 266 LALVYNESDASHSTGRRPSLYDDIEDAEDHGELRDQQASTRGTAFWGAPRAPLSLAISED 325

Query: 271 DGDSWH-------DALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W        D   +  N A     E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGHTWRRRNIEIGDGYCMTNNSADQRNREYSYPSIVQGNDGALHIAFTYFRQRIKYVTV 384


>gi|402082238|gb|EJT77383.1| glycosyl hydrolase [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 393

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 145/363 (39%), Gaps = 85/363 (23%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++ +D G  L A+FGGS EG  D+ IWL     G   W  P     +P     NPV
Sbjct: 32  HASNLLVLDSGEVLCAFFGGSMEGTSDIYIWLARLAPGADEWSEPARVSGDPGRSEQNPV 91

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSE-REQLP------------- 128
           LF+ P    L      QE    + C+ R   S ++G TWSE RE                
Sbjct: 92  LFEDPRTKNLWLLYTAQEHGAQNKCLVRRRVSTDQGRTWSEAREAFSDRGVLVRQPVTVL 151

Query: 129 PGILGPSKNKPILLENGLLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
           P   GP ++  +L    +  C  +  E W  N   S +  + D G +W++     +P+  
Sbjct: 152 PDKEGPGRDVWVLP---VFHCRAAEGEEWVGNDDVSAVRYSADGGETWKES---VVPDSR 205

Query: 186 LSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
            +V   I+P P            R  + I  +  S+ C   +TW   +PT+L NPNSGI 
Sbjct: 206 GAVHMNIRPDPGPEGGWVAVFRSRWADNI-YMSRSQDC---ITWFKPQPTELPNPNSGIG 261

Query: 243 GVKLKDGRLLLAYNT--------------------------------------VSRGVLK 264
               K GRL L YN                                         R  L 
Sbjct: 262 MAVTKGGRLALVYNDSRATPDMARREGLYDDLETGKPKLSKQAVIDGKPAIWGTPRNRLT 321

Query: 265 VALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQ 312
           +A S D G +W     L++ L              E SYP++  A+DG++HI YT+ R +
Sbjct: 322 LATSDDGGRTWKTRGLLDDGLGFCLTNNSKGQRNRELSYPSIAVAADGTIHIAYTFWRMR 381

Query: 313 IKH 315
           IK+
Sbjct: 382 IKY 384


>gi|365133916|ref|ZP_09343066.1| hypothetical protein HMPREF1032_00862 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363614831|gb|EHL66310.1| hypothetical protein HMPREF1032_00862 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 385

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 100/337 (29%), Positives = 147/337 (43%), Gaps = 55/337 (16%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P+ + HA  ++E   G  L A+F GS EG+ DV I       G  RW++P+    +    
Sbjct: 32  PWATAHAPALLETPDGSLLCAWFAGSFEGSGDVSIVCARLPKGAARWEAPVQVSHDEKRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMKR--SYNKGVTWSEREQLPPGIL 132
             NP LF  P   +   Y        G++  +++  ++R  S + G TW E     P +L
Sbjct: 92  EQNPSLFLAPDGAVWAVYTAQLDRVPGKDNMQFTSVIRRQKSTDGGRTWGE-----PDVL 146

Query: 133 ----GPSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
               G    +PI +L +G       LC  S        +  +++ D G +WR    + +P
Sbjct: 147 FAREGSFCRQPIQVLASGRWIFANWLCSDSASGLAGDPTAFQISDDQGCTWRT---VEMP 203

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
             +  V   V    A   L   MRS      +  SES D G TWS   PT L N NS I 
Sbjct: 204 GSNGRVHANV-VELAPGRLAAFMRS-RAADFIYRSESLDDGNTWSEPVPTVLPNNNSSIS 261

Query: 243 GVKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE------ 282
            VKL+ GR+ +AYN                 R  + VALS+D G +W     +E      
Sbjct: 262 AVKLQSGRIAVAYNPTHTPAPQPGMAAWPGLRCPVAVALSEDGGLTWPMIRHMELGEGFV 321

Query: 283 --ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIK 314
             EN     ++ YP ++QASDG +H+ + Y NR  +K
Sbjct: 322 GPENSTNNRQYEYPYLMQASDGRLHLAFAYQNRRGVK 358


>gi|296534630|ref|ZP_06897036.1| BNR/Asp-box repeat protein [Roseomonas cervicalis ATCC 49957]
 gi|296265040|gb|EFH11259.1| BNR/Asp-box repeat protein [Roseomonas cervicalis ATCC 49957]
          Length = 402

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/387 (26%), Positives = 159/387 (41%), Gaps = 77/387 (19%)

Query: 2   KKDCGIKGLVAEEFTFPANSAPF-----KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVK 56
           + D G  GLV      PA    +        HA+ +  +  G     +FGG+ EG PD+ 
Sbjct: 9   RLDLGEAGLVRPAPEDPARLEAYIPPATVQSHAANLTPLPDGSLGCVWFGGTQEGVPDIC 68

Query: 57  IWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR- 113
            W      G  RW +P+   E+      NP+LF  P   L L +       + +  +++ 
Sbjct: 69  AWFSRLDPGAERWSAPVRLSEDATRSEQNPLLFPAPDGTLWLIWTAQISGNQDTAIVRKR 128

Query: 114 -SYNKGVTWSEREQL--PPGILGPSKNKPIL-LENG-----LLLCGSSV-ESW--NSWGS 161
            S++ G +W   E L  P    G    +P++ L+NG     + +C S+  + W  +   S
Sbjct: 129 ISHDMGRSWGPVETLFGPRPEGGTFMRQPVVVLDNGDWLLPIWVCKSTPGKKWVGDEDVS 188

Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
            ++++ D G+SW +   I +P +SL  +        + TL    RS      V +S S D
Sbjct: 189 AVKLSTDQGKSWTE---IEVP-DSLGCVHMSIVDLRDGTLAAFYRS-RWADNVYVSRSTD 243

Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT------------------------ 257
           GG +WS  +PT+L N NS I   +L +G L + +N                         
Sbjct: 244 GGRSWSAPEPTELPNNNSSIQVARLSNGHLAMVFNHSSAADATERRLSLYDDIEDDSDDG 303

Query: 258 ----------------VSRGVLKVALSKDDGDSWHDALTLE------------ENLAMEF 289
                           V R  + +A+S+D G SW     LE            E L  EF
Sbjct: 304 KVVAAIDTTRRSTFWGVPRAPMTLAISEDGGRSWPWRRNLETGDGYCMTNNSREGLNREF 363

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHV 316
           SYP V +  DG +HI YT  R  IK+V
Sbjct: 364 SYPTVTEGQDGRLHIAYTVYRQAIKYV 390


>gi|332667926|ref|YP_004450714.1| neuraminidase (sialidase)-like protein [Haliscomenobacter hydrossis
           DSM 1100]
 gi|332336740|gb|AEE53841.1| neuraminidase (sialidase)-like protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 390

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 161/361 (44%), Gaps = 75/361 (20%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
           K  HAS++V +  G FL  +F GS E  A DVK+     + G+  W  P +  + PN+P 
Sbjct: 37  KHVHASSVVYLPNGDFLAVWFYGSGERTADDVKLMGSRLEKGKSTWSEPFLMADTPNIPD 96

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SYNKGV----TWSEREQLPPG-- 130
            NPVLF    N L L + I  +  KW   + R      Y K       W +   L PG  
Sbjct: 97  CNPVLFLNQENKLFLVW-IAVQANKWEQSILRLRTSTDYLKSGPPLWNWQDNILLKPGDT 155

Query: 131 -----------------------------ILGPSKN------------KPILLENGLLLC 149
                                        I+  S++            KP++L N ++L 
Sbjct: 156 FTQEVARKMKELPENTAGWAEYAPAYDKMIIVASQDVTKRSFGWMTRIKPLMLGNRIVLP 215

Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMR-SF 208
             S + +N   S   ++ D G +WR   P+         IQP      N  L   MR S 
Sbjct: 216 LYS-DGFNF--SLCAISEDRGNTWRPGLPLV----GRGPIQPAIVQKMNGNLMAYMRDSG 268

Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVAL- 267
           +  GRV  SES D G +W+ A  + + N  + ++ + L+DG+     N +  G  +++L 
Sbjct: 269 DAPGRVQTSESSDQGESWTAAVKSDIPN-TASVELLVLQDGKWAFVGNDLEDGRYRLSLY 327

Query: 268 -SKDDGDSWHDALTLEENLAM---EFSYPAVIQASDGSVHITYTYNRTQ----IKHVVLL 319
            S D+G SW    TLE N+A     FSYP++IQA DG +H+TY+Y+  Q    IK+VV+ 
Sbjct: 328 LSDDEGKSWKWKTTLE-NVAPGQGSFSYPSLIQAPDGLLHLTYSYHLAQDKKSIKYVVVN 386

Query: 320 P 320
           P
Sbjct: 387 P 387


>gi|261406860|ref|YP_003243101.1| hypothetical protein GYMC10_3028 [Paenibacillus sp. Y412MC10]
 gi|261283323|gb|ACX65294.1| conserved hypothetical protein [Paenibacillus sp. Y412MC10]
          Length = 362

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 140/321 (43%), Gaps = 37/321 (11%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
            H++ ++ +D G  L  +F GS EG PD  + L     G  RW+ PI    +P     NP
Sbjct: 33  SHSANLLALDNGDLLCVWFSGSGEGNPDTNVLLSRLPAGGERWEEPIEVAADPERSEQNP 92

Query: 85  VLFKLPSNGLLLFYKIGQ-EVQKWSGCMKR-SYNKGVTWSEREQLPPGILGPSKNKPIL- 141
           ++F+ P N + L +   +   Q+ S  + R S ++G TW E   L  G  G    +PIL 
Sbjct: 93  LVFQAPDNKVWLLHTSNEPHNQQTSKIVGRISDDRGYTWGEPFVLHEGP-GMFLRQPILA 151

Query: 142 LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTL 201
           + NG  L            S + ++ D G +W +Y        SL  +Q         TL
Sbjct: 152 MSNGEWLLPCYYCKPGGHYSVVLISADQGETWSEYEV----QGSLHRVQMSVVELDGGTL 207

Query: 202 RVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-- 259
             + RS +   R+  S S D G TWS    T L N NS I   KL +G L + YN  +  
Sbjct: 208 FAVFRSRHA-DRIYGSVSKDFGKTWSEPAKTSLPNNNSSIQLAKLANGHLAIIYNDSTME 266

Query: 260 ------------------RGVLKVALSKDDGDSWHDALTLE------ENLAMEFSYPAVI 295
                             R  L VA+S+D G +W     ++      +     +SYP+++
Sbjct: 267 RDQYRWIERNGEWLKKTLRTPLTVAISEDGGKTWPHVKNVQMSDLEHKEKQTGYSYPSIM 326

Query: 296 QASDGSVHITYTYNRTQIKHV 316
              DGS+H  Y+Y R  IK+V
Sbjct: 327 GTKDGSIHAAYSYLRKAIKYV 347


>gi|350271843|ref|YP_004883151.1| hypothetical protein OBV_34470 [Oscillibacter valericigenes
           Sjm18-20]
 gi|348596685|dbj|BAL00646.1| hypothetical protein OBV_34470 [Oscillibacter valericigenes
           Sjm18-20]
          Length = 386

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 92/325 (28%), Positives = 145/325 (44%), Gaps = 46/325 (14%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
           + + HA  ++E++ G  L A+F GS EG+ DV I       G  +W  P+    +P    
Sbjct: 33  YATAHAPALLELENGDLLCAWFAGSYEGSADVSIVCARLPKGAEKWSKPVRVSTDPQRSE 92

Query: 82  WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILG 133
            NP LF  P   +   Y        G++  +++  ++R  S + G TWS  + L P   G
Sbjct: 93  QNPSLFLAPDEEIWAVYTAQLDRAEGKDNMQYTAVVRRQRSNDGGCTWSNYDVLFPE-EG 151

Query: 134 PSKNKPI-LLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGPIYIPNESLS 187
               +PI +L NG  + G+ + + ++ G     S   ++ D G +WR+   + IP +S  
Sbjct: 152 TFCRQPIQVLSNGRWIFGNWICTDSASGLAGDPSAFRISDDGGNTWRR---VDIP-QSHG 207

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P      +  L   MRS      +  SES D G +WS   PT L N NS I  VKL 
Sbjct: 208 RVHPNVVELEDGHLAAFMRS-RAADFIYRSESFDNGDSWSVPVPTLLPNNNSSISAVKLS 266

Query: 248 DGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEENL 285
             R+ +AYN                 R  + VALS+D G ++         +    EEN 
Sbjct: 267 SSRIAIAYNPTCAPNPRADEAAWPGLRCPVAVALSEDGGATFPLIRYMELGEGYVGEENS 326

Query: 286 A--MEFSYPAVIQASDGSVHITYTY 308
               ++ YP ++Q  D S+H+ + Y
Sbjct: 327 TNNRQYEYPYLMQGRDDSLHLAFAY 351


>gi|295090883|emb|CBK76990.1| Predicted neuraminidase (sialidase) [Clostridium cf.
           saccharolyticum K10]
          Length = 386

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/338 (28%), Positives = 151/338 (44%), Gaps = 55/338 (16%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDGR-WQSPIIADEEPNVPM 81
           + + HA  +VE+  G  L  +F G+ EG+ DV I      KDG+ W  P+    +P    
Sbjct: 33  YPTAHAPAMVELPNGDLLCCWFAGTYEGSADVHIVCSRLPKDGQAWLPPVDISSDPTRSE 92

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
            NP LF  P + +   Y    + Q+    M+        +S + G+TW   E + P   G
Sbjct: 93  QNPSLFYGPDSAVWAMYTAQLDRQEGKDNMQFTSVVRCQKSTDGGLTWGPYETIFPE-EG 151

Query: 134 PSKNKPI-LLENGLLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-- 185
               +PI +L NG  +     C  S +  +   +   ++ D G++W+K   + +P  +  
Sbjct: 152 TFCRQPIQILSNGRWIFSNWICTDSADGLSGDPTAFRISDDQGKTWKK---VMMPESNGH 208

Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +V++  P H     L   MR+     R+  SES D G TWS  +PT L N NS I  
Sbjct: 209 VHANVVELEPGH-----LAAFMRNREAY-RIHRSESFDWGETWSKPEPTPLPNNNSSISA 262

Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTL 281
           VKL+ GR+ +AYN                 R  + VALS+D G ++         +    
Sbjct: 263 VKLQSGRIAIAYNPTCTPDPQPGKAAWPGLRCPVAVALSEDGGLTFPIIRWMERGEGFIG 322

Query: 282 EENLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
           +EN     ++ YP ++Q  DG +H+ Y    R  +K+V
Sbjct: 323 DENKTNNKQYEYPYIMQGKDGMIHLAYAARTRQGVKYV 360


>gi|291087517|ref|ZP_06346666.2| BNR/Asp-box repeat protein [Clostridium sp. M62/1]
 gi|291074881|gb|EFE12245.1| BNR/Asp-box repeat protein [Clostridium sp. M62/1]
          Length = 391

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/338 (28%), Positives = 151/338 (44%), Gaps = 55/338 (16%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDGR-WQSPIIADEEPNVPM 81
           + + HA  +VE+  G  L  +F G+ EG+ DV I      KDG+ W  P+    +P    
Sbjct: 38  YPTAHAPAMVELPNGDLLCCWFAGTYEGSADVHIVCSRLPKDGQAWLPPVDISSDPTRSE 97

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
            NP LF  P + +   Y    + Q+    M+        +S + G+TW   E + P   G
Sbjct: 98  QNPSLFYGPDSAVWAMYTAQLDRQEGKDNMQFTSVVRCQKSTDGGLTWGPYETIFPE-EG 156

Query: 134 PSKNKPI-LLENGLLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-- 185
               +PI +L NG  +     C  S +  +   +   ++ D G++W+K   + +P  +  
Sbjct: 157 TFCRQPIQILSNGRWIFSNWICTDSADGLSGDPTAFRISDDQGKTWKK---VMMPESNGH 213

Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +V++  P H     L   MR+     R+  SES D G TWS  +PT L N NS I  
Sbjct: 214 VHANVVELEPGH-----LAAFMRNREAY-RIHRSESFDWGETWSKPEPTPLPNNNSSISA 267

Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTL 281
           VKL+ GR+ +AYN                 R  + VALS+D G ++         +    
Sbjct: 268 VKLQSGRIAIAYNPTCTPDPQPGKAAWPGLRCPVAVALSEDGGLTFPIIRWMERGEGFIG 327

Query: 282 EENLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
           +EN     ++ YP ++Q  DG +H+ Y    R  +K+V
Sbjct: 328 DENKTNNKQYEYPYIMQGKDGMIHLAYAARTRQGVKYV 365


>gi|323525540|ref|YP_004227693.1| hypothetical protein BC1001_1190 [Burkholderia sp. CCGE1001]
 gi|323382542|gb|ADX54633.1| hypothetical protein BC1001_1190 [Burkholderia sp. CCGE1001]
          Length = 401

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 150/357 (42%), Gaps = 78/357 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L A+FGG+ EG PD+ I+L     G   W  P+   ++P     NPV
Sbjct: 40  HAANLLGLANGDLLCAWFGGTQEGVPDISIYLSRLAQGSSTWSEPLRLSDDPTRSEQNPV 99

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNKPILL- 142
           LF  P   L L Y       + +  ++R  S + G TW   + L   + G    +PI++ 
Sbjct: 100 LFAAPDGKLWLIYTAQLSGHQNTSIVRRRVSGDNGHTWGPIDTL-FDLPGTFVRQPIVVA 158

Query: 143 ENGLLLCG------SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---IQP 191
            +G  LC       +  E W  N   S +  + D G +W  +    +P     V   IQP
Sbjct: 159 RDGAWLCPVFLCRVAPGERWSGNDDVSAVMRSTDNGATWTAHD---VPGSVGCVHMNIQP 215

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
           +     + +L  L RS      + MS S D G+ WS  +P  L N NS I  V L +G L
Sbjct: 216 L----RDGSLVALFRS-RWADHIHMSRSAD-GIAWSEPEPLALPNNNSSIQFVTLANGHL 269

Query: 252 LLAYNTVS----------------------------------------RGVLKVALSKDD 271
            L +N  S                                        R  + +A+S D 
Sbjct: 270 ALVFNNSSAAQSTGRRASLYDDIEDAEDSGTLATQAASARGTAFWGAPRAPMTLAVSMDG 329

Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           G +W     LE            + L  EFSYP+++Q++DG +HI YTY R +IK+V
Sbjct: 330 GRTWPVMRNLETGDGYCMTNNSTDKLNREFSYPSIVQSADGRLHIAYTYFRQRIKYV 386


>gi|225389863|ref|ZP_03759587.1| hypothetical protein CLOSTASPAR_03612 [Clostridium asparagiforme
           DSM 15981]
 gi|225044056|gb|EEG54302.1| hypothetical protein CLOSTASPAR_03612 [Clostridium asparagiforme
           DSM 15981]
          Length = 386

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 156/338 (46%), Gaps = 55/338 (16%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
           +K+ H   ++E++ G  L  +F GS EG  DV +     K G  RW+ P++   +P    
Sbjct: 33  YKTAHGPGLLELENGDMLCVWFAGSFEGNADVNVICARLKKGADRWEEPVLVSHDPERSE 92

Query: 82  WNPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILG 133
            NP LF  P   +   Y        G++  +++  ++  +S + G TWS+ E + P   G
Sbjct: 93  QNPSLFSGPDGMVWAMYTSQLSRVEGKDNMQFTSVIRCQKSADGGETWSDYETVFPE-EG 151

Query: 134 PSKNKPI-LLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGPIYIPNES-- 185
               +PI +L NG  + G+ + + +  G     +   ++ D G++W+    + +P  +  
Sbjct: 152 SFCRQPIQVLSNGRWIFGNWICTDSELGLTGDPTAFRISDDQGKTWKM---VDMPESNGA 208

Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +V++  P H     L   MRS      +  SES D G TW+   PT L N NS I  
Sbjct: 209 VHANVVELEPGH-----LAAFMRS-RAADNIYRSESLDYGDTWTKPVPTVLPNNNSSISA 262

Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE------- 282
           VKL+ GR+ +AYN                 R  + VALS+D G ++     +E       
Sbjct: 263 VKLQSGRIAVAYNPTCTPCPTPGIASWPGLRCPVAVALSEDGGLTFPMIRYMERGEGYMG 322

Query: 283 -ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
            EN     ++ YP ++Q  DG +H+ + Y +R  +K++
Sbjct: 323 DENKTNNRQYEYPYIMQGRDGRLHLAFAYKDRLSVKYM 360


>gi|312142943|ref|YP_003994389.1| BNR repeat-containing glycosyl hydrolase [Halanaerobium
           hydrogeniformans]
 gi|311903594|gb|ADQ14035.1| BNR repeat-containing glycosyl hydrolase [Halanaerobium
           hydrogeniformans]
          Length = 357

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 90/327 (27%), Positives = 144/327 (44%), Gaps = 56/327 (17%)

Query: 43  AYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY-- 98
            +FGGSCEG  D+ I     K G   W   ++   + N    NP+L+++    L L Y  
Sbjct: 29  TWFGGSCEGKADISIHYSRLKKGEVSWSKAVVLSGDENRSEQNPILYEVKPGHLWLLYTA 88

Query: 99  KIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNS 158
           +IG   +     ++RS + G  WS+ E L        +N PI LEN  +L  +     + 
Sbjct: 89  QIGVHQETAVVRIRRSDDYGHNWSKAEDLFEDEGLFVRNPPIKLENDDILLPAYYCQKSE 148

Query: 159 WG------SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIG 212
            G      S ++++ D G +W++   + IP ES  ++        N+ +    R+     
Sbjct: 149 TGFLGDDYSVVKLSSDGGSTWKE---VSIP-ESKGLVHMSAVELDNKDIVGFFRNRRA-D 203

Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV---------- 262
            +  + S D G+TWS  +P +L N NS I  +KLK G+L L Y+ V++ +          
Sbjct: 204 YIYRTFSKDQGITWSVPEPLELPNNNSSIQCLKLKSGKLALVYDDVNKHISPPTVDMPPW 263

Query: 263 ---------------------------LKVALSKDDGDSW--HDALTLEENL--AMEFSY 291
                                      L ++LS D G +W     L  +E L    EFSY
Sbjct: 264 FDKKDMENVGVKEVEKPSAVWGVKRNPLVISLSDDGGRTWPQKKELMTDEGLEGEPEFSY 323

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVL 318
           P+++Q   G +HI YTY R  I++V +
Sbjct: 324 PSLVQDDTGLIHIAYTYLREYIRYVTI 350


>gi|187927659|ref|YP_001898146.1| glycosyl hydrolase family protein [Ralstonia pickettii 12J]
 gi|187724549|gb|ACD25714.1| glycosyl hydrolase BNR repeat-containing protein [Ralstonia
           pickettii 12J]
          Length = 391

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 151/359 (42%), Gaps = 69/359 (19%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           P    HA+ +  +  G  L  +FGG+ EG PDV I+L     G   WQ      E+    
Sbjct: 32  PCVQNHAANLHALPNGDLLCVWFGGTQEGIPDVSIYLSRLAAGTSEWQPATKLSEDTTRS 91

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNK 138
             NPVLF  P+  L L Y       + +  ++R  S + G +W   + L        +  
Sbjct: 92  EQNPVLFTTPTGELWLIYTAQLSGHQNTAIVRRRISTDGGRSWGPIDTLFDQAGTFVRQP 151

Query: 139 PILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
            ++L +G     + LC +   E W  N   S + ++ D GR+W ++    +P +S+  + 
Sbjct: 152 IVVLPDGAWACPVFLCRTQPGERWVGNDDISAVMISEDQGRTWSRHD---VP-DSIGCVH 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
                 A+ +L  L RS      +  S S DG  TWS  +PT L N NS I  V L++G 
Sbjct: 208 MNVQTLADGSLLALYRS-RWADHIYASRSQDG-RTWSAPQPTTLPNNNSSIQFVALRNGH 265

Query: 251 LLLAYNT----------------------------------------VSRGVLKVALSKD 270
           L L YN                                           R  L +A+S+D
Sbjct: 266 LALVYNESDASHSTGRRTSLYDDIEDAEDHGELRDQQASTRGTAFWGAPRAPLSLAISED 325

Query: 271 DGDSWH-------DALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W        D   +  N A     E+SYP+++Q +DG++HI +TY R +IK+V +
Sbjct: 326 GGQTWRRRNIEIGDGYCMTNNSADQRNREYSYPSIVQGNDGALHIAFTYFRQRIKYVTV 384


>gi|238024875|ref|YP_002909107.1| BNR repeat-containing glycosyl hydrolase [Burkholderia glumae BGR1]
 gi|237879540|gb|ACR31872.1| BNR repeat-containing glycosyl hydrolase [Burkholderia glumae BGR1]
          Length = 407

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 76/357 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNP 84
           CHA+ ++ +  G  L A+FGG+ EG PD+ I+    + G   W  P+   ++      NP
Sbjct: 39  CHAANLLALANGDLLCAWFGGTQEGVPDISIYCSRLEKGGASWSEPVRLSDDTTRSEQNP 98

Query: 85  VLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPI 140
           VLF  P   L L Y       + +  ++R  S ++G TW   + L   PG       +PI
Sbjct: 99  VLFAAPDGDLWLLYTAQLSGHQNTSIVRRRISKDQGRTWGAIDTLFDKPGTF---VRQPI 155

Query: 141 LL-ENGLLLCG---SSVESWNSWGSWMEVTV-----DAGRSWRKYGPIYIPNESLSVIQP 191
           ++  +G  LC      V+    W    +V+V     D G +W ++    +P +S+  +  
Sbjct: 156 VVARDGAWLCPVFLCRVQPGERWSGNDDVSVVMRSTDGGATWSEHA---VP-DSVGCVHM 211

Query: 192 VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
                A+ TL  L RS      +  S S D GL WS  +   L N NS I  V L +G L
Sbjct: 212 NIQMLADGTLLALYRS-RWADHIYASRSRD-GLAWSAPEALDLPNNNSSIQFVTLANGHL 269

Query: 252 LLAYNTVS----------------------------------------RGVLKVALSKDD 271
            L +N  S                                        R  + +A+S D 
Sbjct: 270 GLVFNASSAAQSTARRASLYDDIEDSEDSGELVAQAASARGTAFWGAPRAPMTLAISLDG 329

Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           G +W     LE            + L  EFSYP++ Q+ DG +HI YTY R +IK+V
Sbjct: 330 GRTWPVRRNLETGDGYCMTNNSVDKLNREFSYPSIAQSPDGRLHIAYTYFRQRIKYV 386


>gi|433611690|ref|YP_007195151.1| putative neuraminidase (sialidase) [Sinorhizobium meliloti GR4]
 gi|429556632|gb|AGA11552.1| putative neuraminidase (sialidase) [Sinorhizobium meliloti GR4]
          Length = 395

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+         +W  P     +   
Sbjct: 28  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPDAAQWGEPQRLSFDAEH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141

Query: 140 ILLENGLLLCGSSV----ESW------------NSWGSWME-----VTVDAGRSWRKYGP 178
           + L  G  +    V    E+W              W    +     V++D GRSW+    
Sbjct: 142 LDLPRGCFIRAPLVIRDDEAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   N S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L+DGRL +  N V+                                     R 
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E+                E SYP +++  DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGHDGSLHVAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLPP 387


>gi|384533958|ref|YP_005716622.1| hypothetical protein [Sinorhizobium meliloti BL225C]
 gi|333816134|gb|AEG08801.1| hypothetical protein SinmeB_4519 [Sinorhizobium meliloti BL225C]
          Length = 395

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+         +W  P     +   
Sbjct: 28  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPDAAQWGEPQRLSFDAEH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + V++D GRSW+    
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   N S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L+DGRL +  N V+                                     R 
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E+                E SYP +++  DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGHDGSLHVAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLPP 387


>gi|296102356|ref|YP_003612502.1| hypothetical protein ECL_02001 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
 gi|295056815|gb|ADF61553.1| hypothetical protein ECL_02001 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
          Length = 387

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 149/356 (41%), Gaps = 70/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W       ++      NPV
Sbjct: 32  HAANLLPLPDGSLMCVWFGGTQEGIADISVWGSRLSPGSQQWSEAEKLSDDATRSEQNPV 91

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P N L L +       + +  +  ++S + G TW +   L   PG     +   ++
Sbjct: 92  LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSRDLGKTWGDIATLLDKPGTF--IRQPIVV 149

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           LENG     +  C +   E W  N   S ++++ D G++WR    + +P +SL  +    
Sbjct: 150 LENGNWLLPVFYCRTRPGEKWVGNDDISAVKISEDGGKTWRD---VEVP-DSLGCVHMNI 205

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               N TL  L RS      +  S+S D G +WS  +PT L N NS I    L  G L L
Sbjct: 206 TPLRNGTLVALFRS-RWADNIYYSQSTDNGESWSVPEPTVLPNNNSSIQVTTLASGELAL 264

Query: 254 AYNTVS-------------------------------------RGVLKVALSKDDGDSWH 276
            +N +S                                     R  + VA+S D G SW 
Sbjct: 265 VFNLMSAAGAQERRTSLYDEIDDGDGRKEPVVTHGRAAFWGAPRAPMTVAISPDGGKSWP 324

Query: 277 DALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
               L+            E L  EFSYP++ Q+ DGS+HI YTY R  IK+V + P
Sbjct: 325 WQRHLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGSLHIAYTYFRQAIKYVRVTP 380


>gi|145589428|ref|YP_001156025.1| BNR/Asp-box repeat-containing protein [Polynucleobacter necessarius
           subsp. asymbioticus QLW-P1DMWA-1]
 gi|145047834|gb|ABP34461.1| BNR/Asp-box repeat protein [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
          Length = 417

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 143/321 (44%), Gaps = 35/321 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDGRWQSPIIADEEPNV----- 79
           S HA++++ +  G     +F GS EGAPDV I      K G+W +P +  +         
Sbjct: 75  SVHAASLIALKDGGIRAFWFAGSREGAPDVVINTSALDKSGKWSAPAVVMDRVTAEKGLG 134

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSY---NKGVTWSEREQLPPGIL 132
                + NPV  +L    L LF+ +   +  W+G    S    ++G++W + ++L   I 
Sbjct: 135 RYIAKLGNPVPSRLADGRLQLFF-VTVSLGGWAGSSISSVISEDEGLSWGKPQRL---IS 190

Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS----V 188
            P  N   L+++  L          ++  W+      G   R  G   I    +S     
Sbjct: 191 SPLLNLSTLVKSPALQFTDGRLGLPAYHEWIG---RFGEFLRIEGAQVIDKRRMSSGRSA 247

Query: 189 IQPVPFHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           IQPV F  + +      R     G    + +S++ + G +W  A   ++ NPNS + G+ 
Sbjct: 248 IQPVVFVNSAQDASAYFRQTRSAGLAKHIPVSQTQNAGQSWQAAGDLEIANPNSAVAGLT 307

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEENLAM------EFSYPAVIQA 297
           L +G  LLA N +  G  ++ L   D  S  WH    LE + A+      E+SYP ++ A
Sbjct: 308 LSNGARLLALNNIETGRHRLVLMMSDPKSGQWHVVDVLENDEALPDDQRKEYSYPYLLSA 367

Query: 298 SDGSVHITYTYNRTQIKHVVL 318
           +    H+ YT++R +I+HV L
Sbjct: 368 NGNDAHLVYTWDRKKIRHVYL 388


>gi|220914973|ref|YP_002490281.1| BNR repeat-containing glycosyl hydrolase [Methylobacterium nodulans
           ORS 2060]
 gi|219952724|gb|ACL63114.1| BNR repeat-containing glycosyl hydrolase [Methylobacterium nodulans
           ORS 2060]
          Length = 398

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/358 (25%), Positives = 148/358 (41%), Gaps = 72/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G     +FGG+ EG  D+ ++      G   W       ++P     NP+
Sbjct: 37  HAANLMPLANGDLACVWFGGTQEGMADISVYFSRLAKGSDAWSPAEKLSDDPARSEQNPI 96

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPS---KNKPI 140
           LF  P+  L L +       + +  ++R  S + G +W   E L P   G     +  P+
Sbjct: 97  LFPAPTGDLWLIWTAQVSGNQDTAFVRRRLSRDHGRSWGPIETLFPRREGCGTFVRQPPV 156

Query: 141 LLENG-----LLLCGSSVES-W--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L NG     +  C S   + W  +   S ++++ DAGR+W +   + +P+ +  V   +
Sbjct: 157 VLANGDWLLPIFHCPSVPGAKWVGDDDTSAVKISSDAGRTWHE---VAVPDSTGCVHMSI 213

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      +  S S DGG TWS   PT L N NS I   +L +G L 
Sbjct: 214 -MPLDDGTLLALFRS-RWADAIYESRSTDGGATWSAPVPTALPNNNSSIQATRLANGHLA 271

Query: 253 LAYNTVS----------------------------------------RGVLKVALSKDDG 272
           L YN V+                                        R  L +A+S+D G
Sbjct: 272 LVYNAVNAEAATERRASLYDEIEDEAGESAAPLPAAPGKRTAFWGTPRAPLTLAISEDGG 331

Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            +W     +E            +    E SYP+V Q  DG++H+ +TY+R  IK+V L
Sbjct: 332 RTWPHRRDVETGDGYCMTNNSRDRTNRELSYPSVAQTPDGALHVAFTYHRQAIKYVRL 389


>gi|16263893|ref|NP_436685.1| hypothetical protein SM_b20145 [Sinorhizobium meliloti 1021]
 gi|15140017|emb|CAC48545.1| HYPOTHETICAL PROTEIN SM_b20145 [Sinorhizobium meliloti 1021]
          Length = 395

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+         +W  P     +   
Sbjct: 28  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPDAAQWGEPQRLSFDAEH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAVSEGRY 141

Query: 140 ILLENGLLLCGSSV----ESW------------NSWGSWME-----VTVDAGRSWRKYGP 178
           + L  G  +    V    E+W              W    +     V++D GRSW+    
Sbjct: 142 LDLPRGCFIRAPLVIRDDEAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   N S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L+DGRL +  N V+                                     R 
Sbjct: 257 SSIAVVRLRDGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E+                E SYP +++  DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLEGHDGSLHVAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLPP 387


>gi|373106197|ref|ZP_09520500.1| hypothetical protein HMPREF9623_00164 [Stomatobaculum longum]
 gi|371652572|gb|EHO17980.1| hypothetical protein HMPREF9623_00164 [Stomatobaculum longum]
          Length = 387

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/338 (27%), Positives = 148/338 (43%), Gaps = 55/338 (16%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPM 81
           F + HA  +VE+  G  L  +F G+ EG+ D+ I     ++G  +W +      +P    
Sbjct: 34  FPTAHAPAMVELPNGDLLACWFAGTYEGSADIHIVCSALRNGETKWSAVTDLSRDPKKSE 93

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
            NP LF  P   +   Y    + +     M+        +S++ G +W E E + P   G
Sbjct: 94  QNPSLFLGPDGAVWAMYTAQDDRESGKDNMQFTAEVRCQKSFDGGRSWGEYETVFPEP-G 152

Query: 134 PSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-- 185
               +PI +L NG       LC  S +      +   ++ D G+SW+    + +P  +  
Sbjct: 153 TFCRQPIQILSNGRWIFSNWLCTDSPDGLAGDPTGFRISDDQGKSWKL---VMMPKSNGH 209

Query: 186 --LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +VI+  P H     L   MR      R+  SES D G +WS   PT L N NSGI  
Sbjct: 210 VHANVIELSPGH-----LVAFMRHREA-KRIHRSESFDWGESWSEPVPTPLPNNNSGISA 263

Query: 244 VKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTL 281
           VK+K GR+ +AYN  S              R  + VALS+D G ++         +    
Sbjct: 264 VKMKSGRIAIAYNPTSAPESQSGKAAWPGLRCPVAVALSEDGGKTFPLIRWMERGEGFIG 323

Query: 282 EENLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
           +EN     ++ YP ++Q+ DG +H+ Y    R  IK++
Sbjct: 324 DENRTNNRQYEYPYLMQSKDGMLHLAYAARTREGIKYL 361


>gi|110640892|ref|YP_668620.1| hypothetical protein ECP_0693 [Escherichia coli 536]
 gi|191173952|ref|ZP_03035470.1| hypthetical protein [Escherichia coli F11]
 gi|300996592|ref|ZP_07181486.1| BNR/Asp-box repeat protein [Escherichia coli MS 200-1]
 gi|422378283|ref|ZP_16458506.1| BNR/Asp-box repeat protein [Escherichia coli MS 60-1]
 gi|432470061|ref|ZP_19712114.1| hypothetical protein A15M_00940 [Escherichia coli KTE206]
 gi|432712351|ref|ZP_19947401.1| hypothetical protein WCI_00715 [Escherichia coli KTE8]
 gi|433076851|ref|ZP_20263415.1| hypothetical protein WIU_00724 [Escherichia coli KTE131]
 gi|110342484|gb|ABG68721.1| hypothetical membrane protein [Escherichia coli 536]
 gi|190905728|gb|EDV65349.1| hypthetical protein [Escherichia coli F11]
 gi|300304489|gb|EFJ59009.1| BNR/Asp-box repeat protein [Escherichia coli MS 200-1]
 gi|324010400|gb|EGB79619.1| BNR/Asp-box repeat protein [Escherichia coli MS 60-1]
 gi|431000128|gb|ELD16202.1| hypothetical protein A15M_00940 [Escherichia coli KTE206]
 gi|431259302|gb|ELF51676.1| hypothetical protein WCI_00715 [Escherichia coli KTE8]
 gi|431600571|gb|ELI70239.1| hypothetical protein WIU_00724 [Escherichia coli KTE131]
          Length = 389

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTLEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVEHRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|432552639|ref|ZP_19789370.1| hypothetical protein A1S3_01027 [Escherichia coli KTE47]
 gi|431086924|gb|ELD92940.1| hypothetical protein A1S3_01027 [Escherichia coli KTE47]
          Length = 389

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ IW      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISIWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVEHRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|345011191|ref|YP_004813545.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
           violaceusniger Tu 4113]
 gi|344037540|gb|AEM83265.1| BNR repeat-containing glycosyl hydrolase [Streptomyces
           violaceusniger Tu 4113]
          Length = 401

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 149/370 (40%), Gaps = 75/370 (20%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
           AP    HA+ +  +  G     +FGG+ EG PD+ +W        G W  P+   ++   
Sbjct: 32  APAVQNHAANLTVLPGGDLGCVWFGGTQEGVPDISVWFSRLAPGSGTWTEPVRLSDDDTR 91

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQLPPGIL--GPS 135
              NP+LF  P+  L L Y   +   + +    ++ S ++G TW     L P     G  
Sbjct: 92  SEQNPLLFPTPAGELWLLYTAQRAGDQDTAEVRLRVSADEGATWDAPRTLFPATEAGGVF 151

Query: 136 KNKPI-LLENGLLL------CGSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +P+ +L++G  L        +    W  +   S + ++ D GR+WR+     +P  + 
Sbjct: 152 IRQPVAVLDSGRWLLPVFHCVATPGVKWVGDHDTSAVMISDDQGRTWRERP---VPGSTG 208

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
            V   V     + T+  L RS      V  S S DGG TWS    T+L N NS +  V L
Sbjct: 209 CVHMNV-HQLPDTTVLALFRS-RWADAVYRSHSTDGGETWSEPVRTELPNNNSSVQYVPL 266

Query: 247 KDGRLLLAYNTVS-------------------------------------------RGVL 263
            DGRL L YN  S                                           R  +
Sbjct: 267 ADGRLALVYNHSSRADATARRVSLYDEIDDEGRTGEPPAPAATRDDAAPGAFWGAPRAPM 326

Query: 264 KVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRT 311
            +ALS D+G +W     L+            + L  E SYP + Q  DG++HI YTY+R 
Sbjct: 327 TLALSSDNGLTWPVRRDLDTGDGHCLTNNSRDRLNRELSYPTIRQGFDGTLHIAYTYHRR 386

Query: 312 QIKHVVLLPN 321
            IK+V + P+
Sbjct: 387 AIKYVRVTPD 396


>gi|334319973|ref|YP_004556602.1| hypothetical protein [Sinorhizobium meliloti AK83]
 gi|334097712|gb|AEG55722.1| hypothetical protein Sinme_4022 [Sinorhizobium meliloti AK83]
          Length = 395

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 143/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+      G  +W  P     +   
Sbjct: 28  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAER 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + V++D GRSW+    
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   N S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 198 LADLNGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L+ GRL +  N V+                                     R 
Sbjct: 257 SSIAVVRLRGGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E+                E SYP +++  DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEDGPGTCLSNNSTDGHNKEMSYPWLLERDDGSLHVAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLPP 387


>gi|307132546|ref|YP_003884562.1| hypothetical protein Dda3937_02632 [Dickeya dadantii 3937]
 gi|306530075|gb|ADN00006.1| expressed protein [Dickeya dadantii 3937]
          Length = 391

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 152/360 (42%), Gaps = 80/360 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I+L       G+W   +   ++P     NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGVSDISIYLSRLVKGSGQWTPAVKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  ++  +S ++G TW +   L   PG       +PI 
Sbjct: 96  LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSLDQGYTWGDIGTLLEQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---I 189
           +L NG     +  C +   E W  N   S ++++ D G++WR+     +PN +  V   I
Sbjct: 153 VLPNGDWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKTWRES---VVPNSTGCVHMNI 209

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
            P+     + +L  L RS      +  S S D G TWS   PT L N NS I    L +G
Sbjct: 210 TPL----KDGSLLALFRS-RWADFIYRSHSTDSGATWSEPVPTVLPNNNSSIQVTTLDNG 264

Query: 250 RLLLAYNTVS---------------------------------------RGVLKVALSKD 270
            L L +N +S                                       R  + +A+S+D
Sbjct: 265 HLALVFNAMSAEGATERRLSLYDEIEDEEETDAKMPEVSSGRSAFWGAPRAPMTLAISED 324

Query: 271 DGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G +W     LE            E L  EFSYP++ Q  DG +HI +TY R  IK+V +
Sbjct: 325 GGKTWPWQRNLEVGDGYCMTNNSTEKLNREFSYPSIKQGPDGKLHIAFTYFRQAIKYVCV 384


>gi|306812905|ref|ZP_07447098.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli NC101]
 gi|432380316|ref|ZP_19623273.1| hypothetical protein WCU_00459 [Escherichia coli KTE15]
 gi|432386086|ref|ZP_19628984.1| hypothetical protein WCY_01334 [Escherichia coli KTE16]
 gi|432512887|ref|ZP_19750123.1| hypothetical protein A17M_00740 [Escherichia coli KTE224]
 gi|432610363|ref|ZP_19846535.1| hypothetical protein A1UG_00716 [Escherichia coli KTE72]
 gi|432645121|ref|ZP_19880921.1| hypothetical protein A1W5_00867 [Escherichia coli KTE86]
 gi|432654919|ref|ZP_19890632.1| hypothetical protein A1WE_01030 [Escherichia coli KTE93]
 gi|432697999|ref|ZP_19933166.1| hypothetical protein A31M_00742 [Escherichia coli KTE169]
 gi|432744619|ref|ZP_19979319.1| hypothetical protein WGG_00735 [Escherichia coli KTE43]
 gi|432903024|ref|ZP_20112641.1| hypothetical protein A13Y_00999 [Escherichia coli KTE194]
 gi|432942639|ref|ZP_20139898.1| hypothetical protein A13C_04397 [Escherichia coli KTE183]
 gi|432970805|ref|ZP_20159683.1| hypothetical protein A15O_01367 [Escherichia coli KTE207]
 gi|432984321|ref|ZP_20173059.1| hypothetical protein A175_00773 [Escherichia coli KTE215]
 gi|433037625|ref|ZP_20225241.1| hypothetical protein WIE_00969 [Escherichia coli KTE113]
 gi|433081575|ref|ZP_20268049.1| hypothetical protein WIW_00713 [Escherichia coli KTE133]
 gi|433100201|ref|ZP_20286311.1| hypothetical protein WK5_00754 [Escherichia coli KTE145]
 gi|433143269|ref|ZP_20328436.1| hypothetical protein WKO_00802 [Escherichia coli KTE168]
 gi|433187477|ref|ZP_20371596.1| hypothetical protein WGS_00551 [Escherichia coli KTE88]
 gi|305853668|gb|EFM54107.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli NC101]
 gi|430909592|gb|ELC30958.1| hypothetical protein WCY_01334 [Escherichia coli KTE16]
 gi|430911425|gb|ELC32711.1| hypothetical protein WCU_00459 [Escherichia coli KTE15]
 gi|431044445|gb|ELD54718.1| hypothetical protein A17M_00740 [Escherichia coli KTE224]
 gi|431151181|gb|ELE52216.1| hypothetical protein A1UG_00716 [Escherichia coli KTE72]
 gi|431183899|gb|ELE83672.1| hypothetical protein A1W5_00867 [Escherichia coli KTE86]
 gi|431194313|gb|ELE93578.1| hypothetical protein A1WE_01030 [Escherichia coli KTE93]
 gi|431246487|gb|ELF40751.1| hypothetical protein A31M_00742 [Escherichia coli KTE169]
 gi|431294901|gb|ELF85078.1| hypothetical protein WGG_00735 [Escherichia coli KTE43]
 gi|431436554|gb|ELH18069.1| hypothetical protein A13Y_00999 [Escherichia coli KTE194]
 gi|431453691|gb|ELH34075.1| hypothetical protein A13C_04397 [Escherichia coli KTE183]
 gi|431485942|gb|ELH65599.1| hypothetical protein A15O_01367 [Escherichia coli KTE207]
 gi|431506064|gb|ELH84668.1| hypothetical protein A175_00773 [Escherichia coli KTE215]
 gi|431554941|gb|ELI28805.1| hypothetical protein WIE_00969 [Escherichia coli KTE113]
 gi|431605410|gb|ELI74799.1| hypothetical protein WIW_00713 [Escherichia coli KTE133]
 gi|431622324|gb|ELI91105.1| hypothetical protein WK5_00754 [Escherichia coli KTE145]
 gi|431665665|gb|ELJ32379.1| hypothetical protein WKO_00802 [Escherichia coli KTE168]
 gi|431708911|gb|ELJ73413.1| hypothetical protein WGS_00551 [Escherichia coli KTE88]
          Length = 389

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIVDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|432582750|ref|ZP_19819160.1| hypothetical protein A1SM_01972 [Escherichia coli KTE57]
 gi|433119241|ref|ZP_20304949.1| hypothetical protein WKC_00679 [Escherichia coli KTE157]
 gi|431119766|gb|ELE22765.1| hypothetical protein A1SM_01972 [Escherichia coli KTE57]
 gi|431648466|gb|ELJ15862.1| hypothetical protein WKC_00679 [Escherichia coli KTE157]
          Length = 389

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|312965114|ref|ZP_07779351.1| BNR/Asp-box repeat family protein [Escherichia coli 2362-75]
 gi|418995571|ref|ZP_13543185.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1A]
 gi|312290205|gb|EFR18088.1| BNR/Asp-box repeat family protein [Escherichia coli 2362-75]
 gi|377848961|gb|EHU13937.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1A]
          Length = 370

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      GR  W   +    +      NPV
Sbjct: 14  HAANILPMPDGALMCVWFAGTQEGIADISVWGSRLPAGRTQWSDAVKLSHDDTRSEQNPV 73

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 74  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 130

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 131 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 186

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 187 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 245

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 246 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 305

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 306 PWRRNLDEGDGNCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 362


>gi|215485704|ref|YP_002328135.1| hypothetical protein E2348C_0564 [Escherichia coli O127:H6 str.
           E2348/69]
 gi|417754437|ref|ZP_12402532.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2B]
 gi|419000826|ref|ZP_13548385.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1B]
 gi|419006338|ref|ZP_13553794.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1C]
 gi|419012203|ref|ZP_13559568.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1D]
 gi|419017107|ref|ZP_13564433.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1E]
 gi|419022799|ref|ZP_13570041.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2A]
 gi|419027608|ref|ZP_13574807.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2C]
 gi|419033150|ref|ZP_13580248.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2D]
 gi|419038389|ref|ZP_13585448.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2E]
 gi|215263776|emb|CAS08112.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
 gi|377851139|gb|EHU16094.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1C]
 gi|377853473|gb|EHU18372.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1B]
 gi|377862968|gb|EHU27775.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1D]
 gi|377867081|gb|EHU31845.1| BNR/Asp-box repeat family protein [Escherichia coli DEC1E]
 gi|377868437|gb|EHU33181.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2A]
 gi|377879383|gb|EHU43956.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2B]
 gi|377883569|gb|EHU48087.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2D]
 gi|377885491|gb|EHU49986.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2C]
 gi|377897990|gb|EHU62353.1| BNR/Asp-box repeat family protein [Escherichia coli DEC2E]
          Length = 389

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      GR  W   +    +      NPV
Sbjct: 33  HAANILPMPDGALMCVWFAGTQEGIADISVWGSRLPAGRTQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGNCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|26246649|ref|NP_752689.1| hypthetical protein [Escherichia coli CFT073]
 gi|222155418|ref|YP_002555557.1| hypothetical protein LF82_075 [Escherichia coli LF82]
 gi|227884352|ref|ZP_04002157.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
           [Escherichia coli 83972]
 gi|300989966|ref|ZP_07179042.1| BNR/Asp-box repeat protein [Escherichia coli MS 45-1]
 gi|301046002|ref|ZP_07193183.1| BNR/Asp-box repeat protein [Escherichia coli MS 185-1]
 gi|386628221|ref|YP_006147941.1| hypothetical protein i02_0728 [Escherichia coli str. 'clone D i2']
 gi|386633141|ref|YP_006152860.1| hypothetical protein i14_0728 [Escherichia coli str. 'clone D i14']
 gi|386638042|ref|YP_006104840.1| hypothetical protein ECABU_c07260 [Escherichia coli ABU 83972]
 gi|387615950|ref|YP_006118972.1| hypothetical protein NRG857_03035 [Escherichia coli O83:H1 str. NRG
           857C]
 gi|417288700|ref|ZP_12075985.1| BNR/Asp-box repeat protein [Escherichia coli TW07793]
 gi|419915280|ref|ZP_14433646.1| hypothetical protein ECKD1_18892 [Escherichia coli KD1]
 gi|422365231|ref|ZP_16445733.1| BNR/Asp-box repeat protein [Escherichia coli MS 153-1]
 gi|422382509|ref|ZP_16462669.1| BNR/Asp-box repeat protein [Escherichia coli MS 57-2]
 gi|425298838|ref|ZP_18688888.1| sialidase [Escherichia coli 07798]
 gi|432410693|ref|ZP_19653375.1| hypothetical protein WG9_01172 [Escherichia coli KTE39]
 gi|432430740|ref|ZP_19673185.1| hypothetical protein A13K_01028 [Escherichia coli KTE187]
 gi|432435268|ref|ZP_19677669.1| hypothetical protein A13M_00974 [Escherichia coli KTE188]
 gi|432455554|ref|ZP_19697755.1| hypothetical protein A15C_01344 [Escherichia coli KTE201]
 gi|432503332|ref|ZP_19745069.1| hypothetical protein A17E_00383 [Escherichia coli KTE220]
 gi|432522777|ref|ZP_19759916.1| hypothetical protein A17Y_00889 [Escherichia coli KTE230]
 gi|432567463|ref|ZP_19803990.1| hypothetical protein A1SE_01039 [Escherichia coli KTE53]
 gi|432591742|ref|ZP_19828071.1| hypothetical protein A1SS_01163 [Escherichia coli KTE60]
 gi|432606509|ref|ZP_19842704.1| hypothetical protein A1U7_01506 [Escherichia coli KTE67]
 gi|432650152|ref|ZP_19885914.1| hypothetical protein A1W7_01151 [Escherichia coli KTE87]
 gi|432731355|ref|ZP_19966193.1| hypothetical protein WGK_01192 [Escherichia coli KTE45]
 gi|432758434|ref|ZP_19992937.1| hypothetical protein A1S1_00552 [Escherichia coli KTE46]
 gi|432782531|ref|ZP_20016717.1| hypothetical protein A1SY_01362 [Escherichia coli KTE63]
 gi|432842911|ref|ZP_20076332.1| hypothetical protein A1YS_01061 [Escherichia coli KTE141]
 gi|432977349|ref|ZP_20166174.1| hypothetical protein A15S_03260 [Escherichia coli KTE209]
 gi|432994420|ref|ZP_20183036.1| hypothetical protein A17A_01501 [Escherichia coli KTE218]
 gi|432998838|ref|ZP_20187378.1| hypothetical protein A17K_01174 [Escherichia coli KTE223]
 gi|433056984|ref|ZP_20244068.1| hypothetical protein WIM_00763 [Escherichia coli KTE124]
 gi|433086298|ref|ZP_20272695.1| hypothetical protein WIY_00744 [Escherichia coli KTE137]
 gi|433114575|ref|ZP_20300391.1| hypothetical protein WKA_00763 [Escherichia coli KTE153]
 gi|433124234|ref|ZP_20309823.1| hypothetical protein WKE_00731 [Escherichia coli KTE160]
 gi|433138253|ref|ZP_20323540.1| hypothetical protein WKM_00536 [Escherichia coli KTE167]
 gi|433148081|ref|ZP_20333147.1| hypothetical protein WKQ_00746 [Escherichia coli KTE174]
 gi|433206808|ref|ZP_20390506.1| hypothetical protein WI1_00578 [Escherichia coli KTE97]
 gi|433211553|ref|ZP_20395168.1| hypothetical protein WI3_00729 [Escherichia coli KTE99]
 gi|442606900|ref|ZP_21021695.1| expressed protein [Escherichia coli Nissle 1917]
 gi|26107048|gb|AAN79232.1|AE016757_136 Hypthetical protein [Escherichia coli CFT073]
 gi|222032423|emb|CAP75162.1| hypthetical protein [Escherichia coli LF82]
 gi|227838438|gb|EEJ48904.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
           [Escherichia coli 83972]
 gi|300301975|gb|EFJ58360.1| BNR/Asp-box repeat protein [Escherichia coli MS 185-1]
 gi|300407241|gb|EFJ90779.1| BNR/Asp-box repeat protein [Escherichia coli MS 45-1]
 gi|307552534|gb|ADN45309.1| conserved hypothetical protein [Escherichia coli ABU 83972]
 gi|312945211|gb|ADR26038.1| hypothetical protein NRG857_03035 [Escherichia coli O83:H1 str. NRG
           857C]
 gi|315292047|gb|EFU51399.1| BNR/Asp-box repeat protein [Escherichia coli MS 153-1]
 gi|324006293|gb|EGB75512.1| BNR/Asp-box repeat protein [Escherichia coli MS 57-2]
 gi|355419120|gb|AER83317.1| hypothetical protein i02_0728 [Escherichia coli str. 'clone D i2']
 gi|355424040|gb|AER88236.1| hypothetical protein i14_0728 [Escherichia coli str. 'clone D i14']
 gi|386247492|gb|EII93665.1| BNR/Asp-box repeat protein [Escherichia coli TW07793]
 gi|388384072|gb|EIL45814.1| hypothetical protein ECKD1_18892 [Escherichia coli KD1]
 gi|408221841|gb|EKI45774.1| sialidase [Escherichia coli 07798]
 gi|430937935|gb|ELC58187.1| hypothetical protein WG9_01172 [Escherichia coli KTE39]
 gi|430956362|gb|ELC75038.1| hypothetical protein A13K_01028 [Escherichia coli KTE187]
 gi|430966847|gb|ELC84210.1| hypothetical protein A13M_00974 [Escherichia coli KTE188]
 gi|430984877|gb|ELD01494.1| hypothetical protein A15C_01344 [Escherichia coli KTE201]
 gi|431042334|gb|ELD52826.1| hypothetical protein A17E_00383 [Escherichia coli KTE220]
 gi|431054897|gb|ELD64466.1| hypothetical protein A17Y_00889 [Escherichia coli KTE230]
 gi|431103296|gb|ELE07966.1| hypothetical protein A1SE_01039 [Escherichia coli KTE53]
 gi|431133186|gb|ELE35184.1| hypothetical protein A1SS_01163 [Escherichia coli KTE60]
 gi|431140408|gb|ELE42179.1| hypothetical protein A1U7_01506 [Escherichia coli KTE67]
 gi|431193523|gb|ELE92859.1| hypothetical protein A1W7_01151 [Escherichia coli KTE87]
 gi|431278263|gb|ELF69257.1| hypothetical protein WGK_01192 [Escherichia coli KTE45]
 gi|431311398|gb|ELF99561.1| hypothetical protein A1S1_00552 [Escherichia coli KTE46]
 gi|431332243|gb|ELG19486.1| hypothetical protein A1SY_01362 [Escherichia coli KTE63]
 gi|431397439|gb|ELG80895.1| hypothetical protein A1YS_01061 [Escherichia coli KTE141]
 gi|431482453|gb|ELH62156.1| hypothetical protein A15S_03260 [Escherichia coli KTE209]
 gi|431509421|gb|ELH87672.1| hypothetical protein A17A_01501 [Escherichia coli KTE218]
 gi|431514407|gb|ELH92249.1| hypothetical protein A17K_01174 [Escherichia coli KTE223]
 gi|431573971|gb|ELI46760.1| hypothetical protein WIM_00763 [Escherichia coli KTE124]
 gi|431609334|gb|ELI78659.1| hypothetical protein WIY_00744 [Escherichia coli KTE137]
 gi|431637075|gb|ELJ05194.1| hypothetical protein WKA_00763 [Escherichia coli KTE153]
 gi|431649547|gb|ELJ16894.1| hypothetical protein WKE_00731 [Escherichia coli KTE160]
 gi|431665222|gb|ELJ31949.1| hypothetical protein WKM_00536 [Escherichia coli KTE167]
 gi|431676974|gb|ELJ43057.1| hypothetical protein WKQ_00746 [Escherichia coli KTE174]
 gi|431732618|gb|ELJ96069.1| hypothetical protein WI1_00578 [Escherichia coli KTE97]
 gi|431736208|gb|ELJ99550.1| hypothetical protein WI3_00729 [Escherichia coli KTE99]
 gi|441712207|emb|CCQ07672.1| expressed protein [Escherichia coli Nissle 1917]
          Length = 389

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|345299303|ref|YP_004828661.1| hypothetical protein Entas_2142 [Enterobacter asburiae LF7a]
 gi|345093240|gb|AEN64876.1| hypothetical protein Entas_2142 [Enterobacter asburiae LF7a]
          Length = 388

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 150/357 (42%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W   +   E+      NPV
Sbjct: 33  HAANLLPLPDGSLMCVWFGGTQEGIADISVWGSRLARGSQQWSEAVKLSEDATRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S++ G +W +   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQVSGNQDTAIVRYRQSHDLGNSWGKIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L NG     +  C +   E W  N   S ++++ D G++WR    + +P ESL  +   
Sbjct: 150 VLNNGNWLLPVFYCRTQPGEKWVGNDDISAVKISEDQGKTWRD---VEVP-ESLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N  L  L RS      +  S+S D G +WS  +PT L N NS +    L+ G L 
Sbjct: 206 ITPLKNGNLVALFRS-RWADNIYYSQSVDHGESWSVPEPTVLPNNNSSVQVTTLESGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L +N +S                                     R  + VA+S D G SW
Sbjct: 265 LVFNHMSAAGALERRASLYDEIDDGDGRKEPVVTSGRAAFWGAPRAPMTVAISPDGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q  DG++HI YTY R  IK+V + P
Sbjct: 325 PWQRHLDEGDGYCMTNNSLEKLNREFSYPSIKQGQDGALHIAYTYFRQAIKYVRVTP 381


>gi|422369047|ref|ZP_16449450.1| BNR/Asp-box repeat protein [Escherichia coli MS 16-3]
 gi|432897502|ref|ZP_20108411.1| hypothetical protein A13U_01159 [Escherichia coli KTE192]
 gi|433027672|ref|ZP_20215547.1| hypothetical protein WIA_00762 [Escherichia coli KTE109]
 gi|315299228|gb|EFU58482.1| BNR/Asp-box repeat protein [Escherichia coli MS 16-3]
 gi|431429548|gb|ELH11476.1| hypothetical protein A13U_01159 [Escherichia coli KTE192]
 gi|431545742|gb|ELI20389.1| hypothetical protein WIA_00762 [Escherichia coli KTE109]
          Length = 389

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVEHRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|384539696|ref|YP_005723780.1| hypothetical protein SM11_pD1447 [Sinorhizobium meliloti SM11]
 gi|336038349|gb|AEH84279.1| hypothetical protein SM11_pD1447 [Sinorhizobium meliloti SM11]
          Length = 395

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 94/371 (25%), Positives = 144/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+      G  +W  P     +   
Sbjct: 28  SPMVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAAQWGEPQRLSFDAEH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  S+ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAAIERDAADPTRLAASEGRY 141

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + V++D GRSW+    
Sbjct: 142 LDLPRGCFIRAPLVIRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGVSIDGGRSWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   + S+  +   P    +       R      +V   ES DGG TWS   PT + N N
Sbjct: 198 LADLDGSIGCVHMSPVRVGDTGYAAFFRRRQA-DQVYRVESVDGGRTWSLPAPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  V+L++GRL +  N V+                                     R 
Sbjct: 257 SSIAVVRLREGRLAMICNPVNAALSADRRASLYDELGDDDGRPDADPSGGCVPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
            + V LS DDG ++ D + +E            +    E SYP +++  DGS+H+ YTY+
Sbjct: 317 PVSVCLSADDGRTFPDRIAIEGGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHVAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLPP 387


>gi|432800913|ref|ZP_20034900.1| hypothetical protein A1W3_01167 [Escherichia coli KTE84]
 gi|431350871|gb|ELG37674.1| hypothetical protein A1W3_01167 [Escherichia coli KTE84]
          Length = 389

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADIPVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|283785231|ref|YP_003365096.1| hypothetical protein ROD_15131 [Citrobacter rodentium ICC168]
 gi|282948685|emb|CBG88277.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
          Length = 390

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 105/379 (27%), Positives = 152/379 (40%), Gaps = 76/379 (20%)

Query: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG-- 65
           KG + E    P+        HA+ ++ +  G  +  +FGG+ EG  D+ IW      G  
Sbjct: 16  KGALVETAMLPS---ACPQNHAANLLPLPDGSLMCVWFGGTQEGVADISIWGSRLSPGSH 72

Query: 66  RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWSGCMKRSYNKGVTWSE 123
           +W   +    +P     NPVLF  P N L L +  +I           +RS + G +WS+
Sbjct: 73  QWSEAVRLSHDPTRSEQNPVLFLAPDNVLWLLWTAQIAGNQDTAIVRYRRSTDLGYSWSD 132

Query: 124 REQL--PPGILGPSKNKPI-LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRS 172
              L   PG       +PI +L NG     +  C +   E W  NS  S ++++ D G+S
Sbjct: 133 TAVLLDTPGTF---IRQPITVLANGNWLLPVFYCRTKPGEKWVGNSDISAVKISSDGGQS 189

Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
           WR      I  +SL  +        +  L  L RS      +  S+S D G +W+   PT
Sbjct: 190 WRD----AIVPQSLGCVHMNITTLPDGRLAALFRS-RWADSIYYSQSDDNGESWTAPTPT 244

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTVS--------------------------------- 259
            L N NS I    L  G L L YN +S                                 
Sbjct: 245 ALPNNNSSIQVTTLSSGELALVYNHMSAEGAQERRASLYDEIDGGDSSRTEPEATNGKSA 304

Query: 260 -----RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSV 302
                R  + VA+S D G SW     L+E                EFSYP++ Q  DG++
Sbjct: 305 FWGAPRAPMTVAISADGGKSWPWRRNLDEGDGYCMTNNSQDKRNREFSYPSINQTPDGTL 364

Query: 303 HITYTYNRTQIKHVVLLPN 321
           HI YT+ R  IK+V + P+
Sbjct: 365 HIAYTWFRQAIKYVRVSPD 383


>gi|386018390|ref|YP_005940995.1| hypothetical protein PAJ_p0030 [Pantoea ananatis AJ13355]
 gi|386076504|ref|YP_005990587.1| BNR repeat-containing glycosyl hydrolase [Pantoea ananatis PA13]
 gi|327396476|dbj|BAK13897.1| hypothetical protein PAJ_p0030 [Pantoea ananatis AJ13355]
 gi|354685372|gb|AER34739.1| BNR repeat-containing glycosyl hydrolase [Pantoea ananatis PA13]
          Length = 385

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  L  +FGG+ EG  D+ +W      G   W   +   ++P     NPV
Sbjct: 33  HAANILPLQDGSLLCVWFGGTQEGIADISVWCSRLPAGADTWSEAVKLSDDPTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P+N L L +       + +  +  ++S + G TW   + L   PG       +PI 
Sbjct: 93  LFLAPNNVLWLLWTAQISGNQDTAIVRYRQSSDLGQTWGPIDTLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G+SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKSWRD---VVVP-DSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + +L  L RS      + +S S DGG TWS  + T L N NS I    L  G L 
Sbjct: 206 ITALQDGSLVALYRS-RWADHIYLSRSTDGGETWSSPEATVLPNNNSSIQVTTLHSGELA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G +W 
Sbjct: 265 LVFNNMSAAGATERRASLYDEIDDGDGRKEPETTGRTAFWGAPRAPMTVAISADGGKTWP 324

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    +E    EFSYP++ Q + G++HI YTY R  IK+  + P
Sbjct: 325 WQKNLDEGDGYCMTNNSQEKRNREFSYPSIKQDAGGTLHIAYTYFRQAIKYARVTP 380


>gi|432464652|ref|ZP_19706760.1| hypothetical protein A15K_00595 [Escherichia coli KTE205]
 gi|433071739|ref|ZP_20258435.1| hypothetical protein WIS_00713 [Escherichia coli KTE129]
 gi|433182227|ref|ZP_20366524.1| hypothetical protein WGO_00682 [Escherichia coli KTE85]
 gi|430997403|gb|ELD13670.1| hypothetical protein A15K_00595 [Escherichia coli KTE205]
 gi|431592830|gb|ELI63399.1| hypothetical protein WIS_00713 [Escherichia coli KTE129]
 gi|431711315|gb|ELJ75668.1| hypothetical protein WGO_00682 [Escherichia coli KTE85]
          Length = 389

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPVNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|172065238|ref|YP_001815950.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
           [Burkholderia ambifaria MC40-6]
 gi|171997480|gb|ACB68397.1| glycosyl hydrolase BNR repeat-containing protein [Burkholderia
           ambifaria MC40-6]
          Length = 395

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 100/355 (28%), Positives = 147/355 (41%), Gaps = 74/355 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ +  +  G  L  +FGG+ EG PDV I+L     G   W SP    ++P     NPV
Sbjct: 39  HAANLHCLANGDLLCVWFGGTQEGIPDVSIYLSRLAAGTNAWSSPQKLSDDPTRSEQNPV 98

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P   L L Y       + +  ++R  S + GVTW   E L   PG     +   I 
Sbjct: 99  LFTAPDGRLWLIYTAQLSGHQNTAIVRRRVSTDGGVTWGPVETLFDRPGTF--VRQPIIA 156

Query: 142 LENGLLLC------GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L+NG  LC       +  E W  N   S + V+ D G SW  +    +P +S+  +    
Sbjct: 157 LDNGTWLCPVFLCRTAPGERWVGNDDVSAVMVSDDQGASWSLH---EVP-DSVGCVHMNV 212

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + +L  L RS      V  S S DG  TWS  +PT + N NS I    L++G L L
Sbjct: 213 HVLHDGSLLALYRS-RWADFVYASRSTDG-RTWSAPEPTDVPNNNSSIQFTVLRNGHLAL 270

Query: 254 AYNT----------------------------------------VSRGVLKVALSKDDGD 273
            +N                                           R  + +A+S+D G 
Sbjct: 271 VFNESDASHANARRASLYDDIEDSEDSGALRDQKASARGTAFWGAPRAPMSLAISEDGGR 330

Query: 274 SWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           +W     LE            +    E+SYP+++Q+ DG++HI +T+ R +IK+V
Sbjct: 331 TWRRRRNLEIGDGYCMTNNSADRRNREYSYPSIVQSQDGALHIAFTWFRQKIKYV 385


>gi|412985180|emb|CCO20205.1| putative alpha-rhamnosidase [Bathycoccus prasinos]
          Length = 586

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 142/330 (43%), Gaps = 75/330 (22%)

Query: 66  RWQSPIIADE--EPNVPMWNPVLFKLP-SNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
           +W +P +A +    N   W P LF +P ++ +LLFYK G   + W G +KRS + G TW 
Sbjct: 254 QWHAPKMAVKGTSSNDKCWRPQLFVVPQTHEVLLFYKRGASRETWEGFLKRSLDGGKTWL 313

Query: 123 EREQLPPGILGPSKNKPIL-LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR-KYGPIY 180
           E E+LPPGILGPSKNK +   E+  +L  S   + +S  +W+E     GR W  K GPI 
Sbjct: 314 EEEKLPPGILGPSKNKVLRNSEDESILSASERWTRSSACAWIERAEREGRDWTVKSGPIR 373

Query: 181 IPNESLSV---------------------IQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
           +  + L                       I+P  F+  + TL VL R      R   S S
Sbjct: 374 MEKKGLQERYGGGRMGDKAAEEEARDYGGIEPAIFYKDDGTLVVLARPAKTTERFPKSLS 433

Query: 220 C---DGGLTWSYAK--PTQLL-----NPNS-------------------GIDGVKLKDGR 250
                GG +   ++  P  L       PN+                   G D  KL DGR
Sbjct: 434 GVVHAGGRSKQSSRYFPKMLKAELKDEPNAQWSAFKEIADLPLPTVFGRGFDVEKLTDGR 493

Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSY------------------- 291
           ++L  N   +  + V +S+DDG+++ +A  +E+    + S                    
Sbjct: 494 VVLVMNDPGKKGVSVKISRDDGETFREAFVIEDGDDEDESSGGSSGSSSSSGSSSSYTRD 553

Query: 292 PAVIQASDGSVHITYTYNR-TQIKHVVLLP 320
           P +  A+D  +H+ Y  N   +IK  VL P
Sbjct: 554 PVIHVANDAMIHVVYVANNGKKIKRAVLDP 583


>gi|378769380|ref|YP_005197854.1| BNR/Asp-box repeat family protein [Pantoea ananatis LMG 5342]
 gi|365188868|emb|CCF11817.1| BNR/Asp-box repeat family protein [Pantoea ananatis LMG 5342]
          Length = 393

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  L  +FGG+ EG  D+ +W      G   W   +   ++P     NPV
Sbjct: 41  HAANILPLQDGSLLCVWFGGTQEGIADISVWCSRLPAGADTWSEAVKLSDDPTRSEQNPV 100

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P+N L L +       + +  +  ++S + G TW   + L   PG       +PI 
Sbjct: 101 LFLAPNNVLWLLWTAQISGNQDTAIVRYRQSSDLGQTWGPIDTLLDKPGTF---IRQPIT 157

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G+SWR    + +P +SL  +   
Sbjct: 158 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKSWRD---VVVP-DSLGCVHMN 213

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + +L  L RS      + +S S DGG TWS  + T L N NS I    L  G L 
Sbjct: 214 ITALQDGSLVALYRS-RWADHIYLSRSTDGGETWSSPEATVLPNNNSSIQVTTLHSGELA 272

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G +W 
Sbjct: 273 LVFNNMSAAGATERRASLYDEIDDGDGRKEPETTGRTAFWGAPRAPMTVAISADGGKTWP 332

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    +E    EFSYP++ Q + G++HI YTY R  IK+  + P
Sbjct: 333 WQKNLDEGDGYCMTNNSQEKRNREFSYPSIKQDAGGTLHIAYTYFRQAIKYARVTP 388


>gi|440287602|ref|YP_007340367.1| putative neuraminidase (sialidase) [Enterobacteriaceae bacterium
           strain FGI 57]
 gi|440047124|gb|AGB78182.1| putative neuraminidase (sialidase) [Enterobacteriaceae bacterium
           strain FGI 57]
          Length = 389

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 100/357 (28%), Positives = 147/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGTQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  +RS + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRRSTDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  +  +L N NS I    L  G L 
Sbjct: 206 ITMLRDSTLVALFRS-RWADSIYISHSADHGESWSVPQAIELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DGS+HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGSLHIAYTWWRQAIKYVRISP 381


>gi|301025188|ref|ZP_07188761.1| BNR/Asp-box repeat protein [Escherichia coli MS 69-1]
 gi|419917195|ref|ZP_14435468.1| Putative Glycosyl hydrolase [Escherichia coli KD2]
 gi|300396188|gb|EFJ79726.1| BNR/Asp-box repeat protein [Escherichia coli MS 69-1]
 gi|388394582|gb|EIL55845.1| Putative Glycosyl hydrolase [Escherichia coli KD2]
          Length = 389

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VVVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDSTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|399021574|ref|ZP_10723670.1| putative neuraminidase (sialidase) [Herbaspirillum sp. CF444]
 gi|398091127|gb|EJL81577.1| putative neuraminidase (sialidase) [Herbaspirillum sp. CF444]
          Length = 405

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 96/361 (26%), Positives = 148/361 (40%), Gaps = 82/361 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ +  +  G  L A+FGG+ EG PD+ I+L   + G   W +P+   ++      NPV
Sbjct: 44  HAANLASLSNGDLLCAWFGGTQEGIPDISIYLSRLRVGSDTWSAPVRLSDDATRSEQNPV 103

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF+ P   L L Y       + +  ++R  S ++G TW   E L   PG       +PI+
Sbjct: 104 LFETPEGALWLIYTAQLSGHQNTSIVRRRISQDQGQTWGPVETLFDKPGTF---VRQPIV 160

Query: 142 LE-NGLLLC---GSSVESWNSWGSWMEVTV-----DAGRSWRKY---GPIYIPNESLSVI 189
           +  +G  LC      V +   W    +++V     D G SW  +   G +   + ++  +
Sbjct: 161 VTGDGAWLCPVFMCRVAAGERWSGNDDISVVMRSTDRGSSWSAHEVPGSVGCVHMNIQAL 220

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           +   F    R+     R  + I     S   D G+TWS   P  L N NS I  + L +G
Sbjct: 221 KSGGFLALYRS-----RWADHI----YSSRSDDGVTWSEPHPLALPNNNSSIQFLALANG 271

Query: 250 RLLLAYNTVS----------------------------------------RGVLKVALSK 269
            L L +N  S                                        R  + +A+S 
Sbjct: 272 HLGLVFNNSSAAQSTGRRASLYDDIEDSEDSGELASQSASHRGTAFWGAPRAPMTLAISC 331

Query: 270 DDGDSW--------HDALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G +W         D   +  N A     EFSYP+++Q+ DG +HI YTY R  IK+V 
Sbjct: 332 DGGHTWPAMRNLEVGDGYCMTNNSADKRNREFSYPSIVQSDDGKLHIAYTYFRQNIKYVC 391

Query: 318 L 318
           +
Sbjct: 392 V 392


>gi|432726746|ref|ZP_19961628.1| hypothetical protein WE3_01183 [Escherichia coli KTE18]
 gi|432740432|ref|ZP_19975154.1| hypothetical protein WEE_01107 [Escherichia coli KTE23]
 gi|432989745|ref|ZP_20178412.1| hypothetical protein A179_01513 [Escherichia coli KTE217]
 gi|433109967|ref|ZP_20295842.1| hypothetical protein WK9_00828 [Escherichia coli KTE150]
 gi|431275455|gb|ELF66484.1| hypothetical protein WE3_01183 [Escherichia coli KTE18]
 gi|431286561|gb|ELF77387.1| hypothetical protein WEE_01107 [Escherichia coli KTE23]
 gi|431497982|gb|ELH77198.1| hypothetical protein A179_01513 [Escherichia coli KTE217]
 gi|431630902|gb|ELI99226.1| hypothetical protein WK9_00828 [Escherichia coli KTE150]
          Length = 389

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRSSLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 SWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|432396574|ref|ZP_19639360.1| hypothetical protein WEI_01488 [Escherichia coli KTE25]
 gi|432722204|ref|ZP_19957128.1| hypothetical protein WE1_01227 [Escherichia coli KTE17]
 gi|430917546|gb|ELC38590.1| hypothetical protein WEI_01488 [Escherichia coli KTE25]
 gi|431268003|gb|ELF59517.1| hypothetical protein WE1_01227 [Escherichia coli KTE17]
          Length = 389

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 SWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|291619532|ref|YP_003522274.1| hypothetical Protein PANA_3979 [Pantoea ananatis LMG 20103]
 gi|291154562|gb|ADD79146.1| Hypothetical Protein PANA_3979 [Pantoea ananatis LMG 20103]
          Length = 393

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 71/356 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  L  +FGG+ EG  D+ +W      G   W   +   ++P     NPV
Sbjct: 41  HAANILPLQDGSLLCVWFGGTQEGIADISVWCSRLPAGADTWSEAVKLSDDPTRSEQNPV 100

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P+N L L +       + +  +  ++S + G TW   + L   PG       +PI 
Sbjct: 101 LFLAPNNVLWLLWTAQISGNQDTAIVRYRQSSDLGQTWGPIDTLLDKPGTF---IRQPIT 157

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G+SWR    + +P +SL  +   
Sbjct: 158 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDQGKSWRD---VVVP-DSLGCVHMN 213

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + +L  L RS      + +S S DGG TWS  + T L N NS I    L  G L 
Sbjct: 214 ITALQDGSLVALYRS-RWADHIYLSRSTDGGETWSSPEATVLPNNNSSIQVTTLLSGELA 272

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSW- 275
           L +N +S                                    R  + VA+S D G +W 
Sbjct: 273 LVFNNMSAAGATERRASLYDEIDDGDGRKEPETTGRTAFWGAPRAPMTVAISADGGKTWP 332

Query: 276 -------HDALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                   D   +    +E    EFSYP++ Q + G++HI YTY R  IK+  + P
Sbjct: 333 WQKNLDEGDGYCMTNNSQEKRNREFSYPSIKQDAGGTLHIAYTYFRQAIKYARVTP 388


>gi|116621776|ref|YP_823932.1| neuraminidase (sialidase)-like protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224938|gb|ABJ83647.1| neuraminidase (sialidase)-like protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 447

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 98/353 (27%), Positives = 157/353 (44%), Gaps = 74/353 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCE-GAPDVKI----WLQTFKDGRWQSPIIADEEPNVPMW 82
           H+S++VE+  G  LV +F GS E  A DV I    W  T K  +W +P    + P  P  
Sbjct: 94  HSSSVVELPNGDLLVCWFHGSGELTADDVLIQAARWNHTTK--QWSAPFTLADTPGFPET 151

Query: 83  NPVLFKLPSNGLLLF----------------YKIGQEVQKWSGCMKRSYNKGVTW----- 121
           NPVLF L S+  L F                Y+I  + Q+ SG  K  +   +       
Sbjct: 152 NPVLF-LDSHQRLFFAWPLIVAHKWETALMKYRISTDYQQPSGPPKWEFQDNIVLIPKDI 210

Query: 122 ----SEREQLPPGILGP-----------------------SKNKPILLENGLLLCGSSVE 154
                E         GP                       ++  P+ L +G +L     +
Sbjct: 211 AALTKEFAGAEAAGTGPRADRAKKQIEHAEDEYFSRMGWFTRTHPLELPSGRILMPMYSD 270

Query: 155 SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGI--G 212
            + S+G  M ++ D G +W    PI         IQP      + +L   +R  NG    
Sbjct: 271 GY-SYGI-MAISDDGGYTWTASQPIV----GEGCIQPSVVRKKDGSLVAYLRD-NGPPPK 323

Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRG--VLKVALSKD 270
           R  +S S D G+TW+ A+ T++ NP + ++ ++L++G  ++ YN + +G   L  A+S D
Sbjct: 324 RAHISYSKDDGVTWTRARDTEIPNPGTSLENIRLRNGNWIMVYNDLEQGRYSLVAAMSDD 383

Query: 271 DGDSWHDALTLEENLA----MEFSYPAVIQASDGSVHITYTYNRTQ---IKHV 316
           +G +W     L+ + +     ++ YP+VIQA DG++H+TY+Y   +   IKHV
Sbjct: 384 EGATWKWKRHLDGSPSGARTSQYHYPSVIQARDGAIHVTYSYFTPEGKAIKHV 436


>gi|406830180|ref|ZP_11089774.1| hypothetical protein SpalD1_01032 [Schlesneria paludicola DSM
           18645]
          Length = 344

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 139/313 (44%), Gaps = 27/313 (8%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           H ++I E+  G   +AY+GG  E   D  ++    + G+  W  P    + P+    N V
Sbjct: 31  HPASIGELANGDLYIAYYGGEGEYKGDTAVFGSRREKGKTSWSLPTTIADTPDRADGNGV 90

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKPILL 142
           +++ P     LFY + +  + W+  +   K S++ G TW++ E L        +++PI L
Sbjct: 91  IWQEPGGATWLFY-VCRYGKTWADSVIKYKYSHDNGHTWTDSEMLTFERGMMVRSQPIQL 149

Query: 143 ENGLLLC------GSSVESWNSWGSWMEVTV-DAGRSWRKYGPIYIPNESLSVIQPVPFH 195
            +G  L       G+  ++  +  S +        + W     ++     +  IQP    
Sbjct: 150 FDGDFLVPIYQEKGTDAKAIGAQSSSLFARYHKKTKEWSFTNKVH---SRIGNIQPSVVQ 206

Query: 196 TANRTLRVLMRSFNGI------GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
                L    R   GI      G +  +ES DGG TW+  + T   NPNS +D +KLK+G
Sbjct: 207 LDQNKLIAFCRR-GGIYGPLLDGFIVRTESRDGGKTWALGEDTSFPNPNSAVDLIKLKNG 265

Query: 250 RLLLAYNT---VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
            L++ YN      R  L + +S D G +W  A  +      E  YP +IQ +DG ++  Y
Sbjct: 266 HLVMIYNNSNQAKRNPLTMRVSTDQGQTWPAARDIVNTPDDEQGYPFIIQTADGRINGVY 325

Query: 307 T-YNRTQIKHVVL 318
           T   RT + H VL
Sbjct: 326 TSQKRTVVNHFVL 338


>gi|417627604|ref|ZP_12277851.1| BNR/Asp-box repeat family protein [Escherichia coli STEC_MHI813]
 gi|345377908|gb|EGX09839.1| BNR/Asp-box repeat family protein [Escherichia coli STEC_MHI813]
          Length = 389

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGTQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLAALFRS-RWADNIYISHSADHGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|312132035|ref|YP_003999375.1| neuraminidase (sialidase)-like protein [Leadbetterella byssophila
           DSM 17132]
 gi|311908581|gb|ADQ19022.1| neuraminidase (sialidase)-like protein [Leadbetterella byssophila
           DSM 17132]
          Length = 371

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 149/349 (42%), Gaps = 63/349 (18%)

Query: 13  EEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGRWQSPI 71
           E   FP      +  HAS+IVE+  G  LVA+F GS E  A DVKI         W +P 
Sbjct: 19  EALIFPLQG---QHVHASSIVELPNGDLLVAWFQGSGERTADDVKIMGARRTGKSWSAPF 75

Query: 72  IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM-----KRSYNKGVTWS---- 122
           +  + P +P  NPVLF      L L + I  +  +W   +      + +     W     
Sbjct: 76  VLADTPFLPDCNPVLFLNHKKQLHLVW-IAVQANRWEQSILKILKSKDHEPNWVWQDNIL 134

Query: 123 ---------EREQ----LPP------------------GILGPSKN--------KPILLE 143
                    E EQ    LPP                      P K         KP+LL 
Sbjct: 135 LKPAEEFAVEVEQKLKSLPPLNHGWAEYAPKYDDLIITAAQDPVKRSTGWMTRIKPLLLP 194

Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
           +  +L     + +N   S M ++ D G +W    P+         IQP      + ++  
Sbjct: 195 DQKILLPLYSDGFNF--SMMALSEDDGETWLPSLPLV----GRGPIQPALVLKKDGSIAA 248

Query: 204 LMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--SR 260
            +R S +   +V  SES DGG TWS ++  ++    S ++ + L DGR ++  N    SR
Sbjct: 249 FLRDSGDSPNQVHYSESKDGGYTWSASRKIKIPTAAS-VEALHLPDGRWIMVLNDQENSR 307

Query: 261 GVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
             L + +S+D+G +W     LEE     FSYP++I +SDG +H+TY+Y+
Sbjct: 308 HRLSLYVSQDEGINWERKKILEETARGSFSYPSLIPSSDGHLHLTYSYH 356


>gi|117622881|ref|YP_851794.1| hypothetical protein APECO1_1391 [Escherichia coli APEC O1]
 gi|218557602|ref|YP_002390515.1| Glycosyl hydrolase, BNR repeat [Escherichia coli S88]
 gi|237707359|ref|ZP_04537840.1| hypothetical protein ESAG_00540 [Escherichia sp. 3_2_53FAA]
 gi|386598409|ref|YP_006099915.1| hypothetical protein ECOK1_0683 [Escherichia coli IHE3034]
 gi|417083139|ref|ZP_11951275.1| hypothetical protein i01_00877 [Escherichia coli cloneA_i1]
 gi|419945350|ref|ZP_14461796.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli HM605]
 gi|422359085|ref|ZP_16439734.1| BNR/Asp-box repeat protein [Escherichia coli MS 110-3]
 gi|422748303|ref|ZP_16802216.1| hypothetical protein ERKG_00531 [Escherichia coli H252]
 gi|422753293|ref|ZP_16807120.1| hypothetical protein ERLG_00415 [Escherichia coli H263]
 gi|422839155|ref|ZP_16887127.1| hypothetical protein ESPG_01813 [Escherichia coli H397]
 gi|432356976|ref|ZP_19600223.1| hypothetical protein WCC_00931 [Escherichia coli KTE4]
 gi|432361448|ref|ZP_19604633.1| hypothetical protein WCE_00469 [Escherichia coli KTE5]
 gi|432440010|ref|ZP_19682365.1| hypothetical protein A13O_00836 [Escherichia coli KTE189]
 gi|432445123|ref|ZP_19687431.1| hypothetical protein A13S_01157 [Escherichia coli KTE191]
 gi|432586930|ref|ZP_19823301.1| hypothetical protein A1SO_01285 [Escherichia coli KTE58]
 gi|432596572|ref|ZP_19832855.1| hypothetical protein A1SW_01279 [Escherichia coli KTE62]
 gi|432753429|ref|ZP_19987996.1| hypothetical protein WEA_00408 [Escherichia coli KTE22]
 gi|432777568|ref|ZP_20011819.1| hypothetical protein A1SQ_01231 [Escherichia coli KTE59]
 gi|432786357|ref|ZP_20020523.1| hypothetical protein A1U3_00490 [Escherichia coli KTE65]
 gi|432819949|ref|ZP_20053663.1| hypothetical protein A1Y5_01555 [Escherichia coli KTE118]
 gi|432826163|ref|ZP_20059819.1| hypothetical protein A1YA_02873 [Escherichia coli KTE123]
 gi|433004157|ref|ZP_20192595.1| hypothetical protein A17S_01712 [Escherichia coli KTE227]
 gi|433011366|ref|ZP_20199771.1| hypothetical protein A17W_04129 [Escherichia coli KTE229]
 gi|433012863|ref|ZP_20201241.1| hypothetical protein WI5_00690 [Escherichia coli KTE104]
 gi|433022488|ref|ZP_20210503.1| hypothetical protein WI9_00655 [Escherichia coli KTE106]
 gi|433152781|ref|ZP_20337748.1| hypothetical protein WKS_00709 [Escherichia coli KTE176]
 gi|433162482|ref|ZP_20347241.1| hypothetical protein WKW_00685 [Escherichia coli KTE179]
 gi|433167491|ref|ZP_20352159.1| hypothetical protein WKY_00750 [Escherichia coli KTE180]
 gi|433323367|ref|ZP_20400716.1| hypothetical protein B185_007808 [Escherichia coli J96]
 gi|115512005|gb|ABJ00080.1| conserved hypothetical protein [Escherichia coli APEC O1]
 gi|218364371|emb|CAR02048.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli S88]
 gi|226898569|gb|EEH84828.1| hypothetical protein ESAG_00540 [Escherichia sp. 3_2_53FAA]
 gi|294489432|gb|ADE88188.1| conserved hypothetical protein [Escherichia coli IHE3034]
 gi|315287119|gb|EFU46533.1| BNR/Asp-box repeat protein [Escherichia coli MS 110-3]
 gi|323952758|gb|EGB48626.1| hypothetical protein ERKG_00531 [Escherichia coli H252]
 gi|323958425|gb|EGB54131.1| hypothetical protein ERLG_00415 [Escherichia coli H263]
 gi|355353007|gb|EHG02180.1| hypothetical protein i01_00877 [Escherichia coli cloneA_i1]
 gi|371611176|gb|EHN99702.1| hypothetical protein ESPG_01813 [Escherichia coli H397]
 gi|388415905|gb|EIL75813.1| Putative Glycosyl hydrolase, BNR repeat [Escherichia coli HM605]
 gi|430879786|gb|ELC03117.1| hypothetical protein WCC_00931 [Escherichia coli KTE4]
 gi|430889966|gb|ELC12613.1| hypothetical protein WCE_00469 [Escherichia coli KTE5]
 gi|430969242|gb|ELC86392.1| hypothetical protein A13O_00836 [Escherichia coli KTE189]
 gi|430975845|gb|ELC92729.1| hypothetical protein A13S_01157 [Escherichia coli KTE191]
 gi|431123402|gb|ELE26140.1| hypothetical protein A1SO_01285 [Escherichia coli KTE58]
 gi|431133475|gb|ELE35465.1| hypothetical protein A1SW_01279 [Escherichia coli KTE62]
 gi|431305039|gb|ELF93555.1| hypothetical protein WEA_00408 [Escherichia coli KTE22]
 gi|431330168|gb|ELG17450.1| hypothetical protein A1SQ_01231 [Escherichia coli KTE59]
 gi|431341486|gb|ELG28493.1| hypothetical protein A1U3_00490 [Escherichia coli KTE65]
 gi|431370951|gb|ELG56744.1| hypothetical protein A1Y5_01555 [Escherichia coli KTE118]
 gi|431374359|gb|ELG59951.1| hypothetical protein A1YA_02873 [Escherichia coli KTE123]
 gi|431517478|gb|ELH95000.1| hypothetical protein A17S_01712 [Escherichia coli KTE227]
 gi|431519578|gb|ELH97030.1| hypothetical protein A17W_04129 [Escherichia coli KTE229]
 gi|431535134|gb|ELI11517.1| hypothetical protein WI5_00690 [Escherichia coli KTE104]
 gi|431540007|gb|ELI15639.1| hypothetical protein WI9_00655 [Escherichia coli KTE106]
 gi|431678162|gb|ELJ44170.1| hypothetical protein WKS_00709 [Escherichia coli KTE176]
 gi|431691152|gb|ELJ56612.1| hypothetical protein WKW_00685 [Escherichia coli KTE179]
 gi|431693491|gb|ELJ58904.1| hypothetical protein WKY_00750 [Escherichia coli KTE180]
 gi|432348070|gb|ELL42522.1| hypothetical protein B185_007808 [Escherichia coli J96]
          Length = 389

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIVDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|432405505|ref|ZP_19648227.1| hypothetical protein WEO_00688 [Escherichia coli KTE28]
 gi|430932422|gb|ELC52845.1| hypothetical protein WEO_00688 [Escherichia coli KTE28]
          Length = 389

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|342883941|gb|EGU84341.1| hypothetical protein FOXB_05140 [Fusarium oxysporum Fo5176]
          Length = 380

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 157/372 (42%), Gaps = 77/372 (20%)

Query: 16  TFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIA 73
           T P  SA  +S HAS ++++     L  +FGGS EG PD+ IWL   + G   W +P   
Sbjct: 7   TIPVPSATVQS-HASNLLQLPDKTVLCTWFGGSQEGLPDISIWLSRLEPGSSSWTTPQKI 65

Query: 74  DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC--MKR-SYNKGVTWSEREQLPPG 130
             + N    NPVLF+ P +G +      Q+     G   +KR S ++G TWSE  +L P 
Sbjct: 66  SFDENRSCQNPVLFRAPHSGEIWLLHTSQDAGNQDGAYILKRTSSDQGHTWSEASRLLPD 125

Query: 131 ILGPSKNKPILLENG-----LLLCGSSVESWNSWGSWMEV-----TVDAGRSWR-KYGPI 179
           + G    +PI++ NG     L +     E  + W    ++     + D G +W+ K  P 
Sbjct: 126 VTGIFIRQPIVI-NGDGTWILPVFYCRTEPGHRWIGSDDISGVLYSDDNGATWKEKQAPD 184

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            + +  +++I PV   + + +     RS           + + G+ W   K T L NPNS
Sbjct: 185 SVGSVHMNIIPPV---SGSSSWVAFYRSR--WADNVYRSTSNNGIDWETPKATTLPNPNS 239

Query: 240 GIDGVKLKDGRLLLAYNTVS--------------------------------------RG 261
           GI   +L  GR+ + +N  +                                      R 
Sbjct: 240 GICAARLSSGRIAIVFNRSNASADTLKRQGLYDDITPEDDKRPNQVTKTGKDAIWGTPRK 299

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGS---VHITY 306
            L + +S+D+G +W + + LE+                E SYP++    DG     H+ Y
Sbjct: 300 TLTLGVSEDEGLTWRERV-LEDGDGFCGTNSSSGQENRELSYPSIYIEKDGDRDVAHVAY 358

Query: 307 TYNRTQIKHVVL 318
           T++R  IK+V +
Sbjct: 359 TWHRQHIKYVRI 370


>gi|271502008|ref|YP_003335034.1| hypothetical protein Dd586_3498 [Dickeya dadantii Ech586]
 gi|270345563|gb|ACZ78328.1| conserved hypothetical protein [Dickeya dadantii Ech586]
          Length = 394

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 150/357 (42%), Gaps = 74/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQSPIIADEEPNVPMWNPV 85
           HAS ++ +  G  L  +FGG+ EG  D+ I+L    +  G+W   +   ++P     NPV
Sbjct: 39  HASNLLHLPNGDVLCVWFGGTQEGVSDISIYLSRLVNGSGQWTPAVKLSDDPTRSEQNPV 98

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  +  ++S ++G TW E   L   PG       +PI 
Sbjct: 99  LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSQDQGYTWGEIGVLLEQPGTF---IRQPIT 155

Query: 141 LLENG-----LLLCG-SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L NG     +  C     E W  N   S ++++ D G++WR+     +PN +  V   +
Sbjct: 156 VLPNGDWLLPVFYCRIQPGEKWVGNDDISAVKISSDRGQTWRES---VVPNSTGCVHMNI 212

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      +  S S DGG TWS  + T L N NS I    L +G L 
Sbjct: 213 TL-LKDGTLLALFRS-RWADFIYRSHSTDGGETWSEPEATDLPNNNSSIQVTTLDNGHLA 270

Query: 253 LAYNTVS---------------------------------------RGVLKVALSKDDGD 273
           L +N ++                                       R  + +A+S+D G 
Sbjct: 271 LVFNAMNADGATERRLSLYDEIEDEEESDAKMPEIATGRSAFWGAPRAPMTLAISEDGGK 330

Query: 274 SWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +W     LE            +    EFSYP++ Q  DG +HI +TY R  IK+V +
Sbjct: 331 TWPWQRNLEVGDGYCMTNNSTDKRNREFSYPSIKQGPDGKLHIAFTYFRQAIKYVCV 387


>gi|158423600|ref|YP_001524892.1| glycosyl hydrolase [Azorhizobium caulinodans ORS 571]
 gi|158330489|dbj|BAF87974.1| putative glycosyl hydrolase [Azorhizobium caulinodans ORS 571]
          Length = 403

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 147/371 (39%), Gaps = 79/371 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ I  +  G     +FGG+ EG PD+ ++      G  RW  P    ++   
Sbjct: 36  SPCVQNHAANIAVLGNGDLATVWFGGTQEGIPDISVYFSRLAKGSDRWTVPQKLSDDGTR 95

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL---PPGILGP 134
              NP+LF  PS  L L +       + +  +++  S + GVTW   E L   P G    
Sbjct: 96  SEQNPILFPTPSGDLWLIWTAQISGNQDTSMVRKRVSRDHGVTWGPIETLFEAPEGFGLF 155

Query: 135 SKNKPILLENG-----LLLCGSSVESWNSWG----SWMEVTVDAGRSWRKYGPIYIPNES 185
            +  P++L++G     +  C  SV      G    S + V+ D GR+W ++    +P+ S
Sbjct: 156 VRQPPVVLDDGAWVIPVFHC-VSVPGAKWVGDRDISAVRVSADEGRTWTEH---VVPH-S 210

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           L  +      T    L +    +     + +S S DG + WS  + T L N NS I    
Sbjct: 211 LGCVHMNVLKTKEGLLALYRSRWADF--IYLSRSADGRI-WSAPEATVLPNNNSSIQATV 267

Query: 246 LKDGRLLLAYNTVS-------------------------------------------RGV 262
           L DGR+ L +N  S                                           R  
Sbjct: 268 LADGRIALVFNNSSNKDATDRRTGLYDDIEDDSAPVASASPAVSPEPTGRTAFWGAPRAP 327

Query: 263 LKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
           L +ALS D G SW     LE            + L  EFSYP++ Q  DG + I +TY R
Sbjct: 328 LTLALSADGGRSWPVVRNLEVGDGYCMTNNSKDGLNREFSYPSITQTPDGMLQIAFTYYR 387

Query: 311 TQIKHVVLLPN 321
             IKHV + P+
Sbjct: 388 RAIKHVRVAPD 398


>gi|449138248|ref|ZP_21773539.1| putative secreted protein [Rhodopirellula europaea 6C]
 gi|448883184|gb|EMB13726.1| putative secreted protein [Rhodopirellula europaea 6C]
          Length = 188

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 84/160 (52%), Gaps = 11/160 (6%)

Query: 5   CGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD 64
            G   LV+ +  F   + P K  HASTIVE   G  + A+F G+ E  PDV I +   ++
Sbjct: 28  AGDGALVSSQQIFDLANKPTKESHASTIVETPTG-LVAAWFAGTRERDPDVGIRVSRNEN 86

Query: 65  GRWQSPI-----IADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           G+W  P+     +       P WNPVLF+     L+LFYK+G   ++W G +  S + G 
Sbjct: 87  GQWTEPVEVVNGVQSSTLRYPTWNPVLFQPTEGPLMLFYKVGPNPREWWGMLTTSEDGGK 146

Query: 120 TWSEREQLPPG-----ILGPSKNKPILLENGLLLCGSSVE 154
           TW+   +L        +LGP KNKPI L +G +LC SS E
Sbjct: 147 TWTWPTKLGEAHTIGHLLGPVKNKPIQLADGTILCPSSTE 186


>gi|91209716|ref|YP_539702.1| hypothetical protein UTI89_C0678 [Escherichia coli UTI89]
 gi|386605414|ref|YP_006111714.1| Putative Glycosyl hydrolase, BNR repeat protein [Escherichia coli
           UM146]
 gi|432572625|ref|ZP_19809116.1| hypothetical protein A1SI_01312 [Escherichia coli KTE55]
 gi|91071290|gb|ABE06171.1| hypothetical protein UTI89_C0678 [Escherichia coli UTI89]
 gi|307627898|gb|ADN72202.1| Putative Glycosyl hydrolase, BNR repeat protein [Escherichia coli
           UM146]
 gi|431110963|gb|ELE14880.1| hypothetical protein A1SI_01312 [Escherichia coli KTE55]
          Length = 389

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 148/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIVDGDDSRKEPTAVGRSAFWGAPRAPVTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 PWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|317139154|ref|XP_001817301.2| glycosyl hydrolase [Aspergillus oryzae RIB40]
          Length = 387

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/353 (28%), Positives = 149/353 (42%), Gaps = 69/353 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HAS ++ +  G  L A+FGGS EG PD+ I+L   + G   W   I    +      NPV
Sbjct: 35  HASNLLLLPNGDVLCAWFGGSMEGKPDISIYLSRLRSGEQSWSEAIRMTHDNTRSEQNPV 94

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK--GVTWSEREQLPPGILGPSKNKPILLE 143
           LF+ P+  L L Y       + S  +KR  +K  G+TW + E L P      +   I+L+
Sbjct: 95  LFRTPTGDLWLLYTSQHAGNQDSAIVKRRVSKDDGITWGKEEVLFPDSGIFIRQPAIVLD 154

Query: 144 NG---LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
           +G   + +    VE    W      S + V+ D G++W +     IP ES   +      
Sbjct: 155 DGAWVIPVFKCRVEPGERWLGNNDISCIRVSRDEGQTWTES---VIP-ESTGCVHMEIQR 210

Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
             + +   L RS      + ++ S D GL+WS  +PT L NPN+GI    L  GR+++ Y
Sbjct: 211 LKDCSYLGLFRS-RWADHIYLATSPD-GLSWSPPQPTILPNPNAGICFDVLPSGRVVVVY 268

Query: 256 NTVS---------------------------------------RGVLKVALSKDDGDSWH 276
           N  S                                       R  L VA S D G +W 
Sbjct: 269 NHSSKLDATGRRQGLYDDIADGVDERRDQSSTQDGRESFWGAPRAPLCVAWSDDSGKTWE 328

Query: 277 DAL-----------TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
             +             E+ L  E SYP+++   +G +HI YT+ R +IK+V L
Sbjct: 329 RRVLEDGDGYCMTNNSEKKLNRELSYPSMV-LDEGKIHIAYTFWRQRIKYVQL 380


>gi|445228817|ref|ZP_21404854.1| hypothetical protein SEE10_016225, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. SE10]
 gi|444865989|gb|ELX90745.1| hypothetical protein SEE10_016225, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. SE10]
          Length = 189

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/167 (34%), Positives = 83/167 (49%), Gaps = 4/167 (2%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHW 170


>gi|440229205|ref|YP_007342998.1| putative neuraminidase (sialidase) [Serratia marcescens FGI94]
 gi|440050910|gb|AGB80813.1| putative neuraminidase (sialidase) [Serratia marcescens FGI94]
          Length = 398

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/359 (26%), Positives = 150/359 (41%), Gaps = 75/359 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +F G+ EG  D+ I+      G  +W   +   ++ +    NPV
Sbjct: 36  HAANLLPLPNGDLLCVWFAGTQEGIADISIYSSRLPQGQQQWTPAVKLSDDESRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P N L L +   +   + +  +  ++S + G +W E   L   PG     +   ++
Sbjct: 96  LFLDPDNVLWLLWTAQKSGNQDTAFVRYRQSRDLGASWGEIATLLEQPGTF--IRQPIVV 153

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L+NG     +  C ++  E W  N   S ++++ D GRSW     + +P+ +  V   + 
Sbjct: 154 LDNGDWLLPVFYCRTAPGEKWVGNHDDSAVKISSDRGRSWHD---VAVPDSTGCVHMNIT 210

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + +L  L RS      +  S S DGG +WS  +PT L N NS I    L++G L L
Sbjct: 211 L-LQDGSLLALYRS-RWADFIYQSRSVDGGRSWSAPQPTALPNNNSSIQLTTLRNGDLAL 268

Query: 254 AYNTVS------------------------------------------RGVLKVALSKDD 271
            +N +S                                          R  + +A+S+D 
Sbjct: 269 VFNAMSASGALERRTSLYDEIEDEDDDGGAEAAIPAPGDGRSAFWGAPRAPMTLAISRDG 328

Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           G SW     L+            E L  EFSYP + Q  DG +HI YTY R  IK+V +
Sbjct: 329 GRSWPYQRNLDEGDGYCMTNNSREKLNREFSYPTIKQGDDGRLHIAYTYFRQAIKYVCV 387


>gi|438121586|ref|ZP_20872173.1| hypothetical protein SEEP9120_01128, partial [Salmonella enterica
           subsp. enterica serovar Pullorum str. ATCC 9120]
 gi|434943228|gb|ELL49380.1| hypothetical protein SEEP9120_01128, partial [Salmonella enterica
           subsp. enterica serovar Pullorum str. ATCC 9120]
          Length = 174

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/167 (34%), Positives = 83/167 (49%), Gaps = 4/167 (2%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS 69
           LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W +
Sbjct: 5   LVNRQVILPESGTKSFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWTT 64

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP 129
           P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L  
Sbjct: 65  PQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLVN 123

Query: 130 GIL---GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
             +   GP KNK +L  NG  +   S+ES   W ++++ + D G+ W
Sbjct: 124 DDILPRGPVKNKLLLASNGAWIAPGSIESPERWRAFVDRSSDEGKHW 170


>gi|397658130|ref|YP_006498832.1| hypothetical protein A225_3121 [Klebsiella oxytoca E718]
 gi|394346470|gb|AFN32591.1| hypothetical protein A225_3121 [Klebsiella oxytoca E718]
          Length = 390

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 151/358 (42%), Gaps = 73/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W   +   ++ +    NPV
Sbjct: 33  HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSEAVKLSDDASRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW     L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQVSGNQDTAIVRYRQSTDLGQTWGAIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L NG     +  C +   E W  N   S ++++ D G++WR    + +P +SL  +   
Sbjct: 150 VLNNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDCGKTWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
               +N TL  L RS      +  S+S D G +WS  +PT+L N NS I    L  G L 
Sbjct: 206 ITALSNGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLTSGELA 264

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           L +N +S                                      R  + VA+S D G +
Sbjct: 265 LVFNHMSAAGALERRASLYDEIDDGDNSRKEPEVISGRSAFWGAPRAPMTVAISADGGKT 324

Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           W     L+E                EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WPWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHVSP 382


>gi|331645830|ref|ZP_08346933.1| hypothetical protein ECIG_00213 [Escherichia coli M605]
 gi|386618128|ref|YP_006137708.1| hypothetical protein ECNA114_0612 [Escherichia coli NA114]
 gi|387828664|ref|YP_003348601.1| hypothetical protein ECSF_0611 [Escherichia coli SE15]
 gi|417661213|ref|ZP_12310794.1| expressed protein [Escherichia coli AA86]
 gi|432420758|ref|ZP_19663314.1| hypothetical protein A137_01174 [Escherichia coli KTE178]
 gi|432498894|ref|ZP_19740671.1| hypothetical protein A177_00990 [Escherichia coli KTE216]
 gi|432557664|ref|ZP_19794354.1| hypothetical protein A1S7_01313 [Escherichia coli KTE49]
 gi|432693434|ref|ZP_19928646.1| hypothetical protein A31I_00900 [Escherichia coli KTE162]
 gi|432709480|ref|ZP_19944547.1| hypothetical protein WCG_02799 [Escherichia coli KTE6]
 gi|432917826|ref|ZP_20122309.1| hypothetical protein A133_01213 [Escherichia coli KTE173]
 gi|432925132|ref|ZP_20127223.1| hypothetical protein A135_01261 [Escherichia coli KTE175]
 gi|432980152|ref|ZP_20168931.1| hypothetical protein A15W_01271 [Escherichia coli KTE211]
 gi|433095576|ref|ZP_20281788.1| hypothetical protein WK3_00778 [Escherichia coli KTE139]
 gi|433104785|ref|ZP_20290805.1| hypothetical protein WK7_00661 [Escherichia coli KTE148]
 gi|281177821|dbj|BAI54151.1| conserved hypothetical protein [Escherichia coli SE15]
 gi|330910431|gb|EGH38941.1| expressed protein [Escherichia coli AA86]
 gi|331044582|gb|EGI16709.1| hypothetical protein ECIG_00213 [Escherichia coli M605]
 gi|333968629|gb|AEG35434.1| Hypothetical protein ECNA114_0612 [Escherichia coli NA114]
 gi|430947230|gb|ELC66937.1| hypothetical protein A137_01174 [Escherichia coli KTE178]
 gi|431031795|gb|ELD44528.1| hypothetical protein A177_00990 [Escherichia coli KTE216]
 gi|431094099|gb|ELD99749.1| hypothetical protein A1S7_01313 [Escherichia coli KTE49]
 gi|431236671|gb|ELF31876.1| hypothetical protein A31I_00900 [Escherichia coli KTE162]
 gi|431251649|gb|ELF45656.1| hypothetical protein WCG_02799 [Escherichia coli KTE6]
 gi|431446700|gb|ELH27444.1| hypothetical protein A133_01213 [Escherichia coli KTE173]
 gi|431448604|gb|ELH29319.1| hypothetical protein A135_01261 [Escherichia coli KTE175]
 gi|431494147|gb|ELH73737.1| hypothetical protein A15W_01271 [Escherichia coli KTE211]
 gi|431619090|gb|ELI88015.1| hypothetical protein WK3_00778 [Escherichia coli KTE139]
 gi|431634084|gb|ELJ02340.1| hypothetical protein WK7_00661 [Escherichia coli KTE148]
          Length = 389

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 147/357 (41%), Gaps = 72/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W        +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAAKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHDGTLVALFRS-RWADNIYISHSVDNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS-------------------------------------RGVLKVALSKDDGDSW 275
           L YN +S                                     R  + VA+S D G SW
Sbjct: 265 LVYNAMSAAGAVERRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSW 324

Query: 276 HDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+            E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 325 SWRRNLDEGDGYCMTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 381


>gi|423114519|ref|ZP_17102210.1| hypothetical protein HMPREF9689_02267 [Klebsiella oxytoca 10-5245]
 gi|376384368|gb|EHS97091.1| hypothetical protein HMPREF9689_02267 [Klebsiella oxytoca 10-5245]
          Length = 390

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 151/358 (42%), Gaps = 73/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W   +   ++ +    NPV
Sbjct: 33  HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSDAVKLSDDASRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G++WR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDCGKTWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                  TL  L RS      +  S+S D G +WS  +PT+L N NS I    L  G L 
Sbjct: 206 ITALPGGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           L +N +S                                      R  + VA+S D G +
Sbjct: 265 LVFNHMSAAGARERRASLYDEIDDGDNSRKEPEVTSGHSAFWGAPRAPMTVAISADGGKT 324

Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           W     L+E                EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WPWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHVSP 382


>gi|423108495|ref|ZP_17096190.1| hypothetical protein HMPREF9687_01741 [Klebsiella oxytoca 10-5243]
 gi|376384900|gb|EHS97622.1| hypothetical protein HMPREF9687_01741 [Klebsiella oxytoca 10-5243]
          Length = 390

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 151/358 (42%), Gaps = 73/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W   +   ++ +    NPV
Sbjct: 33  HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSDAVKLSDDASRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  +  ++S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G++WR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISSDCGKTWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                  TL  L RS      +  S+S D G +WS  +PT+L N NS I    L  G L 
Sbjct: 206 ITALPGGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           L +N +S                                      R  + VA+S D G +
Sbjct: 265 LVFNHMSAAGARERRASLYDEIDDGDNSRKEPEVTSGRSAFWGAPRAPMTVAISADGGKT 324

Query: 275 WHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           W     L+E                EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 325 WPWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHVSP 382


>gi|160943611|ref|ZP_02090843.1| hypothetical protein FAEPRAM212_01103 [Faecalibacterium prausnitzii
           M21/2]
 gi|158445066|gb|EDP22069.1| BNR/Asp-box repeat protein [Faecalibacterium prausnitzii M21/2]
 gi|295104439|emb|CBL01983.1| Predicted neuraminidase (sialidase) [Faecalibacterium prausnitzii
           SL3/3]
          Length = 386

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 149/336 (44%), Gaps = 55/336 (16%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-KDG-RWQSPIIADEEPNVPMWN 83
           + HA  +VE+  G  L  +F G+ EG+ DV I       DG +W  P+    +P     N
Sbjct: 35  TAHAPAMVELPNGDLLCCWFAGTYEGSADVHIICSVLPHDGTKWLEPVNISGDPTRSEQN 94

Query: 84  PVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILGPS 135
           P LF  P N +   Y        G++  +++  ++  +S + G TW +   + P   G  
Sbjct: 95  PSLFYGPDNAVWAMYTAQLDRVEGKDNMQFTAVVRCQKSTDGGKTWGDYTTVFPE-EGTF 153

Query: 136 KNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES---- 185
             +PI +L NG       LC  S E  +   +   ++ D G++WR    + +P  +    
Sbjct: 154 CRQPIQVLSNGRWIFANWLCTDSAEGLSGDPTAFRISDDEGKTWRM---VMMPGSNGHVH 210

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +VI+  P H     L   MR+     R+  SES D G TW+   PT L N NS I  VK
Sbjct: 211 ANVIELEPGH-----LVAFMRNREAY-RIHRSESFDWGETWTVPAPTPLPNNNSSISAVK 264

Query: 246 LKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEE 283
           L+ GR+ +AYN                 R  + VALS+D G ++         +    +E
Sbjct: 265 LQSGRIAIAYNPTCTPNPVPGKAAWPGLRCPVAVALSEDGGLTFPIIRWMERGEGYMGDE 324

Query: 284 NLA--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
           N     ++ YP ++Q  DG +H+ Y    R  IK+V
Sbjct: 325 NKTNNKQYEYPYLMQGRDGMLHLAYAARTRQGIKYV 360


>gi|159186481|ref|NP_396007.2| hypothetical protein Atu5072 [Agrobacterium fabrum str. C58]
 gi|159141541|gb|AAK90448.2| conserved hypothetical protein [Agrobacterium fabrum str. C58]
          Length = 396

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 140/367 (38%), Gaps = 73/367 (19%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPN 78
           ++P    HA  +   D G  + A+FGG+ EG  D+ I+      G  +W  P     +P+
Sbjct: 27  ASPMIQNHAPFLHLADDGALICAWFGGTLEGKSDISIFTSVLAAGSNKWGEPQRLSFDPD 86

Query: 79  VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG-------I 131
               NPVLF  P N LLLF+   Q       C  R        S+  +L  G        
Sbjct: 87  HSEQNPVLFAAPGNTLLLFH-TSQPSGNQDECRIRMAEVSRDASDPTRLTAGEGLYLDLP 145

Query: 132 LGPSKNKPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIP 182
            G     P+ +         +  C     + WN       V +  D G+SWR    +   
Sbjct: 146 RGCFVRAPLTVRADGAWLLPIFRCIQRPGQKWNGSHDRAAVGISEDCGKSWR----LEDI 201

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
           ++S   +   P    +  L  + R       V  +ES DGG TWS  + T + N NS I 
Sbjct: 202 DQSTGCVHMSPLVIGDEKLAAVFRRRQA-DFVYRTESADGGRTWSAPQATDVPNNNSSIA 260

Query: 243 GVKLKDGRLLLAYNT-------------------------------------VSRGVLKV 265
            ++L DGR+ +  N                                      V R  + V
Sbjct: 261 AIRLNDGRIAMICNPTNAAMSSDRRASLYDELGEDDDRPDANPDGGCVPIWGVPRAPVTV 320

Query: 266 ALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQI 313
            +S+D   S+   +T+E+                E SYP +++ +DGS+HI YTY R  I
Sbjct: 321 CISEDGAKSFPQRITIEDGPGTCLSNNSIDGHNKEMSYPWLLEGADGSLHIAYTYYRRAI 380

Query: 314 KHVVLLP 320
           K+V L P
Sbjct: 381 KYVRLAP 387


>gi|336250192|ref|YP_004593902.1| glycosyl hydrolase family protein [Enterobacter aerogenes KCTC
           2190]
 gi|334736248|gb|AEG98623.1| glycosyl hydrolase, BNR repeat protein [Enterobacter aerogenes KCTC
           2190]
          Length = 388

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 146/357 (40%), Gaps = 71/357 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGGS EG  D+ +W      G   W   +   ++P     NPV
Sbjct: 33  HAANLLPLPDGTLLCVWFGGSQEGKADISVWGARLAPGATTWSEAVKLSDDPQRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P   L L +       + +  +  ++S + G  W     L   PG       +PI 
Sbjct: 93  LFQAPDGVLWLLWTAQHAGNQDTAIVRYRQSRDGGKHWGAIATLLDEPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ NG     +  C +   E W  N   S ++++ D G+SWR    + +P +SL  +   
Sbjct: 150 VMPNGNWLLPVFYCRTRPGEKWVGNDDVSAVKISADGGKSWRD---VAVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
               A+ +L    RS      +  S S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLADGSLVAFFRS-RWADHIWYSRSTDFGESWSAPIPTTLPNNNSSIQVTTLSNGDLA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSWH 276
           L +N +S                                    R  + VA+S D G SW 
Sbjct: 265 LVFNNMSAAGASERRASLYDEIEDDDGRKEPERTGKTAFWGAPRAPMTVAISADGGKSWP 324

Query: 277 DALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
               L            E+ L  EFSYP++ Q  DG++HI YT+ R  IK++ + P 
Sbjct: 325 WMRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGEDGNLHIAYTWYRQAIKYIRVSPQ 381


>gi|358373759|dbj|GAA90355.1| glycosyl hydrolase BNR repeat-containing protein [Aspergillus
           kawachii IFO 4308]
          Length = 406

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 147/359 (40%), Gaps = 69/359 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HAS ++ +  G  L A+FGGS EG PD+ I+L   + G   W        +   
Sbjct: 27  APTVQSHASNLLRMPNGDLLCAWFGGSLEGKPDISIYLSRLRCGESEWTEASKMTHDHTR 86

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NPVLF  P+  L L Y       + +  +K   S + G TW   + L        + 
Sbjct: 87  SEQNPVLFHTPTGNLWLLYTSQHAGDQDTAIVKYRVSPDSGTTWGGEKVLFHDTGTFIRQ 146

Query: 138 KPILLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
             ++L++G     +  C     E W  N   S + V+ D GRSW +     IP ES   +
Sbjct: 147 PVVVLDDGAWVVPVFKCRVQPGERWLGNDDISCIRVSRDQGRSWTES---EIP-ESAGCV 202

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
                   + T   + RS      V ++ S D GL WS  +PT L NPN+GI    L  G
Sbjct: 203 HMEIQRLKDGTYLGMFRS-RWADNVYLATSPD-GLVWSTPQPTSLPNPNAGICFDVLPSG 260

Query: 250 RLLLAYNTVS---------------------------------------RGVLKVALSKD 270
           R++L YN  S                                       R  L VA S D
Sbjct: 261 RVVLVYNHSSKRDALGRRQGLYDDIADGVDERKNQAAAKDGRESFWGSPRAPLCVAWSDD 320

Query: 271 DGDSWH-------DALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G++W        D   L    E  L  E SYP++I A DG++HI YT+ R +IK+V L
Sbjct: 321 KGNTWKHRILEEGDGYCLTNNSERKLNRELSYPSMICA-DGTIHIAYTFWRQRIKYVQL 378


>gi|421724415|ref|ZP_16163637.1| hypothetical protein KOXM_01978 [Klebsiella oxytoca M5al]
 gi|410374785|gb|EKP29444.1| hypothetical protein KOXM_01978 [Klebsiella oxytoca M5al]
          Length = 390

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 93/357 (26%), Positives = 148/357 (41%), Gaps = 71/357 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      G  +W   +    + +    NPV
Sbjct: 33  HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGSRQWSEAVKLSNDASRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P N L L +       + +  +  ++S + G TW +   L   PG     +    +
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGDIATLLDKPGTF--IRQPVTV 150

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L+NG     +  C +   E W  N   S ++++ D G++WR    + +P +SL  +    
Sbjct: 151 LDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADKGKTWRD---VEVP-QSLGCVHMNI 206

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
                 TL  L RS      +  S+S D G +WS  +PT+L N NS I    L  G L L
Sbjct: 207 TALPGGTLVALFRS-RWADNIYYSQSVDNGESWSVPEPTELPNNNSSIQVTTLASGELAL 265

Query: 254 AYNTVS--------------------------------------RGVLKVALSKDDGDSW 275
            +N +S                                      R  + VA+S D G +W
Sbjct: 266 VFNHMSAAGALERRASLYDEIDDGDNSRKEPEVTSGRSAFWGAPRAPMTVAISADGGKTW 325

Query: 276 HDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                L+E                EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 326 PWQRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHITP 382


>gi|107023083|ref|YP_621410.1| glycosyl hydrolase [Burkholderia cenocepacia AU 1054]
 gi|116686675|ref|YP_839922.1| BNR repeat-containing glycosyl hydrolase [Burkholderia cenocepacia
           HI2424]
 gi|105893272|gb|ABF76437.1| glycosyl hydrolase, BNR repeat protein [Burkholderia cenocepacia AU
           1054]
 gi|116652390|gb|ABK13029.1| glycosyl hydrolase, BNR repeat-containing protein [Burkholderia
           cenocepacia HI2424]
          Length = 395

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 151/356 (42%), Gaps = 76/356 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ +  +D G  L  +FGG+ EG PDV I+L     G   W +P    ++P     NPV
Sbjct: 39  HAANLHCLDNGDLLCVWFGGTQEGIPDVSIYLSRLAAGTDEWSAPRKLSDDPTRSEQNPV 98

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P   L L Y       + +  ++R  S + G+TW   + L   PG       +PI+
Sbjct: 99  LFTAPDGRLWLIYTAQLSGHQNTAIVRRRVSTDGGLTWGPVDTLFDRPGTF---VRQPIV 155

Query: 142 -LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
            L++G     + LC +   E W  N   S + V+ D GR+W    P  +P +S+  +   
Sbjct: 156 ALDDGTWICPIFLCRTEPGERWVGNDDVSAVMVSDDQGRTW---TPHEVP-DSVGCVHMN 211

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + +L  L RS      V  S S D G TWS  + T L N NS I    L++G L 
Sbjct: 212 VHVLHDGSLLALYRS-RWADFVHASRSTD-GRTWSAPERTDLPNNNSSIQFTVLQNGHLA 269

Query: 253 LAYNT----------------------------------------VSRGVLKVALSKDDG 272
           L +N                                           R  + +A+S+D G
Sbjct: 270 LVFNESDASHGKERRASLYDDIDDSEDSGELRDQQASVRGTAFWGAPRAPMSLAISEDGG 329

Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            +W     LE            +    E+SYP+++Q+ DG++HI +T+ R +IK+V
Sbjct: 330 RTWPRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQSRDGALHIAFTWFRQKIKYV 385


>gi|423123646|ref|ZP_17111325.1| hypothetical protein HMPREF9694_00337 [Klebsiella oxytoca 10-5250]
 gi|376401727|gb|EHT14333.1| hypothetical protein HMPREF9694_00337 [Klebsiella oxytoca 10-5250]
          Length = 390

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 150/360 (41%), Gaps = 77/360 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  +  +FGG+ EG  D+ +W      GR  W   +   ++P+    NP+
Sbjct: 33  HAANLLPLPDGSLMCVWFGGTQEGVADISVWGSRLLPGRRQWCDAVKLSDDPSRSEQNPL 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P N L L +       + +  +  ++S + G TW +   L   PG     +    +
Sbjct: 93  LFLDPDNVLWLLWTAQISGNQDTAIVRYRQSTDLGQTWGDIATLLDKPGTF--IRQPMTV 150

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV---IQ 190
           L+NG     +  C +   E W  N   S ++++ D G++WR    + +P+    V   I 
Sbjct: 151 LDNGNWLLPVFYCRTRPGEKWVGNDDISAVKISADKGKTWRD---VEVPHSLGCVHMNIT 207

Query: 191 PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR 250
           P+P       L  L RS      +  S+S D G +WS  +PT L N NS I    L  G 
Sbjct: 208 PLP----GGVLVALFRS-RWADNIYYSQSVDNGESWSAPEPTVLPNNNSSIQVTTLASGE 262

Query: 251 LLLAYNTVS--------------------------------------RGVLKVALSKDDG 272
           L L +N +S                                      R  + VALS D G
Sbjct: 263 LALVFNPMSAAGALERRASLYDEIDDGDSSRKEPEANGGRSAFWGAPRAPMTVALSADGG 322

Query: 273 DSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
            +W     L+E                EFSYP++ Q +DG++HI YT+ R  IK+V + P
Sbjct: 323 KTWPWRRNLDEGDGYCMTNNSLDKRNREFSYPSIKQGADGTLHIAYTWFRQAIKYVHITP 382


>gi|251788284|ref|YP_003003005.1| hypothetical protein Dd1591_0644 [Dickeya zeae Ech1591]
 gi|247536905|gb|ACT05526.1| conserved hypothetical protein [Dickeya zeae Ech1591]
          Length = 391

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 149/357 (41%), Gaps = 74/357 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I+L     G  +W   +   ++P     NPV
Sbjct: 36  HAANLLHLPNGDVLCVWFGGTQEGVSDISIYLSRLVKGSEQWTPAVKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  +  ++S ++G TW E   L   PG       +PI 
Sbjct: 96  LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSLDQGYTWGEIGVLLEQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCG-SSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L NG     +  C     E W  N   S ++++ D G++WR+     +PN +  V   +
Sbjct: 153 VLPNGDWLLPVFYCRIQPGEKWVGNDDISAVKISSDQGKTWRES---VVPNSTGCVHMNI 209

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      +  S S DGG +WS  + T L N NS I    L +G L 
Sbjct: 210 TL-LKDGTLLALFRS-RWADFIYRSHSTDGGESWSEPEATTLPNNNSSIQVTTLDNGHLA 267

Query: 253 LAYNTVS---------------------------------------RGVLKVALSKDDGD 273
           L +N ++                                       R  + +A+S+D G 
Sbjct: 268 LVFNAMNADGATERRLSLYDEIEDEEETDAKMPDISSGRSAFWGAPRAPMTLAISEDGGK 327

Query: 274 SWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SW     LE            +    EFSYP++ Q  DG +H+ +TY R  IK+V +
Sbjct: 328 SWPWQRNLEVGDGYCMTNNSTDKRNREFSYPSIKQGPDGKLHVAFTYFRQAIKYVCV 384


>gi|398377645|ref|ZP_10535819.1| putative neuraminidase (sialidase) [Rhizobium sp. AP16]
 gi|397726508|gb|EJK86942.1| putative neuraminidase (sialidase) [Rhizobium sp. AP16]
          Length = 395

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 150/370 (40%), Gaps = 84/370 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW +P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLSPGATRWSTPEKMSDDPAK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
              NP++F  P   + L Y   Q      G + +   S + G T+ E   L   + G   
Sbjct: 95  SEQNPLIFNAPGGDVWLLYT-SQTSGNQDGSVVKCRISADGGKTFGEVRVL-CDLPGTFV 152

Query: 137 NKPILLENG------LLLC-GSSVESW--NSWGSWMEVTVDAGRSW------RKYGPIYI 181
            +PI++ +       +  C G   E W  ++  + + ++ D G+SW         G +++
Sbjct: 153 RQPIIVNSRGAWLLPVFRCVGLPGERWTGDADTACVLISRDQGKSWVMTAVPDSLGAVHM 212

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
              +L     V F+  NR  +           V  S S DGG +WS  +P  L N NS I
Sbjct: 213 NIVALEGDDLVAFYR-NRFAQ----------SVLASRSTDGGESWSAPQPMDLPNNNSSI 261

Query: 242 DGVKLKDGRLLLAYN---------------------------------------TVSRGV 262
             VKLK G + + YN                                        V R  
Sbjct: 262 QAVKLKSGVIAIVYNHSNALMSDARRQSLYDEIEGGEAIDAAPLQTLERRKAVWGVPRAP 321

Query: 263 LKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
           L +A S D G S+   + L+            ++L  EFSYP+++Q +DG VHI YTY R
Sbjct: 322 LSLAFSSDGGTSFSRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVQGADGVVHIAYTYYR 381

Query: 311 TQIKHVVLLP 320
             IK+V L P
Sbjct: 382 RAIKYVRLSP 391


>gi|373852619|ref|ZP_09595419.1| hypothetical protein Opit5DRAFT_3473 [Opitutaceae bacterium TAV5]
 gi|372474848|gb|EHP34858.1| hypothetical protein Opit5DRAFT_3473 [Opitutaceae bacterium TAV5]
          Length = 535

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 87/332 (26%), Positives = 148/332 (44%), Gaps = 46/332 (13%)

Query: 18  PANSAPFKSCHA---------STIVEVDKGHFLVAYFGGSCEGAP-DVKIWLQTFKDG-- 65
           P    P ++ HA         S++     G   V ++ G   G   +  + L T  DG  
Sbjct: 31  PGYVGPPQAAHAVTNRAFQGISSMAVSPGGRLWVIWYAGVTPGEDRNNYVVLATSGDGGK 90

Query: 66  RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
            W+  ++ D +   P+  ++P ++  P   L L+         W+       +   TW+ 
Sbjct: 91  TWKEVLVVDPDGPGPVRAFDPEVWMGPDGRLRLY---------WAQAAGHDGSISGTWT- 140

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SW--------MEVTVDAGR 171
            E   P    P+ + P  + +G+++C S V S   W     +W        M V+ D GR
Sbjct: 141 LETGEPESEAPAWSAPARISDGIMMCKSVVLSTGEWALPVSTWRKTDHSAKMLVSTDQGR 200

Query: 172 SWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
           +W   G  ++P +  S  + +     + +LR+L+R+  GIG    S S D G+TW    P
Sbjct: 201 TWSVRGACHVPEKDRSFDEHMLVERGDGSLRMLVRTRYGIGE---SISTDRGVTWPELVP 257

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSWHDALTLEENLA 286
           + + +P++     +L  G LLL  +        R  L   +S DDG +W   L L+E   
Sbjct: 258 SAIPHPSARFFITRLHSGNLLLVKHGPMDQQTLRSHLTAFISTDDGRTWGGGLLLDERTG 317

Query: 287 MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +  SYP   QA+DG+V+ITY ++RT  +H++ 
Sbjct: 318 V--SYPDGQQAADGTVYITYDFSRTGTRHILF 347


>gi|408388656|gb|EKJ68335.1| hypothetical protein FPSE_11343 [Fusarium pseudograminearum CS3096]
          Length = 380

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 149/359 (41%), Gaps = 74/359 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++++     L  +FGGS EG PD+ IWL   + G   W +P     + N    NPV
Sbjct: 18  HASNLLQLPDKTVLCTWFGGSQEGLPDICIWLSRLESGSSSWSTPQKVSSDENRSCQNPV 77

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC--MKR-SYNKGVTWSEREQLPPGILGPSKNKPILL 142
           LF+ P NG L      Q+     G   +KR S ++G TWSE  +L P   G    +PI++
Sbjct: 78  LFRAPHNGELWLLHTSQDAGNQDGAYILKRISSDEGKTWSEASRLLPDATGIFIRQPIVV 137

Query: 143 ENG----LLLCGSSVESWNSWGSWMEV-----TVDAGRSWR-KYGPIYIPNESLSVIQPV 192
            +     L +     E  + W    ++     + D G +W+ K  P  +    ++++ PV
Sbjct: 138 NSEGAWILPVFYCRTEPGHRWIGSDDISGVFYSQDHGTTWKEKQAPDSVGAVHMNIVPPV 197

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
              +   T   + RS      V  S S + G+ W   K T L NPNSGI   +L  G + 
Sbjct: 198 ---SGQSTWVAVYRS-RWADNVYRSTSKN-GIDWEEPKATSLPNPNSGICAARLSSGHIA 252

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           L +N  S                                      R  L + +S+D+G +
Sbjct: 253 LVFNRSSASADTLKRKGLYDDITPEDDKRPNQVTKTGKDAIWGTPRKTLTLGVSQDEGLT 312

Query: 275 WHDALTLEENLAM------------EFSYPAVIQASDGSV---HITYTYNRTQIKHVVL 318
           W + + LE+                E SYP++     G +   H+ YT++R  IK+V +
Sbjct: 313 WSERV-LEDGDGFCGTNSSSGQENRELSYPSIYIEKKGEMDVAHVAYTWHRQHIKYVQI 370


>gi|83765156|dbj|BAE55299.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391864605|gb|EIT73900.1| putative neuraminidase [Aspergillus oryzae 3.042]
          Length = 464

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 146/350 (41%), Gaps = 69/350 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HAS ++ +  G  L A+FGGS EG PD+ I+L   + G   W   I    +      NPV
Sbjct: 120 HASNLLLLPNGDVLCAWFGGSMEGKPDISIYLSRLRSGEQSWSEAIRMTHDNTRSEQNPV 179

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNK--GVTWSEREQLPPGILGPSKNKPILLE 143
           LF+ P+  L L Y       + S  +KR  +K  G+TW + E L P      +   I+L+
Sbjct: 180 LFRTPTGDLWLLYTSQHAGNQDSAIVKRRVSKDDGITWGKEEVLFPDSGIFIRQPAIVLD 239

Query: 144 NG---LLLCGSSVESWNSW-----GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
           +G   + +    VE    W      S + V+ D G++W +     IP ES   +      
Sbjct: 240 DGAWVIPVFKCRVEPGERWLGNNDISCIRVSRDEGQTWTES---VIP-ESTGCVHMEIQR 295

Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
             + +   L RS      + ++ S D GL+WS  +PT L NPN+GI    L  GR+++ Y
Sbjct: 296 LKDCSYLGLFRS-RWADHIYLATSPD-GLSWSPPQPTILPNPNAGICFDVLPSGRVVVVY 353

Query: 256 NTVS---------------------------------------RGVLKVALSKDDGDSWH 276
           N  S                                       R  L VA S D G +W 
Sbjct: 354 NHSSKLDATGRRQGLYDDIADGVDERRDQSSTQDGRESFWGAPRAPLCVAWSDDSGKTWE 413

Query: 277 DAL-----------TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
             +             E+ L  E SYP+++   +G +HI YT+ R +IK 
Sbjct: 414 RRVLEDGDGYCMTNNSEKKLNRELSYPSMV-LDEGKIHIAYTFWRQRIKQ 462


>gi|391229608|ref|ZP_10265814.1| BNR/Asp-box repeat protein [Opitutaceae bacterium TAV1]
 gi|391219269|gb|EIP97689.1| BNR/Asp-box repeat protein [Opitutaceae bacterium TAV1]
          Length = 535

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 73/271 (26%), Positives = 128/271 (47%), Gaps = 34/271 (12%)

Query: 67  WQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
           W+  ++ D +   P+  ++P ++  P   L L+         W+       +   TW+  
Sbjct: 92  WKEVLVVDPDGPGPVRAFDPEVWMGPDGRLRLY---------WAQAAGHDGSISGTWT-L 141

Query: 125 EQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SW--------MEVTVDAGRS 172
           E   P    P+ + P  + +G+++C   V S   W     +W        M V+ D GR+
Sbjct: 142 ETGEPESEAPAWSAPARISDGIMMCKPVVLSTGEWALPVSTWRKTDHSAKMLVSADGGRT 201

Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
           W   G  ++P +  S  + +     + +LR+L+R+  GIG    S S D G+TW    P+
Sbjct: 202 WSVRGACHVPEKDRSFDEHMLVERGDGSLRMLVRTRYGIGE---SVSTDRGVTWPELVPS 258

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNT-----VSRGVLKVALSKDDGDSWHDALTLEENLAM 287
            + +P++     +L+ G LLL  ++       R  L   +S DDG +W   L L+E   +
Sbjct: 259 AIPHPSARFFITRLQSGNLLLVKHSPMDQQTLRSHLTAFISTDDGRTWGGGLLLDERTGV 318

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
             SYP   QA+DG+V+ITY ++RT  +H++ 
Sbjct: 319 --SYPDGQQAADGTVYITYDFSRTGTRHILF 347


>gi|94501941|ref|ZP_01308450.1| BNR/Asp-box repeat protein [Bermanella marisrubri]
 gi|94425935|gb|EAT10934.1| BNR/Asp-box repeat protein [Oceanobacter sp. RED65]
          Length = 388

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 139/319 (43%), Gaps = 34/319 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVP--- 80
           S H++T +E+D G+ L  ++GG+ EG  DV ++   F   +  W       +  ++    
Sbjct: 63  SVHSATAIELDNGNLLAMWYGGTREGHTDVSLYKAVFDQSKQAWIGHEKVLDRYDISQQL 122

Query: 81  ------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPP-- 129
                 + NPVL K P+  ++LFY +   +  W+     M  SY+ G +W   ++L    
Sbjct: 123 NRYIKKVGNPVLVKHPAGPVVLFY-VTVSLGGWATSQVNMAVSYDDGQSWHSSKRLVTSP 181

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              I    KN  ++ ++G +      E +  +   + V +  G    KY    I +   +
Sbjct: 182 FINISTLVKNDAVIYQDGTIGITGYHELFGEFSEMIRVNLQ-GEVINKYR---ITDGDYT 237

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           +   V     +  + ++  S     +V  S++ DGG TWS  +   + NPNS + G K  
Sbjct: 238 IQPTVIVQDPDHAVALIRDSSRHTEKVHYSKTSDGGKTWSDYEKLAVDNPNSAVYGFKDN 297

Query: 248 DGRLLLAYNTVSRGV------LKVALSKDDGDSWHDALTLEE-----NLAMEFSYPAVIQ 296
             RL + +N  +R V      + +A+S D+G +W      E      NL   + YP V  
Sbjct: 298 KDRLWMLFNEATRQVEFPRNTMALAVSTDNGKTWTTKHYFENPGKDLNLNATYGYPWVTV 357

Query: 297 ASDGSVHITYTYNRTQIKH 315
            + G  H+ YT +R  I H
Sbjct: 358 TTSGDYHVFYTRDRESIVH 376


>gi|456062926|ref|YP_007501896.1| BNR/Asp-box repeat-containing protein [beta proteobacterium CB]
 gi|455440223|gb|AGG33161.1| BNR/Asp-box repeat-containing protein [beta proteobacterium CB]
          Length = 415

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 143/337 (42%), Gaps = 44/337 (13%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIAD 74
            P   AP  S HA++++ +  G     +F GS EGAPDV I    F  +  RW SP +  
Sbjct: 76  LPDTGAP--SVHAASLIALKDGAVRAFWFAGSREGAPDVVINTAVFDPRASRWGSPTVVM 133

Query: 75  EEPNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYN---KGVTWS 122
           +  +           + NPV  ++    + LF+ +   +  W+G    S     +G+TW 
Sbjct: 134 DRVSAEKGLSRYIAKLGNPVPARMADGRMQLFF-VTVSIGGWAGSSISSMISDDEGLTWD 192

Query: 123 EREQLPPGILGPSKNKPILLE-------NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
             ++L   I  P  N   L++       +G L   +  E    +G ++ +        R+
Sbjct: 193 RPQRL---ISSPLINLSTLVKSPAVSFTDGRLGLPAYHEWIGRFGEFLRIEASQVIDKRR 249

Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMR---SFNGIGRVCMSESCDGGLTWSYAKPT 232
                  +     IQPV F    +      R   S +   ++ +SE+ D G +W   K  
Sbjct: 250 M------SSGRGAIQPVLFADGPQEASAFFRQTRSSSQAKQIPVSETKDAGQSWVVTKDL 303

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDS--WH------DALTLEEN 284
           ++ NPNS +  + L  G  L+  N +  G  ++ +   + +S  W       D  +L  N
Sbjct: 304 EIPNPNSAVAALTLASGARLMVLNNIEAGRYRLVMVMRESNSSQWQVVQLIEDDESLVNN 363

Query: 285 LAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
              EFSYP +I A+    H+ YT+NR +IKHV    N
Sbjct: 364 QHREFSYPYLISANGEDAHLVYTWNRQKIKHVYFPAN 400


>gi|46109856|ref|XP_381986.1| hypothetical protein FG01810.1 [Gibberella zeae PH-1]
          Length = 380

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/359 (25%), Positives = 148/359 (41%), Gaps = 74/359 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++++     L  +FGGS EG PD+ IWL   + G   W +P     + N    NPV
Sbjct: 18  HASNLLQLPDKTVLCTWFGGSQEGLPDICIWLSRLEPGSTSWSTPQKVSSDENRSCQNPV 77

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC--MKR-SYNKGVTWSEREQLPPGILGPSKNKPILL 142
           LF+ P NG L      Q+     G   +KR S ++G TWSE  +L P   G    +PI++
Sbjct: 78  LFRAPHNGDLWLLHTSQDAGNQDGAYILKRTSSDEGKTWSEASRLLPNATGIFIRQPIVV 137

Query: 143 ENG----LLLCGSSVESWNSWGSWMEV-----TVDAGRSWR-KYGPIYIPNESLSVIQPV 192
            +     L +     E  + W    ++     + D G +W+ K  P  +    ++++ P 
Sbjct: 138 NSEGAWILPVFYCRTEPGHRWIGSDDISGVFYSQDNGATWKEKQAPDSVGAVHMNIVPPA 197

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
               +++T  V +        V  S S + G+ W   K T L NPNSGI   +L  G + 
Sbjct: 198 ----SDQTTWVAVYRSRWADNVYRSTSKN-GIDWEEPKATSLPNPNSGICAARLSSGHIA 252

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           L +N  S                                      R  L + +S+D+G +
Sbjct: 253 LVFNRSSASADTLKRKGLYDDITPEDDKRPNQVTKTGKDAIWGTPRKTLTLGVSQDEGLT 312

Query: 275 WHDALTLEENLAM------------EFSYPAVIQASDGSV---HITYTYNRTQIKHVVL 318
           W + + LE+                E SYP++     G     H+ YT++R  IK+V +
Sbjct: 313 WSERV-LEDGDGFCGTNSSSGQENRELSYPSIYIERKGETDVAHVAYTWHRQHIKYVQI 370


>gi|266622136|ref|ZP_06115071.1| BNR/Asp-box repeat protein [Clostridium hathewayi DSM 13479]
 gi|288866158|gb|EFC98456.1| BNR/Asp-box repeat protein [Clostridium hathewayi DSM 13479]
          Length = 389

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/335 (27%), Positives = 148/335 (44%), Gaps = 47/335 (14%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMW 82
           K+ HA  ++E+  G  L  +F G+ EG+ DV I       G   W+ P    ++P     
Sbjct: 33  KTAHAPALLELSDGSMLCVWFTGTYEGSADVSIVCSKLPAGAKSWKRPEPVSKDPERSEQ 92

Query: 83  NPVLFKLPSNGLLLFYKI------GQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILGP 134
           NP LF  P   +   Y        G++  +++  ++  +S + G TW E E L     G 
Sbjct: 93  NPSLFTGPEGDVWAVYTAQLDRMPGKDNMQYTSVIRCQKSVDGGETWGEAETLF-SRQGS 151

Query: 135 SKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
              +PI +LENG       LC  S+       +  +++ D G++WR    + +P+ +  V
Sbjct: 152 FCRQPIQILENGRWIFANWLCSDSLTGLAGDPTVFQLSDDQGKTWRM---VRMPDSNGRV 208

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              V      R L   MRS      +  SES D G  W+    T L N NS I  ++LK 
Sbjct: 209 HANVVELGGGR-LAAFMRS-RSADFIYRSESQDWGENWTVPMATSLPNNNSSISAIRLKS 266

Query: 249 GRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE--------ENLA 286
           GR+ +AYN                 R  + VALS+D+G +W     +E        EN  
Sbjct: 267 GRIGIAYNPTHAANPIAGTAAWPGLRCPVAVALSEDEGLTWPVIRYMERGEGYAGPENRT 326

Query: 287 --MEFSYPAVIQASDGSVHITYT-YNRTQIKHVVL 318
              ++ YP ++Q +DG +H+ +   +R+ IK++  
Sbjct: 327 NNRQYEYPYLMQGTDGRLHLAFAGKDRSCIKYMCF 361


>gi|444351569|ref|YP_007387713.1| expressed protein [Enterobacter aerogenes EA1509E]
 gi|443902399|emb|CCG30173.1| expressed protein [Enterobacter aerogenes EA1509E]
          Length = 388

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 144/357 (40%), Gaps = 71/357 (19%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGGS EG  D+ +W      G   W       ++P     NPV
Sbjct: 33  HAANLLPLPDGTLLCVWFGGSQEGKADISVWGARLAPGAASWSEATKLSDDPQRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF+ P   L L +       + +  +  ++S + G  W     L   PG       +PI 
Sbjct: 93  LFQAPDGVLWLLWTAQHAGNQDTAIVRYRQSRDGGKHWGAIATLLDEPGTF---IRQPIS 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           ++ NG     +  C +   E W  N   S ++++ D G+SWR    + +P +SL  +   
Sbjct: 150 VMPNGNWLLPVFYCRTRPGEKWVGNDDVSAVKISADGGKSWRD---VAVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
               A+ +L    RS      +  S S D G +WS   PT L N NS I    L +G L 
Sbjct: 206 ITPLADGSLAAFFRS-RWADHIWYSRSTDFGESWSAPIPTTLPNNNSSIQVTTLSNGDLA 264

Query: 253 LAYNTVS------------------------------------RGVLKVALSKDDGDSWH 276
           L +N +S                                    R  + VA+S D G SW 
Sbjct: 265 LVFNNMSAAGASERRASLYDEIEDDDGRKEPERTGKTAFWGAPRAPMTVAISADGGKSWP 324

Query: 277 DALTL------------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
               L            E+ L  EFSYP++ Q  DG++ I YT+ R  IK+V + P 
Sbjct: 325 WMRNLDEGDGYCMTNNSEQKLNREFSYPSIKQGEDGNLQIAYTWYRQAIKYVRVSPQ 381


>gi|320103153|ref|YP_004178744.1| neuraminidase (sialidase)-like protein [Isosphaera pallida ATCC
           43644]
 gi|319750435|gb|ADV62195.1| neuraminidase (sialidase)-like protein [Isosphaera pallida ATCC
           43644]
          Length = 401

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 150/364 (41%), Gaps = 74/364 (20%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR--WQSPIIADE 75
           AN A + + HAS +VE      LVA++ GS E  A DV+I     K G+  W       +
Sbjct: 40  ANDAVYHN-HASCVVECPNSDLLVAWYRGSGERDADDVEILAARLKKGQTTWGPVFPLID 98

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS 135
            P  P  NP L  +  +G    ++      +W G +       + ++  +  P G   P 
Sbjct: 99  TPGYPDCNPALL-VGDDGSFWVFRPTILDHRWEGAL-------LKFARADACPEGDAAPV 150

Query: 136 KNKPILL-----------ENGLLLCGSSVESWNSWGSWMEVTVDAGRS---WRKYG---- 177
            N+  +L           +  L      + +    G +++  VDA R    +++ G    
Sbjct: 151 WNREGVLHLTPVGLSEAIQASLDRLPEEIRAHPRLGGYLD-EVDARRDDELYQRLGWMPR 209

Query: 178 -------------PIYIPNESLSV----------------------IQPVPFHTANRTLR 202
                        P+Y    S+S+                      IQP     ++ +L 
Sbjct: 210 CRPVILSTGRWILPLYTDTYSVSLMAYSDDRGATWNTGRMIPGIFAIQPAVVERSDGSLV 269

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT--VSR 260
              R+     R+  + S DGG+TWS A  T L NP +G+D ++L  G+ +L YN   + R
Sbjct: 270 AYCRNAGAGRRILSATSNDGGVTWSEAIETTLPNPGAGVDAIRLASGKFVLVYNDTEIGR 329

Query: 261 GVLKVALSKDDGDSWHDALTLEENLAME--FSYPAVIQASDGSVHITYTYNRTQ----IK 314
             L VALS D+G +W+    LE     +  F YP+++Q  DG +H+ YT  +      I+
Sbjct: 330 HSLAVALSNDEGRTWNPPRFLERQPPDQGAFHYPSILQTRDGQIHVLYTAGKAHRGETIR 389

Query: 315 HVVL 318
           H V 
Sbjct: 390 HAVF 393


>gi|452985679|gb|EME85435.1| hypothetical protein MYCFIDRAFT_131178 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 393

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 150/387 (38%), Gaps = 90/387 (23%)

Query: 13  EEFTFPANSAPFK-----------SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL-- 59
           + FT    S PF             CHAS ++ +D G  L A+FGG+ EG PD+ I+L  
Sbjct: 12  KNFTLTTKSHPFPHSECFIPSTTPQCHASNLLILDNGDMLCAWFGGTQEGKPDISIYLSR 71

Query: 60  QTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNK 117
           +T     W        +      NPVLF+ PS  L L Y       + S  +K+  S++ 
Sbjct: 72  KTPASTSWSQASRVTFDTTRSEQNPVLFQTPSGELWLLYTSQNLGDQDSALVKKQSSHDN 131

Query: 118 GVTWSEREQLPPGILGPS-----KNKPILLENGL------LLCGSSVES-W--NSWGSWM 163
           G++WS      P IL P      +   I+L++GL        C S   + W  N   S +
Sbjct: 132 GISWST-----PTILFPEPGTFIRQPVIILDDGLTWVIPTFKCRSEPGTRWVGNDDISAL 186

Query: 164 EVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGG 223
            ++ D G++W +     +P+ S   +        +++     RS           +   G
Sbjct: 187 RISQDEGKTWMEK---EVPH-SFGAVHMAIRQLPDKSYLAFYRS--RWADFIHKSTSRNG 240

Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------------ 259
           + WS   PT L NPN+GI    L  GR+L+ YN  S                        
Sbjct: 241 IDWSEPIPTPLPNPNAGICFDVLPSGRILVVYNHSSRVNARGRREGLYDDIAAADDLRRN 300

Query: 260 ---------------RGVLKVALSKDDGDSWH-------DALTLEENLAM----EFSYPA 293
                          R  L V  S D G  W        D   L  N  M    E SYP+
Sbjct: 301 QGSKHAGKEAFWGAPRAPLCVGWSDDGGMEWKHRVLEEGDGFCLTNNSEMKLNRELSYPS 360

Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
           ++   DG V++ +TY R  IK+V   P
Sbjct: 361 IVVGKDGGVNVAFTYWRQGIKYVRFDP 387


>gi|241258892|ref|YP_002978776.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|240863362|gb|ACS61025.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
           trifolii WSM1325]
          Length = 395

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 145/367 (39%), Gaps = 78/367 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++       GRW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSGRWSVPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQL--PPGILGP 134
              NP++FK P   + L Y   Q      G +   + S + G T+   + L   PG    
Sbjct: 95  SEQNPLIFKAPDGNVWLLYT-SQTSGNQDGSVVKFRVSGDGGQTFGPVQILCDSPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G + + W+       V +  D G SW+      IP+ 
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLMSRDGGASWQMRD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +     +  +      F     +  S S DGG TWS  +P +L N NS I   
Sbjct: 207 IGAVHMNILPLGGDEMIAFYRNRF--AENILSSRSSDGGETWSAPEPAELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYN---------------------------------------TVSRGVLKV 265
            L DG + + YN                                        V R  L +
Sbjct: 265 ILNDGAIAMVYNHSNAGMSDARRQSLYDEIEGGDAGETTVVADTSARKAVWGVPRAPLSL 324

Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           A+S+D G S+   + L+            E+L  EFSYP+++Q  DG++HI YTY R  I
Sbjct: 325 AISRDGGKSFPHRIDLDTGDGFCLSNNSKESLNREFSYPSIVQGGDGTLHIAYTYYRRAI 384

Query: 314 KHVVLLP 320
           K+V L P
Sbjct: 385 KYVRLAP 391


>gi|326799313|ref|YP_004317132.1| neuraminidase (sialidase)-like protein [Sphingobacterium sp. 21]
 gi|326550077|gb|ADZ78462.1| neuraminidase (sialidase)-like protein [Sphingobacterium sp. 21]
          Length = 391

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 161/375 (42%), Gaps = 74/375 (19%)

Query: 11  VAEE-FTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIW--LQTFKDGR 66
           VAE    FP  +   K  H S++  +  G  L A+F GS E GA DV I    +  +   
Sbjct: 24  VAEATLIFPLQA---KHVHGSSLAVLPNGDVLAAWFYGSGERGADDVCIMGSRKRAESKT 80

Query: 67  WQSPIIADEEPNVPMWNPVLFKLPSNGL---------------LLFYKIGQEVQK----- 106
           W  P +  +   +P  NPVLF      L               +L Y++  E +      
Sbjct: 81  WSKPFLMADTKGIPDCNPVLFLNHKKQLYLAWIAVQANRWEQSILRYRVADEYEGSAAPN 140

Query: 107 --WSGCMKRSYNKGVTWSEREQL--------------PP-------GILGPSKN------ 137
             W   +    +       +E+L              PP           P+K       
Sbjct: 141 WGWQDNILLKPDSSFAQELKEKLSMLPDQQHGWAAYAPPYDEMIVTNAQDPAKRSFGWMT 200

Query: 138 --KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
             KP+LL +  +L     + +N   S + V+ D G+SWR   PI         IQP    
Sbjct: 201 RIKPLLLADNKILLPLYSDGFNC--SLIAVSEDDGKSWRPSKPI----AGRGNIQPALIR 254

Query: 196 TANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLA 254
                +   MR S +   R+  S S DGG TW+ A+ T++ N  S ++ ++L DGR LL 
Sbjct: 255 RKQGDIIAYMRDSGDAPARIQQSISMDGGETWTVARKTEIPNTAS-VEVIRLLDGRWLLV 313

Query: 255 YNTVSRG--VLKVALSKDDGDSWHDALTLE--ENLAMEFSYPAVIQASDGSVHITYTY-- 308
            N V  G   L + LS+D+G SW  +  LE  +   +  SYP+++Q++DG +H++Y+Y  
Sbjct: 314 GNDVDDGRYQLSLYLSEDEGTSWKLSTLLEADDKKKLSCSYPSIVQSTDGFIHLSYSYHV 373

Query: 309 --NRTQIKHVVLLPN 321
             ++  IK++ + P+
Sbjct: 374 TSDKKSIKYLKISPD 388


>gi|346972642|gb|EGY16094.1| glycosyl hydrolase [Verticillium dahliae VdLs.17]
          Length = 381

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 95/358 (26%), Positives = 147/358 (41%), Gaps = 72/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVPMWNPV 85
           HAS ++++     L A+FGGS EG PD+ IWL  QT     W  P     +      NPV
Sbjct: 19  HASNLLQLPDKTLLCAWFGGSQEGLPDIGIWLSRQTPGSNEWSPPQKISSDTGRSCQNPV 78

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKP-IL 141
           LF+ P  G +  +   Q+     G +   + S + GVTWS+       + G    +P ++
Sbjct: 79  LFRAPRTGHIWLFHTSQDAGNQDGALVMVRTSSDHGVTWSDPRYPFKDLKGVFVRQPLVV 138

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSW-RKYGPIYIPNESLSVIQPV 192
           L++G     + LC S+  + W  N   S +  + D G +W  K  P  I    ++++ PV
Sbjct: 139 LKDGTWVLPVFLCRSAAGQRWIGNDDISAVLFSRDDGVTWEEKRVPGGIGAVHMNIVPPV 198

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                        RS     +V  S S D G+ W+  +P  L NPNSGI   +L  G + 
Sbjct: 199 ---RGQDQYVAFFRS-RWADKVYRSTSTD-GIDWTSPEPNSLPNPNSGICAARLASGSIA 253

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           + +N  +                                      R  L V +S D G +
Sbjct: 254 IVFNDSAAVTNMARREGLYDDITPEDDKRPNQVSVNGKSAIWGTPRKALTVGISSDQGIT 313

Query: 275 WH-------DALTL----EENLAMEFSYPAVIQASDGS---VHITYTYNRTQIKHVVL 318
           W        D   +    E+ +  E SYP++    D S   VHI +T++R  IK+V +
Sbjct: 314 WTTRVLEDGDGFCMTNNSEKKMNRELSYPSIWAEQDVSHAPVHIAFTFHRQHIKYVRI 371


>gi|343083102|ref|YP_004772397.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351636|gb|AEL24166.1| hypothetical protein Cycma_0387 [Cyclobacterium marinum DSM 745]
          Length = 389

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 82/303 (27%), Positives = 143/303 (47%), Gaps = 33/303 (10%)

Query: 29  ASTIVEVDKGHFLVAYFGGSCEGAPD------VKIWLQTFKDGRWQSPIIADEEPN--VP 80
           +S  ++ DKG F   ++ G    +PD      V +   T K   W+  ++ D +    V 
Sbjct: 61  SSLAIQEDKGEFWAVWYAGI---SPDEDENNYVVLSSSTDKGETWKETLVIDPDGGGMVR 117

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQK-------WSGCMKRSYNKGVTWSEREQLPPGILG 133
            ++P ++  PS  + +F+   Q V K       W+   + S + G +WS   +L  G++ 
Sbjct: 118 AFDPEVWVDPSGKVWVFW--AQAVDKDAKIGGVWAIIGETSVDGGTSWSAPRRLTDGVM- 174

Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
               KP++L NG  +  +S        + M V+ D G++W   G +++P E  S  +   
Sbjct: 175 --MCKPLILNNGDWVLPASTWRETDHSAKMVVSKDQGKTWEVVGGVHVPKEFRSFDEHHF 232

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               +  L  L+R+  GIG    S S D G TWS  + + + +P++     +L+ G LLL
Sbjct: 233 VEKEDGKLWALLRTKYGIGE---SYSSDSGKTWSALEESFIAHPSARFFIRRLQSGNLLL 289

Query: 254 AYN-----TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
             +        R  +   +SKDDG +W   L L++ + +  SYP   Q +DG+++ITY  
Sbjct: 290 VKHGPISVKTGRSHMMAFISKDDGATWSKGLLLDQRINV--SYPDGQQTADGTIYITYDR 347

Query: 309 NRT 311
           +RT
Sbjct: 348 SRT 350


>gi|134101090|ref|YP_001106751.1| BNR repeat-containing glycosyl hydrolase [Saccharopolyspora
           erythraea NRRL 2338]
 gi|291004090|ref|ZP_06562063.1| BNR repeat-containing glycosyl hydrolase [Saccharopolyspora
           erythraea NRRL 2338]
 gi|133913713|emb|CAM03826.1| glycosyl hydrolase, BNR repeat [Saccharopolyspora erythraea NRRL
           2338]
          Length = 394

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 92/365 (25%), Positives = 140/365 (38%), Gaps = 70/365 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HA+ ++ +  G     +F G+ EG  D+ +W      G   W  P+   ++   
Sbjct: 27  APAVQSHAANLMPLPDGDLGCVWFSGTQEGVADIGVWFSRLAAGGDTWSEPVRLSDDQTR 86

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--CMKRSYNKGVTWSEREQLPPGILGPS-- 135
              NPVLF  P   L L +       + +    ++ S + G TW     L          
Sbjct: 87  SEQNPVLFPAPGGELWLLHTAQHAGDQDTAEVRVRTSGDSGRTWGPVRTLVAATEAGGVF 146

Query: 136 -KNKPILLENGLLLCGSSVESWNSWGSW--------MEVTVDAGRSWRKYGPIYIPNESL 186
            +   ++L +G  L      +   +G W        + V+ D G +W       +P+ + 
Sbjct: 147 VRQPVVVLRSGRWLLPVFHCARPEFGKWVGDDDTSALLVSDDRGATWHHE---EVPDSTG 203

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
            V   V     + +L  L RS      +  S S DGG +WS   PT L N NS I   +L
Sbjct: 204 CVHMNV-VELDDGSLLALFRS-RWADAIHESRSHDGGRSWSAPVPTALPNNNSSIQCTRL 261

Query: 247 KDGRLLLAYNTVS--------------------------------------RGVLKVALS 268
           +DGRL L +N  S                                      R  L +ALS
Sbjct: 262 RDGRLALVFNASSAADATERRTSLYDEIDDNGITGSPSTQGGPSRAFWGAPRAPLTLALS 321

Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            D G +W     +E            + L  E SYP+V Q +DG +HI +TY+R  +KHV
Sbjct: 322 SDGGRTWPVRRDVETGDGYCMTNNSRDGLNRELSYPSVAQTADGVLHIAFTYHRRAVKHV 381

Query: 317 VLLPN 321
            + P+
Sbjct: 382 RIDPD 386


>gi|325003240|ref|ZP_08124352.1| BNR repeat-containing glycosyl hydrolase [Pseudonocardia sp. P1]
          Length = 402

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 144/375 (38%), Gaps = 88/375 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HA+ ++E+  G     +FGG+ EG PD+ +     + G  RW  P+   ++P  
Sbjct: 35  APDVQNHAANLMELPGGALGCVWFGGTQEGVPDIGVRYSRLEPGSDRWTGPVALSDDPER 94

Query: 80  PMWNPVLFKLPSNGLLLFY--KIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPS-- 135
              NP+LF  PS  + L +  ++          ++ S + G TW  R  L P   GP   
Sbjct: 95  SEQNPLLFVAPSGTVWLLWTAQVAGNQDTAEVRVRTSGDGGRTWDARRTLFPA--GPGGG 152

Query: 136 ---KNKPILLENGLLL-----CGSSVESWNSWGSWME--------VTVDAGRSWRKYGPI 179
              +   ++  +G  L     C +  E     G W+         V+ D G +WR+    
Sbjct: 153 VFVRQPVVVTRSGRWLLPVWHCVTPPE-----GRWVGDLDTSAVMVSDDDGATWRE---T 204

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            +P+ +  V   V     + +L  L RS      V  S S D G TWS  +P +L N NS
Sbjct: 205 AVPDSTGQVHMNV-VARPDGSLVALYRSRRADA-VHRSVSTDDGGTWSAPEPVELPNNNS 262

Query: 240 GIDGVKLKDGRLLLAYNT------------------------------------------ 257
            +  V L DGRL L  N                                           
Sbjct: 263 SVQVVGLADGRLALVGNPSSRADATARRASLYDEISDSGDVGTAGGAAAADALREGAFWG 322

Query: 258 VSRGVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHIT 305
            SR  + +A+S D G +W     LE                 EFSYP +  A+DG + I 
Sbjct: 323 ASRAPMTLAISDDGGRTWPVRRDLETGDGFCLVNDSRGKRNREFSYPTIRAAADGGLEIA 382

Query: 306 YTYNRTQIKHVVLLP 320
           YT  R  I+HV L P
Sbjct: 383 YTVYRQAIRHVHLTP 397


>gi|302405557|ref|XP_003000615.1| glycosyl hydrolase [Verticillium albo-atrum VaMs.102]
 gi|261360572|gb|EEY23000.1| glycosyl hydrolase [Verticillium albo-atrum VaMs.102]
          Length = 381

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 148/358 (41%), Gaps = 72/358 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWL--QTFKDGRWQSPIIADEEPNVPMWNPV 85
           HAS ++++     L A+FGGS EG PD+ IWL  QT     W  P     +      NPV
Sbjct: 19  HASNLLQLPNKTLLCAWFGGSQEGLPDISIWLSRQTPGSNEWSPPQKISSDTGRSCQNPV 78

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPGILGPSKNKP-IL 141
           LF+ P  G +  +   Q+     G +   + S + GVTWSE       + G    +P ++
Sbjct: 79  LFRAPRTGHIWLFHTSQDAGNQDGALVMVRTSSDHGVTWSEPSYPFKDLKGVFVRQPLVV 138

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSW-RKYGPIYIPNESLSVIQPV 192
           L++G     + LC S+  + W  N   S +  + D G +W  K  P  I    ++++   
Sbjct: 139 LKDGTWVLPVFLCRSAAGQRWIGNDDISAVLFSRDDGVTWEEKRVPGGIGAVHMNIVP-- 196

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
           PF   ++ +      +    +V  S S D G+ W+  +P  L NPNSGI   +L  G + 
Sbjct: 197 PFRGQDQYVAFFRSRW--ADKVYRSTSTD-GIDWASPEPISLPNPNSGICAARLPSGSIA 253

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           + +N  +                                      R  L V +S D G +
Sbjct: 254 IVFNKSAAVPNMTRREGLYDDITPEGDKRPNQVSVNGRSAIWGTPRKALTVGISSDQGIT 313

Query: 275 WH-------DALTL----EENLAMEFSYPAVIQASDGS---VHITYTYNRTQIKHVVL 318
           W        D   +    E+    E SYP++    D S   VHI +T++R  IK+V +
Sbjct: 314 WKTRVLEDGDGFCMTNNSEKKANRELSYPSIWTEQDVSHAPVHIAFTFHRQHIKYVRI 371


>gi|170735438|ref|YP_001774552.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
           [Burkholderia cenocepacia MC0-3]
 gi|169821476|gb|ACA96057.1| glycosyl hydrolase BNR repeat-containing protein [Burkholderia
           cenocepacia MC0-3]
          Length = 395

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 150/356 (42%), Gaps = 76/356 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ +  +D G  L  +FGG+ EG PDV I+L     G   W +P    ++P     NPV
Sbjct: 39  HAANLHCLDNGDLLCVWFGGTQEGIPDVSIYLSRLAAGTDEWSAPRKLSDDPTRSEQNPV 98

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P   L L Y       + +  ++R  S + G+TW   + L   PG       +PI+
Sbjct: 99  LFTAPDGRLWLIYTAQLSGHQNTAIVRRRVSTDGGLTWGPVDTLFDRPGTF---VRQPIV 155

Query: 142 -LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
            L++G     + LC +   E W  N   S + V+ D GR+W  +    +P  S+  +   
Sbjct: 156 ALDDGTWICPIFLCRTEPGERWVGNDDVSAVMVSDDQGRTWTLH---EVP-ASVGCVHMN 211

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + +L  L RS      V  S S D G TWS  + T L N NS I    L++G L 
Sbjct: 212 VHVLHDGSLLALYRS-RWADFVHASRSTD-GRTWSAPERTDLPNNNSSIQFTVLQNGHLA 269

Query: 253 LAYNT----------------------------------------VSRGVLKVALSKDDG 272
           L +N                                           R  + +A+S+D G
Sbjct: 270 LVFNESDASHGKERRASLYDDIDDSEDSGELRDQQASVRGTAFWGAPRAPMSLAISEDGG 329

Query: 273 DSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            +W     LE            +    E+SYP+++Q+ DG++HI +T+ R +IK+V
Sbjct: 330 RTWPRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQSRDGALHIAFTWFRQKIKYV 385


>gi|338209583|ref|YP_004653630.1| neuraminidase (sialidase)-like protein [Runella slithyformis DSM
           19594]
 gi|336303396|gb|AEI46498.1| neuraminidase (sialidase)-like protein [Runella slithyformis DSM
           19594]
          Length = 393

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 156/375 (41%), Gaps = 75/375 (20%)

Query: 10  LVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCE-GAPDVKIWLQTFKDGR-- 66
           +V     FP      K  H S++V +  G FL  +F GS E  A DVKI     K G   
Sbjct: 24  IVRSTLIFPGQE---KHVHGSSLVNLPNGDFLSVWFYGSGERTADDVKIMGARLKKGETV 80

Query: 67  WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR-----SYNKGV-- 119
           W  P    + P++P  NPVLF L   G L    I  +  +W   + R      Y K    
Sbjct: 81  WSEPFEMADTPHLPDCNPVLF-LNQQGKLFLVWIAVQANQWEQSILRVKTSTDYTKSGPP 139

Query: 120 --TWSEREQLPPG-------------------------------ILGPSKN--------- 137
              W +   L P                                I+  +K+         
Sbjct: 140 VWNWQDNILLKPDERFANEVAEKFKKLPENTAGWAGYARKYDDMIIEAAKDAPKRSIGWM 199

Query: 138 ---KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPF 194
              KP+L+ENG ++     + +N   S M ++ D G +W+   P+         IQP   
Sbjct: 200 TRIKPMLMENGRIVLPLYSDGYNF--SLMAISDDFGTTWKPSLPLV----GRGPIQPALA 253

Query: 195 HTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
              N  L   +R S +   RV  SES D G +W+ +  T + N  + ++ + L+DG    
Sbjct: 254 KRKNGNLVAYLRDSGDEPTRVHYSESSDKGESWTASVKTDIPN-TASVELLTLQDGTWAF 312

Query: 254 AYNTVSRGVLKVAL--SKDDGDSWHDALTLEENLAME--FSYPAVIQASDGSVHITYTYN 309
             N +  G  +V+L  S D+G +W+    LE++   +  +SYP +IQ +DG +HITY+ +
Sbjct: 313 LGNDIDDGRYRVSLYLSDDEGKTWNRKTRLEDHPRDKGGYSYPCLIQTADGLLHITYSAH 372

Query: 310 ----RTQIKHVVLLP 320
                  IK+VV+ P
Sbjct: 373 ADNKNKSIKYVVVDP 387


>gi|398793175|ref|ZP_10553649.1| putative neuraminidase (sialidase) [Pantoea sp. YR343]
 gi|398211242|gb|EJM97863.1| putative neuraminidase (sialidase) [Pantoea sp. YR343]
          Length = 395

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 146/358 (40%), Gaps = 76/358 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++ +  G  L  +FGG+ EG  D+ +W    + G  +W       ++P     NPV
Sbjct: 36  HASNLLPLPNGDLLCVWFGGTQEGIADISVWCSRLESGSTQWSEAQKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P   L L +   +   + +  ++  +S + G  W     L   PG     +   ++
Sbjct: 96  LFLDPEQVLWLLWTAQKSGNQDTAIVRYRQSRDFGRNWGAINTLLDQPGTF--IRQPIVV 153

Query: 142 LENG-----LLLCGSSVE-SW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L NG     +  C +     W  N   S ++++ D G+SWR    + +P ESL  +    
Sbjct: 154 LPNGNWLLPVFYCRTQPGVKWVGNDDISAVKISSDKGQSWRD---VAVP-ESLGCVHMNI 209

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + +L  L RS      +  S S DGG +WS    T+L N NS I    L++G L L
Sbjct: 210 TLLQDGSLLALYRS-RWADFIYQSRSTDGGESWSAPLATELPNNNSSIQVTTLRNGHLAL 268

Query: 254 AYNTVS-------------------------------------------RGVLKVALSKD 270
            +N +S                                           R  + +A+S D
Sbjct: 269 VFNAMSAKDASERRLSLYDEIEDEEDDGDVAVAAEPVVHSGRTAFWGAPRAPMTLAISAD 328

Query: 271 DGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            G SW     L+E             L  EFSYP++ Q+ DG++H+ YTY R  IK+V
Sbjct: 329 GGKSWPWQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQSDDGALHVAYTYFRQAIKYV 386


>gi|242238139|ref|YP_002986320.1| hypothetical protein Dd703_0687 [Dickeya dadantii Ech703]
 gi|242130196|gb|ACS84498.1| conserved hypothetical protein [Dickeya dadantii Ech703]
          Length = 393

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 149/359 (41%), Gaps = 76/359 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ I++     G  +W       E+P     NPV
Sbjct: 36  HAANLLHLPNGDLLCVWFGGTQEGVADISIYMSRLPQGSQQWTPAEKLSEDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P   L L Y   +   + +  ++  +S + G TW +   L   PG       +PI 
Sbjct: 96  LFLAPDGVLWLLYTAQKSGNQDTAIVRYRQSTDLGHTWGDIGVLLEQPGTF---IRQPIT 152

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L NG     +  C +   E W  N   S ++++ D G++WR+     +PN +  V   +
Sbjct: 153 VLPNGNWLLPVFYCRTRPGEKWVGNDDISAVKISSDQGKTWRES---VVPNSTGCVHMNI 209

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                + TL  L RS      + +S S DGG +WS    T L N NS I    L++G L 
Sbjct: 210 TL-LQDGTLLALYRS-RWADAIYLSRSTDGGESWSDPVATDLPNNNSSIQVTTLRNGHLA 267

Query: 253 LAYNTVS-----------------------------------------RGVLKVALSKDD 271
           L +N ++                                         R  + VA+S+D 
Sbjct: 268 LVFNHMNADGATERRVSLYDEIEDEEDEGQNAVMPEITSERSAFWGAPRAPMTVAISEDG 327

Query: 272 GDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           G SW     +E            E L  EFSYP++ Q  D  +H+ +TY R  IK+V +
Sbjct: 328 GRSWPWQRNVEIGDGYCMTNNSTEKLNREFSYPSIKQGPDDKLHMAFTYFRQAIKYVCV 386


>gi|432494493|ref|ZP_19736311.1| hypothetical protein A173_01658 [Escherichia coli KTE214]
 gi|431028009|gb|ELD41054.1| hypothetical protein A173_01658 [Escherichia coli KTE214]
          Length = 344

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 96/344 (27%), Positives = 141/344 (40%), Gaps = 72/344 (20%)

Query: 41  LVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY 98
           +  +F G+ EG  D+ +W      G  +W   +    +      NPVLF  P N L L +
Sbjct: 1   MCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPVLFLAPDNVLWLLW 60

Query: 99  KIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPI-LLENG-----LLL 148
                  + +  +  ++S + G TW E   L   PG       +PI +L+NG     +  
Sbjct: 61  TAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPITVLDNGNWLLPVFY 117

Query: 149 CGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
           C +   E W  N   S ++++ D G SWR    + +P +SL  +        N TL  L 
Sbjct: 118 CRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMNITMLHNGTLVALF 173

Query: 206 RSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------ 259
           RS      + +S S D G +WS  + T+L N NS I    L  G L L YN +S      
Sbjct: 174 RS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELALVYNAMSAAGAVE 232

Query: 260 -------------------------------RGVLKVALSKDDGDSWHDALTLE------ 282
                                          R  + VA+S D G SW     L+      
Sbjct: 233 RRASLYDEIDDGDDSRKEPTAVGRSAFWGAPRAPMTVAISADGGKSWPWRRNLDEGDGYC 292

Query: 283 ------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                 E L  EFSYP++ Q+ DG++HI YT+ R  IK+V + P
Sbjct: 293 MTNNSLEKLNREFSYPSIKQSPDGTLHIAYTWWRQAIKYVRISP 336


>gi|150376129|ref|YP_001312725.1| hypothetical protein Smed_3984 [Sinorhizobium medicae WSM419]
 gi|150030676|gb|ABR62792.1| conserved hypothetical protein [Sinorhizobium medicae WSM419]
          Length = 398

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 87/373 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  ++ G  + A+FGG+ EG  D+ I+      G  RW  P     + + 
Sbjct: 28  SPTVQNHAAFLHRLEDGSLVCAWFGGTLEGKSDISIFASVLAPGAERWGEPQRLSFDADH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +   ER+   P  L  ++ + 
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPAGNQDECRIR-----MAAIERDPDDPTKLDAAEGRY 141

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + ++ D G  WR    
Sbjct: 142 LDLPRGCFIRAPLVVRDDGAWLLPIFRCIQRPGQKWNGSHDNAAVGISTDGGERWR---- 197

Query: 179 IYIPNESLSVIQ--PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN 236
           +    +S+  +   PVP            R  + + RV   ES DGG TWS   PT + N
Sbjct: 198 LADLEDSIGCVHMSPVPVGGDGYAAFFRRRQADQVYRV---ESADGGRTWSLPAPTDVPN 254

Query: 237 PNSGIDGVKLKDGRLLLAYNTVS------------------------------------- 259
            NS I  ++L DGRL +  N V+                                     
Sbjct: 255 NNSSIAVIRLSDGRLAMICNPVNAALSPDRRASLYDELGEDDGRPDADPSGGCVPIWGVP 314

Query: 260 RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYT 307
           R  + V LS D G ++ + + +E            +    E SYP +++  DGS+HI YT
Sbjct: 315 RAPVSVCLSDDGGRTFPERIVIENGPGTCLSNNSTDGHNKEMSYPWLLEGDDGSLHIAYT 374

Query: 308 YNRTQIKHVVLLP 320
           ++R  IK+V L P
Sbjct: 375 FHRRAIKYVRLPP 387


>gi|116249553|ref|YP_765391.1| hypothetical protein pRL90099 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115254201|emb|CAK03814.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
           3841]
          Length = 395

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 145/367 (39%), Gaps = 78/367 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLTPGSDRWSVPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++FK P   + L Y   Q      G + +   S + G T+   + L   PG    
Sbjct: 95  SEQNPLIFKAPDGKVWLLY-TSQTSGNQDGSVVKCRISGDGGQTFGPVQILCDSPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G + + W+       V V  D G SW+      IP+ 
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLVSRDGGASWQMRD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +     +  +      F     +  S S DGG TWS  +P +L N NS I   
Sbjct: 207 IGAVHMNILPLGGDEMIAFYRNRF--AENILSSRSSDGGETWSAPEPAELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYNT---------------------------------------VSRGVLKV 265
            L DG + + YN                                        V R  L +
Sbjct: 265 ILNDGAIAMVYNHSNAAMSDARRQSLYDEIEGGEAEETAVIADSSARKAVWGVPRAPLSL 324

Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           A+S+D G S+   + L+            ++L  EFSYP+++Q  DG++HI YTY R  I
Sbjct: 325 AISRDGGKSFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSIVQGGDGTLHIAYTYYRRAI 384

Query: 314 KHVVLLP 320
           K+V L P
Sbjct: 385 KYVRLAP 391


>gi|317046612|ref|YP_004114260.1| hypothetical protein Pat9b_0378 [Pantoea sp. At-9b]
 gi|316948229|gb|ADU67704.1| conserved hypothetical protein [Pantoea sp. At-9b]
          Length = 394

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 147/357 (41%), Gaps = 75/357 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ +W      G  +W       ++P     NPV
Sbjct: 36  HAANLLPLPNGDVLCVWFGGTQEGIADISVWSSRLAKGSNQWSEAEKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P   L L +   +   + +  ++  +S + G +WS  + L   PG     +   ++
Sbjct: 96  LFLDPQQVLWLLWTAQKSGNQDTAIVRYRQSRDFGRSWSAIDTLLDQPGTF--IRQPIVV 153

Query: 142 LENG-----LLLCGSSVE-SW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L NG     +  C +     W  N   S ++++ D G+SWR    + +P +SL  +    
Sbjct: 154 LPNGNWLLPVFYCRTQPGVKWVGNDDVSAVKISSDQGKSWRD---VAVP-DSLGCVHMNI 209

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + +L  L RS      +  S S DGG +WS  + T L N NS I    L +G L L
Sbjct: 210 TLLQDGSLLALYRS-RWADFIYQSRSRDGGESWSAPQATALPNNNSSIQVTTLHNGHLAL 268

Query: 254 AYNTVS------------------------------------------RGVLKVALSKDD 271
            +N +S                                          R  + +A+S D 
Sbjct: 269 VFNAMSAKDASERRLSLYDEIEDEEEGDVAVAAEPVVHSGRTAFWGAPRAPMTLAISADG 328

Query: 272 GDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           G SW     L+E             L  EFSYP++ Q+ DG++HI YTY R  IK+V
Sbjct: 329 GQSWPWQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQSEDGALHIAYTYFRQAIKYV 385


>gi|418298483|ref|ZP_12910321.1| hypothetical protein ATCR1_13188 [Agrobacterium tumefaciens
           CCNWGS0286]
 gi|355536396|gb|EHH05669.1| hypothetical protein ATCR1_13188 [Agrobacterium tumefaciens
           CCNWGS0286]
          Length = 398

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/371 (23%), Positives = 139/371 (37%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HAS +  +D G  L A+FGG+ EG  D+ I+      G   W  P     +P  
Sbjct: 28  SPMIQNHASFLHLLDDGTLLCAWFGGTLEGKSDISIFASVLTPGSTHWGPPQRLSHDPAH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P +G+L  +   Q       C  R     +    R+   P  L   + + 
Sbjct: 88  SEQNPVLFTAP-DGVLWLFHTAQPSGNQDECRIR-----MARVTRDPTAPEKLASEEGRF 141

Query: 140 ILLENGLLL-------------------CGSSVESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +                        + WN     + + ++ D G +W+    
Sbjct: 142 LDLPQGCFIRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISTDNGATWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +    +S   +   P    N       R       V  +ES DGG +WS  +PT + N N
Sbjct: 198 LQELQKSTGCVHMSPVAVTNGHYAAFFRRRQA-DFVYRTESADGGRSWSKPQPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  ++L DGRL +  N ++                                     R 
Sbjct: 257 SSIAVIRLGDGRLAMICNPINAAQSSDRRASLYDELGEEDDRPDADPSGGCVPVWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYN 309
            + + LS DDG S+   + +E+                E SYP +++A DG++H +YTY+
Sbjct: 317 PVSICLSNDDGRSFATRILIEDGPGTCLSNDSTDGRNQEMSYPWLLEAPDGTLHASYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLAP 387


>gi|424877428|ref|ZP_18301074.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392521575|gb|EIW46302.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 395

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/367 (24%), Positives = 146/367 (39%), Gaps = 78/367 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGAERWSVPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P   + L Y   Q      G + +   S + G T+   + L   PG    
Sbjct: 95  SEQNPLIFNAPDGKIWLLYT-SQTSGNQDGSVVKCRISDDGGQTFGPVQILCDSPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G + + W+       V +  D G +W+ +    IP+ 
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLMSRDGGANWQMHD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +     +  +      F     +  S S DGG TWS  +PT+L N NS I   
Sbjct: 207 IGAVHMNILPLGGDEMIAFYRNRF--AENILSSRSSDGGATWSAPEPTKLPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYNT---------------------------------------VSRGVLKV 265
            L DG + + YN                                        V R  L +
Sbjct: 265 ILNDGAIAMVYNHSNAGTSDARRQSLYDEIEAGEAGETAVVADNSTRKAVWGVPRAPLSL 324

Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           A+S+D G S+   + L+            ++L  EFSYP+++Q  DG++HI YTY R  I
Sbjct: 325 AISRDGGKSFPHRVDLDTGDGFCLSNNSKDSLNREFSYPSIVQGGDGALHIAYTYYRRAI 384

Query: 314 KHVVLLP 320
           K++ L P
Sbjct: 385 KYISLAP 391


>gi|407918096|gb|EKG11384.1| hypothetical protein MPH_11545 [Macrophomina phaseolina MS6]
          Length = 388

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/353 (27%), Positives = 151/353 (42%), Gaps = 72/353 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++ +D G  L ++FGGS EG  D+ I+L   K G  +W++P+   ++P+    NPV
Sbjct: 35  HASNLLLLDDGELLCSWFGGSQEGCSDISIYLSRLKPGSQKWEAPLQVSDDPSRSEQNPV 94

Query: 86  LFKLP-SNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKNKPILL 142
           LF+ P SN L + Y   +   +    ++R  S ++G TWS   QL P I    +   ++L
Sbjct: 95  LFQAPDSNELWVIYTAQEAGHQNKAIIRRRVSTDRGATWSPATQLFPEIGAFVRQPIVVL 154

Query: 143 ENG-----LLLCGSSVE-SW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPF 194
           +NG     +  C +     W  N   S +  + D G +W++     +PN +  V   +  
Sbjct: 155 DNGTWVLPIWYCRAPDGFQWTGNDDISAVRYSNDRGATWKEQA---VPNSTGCVHMNI-R 210

Query: 195 HTANRTLRVLMRS--FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
              + T     RS   + + R   S+  D    WS  +P  L NPNSGI    L  G+L 
Sbjct: 211 RGPDGTYLAFFRSRWADNVYRSTSSDCID----WSPPQPIGLPNPNSGICFDVLPSGKLA 266

Query: 253 LAYNTVS--------------------------------------RGVLKVALSKDDGDS 274
           L +N  S                                      R  L   +S+D G +
Sbjct: 267 LVFNNSSATPDMRRREGLYDDITPEAEKLKNQPAVDGKEAIWGTPRSALSFGISEDFGLT 326

Query: 275 WH-------DALTLEEN----LAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           W        D   +  N    +  E SYP++    DG +HI YTY R +IK+V
Sbjct: 327 WRFKTIEDTDGYCMTNNSRDKINREASYPSIATGEDGWIHIAYTYFRQRIKYV 379


>gi|218461765|ref|ZP_03501856.1| putative glycosyl hydrolase protein [Rhizobium etli Kim 5]
          Length = 395

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/368 (24%), Positives = 139/368 (37%), Gaps = 80/368 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSDRWSDPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK-RSYNKGVTWSER---EQLPPGILGPS 135
              NP++F  P   + L Y       + +     R+ N       R    + P  +  P+
Sbjct: 95  SEQNPLIFNAPDGKIWLLYTSQTSATRMARLSNFRTSNDAANIPARFNPLRQPRYLRPPA 154

Query: 136 KNKPILLENGLL----LCGSSVESWNSWGSWMEVTVDAGRS--------WRKYGPIYIPN 183
             +     + LL      G     W+       V +   R         W   G +++  
Sbjct: 155 DCRQRPAADWLLPIFRCVGLEGRRWSGDADTAAVLISRDRGANWQMVDIWDSLGAVHMNI 214

Query: 184 ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
             L   + V F+  NR    ++ S           S DGG +WS  +PT+L N NS I  
Sbjct: 215 LPLGGSEMVAFYR-NRFAETILSS----------RSADGGESWSPPEPTELPNNNSSIQA 263

Query: 244 VKLKDGRLLLAYNT---------------------------------------VSRGVLK 264
             L DG + + YN                                        V R  L 
Sbjct: 264 TVLDDGGIAMVYNHSNATMSDARRQSLYDEIEGDVAGESAAIVADAGRKAVWGVPRAPLS 323

Query: 265 VALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQ 312
           +A+S+D G ++   + L+            ++L  EFSYP+VIQ SDG++H+ YTY R  
Sbjct: 324 LAISRDGGRTFPQRMDLDTGDGFCLSNNSKDSLNREFSYPSVIQGSDGTLHVAYTYYRRA 383

Query: 313 IKHVVLLP 320
           IK+V L P
Sbjct: 384 IKYVRLAP 391


>gi|398798882|ref|ZP_10558179.1| putative neuraminidase (sialidase) [Pantoea sp. GM01]
 gi|398099705|gb|EJL89957.1| putative neuraminidase (sialidase) [Pantoea sp. GM01]
          Length = 395

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 149/362 (41%), Gaps = 84/362 (23%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ +W    + G  +W       ++P     NP+
Sbjct: 36  HAANLLPLPNGDVLCVWFGGTQEGIADISVWCSRLESGSTQWSEAQKLSDDPTRSEQNPL 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL---------PPGILGP 134
           LF  P   L L +   +   + +  ++  +S++ G +W   + L          P ++ P
Sbjct: 96  LFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDLGHSWGAIDTLLDQPGTFIRQPMVVLP 155

Query: 135 SKN--KPILL---ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
           + N   P+     + G+   G      N   S ++++ D G+SWR    + +P +SL  +
Sbjct: 156 NDNWLLPVFYCRTQPGVKWVG------NDDISAVKISSDRGKSWRD---VVVP-DSLGCV 205

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
                   + +L  L RS      +  S S DGG +WS  + T+L N NS I    L +G
Sbjct: 206 HMNITLLQDGSLLALYRS-RWADFIYQSRSTDGGESWSAPRATELPNNNSSIQVTTLHNG 264

Query: 250 RLLLAYNTVS-------------------------------------------RGVLKVA 266
            L L +N +S                                           R  + +A
Sbjct: 265 HLALVFNAMSAKDARERRVSLYDEIEDEEDDGDAAVAAEPQVHSGRSAFWGAPRAPMTLA 324

Query: 267 LSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
           +S D G SW     L+E             L  EFSYP++ Q+ DG++H+ YTY R  IK
Sbjct: 325 ISADGGKSWPWQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQSEDGALHVAYTYFRQAIK 384

Query: 315 HV 316
           +V
Sbjct: 385 YV 386


>gi|408377280|ref|ZP_11174883.1| hypothetical protein QWE_06783 [Agrobacterium albertimagni AOL15]
 gi|407749239|gb|EKF60752.1| hypothetical protein QWE_06783 [Agrobacterium albertimagni AOL15]
          Length = 396

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 90/371 (24%), Positives = 141/371 (38%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
           +P    HAS +     G  + A+FGGS EG  D+ I+      G+  W +P     +P+ 
Sbjct: 28  SPMVQNHASFLHRQADGSLICAWFGGSLEGKSDISIYASVLVAGQNAWGAPQRLSSDPDH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF+ P   LLLF+   Q       C  R     V   +     P  L  ++ + 
Sbjct: 88  SEQNPVLFQAPDGRLLLFH-TSQPSGNQDECRIRMSEISVDAED-----PTRLRATEGRY 141

Query: 140 ILLENG-LLLCGSSVESWNSW--------------------GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    ++ +  +W                     + + ++ D G +WR    
Sbjct: 142 LDLPRGSFVRAPVAIRADGTWLLPIFRCIPRPGQKWNGSHDTAAIGISEDGGATWR---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +   ++S+  +   P    + T+    R       V  +ES D G TWS   PT L N N
Sbjct: 198 LEELSQSIGCVHMSPVALGDTTMAAFFRRRQA-DFVYRTESHDAGRTWSAPAPTDLPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  + L DGR+ +  N V+                                     R 
Sbjct: 257 SSIAAIALDDGRVAIICNPVNAAASLDRRTSLYDELGEEDDRPEADPTGGCAPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYN 309
            + V +S D G ++   + +E+                E SYP ++   DGS+HI YTY+
Sbjct: 317 PVTVCISSDGGLTFPQRIQIEDGPGTCLSNDSTDGRNKEMSYPWLLADQDGSLHIAYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLAP 387


>gi|86359911|ref|YP_471801.1| glycosyl hydrolase [Rhizobium etli CFN 42]
 gi|86284013|gb|ABC93074.1| putative glycosyl hydrolase protein [Rhizobium etli CFN 42]
          Length = 395

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 92/371 (24%), Positives = 148/371 (39%), Gaps = 86/371 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++       GRW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMRDISIYMSRLPPGSGRWSEPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P     L Y   Q      G + +   S + G T+   + L   PG    
Sbjct: 95  SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSIVKCRISDDGGKTFGPVQILCDSPGTF-- 151

Query: 135 SKNKPILLENG-----LLLC-GSSVESWNSWGSWMEVTV--DAGRSWR------KYGPIY 180
            + + ++ + G     +  C G   + W+       V +  D G SW+        G ++
Sbjct: 152 VRQQIVVNDRGDWLLPIFRCVGLEGQRWSGDADTAAVLISRDGGASWQMRDIPDSIGAVH 211

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +    L   + + F+  NR    ++ S           S DGG TWS  +PT+L N NS 
Sbjct: 212 MNILPLGGCEMIAFYR-NRFAETILSS----------RSADGGETWSPPEPTELPNNNSS 260

Query: 241 IDGVKLKDGRLLLAYNT---------------------------------------VSRG 261
           I    L +G + + YN                                        V R 
Sbjct: 261 IQATVLDNGGIAMVYNHSNATMSDARRQSLYDEIEGDECGENAAVVADAGRKAVWGVPRA 320

Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
            L +A+S+D G ++   + L+            ++L  EFSYP+VI+ SDG++H+ YTY 
Sbjct: 321 PLSLAISRDGGRTFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSVIEGSDGTLHVAYTYY 380

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 381 RRAIKYVRLAP 391


>gi|171463784|ref|YP_001797897.1| BNR/Asp-box repeat-containing protein [Polynucleobacter necessarius
           subsp. necessarius STIR1]
 gi|171193322|gb|ACB44283.1| BNR/Asp-box repeat protein [Polynucleobacter necessarius subsp.
           necessarius STIR1]
          Length = 413

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 84/320 (26%), Positives = 143/320 (44%), Gaps = 38/320 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVP--- 80
           S HA++++ +  G   V +F GS EGA DV I+   +      W +P +  +  +     
Sbjct: 73  SVHAASMIALKDGAVRVFWFAGSREGAADVAIYNSVYDPHSTNWSAPTVVMDRVSAEKGL 132

Query: 81  ------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGI 131
                 + NPV  +L    L LF+ +   +  W+G       S ++G+TWS  ++L   I
Sbjct: 133 LRYIAKLGNPVPTRLVDGRLQLFF-VTVSIGGWAGSSISAITSDDEGLTWSSPQRL---I 188

Query: 132 LGPSKNKPILLEN-GLLLCGS--SVESWNSW-GSWME-VTVDAGRSWRKYGPIYIPNESL 186
             P  N   L+++ G++       + +++ W G + E + VDAGR   K       +   
Sbjct: 189 SSPLLNLSTLVKSPGVMFVDGLMGMPAYHEWVGRFGEFLRVDAGRVIDKRRM----SSGR 244

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
             IQP+ F    +      R     G   ++ +S + + G  W  ++   + NPNS + G
Sbjct: 245 GAIQPLVFVNDAQDASAFFRQTRSAGLPKQIPVSYTQNAGQNWHQSEDLAIANPNSAVAG 304

Query: 244 VKLKDGRLLLAYNTVSRGVLKVAL--SKDDGDSWHDALTLEENLAM------EFSYPAVI 295
           V LK G  +L  N +  G  ++ L  S      W     LE++ A+      EFSYP +I
Sbjct: 305 VILKSGTRILVLNDIEYGRHRLVLMMSSPKNGQWQTVEVLEDDEALPDIQRKEFSYPYLI 364

Query: 296 QASDGSVHITYTYNRTQIKH 315
                  H+ YT++R +I+H
Sbjct: 365 TVDGEDAHLVYTWDRKKIRH 384


>gi|121605966|ref|YP_983295.1| BNR/Asp-box repeat-containing protein [Polaromonas
           naphthalenivorans CJ2]
 gi|120594935|gb|ABM38374.1| BNR/Asp-box repeat protein [Polaromonas naphthalenivorans CJ2]
          Length = 461

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 93/370 (25%), Positives = 155/370 (41%), Gaps = 74/370 (20%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RW--QSPIIAD 74
           A+S+P ++ HA++IVE+  G     +F GS EGA DV I     +    RW  ++ ++  
Sbjct: 83  ASSSPGQAVHAASIVELRDGRLRAVWFSGSREGAGDVVIKTSVMEPESLRWSEETTLLDR 142

Query: 75  EEPNVPMW-------NPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSER 124
           +     +W       NPV+ ++P + L+L + +   V  W+G      RS + G++WS  
Sbjct: 143 QHLQQGLWRYVKKIGNPVIARMPDDSLVL-WMVNVSVGGWAGSSITWLRSTDDGMSWSLP 201

Query: 125 EQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
            +L       I   +K  PI  ++G +      E  + +   + ++ +     +      
Sbjct: 202 RRLVTSPFLNISTLAKGAPIFYQDGQIGLPVYHEFVSKFAEILRISPEG----QVLDKTR 257

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           IP    S +QPV   ++ R  ++ MRS +    V  SE+ D G TWS    T   NP+S 
Sbjct: 258 IPRSQTS-LQPVVLVSSARQAQIYMRSGHSTA-VMASETVDAGKTWSATHATAWPNPDSA 315

Query: 241 IDGV-------------KLKDGRLLLAYNTVSRGVLKVALS------------------- 268
           + GV             K K+  +L    T + G  + A+                    
Sbjct: 316 LAGVVTATSQQWLALNPKPKNREILALLQTSAAGSFEGAIPWVVESSPTPEIRLPVSDYE 375

Query: 269 -----------------KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRT 311
                            + D  S    L        EFSYP ++Q+ DG +H+ YT++R+
Sbjct: 376 RLLGDELQKQGASQAQIQADVASAKRQLCGSATCLQEFSYPYLLQSRDGYLHLVYTWHRS 435

Query: 312 QIKHVVLLPN 321
           +IKHV L P 
Sbjct: 436 RIKHVRLDPR 445


>gi|222082211|ref|YP_002541576.1| glycosyl hydrolase [Agrobacterium radiobacter K84]
 gi|221726890|gb|ACM29979.1| glycosyl hydrolase protein [Agrobacterium radiobacter K84]
          Length = 395

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 146/371 (39%), Gaps = 86/371 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW +P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLFPGATRWSAPEKMSDDPVK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
              NP++F  P   + L Y   Q      G + +   S + G T+ E   L   + G   
Sbjct: 95  SEQNPLIFNAPGGDVWLLYT-SQTSGNQDGSVVKCRISADGGKTFGEVSVL-CDLPGTFV 152

Query: 137 NKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSW------RKYGPIY 180
            +PI++ NG       +  C G   E W        V +  D G+SW         G ++
Sbjct: 153 RQPIIV-NGRGAWLLPVFRCIGLLGERWTGDADTAGVLISRDQGKSWVMTAVPDSLGAVH 211

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +   +L     V F+  NR  +           V  S S DGG +WS  +P  L N NS 
Sbjct: 212 MNIVALEGDDLVAFYR-NRFAQ----------SVLASRSTDGGESWSAPQPMDLPNNNSS 260

Query: 241 IDGVKLKDGRLLLAYN---------------------------------------TVSRG 261
           I  VKLK   + + YN                                        V R 
Sbjct: 261 IQAVKLKSCVIAIVYNHSNALMSDARRQSLYDEIEGGEAIDAAPLQTLERRKAVWGVPRA 320

Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
            L +A S D G S+   + L+            ++L  EFSYP+++Q +DG VHI YTY 
Sbjct: 321 PLSLAFSSDGGTSFSRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVQGADGVVHIAYTYY 380

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 381 RRAIKYVRLSP 391


>gi|149177682|ref|ZP_01856283.1| hypothetical protein PM8797T_27502 [Planctomyces maris DSM 8797]
 gi|148843500|gb|EDL57862.1| hypothetical protein PM8797T_27502 [Planctomyces maris DSM 8797]
          Length = 393

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 82/301 (27%), Positives = 140/301 (46%), Gaps = 38/301 (12%)

Query: 44  YFGGSCEGAPDVKIWLQTFKDGR-WQSPIIADEEPN-VPMWNPVLFKLPSNGLLLFYKIG 101
           Y GG+ EG  +  + + +  DG+ W  P +  + P  V  ++P ++  PS  + +F+   
Sbjct: 73  YAGGTGEGELNYVVLVTSGDDGKTWSGPKLVIDPPGPVRAYDPAIWHDPSGRMWVFW--A 130

Query: 102 QEVQKWSG-------CMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLL------ 148
           Q  + W G           S  +   W+E  +L  GI+    NKP +L NG  L      
Sbjct: 131 QSYRWWDGRSGVWAVTTDDSDQENPKWTEPRRLCNGIM---MNKPTVLSNGDWLLPVAIW 187

Query: 149 ---CGSSVESW----NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTL 201
                 S+E         G  + ++ D G++++  G   +P  +    + +     + +L
Sbjct: 188 KQPAKESIEHRFDLPEERGGNIFISKDQGKTFQLLGQTDVPERTFD--EHMIVERKDNSL 245

Query: 202 RVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY----NT 257
             L+R+  GIG    S S DGG TWS  K + + + N+     +LK G LL+      + 
Sbjct: 246 WTLVRTRYGIGE---SISADGGKTWSKGKASDIPHVNARFFIRRLKSGNLLMVRHNPADK 302

Query: 258 VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
            SR  L   LS+DDG +W   L L+E   +  SYP  +Q+ DG+++I Y + RT+ K ++
Sbjct: 303 KSRRDLTAFLSQDDGKTWQGGLLLDERSGV--SYPDGVQSDDGTIYIIYDFARTRDKKIL 360

Query: 318 L 318
           +
Sbjct: 361 M 361


>gi|213586273|ref|ZP_03368099.1| hypothetical protein SentesTyph_35378 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-0664]
          Length = 210

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 105/211 (49%), Gaps = 37/211 (17%)

Query: 141 LLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS------------- 187
           +  NG  +   S+ES   W ++++ + D G+ W        P+ ++S             
Sbjct: 1   MASNGAWIAPGSIESPERWRAFVDRSSDEGKHWNISFVPLEPDNAISGTNVALWDGVKKG 60

Query: 188 ---------------VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
                          VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T
Sbjct: 61  MLWECCLENLLRWDGVIQPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARAT 118

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAM 287
            L N NSGID V ++DG L+LA N V+     R  L +  S+D+G+SW   L LE +   
Sbjct: 119 FLPNNNSGIDLVSMQDGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG- 177

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 178 EYSYPAII-SKGGVVHITYTWNRKNIVYCRL 207


>gi|389626781|ref|XP_003711044.1| glycosyl hydrolase [Magnaporthe oryzae 70-15]
 gi|351650573|gb|EHA58432.1| glycosyl hydrolase [Magnaporthe oryzae 70-15]
 gi|440466568|gb|ELQ35828.1| glycosyl hydrolase [Magnaporthe oryzae Y34]
 gi|440487602|gb|ELQ67382.1| glycosyl hydrolase [Magnaporthe oryzae P131]
          Length = 396

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 152/368 (41%), Gaps = 92/368 (25%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HAS ++ +  G  L A+FGGS EG  D+ IWL     G   W SP+   ++P     NPV
Sbjct: 33  HASNLLLLPSGELLCAWFGGSMEGISDIYIWLSRLAPGAKEWTSPLQVSKDPGRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL----------PPGIL 132
           LF    +G L      QE    + C+ R   S + G TW E E+L          P  +L
Sbjct: 93  LFLDERSGNLWCIYTAQEHGAQNKCLVRRRISSDLGKTWGEAEELFSDRGIFVRQPIVVL 152

Query: 133 ----GPSKNKPIL------LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIP 182
               GP K+  +L       E G+   G      N   S +  + D+G++W++     +P
Sbjct: 153 KDKNGPGKDVWVLPVFHCRAEEGVEWVG------NDDVSAVRYSEDSGKTWKES---MVP 203

Query: 183 NESLSV---IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
               +V   I+P P +          R  + I  +  S+ C   + W+  +PT+L NPNS
Sbjct: 204 ESRGAVHMNIRPDPSNPGKFVGVYRSRWADNI-YLSRSDDC---INWTKPEPTELPNPNS 259

Query: 240 GID-GVKLKDGRLLLAYNT--------------------------------------VSR 260
           GI  GV  K G+L + YN                                         R
Sbjct: 260 GICLGVLPKSGKLAMVYNDSRATPDMVQRTGLYDDIETDGPKLKKQAVIGDKAAIWGTPR 319

Query: 261 GVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTY 308
             + +ALS D G +W +   LE            E L  E SYP++  A D ++HI +T+
Sbjct: 320 SRMTLALSSDGGRTWPEKRLLEEGDGFCLTNNSKEKLNRELSYPSIFVADDETMHIAFTF 379

Query: 309 NRTQIKHV 316
            R +IK+V
Sbjct: 380 WRQRIKYV 387


>gi|424920580|ref|ZP_18343943.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM597]
 gi|392849595|gb|EJB02117.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM597]
          Length = 394

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 143/366 (39%), Gaps = 77/366 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLRDGTLTCVWFGGTMEGMGDISIYMSRLAPGSERWSEPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P     L Y   Q      G + +   S + G ++     L   PG    
Sbjct: 95  SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSVVKCRISDDGGKSFGPVRILCDSPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G + + W+       V +  D G SW+      IP+ 
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLISRDGGASWQMRD---IPDT 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +     +  +      F     +  S S DGG TWS  +PT+L N NS I   
Sbjct: 207 IGAVHMNILPLDGDDMIAFYRNRF--AESILSSRSSDGGETWSAPEPTELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYNT--------------------------------------VSRGVLKVA 266
            L D  + + YN                                       V R  L +A
Sbjct: 265 TLNDRAIAMVYNHSNASTSDARRQSLYDEIEGGETGEAPVVADIGRKAVWGVPRAPLSLA 324

Query: 267 LSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
           +S+D G S+   + L+            ++L  EFSYP++IQ SDG++HI YTY R  IK
Sbjct: 325 ISRDSGKSFPHRVDLDTGDGFCLSNNSKDSLNREFSYPSIIQGSDGALHIAYTYYRRAIK 384

Query: 315 HVVLLP 320
           +V L P
Sbjct: 385 YVRLAP 390


>gi|445227594|ref|ZP_21404370.1| hypothetical protein SEE10_003303 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444866898|gb|ELX91610.1| hypothetical protein SEE10_003303 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
          Length = 158

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++
Sbjct: 24  VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQ 81

Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           DG L+LA N V+     R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G V
Sbjct: 82  DGTLILALNLVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 139

Query: 303 HITYTYNRTQIKHVVL 318
           HITYT+NR  I +  L
Sbjct: 140 HITYTWNRKNIVYCRL 155


>gi|338811147|ref|ZP_08623378.1| expressed protein [Acetonema longum DSM 6540]
 gi|337276830|gb|EGO65236.1| expressed protein [Acetonema longum DSM 6540]
          Length = 386

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 88/334 (26%), Positives = 144/334 (43%), Gaps = 47/334 (14%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
           + + HA  ++E+  G  L  +F G+ EG+ D+ I           W   +    +     
Sbjct: 33  YPTAHAPAMLELPNGDLLCCWFAGTYEGSADIHIICSILPKNSPAWLPTVDISGDSTRSE 92

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK--------RSYNKGVTWSEREQLPPGILG 133
            NP LF  P + +   Y    + Q+    M+        +S + G TW   E + P   G
Sbjct: 93  QNPSLFYGPDHAVWAMYTAQLDRQEGKDNMQYTAMVRCQKSTDGGKTWGPYETVFPE-EG 151

Query: 134 PSKNKPI-LLENGLL-----LCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
               +PI +L NG       LC  S +  +   S   ++ D G++WR    + +P +S  
Sbjct: 152 TFCRQPIQILSNGRWIFGNWLCTDSKDGLSDDPSVFRISDDEGKTWRM---VMMP-KSNG 207

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P      N  L   MR+     R+  SES D G TWS   PT L N NS I  ++L+
Sbjct: 208 HVHPNVVELENGHLVAFMRNREA-HRIHRSESFDWGETWSKPAPTPLPNNNSSISALRLQ 266

Query: 248 DGRLLLAYNTVS--------------RGVLKVALSKDDGDSW--------HDALTLEENL 285
            GR+ +AYN                 R  + VALS+D G ++         +    +EN 
Sbjct: 267 SGRIAIAYNPTCTPNPQSGKAAWPGLRCPVAVALSEDGGLTFPIVRWMERGEGFIGDENK 326

Query: 286 A--MEFSYPAVIQASDGSVHITY-TYNRTQIKHV 316
               ++ YP ++Q+ DG++H+ Y ++ R  +K+V
Sbjct: 327 TNNRQYEYPYLMQSVDGALHLAYASHTRAGVKYV 360


>gi|438144458|ref|ZP_20875457.1| hypothetical protein SEEP9120_19452, partial [Salmonella enterica
           subsp. enterica serovar Pullorum str. ATCC 9120]
 gi|434939238|gb|ELL46096.1| hypothetical protein SEEP9120_19452, partial [Salmonella enterica
           subsp. enterica serovar Pullorum str. ATCC 9120]
          Length = 151

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++
Sbjct: 17  VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQ 74

Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           DG L+LA N V+     R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G V
Sbjct: 75  DGTLILALNLVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 132

Query: 303 HITYTYNRTQIKHVVL 318
           HITYT+NR  I +  L
Sbjct: 133 HITYTWNRKNIVYCRL 148


>gi|417383909|ref|ZP_12149461.1| hypothetical protein LTSEJOH_3060 [Salmonella enterica subsp.
           enterica serovar Johannesburg str. S5-703]
 gi|417531553|ref|ZP_12186235.1| hypothetical protein LTSEURB_2815 [Salmonella enterica subsp.
           enterica serovar Urbana str. R8-2977]
 gi|353610544|gb|EHC63469.1| hypothetical protein LTSEJOH_3060 [Salmonella enterica subsp.
           enterica serovar Johannesburg str. S5-703]
 gi|353664663|gb|EHD03025.1| hypothetical protein LTSEURB_2815 [Salmonella enterica subsp.
           enterica serovar Urbana str. R8-2977]
          Length = 188

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++
Sbjct: 54  VIQPTLWESSPGHIHMLLRSTRGA--IFRSDSIDYGATWSVARATSLPNNNSGIDLVSMQ 111

Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           DG L+LA N V+     R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G V
Sbjct: 112 DGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 169

Query: 303 HITYTYNRTQIKHVVL 318
           HITYT+NR  I +  L
Sbjct: 170 HITYTWNRKNIVYCRL 185


>gi|209552288|ref|YP_002284203.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|209539400|gb|ACI59332.1| putative glycosyl hydrolase protein [Rhizobium leguminosarum bv.
           trifolii WSM2304]
          Length = 394

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 142/366 (38%), Gaps = 77/366 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSERWSEPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P     L Y   Q      G + +   S + G ++     L   PG    
Sbjct: 95  SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSVVKCRISDDGGKSFGPVRILCESPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G + + W+       V +  D G SW+      IP+ 
Sbjct: 152 --VRQQIVVNGRGDWLLPVFRCVGLNGQRWSGDADTAAVLISRDGGASWQMRD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   V     +  +      F     +  S S DGG TWS  +P +L N NS I   
Sbjct: 207 IGAVHMNVLPLGGDDMIAFYRNRF--AESILSSRSSDGGETWSAPEPAELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYNT--------------------------------------VSRGVLKVA 266
            L DG + + YN                                       V R  L +A
Sbjct: 265 VLNDGAIAMVYNHSNASTSDARRQSLYDEIESGDAGEAPVVADIGRKAVWGVPRAPLSLA 324

Query: 267 LSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
           +S+D G S+   + L+            ++L  EFSYP++IQ  DG++HI YTY R  IK
Sbjct: 325 ISRDSGKSFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSIIQGGDGALHIAYTYYRRAIK 384

Query: 315 HVVLLP 320
           +V L P
Sbjct: 385 YVRLAP 390


>gi|424889567|ref|ZP_18313166.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM2012]
 gi|393171785|gb|EJC71830.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM2012]
          Length = 394

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 144/366 (39%), Gaps = 77/366 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++        RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPDAERWSEPEKMSDDPQK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P     L Y   Q      G + +   S + G ++     L   PG    
Sbjct: 95  SEQNPLIFNAPDGKTWLLYT-SQTSGNQDGSLVKCRISDDGGKSFGPVSVLCDSPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +PI++ NG       +  C G + + W+       V +  + G SW+ +    IP+ 
Sbjct: 152 -VRQPIVV-NGRGDWLLPVFRCVGLNGQRWSGDADTAAVLISRNGGESWQMHD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +     +  +      F     +  S S DGG TWS   PT+L N NS I   
Sbjct: 207 IGAVHMNILPLGGDEMVAFYRNRF--AESILSSRSSDGGETWSAPAPTELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYNT--------------------------------------VSRGVLKVA 266
            L DG + + YN                                       V R  L +A
Sbjct: 265 VLNDGAIAMVYNHSNATMSDARRQSLYDEIEGDEAGEATVVAHIGRNAVWGVPRAPLSLA 324

Query: 267 LSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
           +S+D G S+   + L+            ++L  EFSYP++I+  DG++HI YTY R  IK
Sbjct: 325 ISRDGGKSFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSIIEGGDGTLHIAYTYYRRAIK 384

Query: 315 HVVLLP 320
           +V L P
Sbjct: 385 YVRLAP 390


>gi|449138247|ref|ZP_21773538.1| hypothetical protein RE6C_05593 [Rhodopirellula europaea 6C]
 gi|448883183|gb|EMB13725.1| hypothetical protein RE6C_05593 [Rhodopirellula europaea 6C]
          Length = 152

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 58/146 (39%), Positives = 76/146 (52%), Gaps = 28/146 (19%)

Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS- 259
           L++L RS     R+  S S DGG TWS    T L NPNSG D V L+DGR +L YN    
Sbjct: 8   LQILCRSQEK--RIVESWSKDGGKTWSTLTATDLPNPNSGTDAVTLQDGRQVLIYNHSEG 65

Query: 260 ------------RGVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVI 295
                       R +L +A+S  DG +W   LTLE                 E+SYPA+I
Sbjct: 66  MVRRKDAPDLKPRRILNLAISS-DGKTWKPVLTLENETGPHPKDADRRRHFGEYSYPAII 124

Query: 296 QASDGSVHITYTYNRTQIKHVVLLPN 321
           Q SDG +++ YTYNR  +KH ++ P+
Sbjct: 125 QTSDGMLNMVYTYNREGVKHAIVDPS 150


>gi|398351148|ref|YP_006396612.1| hypothetical protein USDA257_c12640 [Sinorhizobium fredii USDA 257]
 gi|390126474|gb|AFL49855.1| hypothetical protein USDA257_c12640 [Sinorhizobium fredii USDA 257]
          Length = 397

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 85/364 (23%), Positives = 148/364 (40%), Gaps = 71/364 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ +++   + G  RW +P    ++P  
Sbjct: 35  SPCVQNHAANLSFLPDGTLTCVWFGGTMEGMGDISVYMSRLEPGARRWTNPEKMSDDPQK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NP++F  P   + L +       +    +KR  S + G ++   E L   I G    
Sbjct: 95  SEQNPLIFTAPDGRVWLLFTSQTSGHQDGAVVKRRISEDGGKSFGPTEVLC-DIPGTFVR 153

Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
           +PI++         +  C G +   W        V +  D G++W       +P+ + +V
Sbjct: 154 QPIIVNGAGDWLLPIFRCMGEAGRRWTGAIDRAAVLISRDQGKAWTM---AEVPDSTGAV 210

Query: 189 -IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            +  VP    N  L    R+      V  S+S D G  W+  +P  L N NS I  +++K
Sbjct: 211 HMSIVPVE--NGALVAFYRNRFAT-HVLRSQSSDAGTHWTAPEPVDLPNNNSSIQAIRMK 267

Query: 248 DGRLLLAYNT--------------------------------------VSRGVLKVALSK 269
           +G + + YN                                       V R  L +A S 
Sbjct: 268 NGAIAMVYNHSNASMSDARRHSLYDEIEGEGDSGSAEAVASARPAVWGVPRAPLSLAFST 327

Query: 270 DDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
           D G ++   + L+            ++L  EFSYP++++ ++G +H+ YTY R  IK+V 
Sbjct: 328 DGGRTFPRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYVR 387

Query: 318 LLPN 321
           L P+
Sbjct: 388 LDPD 391


>gi|417100343|ref|ZP_11960093.1| putative glycosyl hydrolase protein [Rhizobium etli CNPAF512]
 gi|327192232|gb|EGE59197.1| putative glycosyl hydrolase protein [Rhizobium etli CNPAF512]
          Length = 411

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/371 (25%), Positives = 146/371 (39%), Gaps = 86/371 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 51  SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLPPGSERWSEPEKMSDDPEK 110

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P     L Y   Q      G + +   S + G T+   + L   PG    
Sbjct: 111 SEQNPLIFNAPDGKTWLLY-TSQTSGNQDGSIVKCRISDDGGKTFGPVQILCDSPGTF-- 167

Query: 135 SKNKPILLENG-----LLLC-GSSVESWNSWGSWMEVTV--DAGRSWR------KYGPIY 180
            + + ++ + G     +  C G   + W+       V +  D G SW+        G ++
Sbjct: 168 VRQQIVVNDRGDWLLPVFRCVGLEGQRWSGDADTAAVLISRDGGASWQMRDIPDSIGAVH 227

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +    L     V F+  NR    ++ S           S DGG  WS  +PT+L N NS 
Sbjct: 228 MNILPLGGGDMVAFYR-NRFAETILSS----------RSADGGENWSPPEPTELPNNNSS 276

Query: 241 IDGVKLKDGRLLLAYNT---------------------------------------VSRG 261
           I    L +G + + YN                                        V R 
Sbjct: 277 IQATVLDNGGIAMVYNHSNADMSDARRQSLYDEIEGDEAGEHAAVVADAGRKAVWGVPRA 336

Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYN 309
            L +A+S+D G ++   L L+            ++L  EFSYP+VIQ SDG++H+ YTY 
Sbjct: 337 PLSLAISRDGGRTFPHHLDLDTGDGFCLSNNSKDSLNREFSYPSVIQGSDGTLHVAYTYY 396

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 397 RRAIKYVRLAP 407


>gi|417349594|ref|ZP_12128222.1| Putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Gaminara str. A4-567]
 gi|353572464|gb|EHC36104.1| Putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Gaminara str. A4-567]
          Length = 188

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 81/136 (59%), Gaps = 9/136 (6%)

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V + 
Sbjct: 54  VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATSLPNNNSGIDLVSMP 111

Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           DG L+LA N V+     R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G V
Sbjct: 112 DGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SEGGVV 169

Query: 303 HITYTYNRTQIKHVVL 318
           HITYT+NR  I +  L
Sbjct: 170 HITYTWNRKNIVYCRL 185


>gi|36958681|gb|AAQ87149.1| Hypothetical protein RNGR00124 [Sinorhizobium fredii NGR234]
          Length = 399

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 84/365 (23%), Positives = 143/365 (39%), Gaps = 71/365 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ +++     G  RW  P    ++P  
Sbjct: 35  SPCVQNHAANLAFLPDGTLTCVWFGGTMEGMGDISVYMSRLAPGGRRWTDPEKMSDDPAK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NP++F  P   + L +       +    +KR  S + G T+   E L   I G    
Sbjct: 95  SEQNPLIFAAPDGRIWLLFTSQTSGNQDGAVVKRRISEDGGKTFGATEVLC-DIPGTFVR 153

Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
           +PI++         +  C G +   W        V +  D GR+W       +P+   +V
Sbjct: 154 QPIIVNAAGDWLLPVFRCIGEAGRRWTGDVDRAAVLISRDEGRTWTMQ---EVPDSIGAV 210

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              +     +  +      F     V  S+S D G TW+  +   L N NS I  +++K+
Sbjct: 211 HMNLVAADGDAMIAFYRNRF--ATHVLRSQSSDAGHTWTAPEAVDLPNNNSSIQAIRMKN 268

Query: 249 GRLLLAYNT----------------------------------------VSRGVLKVALS 268
           G + + YN                                         V R  L +A S
Sbjct: 269 GAIAMVYNHSNASMSEARRHSLYDEIEGDGEELGGAAEAVASGRPAVWGVPRAPLSLAFS 328

Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            DDG ++   + L+            ++L  EFSYP++++ ++G +H+ YTY R  IK+V
Sbjct: 329 TDDGRTFPRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYV 388

Query: 317 VLLPN 321
            L P+
Sbjct: 389 RLDPS 393


>gi|213024134|ref|ZP_03338581.1| hypothetical protein Salmonelentericaenterica_17126 [Salmonella
           enterica subsp. enterica serovar Typhi str. 404ty]
          Length = 139

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 9/136 (6%)

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           VIQP  + ++   + +L+RS  G   +  S+S D G TWS A+ T L N NSGID V ++
Sbjct: 5   VIQPTLWESSPGHIHMLLRSTRG--AIFRSDSIDYGATWSVARATFLPNNNSGIDLVSMQ 62

Query: 248 DGRLLLAYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           DG L+LA N V+     R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G V
Sbjct: 63  DGTLILALNPVNGNWGKRYPLSLIASQDNGESWLPLLDLESDHG-EYSYPAII-SKGGVV 120

Query: 303 HITYTYNRTQIKHVVL 318
           HITYT+NR  I +  L
Sbjct: 121 HITYTWNRKNIVYCRL 136


>gi|119897038|ref|YP_932251.1| hypothetical protein azo0747 [Azoarcus sp. BH72]
 gi|119669451|emb|CAL93364.1| conserved hypothetical secreted protein [Azoarcus sp. BH72]
          Length = 426

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/359 (25%), Positives = 147/359 (40%), Gaps = 75/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--W--QSPIIADEEPNV--- 79
            HA+++  +  G     +FGG  EG  +VK++  T   G   W  Q  I   E+ +    
Sbjct: 63  VHAASVAVLPDGRLFATWFGGEREGGTEVKVYAATRNPGEAGWGRQHAIATPEQTSADVG 122

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPP--- 129
                M NPV F  P   L + Y +   +  W+     + RS + G TW    +L     
Sbjct: 123 RLVRKMGNPVAFVTPRGELWVVY-VSVTLGGWATSHLNLLRSPDLGTTWLPARRLAATPF 181

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             +    K  P+  +NG +      E    +   + V  D G    K   + + +   S 
Sbjct: 182 FNLSTLVKGFPVFFDNGDVGLPVYHEMAGKFAELL-VLSDEGEVRHK---VRMDHGRRS- 236

Query: 189 IQPVPFHTANRTLRVLMR--SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
           +QPV      +    LMR    +G  R   SE+ DGG +WS  +PT L NPNS +  ++L
Sbjct: 237 LQPVVLVEDEQRAVALMRYGGEHGPFRAWRSETADGGRSWSAVEPTTLANPNSALAALRL 296

Query: 247 KDGRLLLAYNTVSRGVLKVAL--SKDDGDSW--------------HDA------LTLEEN 284
            DGRLL   N      L+++L  S+D G +W              H+A        LE +
Sbjct: 297 DDGRLLAVANDTEDERLRLSLLVSEDGGRNWRSIHRFEDKQDFAGHEAGGEVFRARLEAD 356

Query: 285 LA---------------------------MEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           +A                            ++ YP +++A+DG  H+ YT+NR+ ++H+
Sbjct: 357 VADLGPGPAAADLVRVAERNLCRNQGACGWQYDYPYLVRAADGDFHLVYTWNRSFVRHI 415


>gi|227819951|ref|YP_002823922.1| glycosyl hydrolase [Sinorhizobium fredii NGR234]
 gi|227338950|gb|ACP23169.1| putative glycosyl hydrolase protein [Sinorhizobium fredii NGR234]
          Length = 385

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 84/365 (23%), Positives = 143/365 (39%), Gaps = 71/365 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ +++     G  RW  P    ++P  
Sbjct: 21  SPCVQNHAANLAFLPDGTLTCVWFGGTMEGMGDISVYMSRLAPGGRRWTDPEKMSDDPAK 80

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NP++F  P   + L +       +    +KR  S + G T+   E L   I G    
Sbjct: 81  SEQNPLIFAAPDGRIWLLFTSQTSGNQDGAVVKRRISEDGGKTFGATEVLCD-IPGTFVR 139

Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
           +PI++         +  C G +   W        V +  D GR+W       +P+   +V
Sbjct: 140 QPIIVNAAGDWLLPVFRCIGEAGRRWTGDVDRAAVLISRDEGRTWTMQ---EVPDSIGAV 196

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              +     +  +      F     V  S+S D G TW+  +   L N NS I  +++K+
Sbjct: 197 HMNLVAADGDAMIAFYRNRF--ATHVLRSQSSDAGHTWTAPEAVDLPNNNSSIQAIRMKN 254

Query: 249 GRLLLAYNT----------------------------------------VSRGVLKVALS 268
           G + + YN                                         V R  L +A S
Sbjct: 255 GAIAMVYNHSNASMSEARRHSLYDEIEGDGEELGGAAEAVASGRPAVWGVPRAPLSLAFS 314

Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            DDG ++   + L+            ++L  EFSYP++++ ++G +H+ YTY R  IK+V
Sbjct: 315 TDDGRTFPRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYV 374

Query: 317 VLLPN 321
            L P+
Sbjct: 375 RLDPS 379


>gi|440714059|ref|ZP_20894645.1| hypothetical protein, secreted [Rhodopirellula baltica SWK14]
 gi|436441048|gb|ELP34323.1| hypothetical protein, secreted [Rhodopirellula baltica SWK14]
          Length = 376

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 152/343 (44%), Gaps = 49/343 (14%)

Query: 10  LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
           L A++ TF   P    P ++ HA T         + V +G  L A  Y G +     +  
Sbjct: 19  LQAQDSTFLAPPKYIGPPQTIHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPREDANNY 78

Query: 57  IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK 112
           + L T  DG   W+  ++ D + + P+  ++P L+  P   L L +   Q V        
Sbjct: 79  VVLSTSGDGGETWEEVLVVDPDEDGPVRAYDPELWIAPDGKLRLIW--AQAVGHLGTI-- 134

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SWME---- 164
                GV + E E   P +  P   +   + +G+++C   V S   W     +W E    
Sbjct: 135 ----AGVWFLEIED--PDVEQPVHGEAKRVTDGIMMCKPLVLSTGEWALPASTWRETDES 188

Query: 165 ----VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
               V+VD G++W + G   +P +  +  + +     + ++ +L R+  GIG    S S 
Sbjct: 189 ARMIVSVDQGKTWAQRGGCNVPKDVRAFDEHMFVERNDGSIWLLARTKYGIGE---SVST 245

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSW 275
           D G TW    P+++ +P++     +L  G LLL  +        R  L   +S DDG +W
Sbjct: 246 DRGKTWPELTPSKIAHPSARFFIRRLDSGNLLLVKHGPIEERTGRSHLMAFVSIDDGKNW 305

Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
              L L+E   +  SYP   Q +DG +HI Y YNRT+ +H++L
Sbjct: 306 KGGLMLDERAGV--SYPDGQQDADGVIHIIYDYNRTKDRHILL 346


>gi|385792411|ref|YP_005825387.1| hypothetical protein [Francisella cf. novicida Fx1]
 gi|328676557|gb|AEB27427.1| BNR/Asp-box repeat protein [Francisella cf. novicida Fx1]
          Length = 372

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 133/304 (43%), Gaps = 27/304 (8%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F V ++    E AP+ KI +   +   G+WQ     P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-VTWYSSDQETAPNTKIVVAVAEKVAGKWQFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
             +E           + NP+++       L+F          S  +  S + G TWS+ E
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPE 171

Query: 126 Q-LPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
            N+ ++ +QP     +      L R  +  I ++  +E+ D GL+WS  KPTQL NP+SG
Sbjct: 228 TNDGVN-LQPTVVPLSKTHALALYRQMHSPIKKIYTNETSDSGLSWSKVKPTQLDNPDSG 286

Query: 241 IDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
           I  +K+++G +LLAYN  T SR  L +AL  D+   W +  T    +  E SYPA     
Sbjct: 287 IAAIKIQNG-ILLAYNNATDSRADLSLALKADNSQQWRNIYTFPNKIKGELSYPAFTLYQ 345

Query: 299 DGSV 302
           D  +
Sbjct: 346 DNII 349


>gi|424875893|ref|ZP_18299552.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393163496|gb|EJC63549.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 395

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 143/367 (38%), Gaps = 78/367 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSERWSVPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++FK P   + L Y   Q      G + +   S + G T+   + L   PG    
Sbjct: 95  SEQNPLIFKAPDGKVWLLY-TSQTSGNQDGSVVKCRISGDGGQTFGPVQILCNNPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G + + W+       V +  D G SW+      IP+ 
Sbjct: 152 --VRQQIVVNGKGDWLLPVFRCVGLNGQRWSGDADTAAVLMSRDGGVSWQMRD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +        +      F     +  S S D G TWS  +P +L N NS I   
Sbjct: 207 IGAVHMNILPLGGGEMIAFYRNRF--AENILSSRSSDSGETWSAPEPAELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYN---------------------------------------TVSRGVLKV 265
            L DG + + YN                                        V R  L +
Sbjct: 265 ILNDGAIAMVYNHSNAAMSDARRQSLYDEIEGGEAGETAVFANSSVRKAVWGVPRAPLSL 324

Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           A+S+D G S+   + L+            ++L  EFSYP+++Q  DG++HI YTY R  I
Sbjct: 325 AISRDGGKSFPHRIDLDSGDGFCLSNNSKDSLNREFSYPSIVQGGDGTLHIAYTYYRRTI 384

Query: 314 KHVVLLP 320
           K+V L P
Sbjct: 385 KYVRLAP 391


>gi|145250895|ref|XP_001396961.1| glycosyl hydrolase [Aspergillus niger CBS 513.88]
 gi|134082486|emb|CAK97293.1| unnamed protein product [Aspergillus niger]
 gi|350636350|gb|EHA24710.1| hypothetical protein ASPNIDRAFT_40608 [Aspergillus niger ATCC 1015]
          Length = 384

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 148/358 (41%), Gaps = 71/358 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           AP    HAS ++ +  G    A+FGGS EG PD+ I+L   + G   W        +   
Sbjct: 27  APTVQSHASNLLRMPNGDLFCAWFGGSLEGKPDISIYLSRLRCGESEWTEASKMTHDDTR 86

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NPVLF  P+  L L Y       + +  +K   S + G TW + E++     G    
Sbjct: 87  SEQNPVLFHSPTGELWLLYTSQHSGDQDTAIVKYRVSNDSGATWGD-EKVLFHDTGTFIR 145

Query: 138 KPI-LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
           +P+ +LE+G     +  C     E W  N   S + V+ D GR+W +     IP ES   
Sbjct: 146 QPVAVLEDGAWVVPVFKCRVQPGERWLGNDDISCIRVSRDQGRTWTES---EIP-ESTGC 201

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           +        + +   + RS      V ++ S D G+ WS  +PT L NPN+GI    L  
Sbjct: 202 VHMEIQRLKDGSYLGMFRS-RWADNVYLATSPD-GVVWSAPQPTSLPNPNAGICFDVLPS 259

Query: 249 GRLLLAYNTVS---------------------------------------RGVLKVALSK 269
           GR++L YN  S                                       R  L VA S 
Sbjct: 260 GRVVLVYNHSSKLDALGRRQGLYDDIADGVDERKNQASAKDGRESFWGSPRAPLCVAWSD 319

Query: 270 DDGDSWH-------DALTL----EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           D G +W        D   L    E+ L  E SYP+++ A +G +H+ YT+ R +IK+V
Sbjct: 320 DKGKTWKHRILEDGDGYCLTNNSEKKLNRELSYPSMVCA-EGIIHVAYTFWRQRIKYV 376


>gi|118497095|ref|YP_898145.1| BNR/Asp-box repeat-containing protein [Francisella novicida U112]
 gi|194323392|ref|ZP_03057169.1| BNR/Asp-box repeat protein [Francisella novicida FTE]
 gi|208778888|ref|ZP_03246234.1| BNR/Asp-box repeat protein [Francisella novicida FTG]
 gi|254372460|ref|ZP_04987949.1| hypothetical protein FTCG_00020 [Francisella tularensis subsp.
           novicida GA99-3549]
 gi|118423001|gb|ABK89391.1| BNR/Asp-box repeat protein [Francisella novicida U112]
 gi|151570187|gb|EDN35841.1| hypothetical protein FTCG_00020 [Francisella novicida GA99-3549]
 gi|194322247|gb|EDX19728.1| BNR/Asp-box repeat protein [Francisella tularensis subsp. novicida
           FTE]
 gi|208744688|gb|EDZ90986.1| BNR/Asp-box repeat protein [Francisella novicida FTG]
 gi|313661724|gb|ADR71924.1| KdoH1 [Francisella novicida U112]
          Length = 372

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 86/305 (28%), Positives = 135/305 (44%), Gaps = 29/305 (9%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E AP+ KI +   +   G+W      P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAIAEKVAGKWHFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            N+ ++ +QP  VP  +    L +  +  + I R+  +E+ D GL+WS  KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285

Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           GI  +K+++G +LLAYN  T SR  L +A   D+   W +  T    +  EFSYPA    
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGEFSYPAFTLY 344

Query: 298 SDGSV 302
            D  +
Sbjct: 345 QDNII 349


>gi|427404018|ref|ZP_18894900.1| hypothetical protein HMPREF9710_04496 [Massilia timonae CCUG 45783]
 gi|425717257|gb|EKU80222.1| hypothetical protein HMPREF9710_04496 [Massilia timonae CCUG 45783]
          Length = 405

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 78/317 (24%), Positives = 137/317 (43%), Gaps = 40/317 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP-IIADEEP------- 77
           S HAS++  +     L  ++ GS E  PDVK++  T+ +G W +P ++A  E        
Sbjct: 91  SAHASSMTPLPGNRMLAFWWAGSRESGPDVKVYASTWANGAWSAPWMVASRESLGAALGF 150

Query: 78  -NVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILG 133
               + NPV +    +G +  Y +   +  W+        S ++G ++  R  LP     
Sbjct: 151 GVRRIGNPVAWT-DRDGKIHLYVVATGLGGWAASRIAHIVSNDQGASFRVRRVLP----- 204

Query: 134 PSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW-------RKYGPIYIPN--E 184
                P+   + L+       S   W  W+ V  + G  +        +  P  +    +
Sbjct: 205 ---TSPLFNTSALVRTSPLALSDGGW--WLPVYFEIGNKYPMLMAFDAQGNPTSLARIGQ 259

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
           S S +QP     +   +R L+R  +   R+  + S DGG +W       L N N+ +  +
Sbjct: 260 STSTLQPAIVAVSKTEVRALLRDDSDARRMQHAASLDGGASWQDLPALDLPNHNTSMAAI 319

Query: 245 KLKDGRLLLAYNTV-----SRGVLKVALSKDDGDSWHDALTLEENLAM-EFSYPAVIQAS 298
           +L  G+ L+ +N       +R  L++++S DD  +W     + +  A  EFSYP + Q  
Sbjct: 320 RLSSGQFLMLHNHAADDASARSRLRLSIS-DDMRTWRTVADIAKGKAGDEFSYPTLQQVG 378

Query: 299 DGSVHITYTYNRTQIKH 315
           D  +H+TYT+ R  I H
Sbjct: 379 D-ELHVTYTWQRQAIAH 394


>gi|378763488|ref|YP_005192104.1| conserved hypothetical protein [Sinorhizobium fredii HH103]
 gi|365183116|emb|CCE99965.1| conserved hypothetical protein [Sinorhizobium fredii HH103]
          Length = 399

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 82/364 (22%), Positives = 144/364 (39%), Gaps = 71/364 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ +++     +  RW  P    ++P  
Sbjct: 35  SPCVQNHAANLAFLPDGTLTCVWFGGTMEGMGDISVYMSRLEPRARRWTEPEKMSDDPAK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSKN 137
              NP++F  P   + L +       +    +KR  S + G ++   E L   I G    
Sbjct: 95  SEQNPLIFTAPDGRIWLLFTSQTSGNQDGAVVKRRISDDGGKSFGATEVLC-DIPGTFVR 153

Query: 138 KPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLSV 188
           +PI++ +       +  C G +   W        V +  D GR+W       +P +S+  
Sbjct: 154 QPIIVNSAGDWLLPVFRCIGEADRRWTGAVDRAAVLISRDQGRTWTMQ---EVP-DSIGA 209

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           +          T+    R+      V  S+S D GL W+  +  +L N NS I  +++K+
Sbjct: 210 VHMNVVRADGATMVAFYRNRFATN-VLRSQSSDAGLNWTAPEAVELPNNNSSIQAIRMKN 268

Query: 249 GRLLLAYNT----------------------------------------VSRGVLKVALS 268
           G + + YN                                         V R  L +A S
Sbjct: 269 GAIAMVYNHSNASMSDARRHSLYDEIESDGGEDNGSTEMVASARPAVWGVPRAPLSLAFS 328

Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            D G ++   + L+            ++L  EFSYP++++ ++G +H+ YTY R  IK+V
Sbjct: 329 TDGGRTFSRRIDLDTGDGYCLSNNSKDSLNREFSYPSIVEDAEGRLHVAYTYFRRAIKYV 388

Query: 317 VLLP 320
            L P
Sbjct: 389 RLDP 392


>gi|409437210|ref|ZP_11264340.1| Glycosyl hydrolase protein [Rhizobium mesoamericanum STM3625]
 gi|408751113|emb|CCM75496.1| Glycosyl hydrolase protein [Rhizobium mesoamericanum STM3625]
          Length = 395

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 141/363 (38%), Gaps = 74/363 (20%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G   W  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLATGAACWSQPEKMSDDPTK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
              NP++F  P   + L Y   Q      G + +   S + G T+ +   L   I G   
Sbjct: 95  SEQNPLIFNAPDGKVWLLYT-SQTSGNQDGSIVKCRISEDGGKTFGDVRVLC-DIPGTFV 152

Query: 137 NKPILLENG------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLS 187
            +PI++         +  C G   E W        V +  DAG+SW       +P+   +
Sbjct: 153 RQPIIVNKAGIWLLPVFRCVGLPGERWTGDVDTAGVLISHDAGKSWVM---TEVPDSIGA 209

Query: 188 V-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
           V +  VP    N       R    I     S S DGG  WS  +P  L N NS I   +L
Sbjct: 210 VHMNIVPIDGDNLVAFYRNRFAESI---LSSRSSDGGAHWSAPRPIDLPNNNSSIQATRL 266

Query: 247 KDGRLLLAYNT---------------------------------------VSRGVLKVAL 267
           ++G + + YN                                        V R  L +AL
Sbjct: 267 ENGMIAMVYNHSNASTSDARRQSLYDEIEGDEETKVAPVQTSERRKAVWGVPRAPLSLAL 326

Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           S D G S+   + L+            +++  EFSYP++IQ +DG VH+ YTY R  IK+
Sbjct: 327 SGDGGVSFPRRVDLDTGDGYCLSNNSKDSVNREFSYPSIIQGADGVVHVAYTYYRRAIKY 386

Query: 316 VVL 318
           V L
Sbjct: 387 VRL 389


>gi|213024184|ref|ZP_03338631.1| hypothetical protein Salmonelentericaenterica_17386 [Salmonella
           enterica subsp. enterica serovar Typhi str. 404ty]
          Length = 130

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 64/123 (52%), Gaps = 4/123 (3%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPV 85
           +CHAST+V +  G  + A+F G CEG+ D  IWL  ++   W +P           WNPV
Sbjct: 3   TCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWTTPQRVAAREGEAHWNPV 62

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGIL---GPSKNKPILL 142
           LF  PS+ L LFYK+G  V  W      S ++G TWS    L    +   GP KNK +L 
Sbjct: 63  LF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLVNDDILPRGPVKNKLLLA 121

Query: 143 ENG 145
            NG
Sbjct: 122 SNG 124


>gi|254373922|ref|ZP_04989404.1| BNR/Asp-box repeat protein [Francisella novicida GA99-3548]
 gi|151571642|gb|EDN37296.1| BNR/Asp-box repeat protein [Francisella novicida GA99-3548]
          Length = 372

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 86/305 (28%), Positives = 135/305 (44%), Gaps = 29/305 (9%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E AP+ KI +   +   G+W      P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            N+ ++ +QP  VP  +    L +  +  + I R+  +E+ D GL+WS  KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285

Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           GI  +K+++G +LLAYN  T SR  L +AL  D+   W +  T    +  E SYPA    
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLALKADNSQQWRNIYTFPNKIKGELSYPAFTLY 344

Query: 298 SDGSV 302
            D  +
Sbjct: 345 QDNII 349


>gi|373853235|ref|ZP_09596034.1| hypothetical protein Opit5DRAFT_4089 [Opitutaceae bacterium TAV5]
 gi|372472762|gb|EHP32773.1| hypothetical protein Opit5DRAFT_4089 [Opitutaceae bacterium TAV5]
          Length = 374

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 134/307 (43%), Gaps = 44/307 (14%)

Query: 44  YFGGSCEGAPDVKIWLQTFKDGR-WQSPIIADEEPN--VPMWNPVLFKLPSNGLLLFY-- 98
           Y GG  E   +  + +++   GR W  P +  + P   V  ++PVL+  PS  L +F+  
Sbjct: 51  YTGGHTECEANHVVVVRSRDGGRTWSEPYLVIDPPARYVRAFDPVLWLDPSGRLWIFWAQ 110

Query: 99  ---KIGQEVQKWSGCMKRSYNKGVTWSEREQLP----PGILGPSK-------NKPILLEN 144
              ++G+      G +  +   GV W  R   P    P    P +       NKP +  N
Sbjct: 111 TFVRVGEGAPH--GGVVHNGRSGV-WFIRTDTPDAGQPAWTSPRRIANGVMMNKPTVRSN 167

Query: 145 GLLLCGSSVESWNSWGSWM-----------EVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           G  L  +S+  W   G  +            V+ D G ++   G   +P   +   + + 
Sbjct: 168 GEWLLPTSI--WTEHGPLLPELAHERYPNITVSTDGGETFSLRGGADVPCRCID--EHML 223

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + +L +L+R+  GIG+   S S DGG TW    PT + +P +     +L  GRLLL
Sbjct: 224 VERRDGSLWMLVRTLYGIGQ---SVSVDGGATWRPGWPTDIASPCTRFHIRRLASGRLLL 280

Query: 254 --AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRT 311
              YN   R  L  +LS DDG +WH  L L+E    E SYP      DG +H+ Y   R 
Sbjct: 281 LHHYNFTGRSHLTASLSDDDGRTWHGRLLLDER--AEVSYPDAGITPDGRIHVIYDRERF 338

Query: 312 QIKHVVL 318
           + + ++L
Sbjct: 339 KAREILL 345


>gi|412991244|emb|CCO16089.1| BNR repeat-containing glycosyl hydrolase [Bathycoccus prasinos]
          Length = 574

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 84/322 (26%), Positives = 134/322 (41%), Gaps = 43/322 (13%)

Query: 38  GHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLF-------K 88
           G  L+A+F G  EG  +V I +   K G  +W       +  +    NPVL        K
Sbjct: 185 GDVLLAWFSGKSEGGDNVGIAVSKLKKGETKWSQAKFVSKAQHRSAQNPVLMRNNNPNDK 244

Query: 89  LPSNGLLLFYKIGQEVQKWSGCMKRSY---NKGVTWSEREQLPPGILGPSKNKPILLENG 145
             S  + + +      +       RS    + G TWS+   +        KN  I   NG
Sbjct: 245 EDSKHVRIVHSSQAAYEGQGTSDMRSLFSTDGGETWSKPVVVDHKNGAFPKNSAIRALNG 304

Query: 146 LLL-----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRT 200
            L+       S     ++  S ++ + D G SW     + +P    +++QP      N  
Sbjct: 305 DLILPMYYTPSGFFDASTQYSELQTSKDGGESWHPL--VEMPGTRGNLVQPTIVRLNNND 362

Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR---LLLAYNT 257
           L    RS      +  S+S + G TW+ AKPT+  N N+GI  +KL+D     L+L ++ 
Sbjct: 363 LLSFFRSRTA-DFIYASKSTNDGKTWTEAKPTKFPNNNAGIQMIKLRDSSKDVLVLVFDN 421

Query: 258 VSRGV--LKVALSKDDGDSWHDALTLEENLAM----------------EFSYPAVIQAS- 298
            + G   L +ALS+D G SW     LE ++                  E+SYP++ +   
Sbjct: 422 CNEGRFPLSIALSEDGGKSWDHIRDLEPDVDFDHEQPNRDKHTNDGDGEYSYPSIQEDGI 481

Query: 299 -DGSVHITYTYNRTQIKHVVLL 319
            DG +H+ YT+ R  IK+   +
Sbjct: 482 HDGLIHVAYTFRRETIKYSAFM 503


>gi|421587605|ref|ZP_16032986.1| putative glycosyl hydrolase protein [Rhizobium sp. Pop5]
 gi|403707873|gb|EJZ22747.1| putative glycosyl hydrolase protein [Rhizobium sp. Pop5]
          Length = 395

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 93/372 (25%), Positives = 146/372 (39%), Gaps = 88/372 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGSKRWSEPEKMTDDPQK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK-RSYNKGVTWSEREQLPPGILGPSKN- 137
              NP++F  P     L Y       +    +K R  + G     R   P  IL  S   
Sbjct: 95  SEQNPLIFNAPDGKTWLLYTSQTSGNQDGSVVKCRISDDG----GRSFGPVRILCDSPGT 150

Query: 138 --KPILLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKY------GPI 179
             +  ++ NG       +  C G + + W+       V +  D G SW+ +      G +
Sbjct: 151 FVRQQIVVNGRGEWLLPVFRCVGLNGQRWSGDADTAAVLISRDGGASWQMHDIADSIGAV 210

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
           ++    L   + + F+  NR    ++ S           S DGG TWS  +PT+L N NS
Sbjct: 211 HMNILPLGGDEMIAFYR-NRFAESILSS----------RSSDGGRTWSAPEPTELPNNNS 259

Query: 240 GIDGVKLKDGRLLLAYNT---------------------------------------VSR 260
            I    L DG + + YN                                        V R
Sbjct: 260 SIQATVLNDGGIAMVYNHSNAAMSDARRLSLYDEIEGDDAGETAAAVADIRRKAIWGVPR 319

Query: 261 GVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTY 308
             L +A+S+D G S+   + L+            ++L  EFSYP+VIQ+ D ++H+ YTY
Sbjct: 320 APLSLAISRDGGRSFPYRIDLDTGDGHCLSNNSKDSLNREFSYPSVIQSGDRTLHVAYTY 379

Query: 309 NRTQIKHVVLLP 320
            R  IK++ L P
Sbjct: 380 YRRAIKYMRLDP 391


>gi|424892701|ref|ZP_18316281.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM2297]
 gi|424893072|ref|ZP_18316652.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM2297]
 gi|393183982|gb|EJC84019.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM2297]
 gi|393184353|gb|EJC84390.1| putative neuraminidase (sialidase) [Rhizobium leguminosarum bv.
           trifolii WSM2297]
          Length = 395

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 145/367 (39%), Gaps = 78/367 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLAPGAERWSEPDKMSDDPQK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE--REQLPPGILGPSKN 137
              NP++F  P   + L Y       + SG    S  K  T  +  +   P  +L  S  
Sbjct: 95  SEQNPLIFNAPDGKIWLLY-----TSQTSGNQDGSVVKCRTSDDGGKSFGPVSVLCDSPG 149

Query: 138 KPI---LLENG-------LLLC-GSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNE 184
             +   L+ NG       +  C G + + W+       V +  D G SWR +    IP+ 
Sbjct: 150 TFVRQPLVVNGKGDWLLPVFRCVGRNGQRWSGDADTAAVLISRDGGASWRMHD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +     ++ +      F     +  S S D G TWS  +PT+L N NS I   
Sbjct: 207 IGAVHMNILPLGGDKMVAFYRNRF--AESILSSRSSDSGETWSAPEPTELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYNT---------------------------------------VSRGVLKV 265
            L DG + + YN                                        V R  L +
Sbjct: 265 ILNDGAIAMVYNHSNASTSDARRQSLYDEIEGGETKEFTVVAADIGRKAVWGVPRAPLSL 324

Query: 266 ALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
           A+S+D G S+   + L+            ++L  EFSYP++I+  DG++HI YTY R  I
Sbjct: 325 AISRDGGRSFPHRVDLDTGDGFCLSNNSKDSLNREFSYPSIIEGDDGTLHIAYTYYRRAI 384

Query: 314 KHVVLLP 320
           K+V L P
Sbjct: 385 KYVRLAP 391


>gi|213580823|ref|ZP_03362649.1| hypothetical protein SentesTyph_06247 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-0664]
          Length = 138

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 66/136 (48%), Gaps = 4/136 (2%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G CEG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G  V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSGVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 129 PGIL---GPSKNKPIL 141
              +   GP KNK +L
Sbjct: 123 NDDILPRGPVKNKLLL 138


>gi|418406167|ref|ZP_12979487.1| hypothetical protein AT5A_03055 [Agrobacterium tumefaciens 5A]
 gi|358008080|gb|EHK00403.1| hypothetical protein AT5A_03055 [Agrobacterium tumefaciens 5A]
          Length = 398

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/371 (23%), Positives = 138/371 (37%), Gaps = 83/371 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HAS +  +  G  L A+FGG+ EG  D+ I+      G  RW +P     +P  
Sbjct: 28  SPTIQNHASFLHLLQDGTLLCAWFGGTLEGKSDISIFASALTPGASRWGAPQRLSNDPAH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPV+F  P   L LF+   Q       C  R     +    R+   P  L     + 
Sbjct: 88  SEQNPVIFTAPDGTLWLFH-TAQPSGNQDECRIR-----MARVVRDPATPEKLSAEDGRF 141

Query: 140 ILLENGLLL-------------------CGSSVESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +                        + WN     + + ++ D G +W+    
Sbjct: 142 LDLPKGCFVRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISKDNGLTWQ---- 197

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
           +     S   +   P   AN       R       V  +ES DGG +W+   PT + N N
Sbjct: 198 LQELAGSTGCVHMSPVAGANGRYSAFFRRRQA-DFVYRTESTDGGRSWTEPLPTDVPNNN 256

Query: 239 SGIDGVKLKDGRLLLAYNTVS-------------------------------------RG 261
           S I  ++L  GRL +  N V+                                     R 
Sbjct: 257 SSIAAIRLAHGRLAMICNPVNAAQSSDRRASLYDELGEEDDRPDADPSGGCVPIWGVPRA 316

Query: 262 VLKVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYN 309
            + + LS DDG S+   + +E+               +E SYP +++A+DG++H +YTY+
Sbjct: 317 PVSICLSDDDGRSFPTRILIEDGPGTCLSNDSTDGRNLEMSYPWLLEAADGTLHASYTYH 376

Query: 310 RTQIKHVVLLP 320
           R  IK+V L P
Sbjct: 377 RRAIKYVRLAP 387


>gi|291612790|ref|YP_003522947.1| BNR/Asp-box repeat protein [Sideroxydans lithotrophicus ES-1]
 gi|291582902|gb|ADE10560.1| BNR/Asp-box repeat protein [Sideroxydans lithotrophicus ES-1]
          Length = 466

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 139/378 (36%), Gaps = 87/378 (23%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEE 76
           A+S      HAS+++E+  G     +F GS EGA DV I    F   R  W +  I    
Sbjct: 80  ASSRQHVQVHASSLIELKDGRIRAFWFSGSREGASDVTINTAVFDPARDEWGAEQIVASR 139

Query: 77  PNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSER 124
            +           + NPV  +     L LFY +   +  W+G       S + G TWS  
Sbjct: 140 SSTQRALHRYVSKLGNPVAGRAADGTLRLFY-VTVSLGGWAGSSITEMTSADDGETWSAP 198

Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
            +L   I  P        K  P L  +G +      ES++ +   + +  D     ++  
Sbjct: 199 RRL---ITSPFINISTLVKGTPFLYADGSMGLPVYHESFSKFAEILHLDADGDVMDKQ-- 253

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES---CDGGLTWSYAKPTQL 234
              +       +QPV    ++    VL R + G     ++ S    DGG  W   + +  
Sbjct: 254 --RLARAGQGTLQPVVLVKSSNDALVLTR-YAGRDNPHLARSLTTADGGRDWGAVEKSPF 310

Query: 235 LNPNSGIDGVKLKDGRLLLAYNTVSRG--VLKVALSKDDGDSWHDALTLEENLAM----- 287
            NP++ +    L DGRLL   N   +G   L + LS D G SW +   LEE   +     
Sbjct: 311 PNPDAALSATALPDGRLLAVLNHQEQGRDSLSLMLSADGGHSWKELHRLEEMRVLRDKKL 370

Query: 288 -----------------------------------------------EFSYPAVIQASDG 300
                                                          EFSYP +IQA +G
Sbjct: 371 DETQCLHIVRGLLVNSETRLAKAPAATLDEYVDSAKARVRADGGCNFEFSYPYLIQARNG 430

Query: 301 SVHITYTYNRTQIKHVVL 318
             H++YT+NR  IKHV  
Sbjct: 431 DFHLSYTWNRVFIKHVTF 448


>gi|332716574|ref|YP_004444040.1| hypothetical protein AGROH133_12331 [Agrobacterium sp. H13-3]
 gi|325063259|gb|ADY66949.1| hypothetical protein AGROH133_12331 [Agrobacterium sp. H13-3]
          Length = 398

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 142/369 (38%), Gaps = 79/369 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HAS +  +  G  L A+FGG+ EG  D+ I+      G  RW +P     +P  
Sbjct: 28  SPMIQNHASFLHLLQDGTLLCAWFGGTLEGKSDISIFASALTPGASRWGAPQRLSNDPAH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL----------PP 129
              NPV+F  P   L LF+   Q       C  R     +  +  E+L          P 
Sbjct: 88  SEQNPVIFTAPDGTLWLFH-TAQPSGNQDECRIRMARVVLDPAIPEKLSTEDGRFLDLPK 146

Query: 130 GILGPSKNKPILLENG------LLLC-GSSVESWNSW--GSWMEVTVDAGRSWRKYGPIY 180
           G        P+ + +       +  C     + WN     + + ++ D G +W+    + 
Sbjct: 147 GCF---IRAPLRIRDDGAWLLPIFRCLQRPGQKWNGSHDTAALGISKDNGLTWQ----LQ 199

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
               S   +   P   AN       R       V  +ES DGG +W+  + T + N NS 
Sbjct: 200 ELAGSTGCVHMSPVAGANGRYSAFFRRRQA-DFVYRTESTDGGRSWAEPQLTDVPNNNSS 258

Query: 241 IDGVKLKDGRLLLAYNTVS-------------------------------------RGVL 263
           I  ++L DGRL +  N ++                                     R  +
Sbjct: 259 IAAIRLADGRLAMICNPINAAQSSDRRASLYDELGEEDDRPDADPSGGCVPIWGVPRAPV 318

Query: 264 KVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYNRT 311
            + LS DDG S+   + +E+               +E SYP +++A+DG++H +YTY+R 
Sbjct: 319 SICLSDDDGRSFPTRILIEDGPGTCLSNDSTDGRNLEMSYPWLLEAADGTLHASYTYHRR 378

Query: 312 QIKHVVLLP 320
            IK+V L P
Sbjct: 379 AIKYVRLAP 387


>gi|365133288|ref|ZP_09342672.1| hypothetical protein HMPREF1032_00468 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363616098|gb|EHL67552.1| hypothetical protein HMPREF1032_00468 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 347

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 144/326 (44%), Gaps = 39/326 (11%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCE--GAP---DVKIWLQTFKDG--RWQSPIIADEEPNVP 80
           H  ++   D G+ L  + GG  +  G P   D K W+   + G   W +P     +    
Sbjct: 22  HMPSLTLADNGNLLAVWNGGFLQWNGDPMGRDAKDWVSVLEPGAKEWSNPDAVGCDIRYC 81

Query: 81  MWNPVLFKLPSNGLLLFYK--IGQEVQKWSGC-------MKRSYNKGVTWSEREQLPPGI 131
             +PV  K     ++L +   +  EV   + C        +++ + G TW   +  P GI
Sbjct: 82  CHDPVFLKNKKGEIILLFAKFLDTEVNFTTWCNGRDELWTRKTKDGGRTWEPAQ--PAGI 139

Query: 132 L-GPSKNKPILLENG-LLLCGSSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPNESLS 187
             G + N  +LL++G ++   +S E  + +   + + +  D G +W K GPI   ++   
Sbjct: 140 QSGHASNDSVLLDDGTIVFASTSSELPDKYFGAVRIYLSHDDGETWEK-GPILSADDGNL 198

Query: 188 VIQPVPFHTANRTLRVLMRSF---NGIGRVCMS------ESCDGGLTWSYAKPTQLLNPN 238
           + +P      + T+R+  R+     G G    S      ES DGG TW+   PT +LN  
Sbjct: 199 IREPALCLRPDGTIRMFTRTCPGSTGWGAGVKSLVSYTAESRDGGKTWTQPVPTTILNNE 258

Query: 239 SGIDGVKLKDGRLLLAYNTV------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           S ID +      +L+AYN         R  L +A SKD+G +W + + L      +   P
Sbjct: 259 SKIDVISWDKDTILMAYNDTPVADWHERSPLTLAYSKDEGKTWKNLIELAAAPGNKCQ-P 317

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
           A+ +  DG +++ Y    T I+H+VL
Sbjct: 318 AMCRDRDGRLNVVYMNRHTAIEHLVL 343


>gi|424908423|ref|ZP_18331800.1| 4-hydroxythreonine-4-phosphate dehydrogenase [Rhizobium
           leguminosarum bv. viciae USDA 2370]
 gi|392844454|gb|EJA96976.1| 4-hydroxythreonine-4-phosphate dehydrogenase [Rhizobium
           leguminosarum bv. viciae USDA 2370]
          Length = 716

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 140/369 (37%), Gaps = 79/369 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
           +P    HAS +  +D G  L A+FGG+ EG  D+ I+        GRW +P     +   
Sbjct: 28  SPMIQNHASFLHLLDDGVLLCAWFGGTLEGKSDISIFASVLTPGAGRWGAPQRLSHDTAR 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ----------LPP 129
              NPVLF  P   L LF+   Q       C  R        SE E+          LP 
Sbjct: 88  SEQNPVLFTAPDGALWLFH-TAQPSGNQDECRIRMAKIHRNRSEPEKLISEDGRFLDLPK 146

Query: 130 GILGPSKNKPILLENG---LLLCGSSV----ESWNSW--GSWMEVTVDAGRSWRKYGPIY 180
           G        P+ + +    LL     V    + WN     + + ++ D G +W       
Sbjct: 147 GCF---IRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISTDDGETWTLQ---E 200

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +P +S+  +   P    +       R       V  +ES DGG +WS   PT + N NS 
Sbjct: 201 LP-DSIGCVHMSPVAGTDGQHSAFFRRRQA-DFVYRTESKDGGRSWSRPHPTDVPNNNSS 258

Query: 241 IDGVKLKDGRLLLAYNTVS-------------------------------------RGVL 263
           I   +L  GRL +  N V+                                     R  +
Sbjct: 259 IAVTRLGSGRLAMICNPVNAAQSSDRRASLYDELGEDDGRPDADPSGGCVPVWGVPRAPV 318

Query: 264 KVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRT 311
            + LS DDG S+   + +E            +   +E SYP +++  DG++H +YTY+R 
Sbjct: 319 TICLSDDDGRSFPTRILIENGPGTCLSNDSTDGRNLEMSYPWLLETPDGALHASYTYHRR 378

Query: 312 QIKHVVLLP 320
            IK+V L P
Sbjct: 379 AIKYVRLEP 387


>gi|308185695|ref|YP_003929826.1| sialidase [Pantoea vagans C9-1]
 gi|308056205|gb|ADO08377.1| Sialidase precursor (Neuraminidase) [Pantoea vagans C9-1]
          Length = 398

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 82/376 (21%)

Query: 12  AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQS 69
           A++     +S P    HA+ ++ +  G  L  +FGG+ EG  D+ +W      G  RW  
Sbjct: 22  AQQIAMLPSSCPQN--HAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLTKGSDRWSD 79

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL 127
                ++P     NPVLF  P   L L +   +   + +  +  ++S++ G +W E + L
Sbjct: 80  AQKLSDDPGRSEQNPVLFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDMGKSWGEIKTL 139

Query: 128 --PPGILGPSKNKPILLENG-----LLLCGSSV-ESWNSWGSW----MEVTVDAGRSWRK 175
              PG     +   ++L NG     +  C +   E W   GS+    ++++ D G+SWR 
Sbjct: 140 LDKPGTF--IRQPIVVLPNGNWLLPVFYCLTQPGEKWV--GSYDVSAVKISEDKGQSWRD 195

Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
              + +P+ +  V   V     + +L  L RS      +  S S D G +WS  +P  L 
Sbjct: 196 ---VSVPDSTGCVHMNVTL-LDDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEPLSLP 250

Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS------------------------------------ 259
           N NS I    L++G L L +N +S                                    
Sbjct: 251 NNNSSIQVTTLRNGHLALVFNAMSAEGASERRLSLYDEIEDDEEDGDVAVAAEPVVHSGH 310

Query: 260 -------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDG 300
                  R  + +A+S D G SW     L+E             L  EFSYP++ Q  +G
Sbjct: 311 TAFWGAPRAPMTLAISTDGGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQGVNG 370

Query: 301 SVHITYTYNRTQIKHV 316
            +HI YT+ R  IK+V
Sbjct: 371 ELHIAYTWYRQAIKYV 386


>gi|304396598|ref|ZP_07378479.1| conserved hypothetical protein [Pantoea sp. aB]
 gi|304356107|gb|EFM20473.1| conserved hypothetical protein [Pantoea sp. aB]
          Length = 398

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 82/376 (21%)

Query: 12  AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQS 69
           A++     +S P    HA+ ++ +  G  L  +FGG+ EG  D+ +W      G  RW  
Sbjct: 22  AQQIAMLPSSCPQN--HAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLTKGSDRWSD 79

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL 127
                ++P     NPVLF  P   L L +   +   + +  +  ++S++ G +W E + L
Sbjct: 80  AQKLSDDPGRSEQNPVLFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDMGKSWGEIKTL 139

Query: 128 --PPGILGPSKNKPILLENG-----LLLCGSSV-ESWNSWGSW----MEVTVDAGRSWRK 175
              PG     +   ++L NG     +  C +   E W   GS+    ++++ D G+SWR 
Sbjct: 140 LDKPGTF--IRQPIVVLPNGNWLLPVFYCLTQPGEKWV--GSYDVSAVKISEDKGQSWRD 195

Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
              + +P+ +  V   V     + +L  L RS      +  S S D G +WS  +P  L 
Sbjct: 196 ---VSVPDSTGCVHMNVTL-LDDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEPLSLP 250

Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS------------------------------------ 259
           N NS I    L++G L L +N +S                                    
Sbjct: 251 NNNSSIQVTTLRNGHLALVFNAMSAEGASERRLSLYDEIEDDEEDGDVAVAAEPVVHSGR 310

Query: 260 -------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDG 300
                  R  + +A+S D G SW     L+E             L  EFSYP++ Q  +G
Sbjct: 311 TAFWGAPRAPMTLAISTDGGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQGVNG 370

Query: 301 SVHITYTYNRTQIKHV 316
            +HI YT+ R  IK+V
Sbjct: 371 ELHIAYTWYRQAIKYV 386


>gi|408788538|ref|ZP_11200256.1| hypothetical protein C241_21817 [Rhizobium lupini HPC(L)]
 gi|408485633|gb|EKJ93969.1| hypothetical protein C241_21817 [Rhizobium lupini HPC(L)]
          Length = 398

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 138/369 (37%), Gaps = 79/369 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPIIADEEPNV 79
           +P    HAS +  +D G  L A+FGG+ EG  D+ I+        GRW +P     +   
Sbjct: 28  SPMIQNHASFLHLLDDGALLCAWFGGTLEGKSDISIFASVLTPGAGRWGAPQRLSHDTAH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ----------LPP 129
              NPV F  P   L LF+   Q       C  R        SE E+          LP 
Sbjct: 88  SEQNPVFFTAPDGALWLFH-TAQPSGNQDECRIRMAKIHRNRSEPEKLISEDGRFLDLPK 146

Query: 130 GILGPSKNKPILLENG------LLLC-GSSVESWNSW--GSWMEVTVDAGRSWRKYGPIY 180
           G        P+ + +       +  C     + WN     + + ++ D G +W       
Sbjct: 147 GCF---IRAPLRIRDDGAWLLPIFRCVQRPGQKWNGSHDTAALGISTDDGETWTLQ---E 200

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +P +S+  +   P    +       R       V  +ES DGG +WS   PT + N NS 
Sbjct: 201 LP-DSIGCVHMSPLAGTDGQHSAFFRRRQA-DFVYRAESKDGGRSWSRPHPTDVPNNNSS 258

Query: 241 IDGVKLKDGRLLLAYNTVS-------------------------------------RGVL 263
           I   +L  GRL +  N V+                                     R  +
Sbjct: 259 IAVTRLGSGRLAMICNPVNAAQSSDRRASLYDELGEDDGRPDADPSGGCVPVWGVPRAPV 318

Query: 264 KVALSKDDGDSWHDALTLEENLA------------MEFSYPAVIQASDGSVHITYTYNRT 311
            + LS DDG S+   + +E+               +E SYP +++  DG++H +YTY+R 
Sbjct: 319 TICLSDDDGRSFPTRILIEDGPGTCLSNDSTDGRNLEMSYPWLLETPDGALHASYTYHRR 378

Query: 312 QIKHVVLLP 320
            IK+V L P
Sbjct: 379 AIKYVRLEP 387


>gi|152979613|ref|YP_001345242.1| glycosyl hydrolase BNR repeat-containing glycosyl hydrolase
           [Actinobacillus succinogenes 130Z]
 gi|150841336|gb|ABR75307.1| glycosyl hydrolase BNR repeat-containing protein [Actinobacillus
           succinogenes 130Z]
          Length = 395

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 146/362 (40%), Gaps = 81/362 (22%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPV 85
           HA+ +  +  G  L  +FGG+ EG  D+  +    K G   W   +   ++P     NPV
Sbjct: 36  HAANLFPLPNGDLLCTWFGGTQEGISDISAYFSRLKKGSDTWTPAVKLSDDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQLPPGILGPSK----NKP 139
            F  P N L + Y       + +  ++  +S + G TW   E L   +  P+K     +P
Sbjct: 96  FFLDPDNVLWILYTAQISGNQDTAIVRYRKSTDFGETWGPIEVL---LEDPNKGVFIRQP 152

Query: 140 IL-LENG-----LLLCGSS-----VESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
           I+ L+NG     +  C +      V S+++  S + ++ D G++WR    + +PN +  V
Sbjct: 153 IVVLDNGNWLLPVFYCIARPGEKWVGSYDT--SAVMISSDKGKTWRS---VDVPNSTGCV 207

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
              +     + +L  L RS      +  S S D G TWS  K   L N N+ I    L +
Sbjct: 208 HMNI-LKLKDGSLYALYRS-RWADYIYESRSTDNGETWSEPKTLPLPNNNASIQADVLDN 265

Query: 249 GRLLLAYNTVS----------------------------------------RGVLKVALS 268
           G + L +N  S                                        R  + +A+S
Sbjct: 266 GDIALVFNNSSAKDAKERRLSLYDEIEDESKEQKKEAELVEGQRNAFWGAPRAPMSLAIS 325

Query: 269 KDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            D+G +W     L+            E L  E SYP++ Q  DG +HI YT+ R  IK+V
Sbjct: 326 TDNGATWPYIRNLDEGDGYCMSNNSREQLNRELSYPSIKQGLDGKLHIAYTFYRMAIKYV 385

Query: 317 VL 318
            +
Sbjct: 386 CV 387


>gi|187931351|ref|YP_001891335.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. mediasiatica FSC147]
 gi|187712260|gb|ACD30557.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           mediasiatica FSC147]
          Length = 372

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 133/305 (43%), Gaps = 29/305 (9%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E AP+ KI +   +   G+W      P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            N+ ++ +QP  VP  +    L +  +  + I R+  +E+ D GL+WS  KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMLSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285

Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           GI  +K+++G +LLAYN  T SR  L +A   D+   W +  T    +  E SYP     
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPY 344

Query: 298 SDGSV 302
            D  +
Sbjct: 345 QDNII 349


>gi|440760708|ref|ZP_20939811.1| hypothetical protein F385_3741 [Pantoea agglomerans 299R]
 gi|436425461|gb|ELP23195.1| hypothetical protein F385_3741 [Pantoea agglomerans 299R]
          Length = 398

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 82/376 (21%)

Query: 12  AEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQS 69
           A++     +S P    HA+ ++ +  G  L  +FGG+ EG  D+ +W      G  RW  
Sbjct: 22  AQQIAMLPSSCP--QNHAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLTKGSDRWSD 79

Query: 70  PIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL 127
                ++P     NPVLF  P   L L +   +   + +  +  ++S++ G +W E + L
Sbjct: 80  AQKLSDDPGRSEQNPVLFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDMGKSWGEIKTL 139

Query: 128 --PPGILGPSKNKPILLENG-----LLLCGSSV-ESWNSWGSW----MEVTVDAGRSWRK 175
              PG     +   ++L NG     +  C +   E W   GS+    ++++ D G+SWR 
Sbjct: 140 LDKPGTF--IRQPIVVLPNGNWLLPVFYCLTQPGEKWV--GSYDVSAVKISEDKGQSWRD 195

Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
              + +P+ +  V   V     + +L  L RS      +  S S D G +WS  +P  L 
Sbjct: 196 ---VSVPDSTGCVHMNVTL-LDDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEPLSLP 250

Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS------------------------------------ 259
           N NS I    L++G L L +N +S                                    
Sbjct: 251 NNNSSIQVTTLRNGHLALVFNAMSAEGASERRLSLYDEIEDEEENGDVAIAAEPMVHSGR 310

Query: 260 -------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPAVIQASDG 300
                  R  + +A+S D G SW     L+E             L  EFSYP++ Q  +G
Sbjct: 311 TAFWGAPRAPMTLAISTDGGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKQGVNG 370

Query: 301 SVHITYTYNRTQIKHV 316
            +HI YT+ R  IK+V
Sbjct: 371 ELHIAYTWYRQAIKYV 386


>gi|420243404|ref|ZP_14747334.1| putative neuraminidase (sialidase) [Rhizobium sp. CF080]
 gi|398061196|gb|EJL52995.1| putative neuraminidase (sialidase) [Rhizobium sp. CF080]
          Length = 397

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 93/372 (25%), Positives = 139/372 (37%), Gaps = 85/372 (22%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
           +P    HAS +  +  G  + A+FGGS EG  D+ I+      G   W  P     +   
Sbjct: 28  SPMVQNHASFLHLLGDGALICAWFGGSLEGKSDISIFASVLVPGSDAWGPPQRLSFDDAH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
              NPVLF  P   L LF+   Q       C  R     +    R++  P  L       
Sbjct: 88  SEQNPVLFTAPDGRLWLFH-TSQPSGNQDECRIR-----MAEIRRDRSDPARLTSEAGAY 141

Query: 140 ILLENGLLLCGSSV-------------------ESWNSW--GSWMEVTVDAGRSWRKYGP 178
           + L  G  +    V                   + WN     + + ++ DAG++WR    
Sbjct: 142 LDLPQGCFVRAPLVVRDDGAWLLPIFRCIQRPGQKWNGSHDTAAIGISTDAGKTWRLED- 200

Query: 179 IYIPNESLSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
             +P  +  V + P              R  + + R   +ES DGG +WS  + T + N 
Sbjct: 201 --LPQSTGCVHMSPAALQAGGHAALFRRRQADFVFR---TESTDGGRSWSVPEATDVPNN 255

Query: 238 NSGIDGVKLKDGRLLLAYNTVS-------------------------------------R 260
           NS I  ++LKDGRL +  N VS                                     R
Sbjct: 256 NSSIAVIRLKDGRLAMICNPVSAAQSGDRRASLYDELGENDDRPDADPTGGCVPVWGVPR 315

Query: 261 GVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTY 308
             + V +S D G S+   + +E+                E SYP +++A DG++HI YTY
Sbjct: 316 APVCVCISDDGGLSFPVRILIEDGPGTCLSNDSTDGRNKEMSYPWLLEAPDGTLHIAYTY 375

Query: 309 NRTQIKHVVLLP 320
           +R  IK+V L P
Sbjct: 376 HRRAIKYVSLAP 387


>gi|62260730|gb|AAX77930.1| unknown protein [synthetic construct]
          Length = 407

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 133/305 (43%), Gaps = 29/305 (9%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E AP+ KI +   +   G+W      P++
Sbjct: 79  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 137

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 138 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 197

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 198 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 253

Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            N+ ++ +QP  VP  +    L +  +  + I R+  +E+ D GL+WS  KPTQL NP+S
Sbjct: 254 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 311

Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           GI  +K+++G +LLAYN  T SR  L +A   D+   W +  T    +  E SYP     
Sbjct: 312 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPY 370

Query: 298 SDGSV 302
            D  +
Sbjct: 371 QDNII 375


>gi|56707548|ref|YP_169444.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. tularensis SCHU S4]
 gi|110670019|ref|YP_666576.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. tularensis FSC198]
 gi|134302518|ref|YP_001122488.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. tularensis WY96-3418]
 gi|254370071|ref|ZP_04986077.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           tularensis FSC033]
 gi|254874366|ref|ZP_05247076.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           tularensis MA00-2987]
 gi|379716810|ref|YP_005305146.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
           tularensis TIGB03]
 gi|379725414|ref|YP_005317600.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
           tularensis TI0902]
 gi|385794167|ref|YP_005830573.1| hypothetical protein NE061598_02225 [Francisella tularensis subsp.
           tularensis NE061598]
 gi|421752393|ref|ZP_16189421.1| hypothetical protein B345_08411 [Francisella tularensis subsp.
           tularensis AS_713]
 gi|421754259|ref|ZP_16191237.1| hypothetical protein B344_08353 [Francisella tularensis subsp.
           tularensis 831]
 gi|421755032|ref|ZP_16191986.1| hypothetical protein B343_02241 [Francisella tularensis subsp.
           tularensis 80700075]
 gi|421757985|ref|ZP_16194850.1| hypothetical protein B342_08454 [Francisella tularensis subsp.
           tularensis 80700103]
 gi|421759820|ref|ZP_16196647.1| hypothetical protein B341_08391 [Francisella tularensis subsp.
           tularensis 70102010]
 gi|424675141|ref|ZP_18112053.1| hypothetical protein B229_08357 [Francisella tularensis subsp.
           tularensis 70001275]
 gi|56604040|emb|CAG45032.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           tularensis SCHU S4]
 gi|110320352|emb|CAL08415.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           tularensis FSC198]
 gi|134050295|gb|ABO47366.1| hypothetical protein FTW_1675 [Francisella tularensis subsp.
           tularensis WY96-3418]
 gi|151568315|gb|EDN33969.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           tularensis FSC033]
 gi|254840365|gb|EET18801.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           tularensis MA00-2987]
 gi|282158702|gb|ADA78093.1| hypothetical protein NE061598_02225 [Francisella tularensis subsp.
           tularensis NE061598]
 gi|377826863|gb|AFB80111.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
           tularensis TI0902]
 gi|377828487|gb|AFB78566.1| BNR Asp-box repeat protein [Francisella tularensis subsp.
           tularensis TIGB03]
 gi|409085109|gb|EKM85261.1| hypothetical protein B344_08353 [Francisella tularensis subsp.
           tularensis 831]
 gi|409085376|gb|EKM85520.1| hypothetical protein B345_08411 [Francisella tularensis subsp.
           tularensis AS_713]
 gi|409089120|gb|EKM89173.1| hypothetical protein B343_02241 [Francisella tularensis subsp.
           tularensis 80700075]
 gi|409090199|gb|EKM90222.1| hypothetical protein B341_08391 [Francisella tularensis subsp.
           tularensis 70102010]
 gi|409091370|gb|EKM91370.1| hypothetical protein B342_08454 [Francisella tularensis subsp.
           tularensis 80700103]
 gi|417434396|gb|EKT89355.1| hypothetical protein B229_08357 [Francisella tularensis subsp.
           tularensis 70001275]
          Length = 372

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 133/305 (43%), Gaps = 29/305 (9%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E AP+ KI +   +   G+W      P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKHIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
            N+ ++ +QP  VP  +    L +  +  + I R+  +E+ D GL+WS  KPTQL NP+S
Sbjct: 228 TNDGVN-LQPTVVPL-SKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDS 285

Query: 240 GIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           GI  +K+++G +LLAYN  T SR  L +A   D+   W +  T    +  E SYP     
Sbjct: 286 GIAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPY 344

Query: 298 SDGSV 302
            D  +
Sbjct: 345 QDNII 349


>gi|32473537|ref|NP_866531.1| hypothetical protein RB5143 [Rhodopirellula baltica SH 1]
 gi|32398217|emb|CAD78312.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 479

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 91/343 (26%), Positives = 151/343 (44%), Gaps = 49/343 (14%)

Query: 10  LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
           L A++ TF   P    P ++ HA T         + V +G  L A  Y G +     +  
Sbjct: 122 LQAQDSTFLAPPKYIGPPQTIHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPREDANNY 181

Query: 57  IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK 112
           + L T  DG   W+  ++ D + + P+  ++P L+  P   L L +   Q V        
Sbjct: 182 VVLSTSGDGGETWEEVLVVDPDEDGPVRAYDPELWIAPDGKLRLIW--AQAVGHLGTI-- 237

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SWME---- 164
                GV + E E   P +  P   +   + +G+++C   V S   W     +W E    
Sbjct: 238 ----AGVWFLEIED--PDMGQPVHGEAKRVTDGIMMCKPLVLSTGEWALPASTWRETDES 291

Query: 165 ----VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
               V+ D G++W + G   +P +  +  + +     ++++ +L R+  GIG    S S 
Sbjct: 292 ARMIVSDDQGKTWAQRGGCNVPKDVRAFDEHMFVERNDQSIWLLARTKYGIGE---SVST 348

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSW 275
           D G TWS   P+ + +P++     +L  G LLL  +        R  L   +S D G +W
Sbjct: 349 DRGKTWSELAPSTIAHPSARFFIRRLDSGNLLLVKHGPIEERTGRSHLMAFVSIDHGRNW 408

Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
              L L+E   +  SYP   Q +DG +HI Y YNRT+ +H++L
Sbjct: 409 KGGLMLDERAGV--SYPDGQQDADGVIHIIYDYNRTKDRHILL 449


>gi|417349595|ref|ZP_12128223.1| hypothetical protein SeGA_2676 [Salmonella enterica subsp. enterica
           serovar Gaminara str. A4-567]
 gi|353572465|gb|EHC36105.1| hypothetical protein SeGA_2676 [Salmonella enterica subsp. enterica
           serovar Gaminara str. A4-567]
          Length = 144

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 59/115 (51%), Gaps = 1/115 (0%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +   F  CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTEFFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS+
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSK 117


>gi|254368714|ref|ZP_04984727.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           holarctica FSC022]
 gi|157121635|gb|EDO65805.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           holarctica FSC022]
          Length = 372

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 83/304 (27%), Positives = 130/304 (42%), Gaps = 27/304 (8%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E AP+ KI +   +   G+W      P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETAPNTKIVVAVAEKVAGKWHFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKYIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
            N+ ++ +QP     +      L R  +  I R+  +E+ D GL+WS  KPTQL NP+SG
Sbjct: 228 TNDGVN-LQPTVVPLSKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDSG 286

Query: 241 IDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
           I  +K+++G +LLAYN  T SR  L +A   D+   W +  T    +  E SYP      
Sbjct: 287 IAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPYQ 345

Query: 299 DGSV 302
           D  +
Sbjct: 346 DNII 349


>gi|417302934|ref|ZP_12090011.1| hypothetical protein RBWH47_00392 [Rhodopirellula baltica WH47]
 gi|327540766|gb|EGF27333.1| hypothetical protein RBWH47_00392 [Rhodopirellula baltica WH47]
          Length = 511

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 90/343 (26%), Positives = 151/343 (44%), Gaps = 49/343 (14%)

Query: 10  LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
           L A++++F   P    P ++ HA T         + V +G  L A  Y G +     +  
Sbjct: 154 LQAQDYSFLASPKYIGPPQTVHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPSEDANNY 213

Query: 57  IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFYKIGQEVQKWSGCMK 112
           + L T  DG   W   ++ D + + P+  ++P L+  P   L L +   Q V        
Sbjct: 214 VVLSTSGDGGETWNEVLVVDPDEDGPVRAYDPELWIAPDGKLRLIW--AQAVGHLGTI-- 269

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWG----SWME---- 164
                GV + E E   P +  P   +   + +G+++C   V S   W     +W E    
Sbjct: 270 ----AGVWFLEIED--PDVEQPVHGEAKRVTDGIMMCKPLVLSTGEWALPASTWRETDES 323

Query: 165 ----VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC 220
               V+ D G++W + G   +P +  +  + +     ++++ +L R+  GIG    S S 
Sbjct: 324 ARMIVSDDQGKTWAQRGGCNVPKDVRAFDEHMFVERNDQSIWLLARTKYGIGE---SVST 380

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSW 275
           D G TWS   P+ + +P++     +L  G LLL  +        R  L   +S D G +W
Sbjct: 381 DRGKTWSELAPSTIAHPSARFFIRRLDSGNLLLVKHGPIEERTGRSHLMAFVSIDHGKNW 440

Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
              L L+E   +  SYP   Q +DG +HI Y YNRT+ +H++L
Sbjct: 441 KGGLMLDERAGV--SYPDGQQDTDGVIHIIYDYNRTKDRHILL 481


>gi|67458777|ref|YP_246401.1| hypothetical protein RF_0385 [Rickettsia felis URRWXCal2]
 gi|67004310|gb|AAY61236.1| unknown [Rickettsia felis URRWXCal2]
          Length = 142

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 71/127 (55%), Gaps = 21/127 (16%)

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALS 268
           +  S S + G+ WS  +P  LLNP+S ID + L D  LLL YN V     SR +L VA S
Sbjct: 16  IHRSISINNGIYWSNLEPVNLLNPDSAIDAINLGDDTLLLTYNRVINNRKSRNILSVATS 75

Query: 269 KDDGDSWHDALTLEEN------------LAMEFSYPAVIQASDGS-VHITYTYNRTQIKH 315
            D+G +WH  +T+  +            L+ E+SYP +I + D + +H+ YT+NR  +KH
Sbjct: 76  YDEGLNWH-PITIRNSIYPEGDIEYSNILSEEYSYPTIIMSPDNNEIHVIYTFNRINLKH 134

Query: 316 VV--LLP 320
            V  LLP
Sbjct: 135 KVFQLLP 141


>gi|421613255|ref|ZP_16054341.1| hypothetical protein RBSH_04149 [Rhodopirellula baltica SH28]
 gi|408495849|gb|EKK00422.1| hypothetical protein RBSH_04149 [Rhodopirellula baltica SH28]
          Length = 376

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 84/336 (25%), Positives = 151/336 (44%), Gaps = 35/336 (10%)

Query: 10  LVAEEFTF---PANSAPFKSCHAST--------IVEVDKGHFLVA--YFGGSCEGAPDVK 56
           L A++ +F   P    P ++ HA T         + V +G  L A  Y G +     +  
Sbjct: 19  LQAQDSSFLAPPKYVGPPQTVHAVTNRAFTGIPSLAVSRGGRLWATWYAGKTPREDANNY 78

Query: 57  IWLQTFKDG--RWQSPIIADEEPNVPM--WNPVLFKLPSNGLLLFY-----KIGQEVQKW 107
           + L T  +G   W+  ++ D + N P+  ++P L+  P   L L +      +G     W
Sbjct: 79  VVLSTSGNGGETWEEVLVVDPDENGPVRAYDPELWIAPDGKLRLIWAQAVGHLGTIAGVW 138

Query: 108 SGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTV 167
              ++    +   + E +++  GI+     KP++L  G     +S        + M V+V
Sbjct: 139 FLEIEDPDVEQPVYGEAKRVTDGIM---MCKPLVLSTGEWALPASTWRETDESARMIVSV 195

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W + G   +P +  +  + +     + ++ +L R+  GIG    S S D   TW 
Sbjct: 196 DQGKTWAQRGGCNVPKDVRAFDEHIFVERNDGSIWLLARTKYGIGE---SVSTDRRKTWP 252

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYN-----TVSRGVLKVALSKDDGDSWHDALTLE 282
              P+++ +P++     +L  G LLL  +        R  L   +S DDG++W   L L+
Sbjct: 253 ELTPSKIAHPSARFFIRRLASGNLLLVKHGPIEKKTGRSHLMAFVSNDDGETWKGGLMLD 312

Query: 283 ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           E   +  SYP   Q ++G +HI Y YNRT+ +H++ 
Sbjct: 313 ERAGV--SYPDGQQDTNGVIHIVYDYNRTKDRHILF 346


>gi|340029880|ref|ZP_08665943.1| hypothetical protein PaTRP_14293 [Paracoccus sp. TRP]
          Length = 395

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 90/367 (24%), Positives = 140/367 (38%), Gaps = 78/367 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
           +P    HA+ +   D G  + A+FGG+ EG  D+ I       G   W        +P+ 
Sbjct: 28  SPMVQNHAAFLSLADDGALICAWFGGTLEGKSDISIHASVLLPGADAWGPAQRLSHDPDH 87

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSERE----QLPPGIL 132
              NPV+F  P+ GL LF+   Q       C   M   +  GV  +  E     LP G  
Sbjct: 88  SEQNPVIFAAPNGGLWLFH-TSQPSGNQDQCRIRMAALHRNGVAITAEEGRFLDLPRGCF 146

Query: 133 GPSKNKPILLENG------LLLC-GSSVESWNSW--GSWMEVTVDAGRSWRKYGPIYIPN 183
                 P+ L         +  C     + WN     + M V+ D G+SWR      +P+
Sbjct: 147 ---VRAPVRLRGDGAWLLPIFRCVQRPGQKWNGSHDTAAMGVSTDGGQSWRLED---VPD 200

Query: 184 ESLSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
              SV + PV             R  + + R   S+S +GG +W+    T + N NS I 
Sbjct: 201 SVGSVHMSPVDLGEGRMAAFYRRRQSDFVHR---SDSANGGHSWAAPVATDVPNNNSSIA 257

Query: 243 GVKLKDGRLLLAYNTVS-------------------------------------RGVLKV 265
            V L+DGR+ +  N V+                                     R  + +
Sbjct: 258 AVTLRDGRIAMICNPVNAAMSPDRRTSLYDELGEDHDRPDADPTGGCVPVWGVPRAPVAI 317

Query: 266 ALSKDDGDSWHDALTLEENLAM------------EFSYPAVIQASDGSVHITYTYNRTQI 313
            +S D G ++     +E+                E SYP +++  DG++H+ YTY+R  I
Sbjct: 318 CVSSDGGQTFPRRYLVEDGPGTCTSNDSTDGRNKEMSYPWLLEGPDGTLHLAYTYHRRAI 377

Query: 314 KHVVLLP 320
           K++ L P
Sbjct: 378 KYLQLQP 384


>gi|398806469|ref|ZP_10565376.1| hypothetical protein PMI15_04242 [Polaromonas sp. CF318]
 gi|398088078|gb|EJL78649.1| hypothetical protein PMI15_04242 [Polaromonas sp. CF318]
          Length = 372

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 143/361 (39%), Gaps = 76/361 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG---RW--QSPIIADEEPNVPMW 82
           HA+++VE+  G     +F GS EGA DV I      D    RW  +S +   +     +W
Sbjct: 2   HAASLVELRDGRLRAVWFSGSREGAGDVTI-QSAVTDATGLRWGAESTLFDRQALQQGLW 60

Query: 83  -------NPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPPG-- 130
                  NPV+ + P +G L  + +   +  W+G      RS ++G +WS   +L     
Sbjct: 61  RYVKKIGNPVIARAP-DGSLNLWMVNVSLGGWAGSSITWARSTDEGASWSAPRRLVTSPF 119

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    K  P++ ENG        E    +G  + ++       +    + IP    S 
Sbjct: 120 LNISTLVKGTPVIFENGETGLPVYHEFVAKFGEVLRISPQG----QVVDKVRIPGSRTS- 174

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           +QPV   +  +  +V MRS N    +  S + D G TW  A  +   NP+S + G     
Sbjct: 175 LQPVVLVSGPQKAQVYMRSGNATA-LMASSTTDAGKTWLPAHASAWPNPDSALAGTMTNS 233

Query: 249 GRLLLAYNTVSRGVLKVAL------------------------SKDDGDSWHDALTLE-- 282
           G   LA N   R    +AL                        ++     +   LT E  
Sbjct: 234 GTQWLALNPAPRNRETLALLQTATGGSFDGVTPWIVESSANPQARTSMGDYERLLTQELK 293

Query: 283 -----------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
                                  ++ + EFSYP ++Q+ DG +H+ YT++RT+IKHV L 
Sbjct: 294 ARGASDGEAQAYVASARRQLCGGQSCSQEFSYPYLLQSRDGYLHLLYTWHRTRIKHVRLD 353

Query: 320 P 320
           P
Sbjct: 354 P 354


>gi|67901886|ref|XP_681199.1| hypothetical protein AN7930.2 [Aspergillus nidulans FGSC A4]
 gi|40740394|gb|EAA59584.1| hypothetical protein AN7930.2 [Aspergillus nidulans FGSC A4]
 gi|259480678|tpe|CBF73543.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 333

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/336 (27%), Positives = 132/336 (39%), Gaps = 72/336 (21%)

Query: 50  EGAPDVKIWLQT--FKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKW 107
           EG PD+ I+     F +  W   I    +      NPV+F+ PS  L L Y   Q   + 
Sbjct: 2   EGKPDISIYFSRLPFGNRTWSEAIKVTHDETRSEQNPVMFQHPSGELWLLYTSQQGGNQD 61

Query: 108 SGCMKR--SYNKGVTWSEREQL--PPGILGPSKNKPILLENGLLLCGS---SVESWNSW- 159
           S  +KR  S++ G  WS+   +   PG     +   I+L+NG+ +  +    VE    W 
Sbjct: 62  SAVVKRTISFDDGNAWSDPTVIFDEPGTF--IRQPVIILKNGIFVIPTFKCRVELGAKWI 119

Query: 160 ----GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
                S +  + D G++W +  PI    ES   +              L RS        
Sbjct: 120 GNDDISAIRTSDDQGQTWSEI-PI---AESTGCVHMEIQRLKTGEYLALYRS--RWADFI 173

Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---------------- 259
            S +   GL W+  KPT L NPN+GI    L  GR+++ YN  S                
Sbjct: 174 YSSTSPDGLAWTAPKPTSLPNPNAGICFDVLPSGRVVVVYNHSSRKNALGRREGLYDEII 233

Query: 260 -----------------------RGVLKVALSKDDGDSWH-------DALTL----EENL 285
                                  R  L VA S D+G +W        D   L    E+ L
Sbjct: 234 EGKDSRPNQKPRPDGKEAFWGAPRAPLSVAWSDDEGRTWQHRTLEEGDGYCLTNNSEQKL 293

Query: 286 AMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
             E SYP+++   DGS+HI +T+ R  IK+V +  N
Sbjct: 294 NRELSYPSMVVGEDGSIHIAFTFWRQTIKYVQIGEN 329


>gi|326430587|gb|EGD76157.1| hypothetical protein PTSG_00864 [Salpingoeca sp. ATCC 50818]
          Length = 376

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 147/342 (42%), Gaps = 39/342 (11%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGS-CEGAPDVKIWLQTFKDG--RW 67
           V+  + F  N++   S H S I  +  G  + A+ G S  EG  D  I+  + KDG   W
Sbjct: 39  VSHHWVFQTNASIVYS-HMSMIEALPDGTLVAAFQGSSLSEGCNDQAIFFTSSKDGGVTW 97

Query: 68  QSPIIADEEPNVPMWNPVL-FKLPSNGLLLFYKIGQEVQKWS----GCMKRSY------- 115
            +P +  +   + +W PVL F   +N L LF+    + Q       G  KRSY       
Sbjct: 98  -APAVVAQNDTLALWGPVLRFDKATNKLWLFFSASTKYQNRDKPGCGSFKRSYPGGEVRY 156

Query: 116 ----NKGVTWSEREQLPP-----GILGPSKNKPILLENGLLLCGSSVESWNSW-----GS 161
               + G TW+    + P      +   + N  I+   G  +     E    +      S
Sbjct: 157 VTSTDGGATWTNPVVILPFEARGQVSKVTANHMIITATGRWMLPFWQEPRTVYDKGLSAS 216

Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
            + V+ D G +W   G I+  N S  +I+       + +L  L R+    G +  S S D
Sbjct: 217 GVLVSDDNGTTWTTRGSIH--NNSTWLIENTLHQFQDSSLIQLFRT--QAGYIYESWSTD 272

Query: 222 GGLTWSYAKPTQLLNPNSGIDGVKLK-DGRLLLAYN--TVSRGVLKVALSKDDGDSWHDA 278
            G TW+ A P+ LLNPNS +   +   +G  +LAYN    +R  L +A S   G +W   
Sbjct: 273 NGNTWTTAMPSSLLNPNSKVCLTRGSVNGSAILAYNPSKTARTPLVLAQSSLTGHTWTTT 332

Query: 279 LTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
            +L+ +  M F YP  +  + G V  TY+ N ++   + + P
Sbjct: 333 ASLDNDPNMSFEYPTPLLHA-GRVFTTYSANSSKGIKLAISP 373


>gi|381405652|ref|ZP_09930336.1| sialidase [Pantoea sp. Sc1]
 gi|380738851|gb|EIB99914.1| sialidase [Pantoea sp. Sc1]
          Length = 398

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 146/360 (40%), Gaps = 76/360 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ ++ +  G  L  +FGG+ EG  D+ +W    + G  RW       E+P     NPV
Sbjct: 36  HAANLLPLPDGDLLCVWFGGTQEGIADISVWCSRLQKGSDRWSEAQKLSEDPTRSEQNPV 95

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCM--KRSYNKGVTWSEREQL--PPGILGPSKNKPIL 141
           LF  P   L L +   +   + +  +  ++S++ G +W   + L   PG     +   ++
Sbjct: 96  LFLDPDQVLWLLWTAQKSGNQDTAIVRYRQSHDLGRSWGAIDTLLDKPGTF--IRQPIVV 153

Query: 142 LENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
           L NG     +  C +   E W  N   S ++++ D GRSWR    + +P+ +  V   V 
Sbjct: 154 LPNGNWLLPVFYCLTPPGEKWVGNYDVSAVKISEDKGRSWRD---VSVPDSTGCVHMNVT 210

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
               + +L  L RS      +  S S D G +WS  +   L N NS I    L++G L L
Sbjct: 211 L-LEDGSLLALYRS-RWADHIYQSRSFDQGESWSAPEALSLPNNNSSIQVTTLRNGHLAL 268

Query: 254 AYNTVS-------------------------------------------RGVLKVALSKD 270
            +N +S                                           R  + +A+S D
Sbjct: 269 VFNAMSAEGASERRLSLYDEIEDDEEDGEVAVAAEAVVHSGRTAFWGAPRAPMTLAISTD 328

Query: 271 DGDSWHDALTLEE------------NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            G SW     L+E             L  EFSYP++ +   G +HI YT+ R  IK+V +
Sbjct: 329 GGKSWPLQRNLDEGDGYCMTNNSQQKLNREFSYPSIKEGLHGELHIAYTWYRQAIKYVRI 388


>gi|417511439|ref|ZP_12176059.1| hypothetical protein LTSESEN_3058 [Salmonella enterica subsp.
           enterica serovar Senftenberg str. A4-543]
 gi|353643034|gb|EHC87321.1| hypothetical protein LTSESEN_3058 [Salmonella enterica subsp.
           enterica serovar Senftenberg str. A4-543]
          Length = 138

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 65/136 (47%), Gaps = 7/136 (5%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSEDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS------ 122
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS      
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFITSSDRGFTWSTPAPLV 122

Query: 123 EREQLPPGILGPSKNK 138
             + LP G +  S+ K
Sbjct: 123 NDDILPRGPVARSRKK 138


>gi|399039745|ref|ZP_10735254.1| putative neuraminidase (sialidase) [Rhizobium sp. CF122]
 gi|398062158|gb|EJL53939.1| putative neuraminidase (sialidase) [Rhizobium sp. CF122]
          Length = 395

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 142/363 (39%), Gaps = 74/363 (20%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G   W  P    ++   
Sbjct: 35  SPCIQNHAANLAFLPDGTLTCVWFGGTMEGMGDISIYMSRLAPGATHWSQPEKMSDDSAK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSK 136
              NP++F  P   + L Y   Q      G + +   S + G T+     L   I G   
Sbjct: 95  SEQNPLIFNAPDGNVWLLY-TSQTSGNQDGSVVKCRISEDGGKTFGVVRVLC-DIPGTFV 152

Query: 137 NKPILLENG------LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
            +PI++         +  C G   E W  +   + + ++ D G+SW       +P+   +
Sbjct: 153 RQPIIVNKTGSWLLPVFRCIGLPGERWTGDVDTAGVLISRDQGKSWAM---AEVPDSIGA 209

Query: 188 V-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
           V +  VP    N       R       V  S S DGG  WS  +P  L N NS I   KL
Sbjct: 210 VHMNIVPIDDDNLVAFYRNRFAES---VLYSRSSDGGEHWSAPRPIGLPNNNSSIQATKL 266

Query: 247 KDGRLLLAYNT---------------------------------------VSRGVLKVAL 267
           ++G + + YN                                        V R  L +A 
Sbjct: 267 ENGMIAMVYNHSNALTSDARRQSLYDEIEGDEETKVAPAPTVERRKAVWGVPRAPLSLAF 326

Query: 268 SKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           S+D G S+   + L+            +++  EFSYP++++ +DG+VH+ YTY R  IK+
Sbjct: 327 SRDGGASFPRRIDLDTGDGYCLSNNSKDSVNREFSYPSIVEGADGAVHVAYTYYRRAIKY 386

Query: 316 VVL 318
           V L
Sbjct: 387 VRL 389


>gi|89255875|ref|YP_513237.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. holarctica LVS]
 gi|115314364|ref|YP_763087.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. holarctica OSU18]
 gi|156501859|ref|YP_001427924.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
           subsp. holarctica FTNF002-00]
 gi|254367239|ref|ZP_04983267.1| bNR/Asp-box repeat protein [Francisella tularensis subsp.
           holarctica 257]
 gi|290953330|ref|ZP_06557951.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
           subsp. holarctica URFT1]
 gi|422938340|ref|YP_007011487.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. holarctica FSC200]
 gi|423050201|ref|YP_007008635.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
           subsp. holarctica F92]
 gi|89143706|emb|CAJ78905.1| BNR/Asp-box repeat protein [Francisella tularensis subsp.
           holarctica LVS]
 gi|115129263|gb|ABI82450.1| possible neuraminidase [Francisella tularensis subsp. holarctica
           OSU18]
 gi|134253057|gb|EBA52151.1| bNR/Asp-box repeat protein [Francisella tularensis subsp.
           holarctica 257]
 gi|156252462|gb|ABU60968.1| putative neuraminidase with BNR/Asp-box repeat [Francisella
           tularensis subsp. holarctica FTNF002-00]
 gi|407293491|gb|AFT92397.1| BNR/Asp-box repeat-containing protein [Francisella tularensis
           subsp. holarctica FSC200]
 gi|421950923|gb|AFX70172.1| BNR/Asp-box repeat-containing neuramidase [Francisella tularensis
           subsp. holarctica F92]
          Length = 372

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 82/304 (26%), Positives = 129/304 (42%), Gaps = 27/304 (8%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPII 72
           AN   FK  HAS++  +D   F + ++    E A + KI +   +   G+W      P++
Sbjct: 53  ANLKYFKYNHASSMTTIDNKLF-ITWYSSDQETATNTKIVVAVAEKVAGKWHFNEIKPVM 111

Query: 73  ADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE-R 124
             +E           + NP+++       L+F          S  +  S + G TWS+ +
Sbjct: 112 NRQEFQSIFKKYIHHLGNPIIYSQAKRLWLVFTSSSGGWVTSSLNIMYSDDLGKTWSQPK 171

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
             L   IL  S   +   I L+N         E  N  G W     D    +       +
Sbjct: 172 TILSSNILNFSTLTRGAAIELDNNRFAIPVYKEFNNLNGRWFVFNKDGELIFVSE----M 227

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
            N+ ++ +QP     +      L R  +  I R+  +E+ D GL+WS  KPTQL NP+SG
Sbjct: 228 TNDGVN-LQPTVVPLSKTHALALYRQMHSPIKRIYTNETSDSGLSWSKVKPTQLDNPDSG 286

Query: 241 IDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
           I  +K+++G +LLAYN  T SR  L +A   D+   W +  T    +  E SYP      
Sbjct: 287 IAAIKIQNG-ILLAYNNATDSRADLSLAFKADNSQQWRNIYTFPNKIKGELSYPTFTPYQ 345

Query: 299 DGSV 302
           D  +
Sbjct: 346 DNII 349


>gi|384220758|ref|YP_005611924.1| hypothetical protein BJ6T_70870 [Bradyrhizobium japonicum USDA 6]
 gi|354959657|dbj|BAL12336.1| hypothetical protein BJ6T_70870 [Bradyrhizobium japonicum USDA 6]
          Length = 401

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 71/361 (19%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ ++ +  G     +FGG+ EG  D+ ++      G  +W       ++   
Sbjct: 36  SPCVQNHAANLMPLANGDLACVWFGGTQEGMSDISVYFSRLARGATQWSPAEKLSDDSGR 95

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR--SYNKGVTWSEREQLPPGILGPSK- 136
              NPVLF  P   L L +       + +  ++R  S + G +W   E L P   G    
Sbjct: 96  SEQNPVLFPAPDGTLWLMWTAQLAGNQDTALIRRRLSRDNGKSWGPIETLFPEQRGRGTF 155

Query: 137 -NKPIL-LENG-----LLLCGSSV-ESWN--SWGSWMEVTVDAGRSWRKYGPIYIPNESL 186
             +PI+ L+NG     +  C S+    WN     S ++++ D+G +WR    + +P  S 
Sbjct: 156 IRQPIVALDNGDWLLPVFYCHSTPGRKWNGDEDTSAVKISSDSGATWRD---VDVPG-SR 211

Query: 187 SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
             +        + +L  L RS      +  S S D G TWS    T L N NS I   +L
Sbjct: 212 GCVHMCIARLDDSSLIALYRS-RWADTIYQSRSTDHGRTWSAPVQTVLPNNNSSIQVTRL 270

Query: 247 KDGRLLLAYNTVS----------------------------------------RGVLKVA 266
            +G L L +N  S                                        R  + +A
Sbjct: 271 GNGHLALVFNDSSARDATGRRLSLYDEIEDDLPPVLAGDGKRTDERTAFWGAPRAPMTLA 330

Query: 267 LSKDDGDSWH-------DALTLEEN----LAMEFSYPAVIQASDGSVHITYTYNRTQIKH 315
           +S D G +W        D   +  N       E SYP+V + +DG++H+ +T++R  IK+
Sbjct: 331 VSADGGRTWSKRNVETGDGYCMTNNSRDQTNRELSYPSVKETADGAIHMAFTFHRRAIKY 390

Query: 316 V 316
           V
Sbjct: 391 V 391


>gi|452747977|ref|ZP_21947766.1| hypothetical protein B381_09501 [Pseudomonas stutzeri NF13]
 gi|452008126|gb|EME00370.1| hypothetical protein B381_09501 [Pseudomonas stutzeri NF13]
          Length = 387

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 140/363 (38%), Gaps = 82/363 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKI-----------WLQTFKDGRWQSPIIADE 75
            HAS++  +  G  +  +F GS EGA DV++           W +       +S   A +
Sbjct: 23  VHASSVTSLPDGGLMAVWFAGSREGAADVQVRGARFDAIAAEWGEELVLATRESTQKATQ 82

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPP--- 129
           +    + NPV+   P N L LFY +   V  W+G       S + G TWS   QL     
Sbjct: 83  KHIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSAINAMYSDDLGQTWSPPRQLITTPF 141

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDA---GRSWRKYGPIYIPNES 185
             I    +N P+  ++G +      E    +  ++ ++ +    G+S    G     N+S
Sbjct: 142 LNISTLVRNAPVFHQDGTIGLPVYHEFLGKFAEYLYISPNGEVIGKSRISKG-----NDS 196

Query: 186 LS-VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
           L   + P+    A   LR    S +   RV  S + D G TWS   P    NPNS +  V
Sbjct: 197 LQPTVVPMDERHAVAMLRYAGASHH---RVLASRTADAGRTWSEPYPIDPANPNSALAAV 253

Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE------------------- 283
              +  LL+A N +  G  ++ L + D   D W   + L+E                   
Sbjct: 254 ATPNHGLLVALNDLEDGRFRLRLMETDSSLDLWKPVIDLDESPDPEGNPFTPEVYREIIG 313

Query: 284 --------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQI 313
                               NL            E+ YP  I++SDG  H+ Y++N T I
Sbjct: 314 DKFRLSSGPRRLSLVDEFLTNLDQRVCKTHGCDFEYEYPYFIRSSDGLYHLVYSWNNTFI 373

Query: 314 KHV 316
           KHV
Sbjct: 374 KHV 376


>gi|417383912|ref|ZP_12149463.1| hypothetical protein LTSEJOH_3062 [Salmonella enterica subsp.
           enterica serovar Johannesburg str. S5-703]
 gi|353610535|gb|EHC63463.1| hypothetical protein LTSEJOH_3062 [Salmonella enterica subsp.
           enterica serovar Johannesburg str. S5-703]
          Length = 133

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 62/131 (47%), Gaps = 1/131 (0%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSKDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP 128
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS    L 
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWSTPAPLV 122

Query: 129 PGILGPSKNKP 139
              + P +  P
Sbjct: 123 NDDILPRRAVP 133


>gi|218508465|ref|ZP_03506343.1| putative glycosyl hydrolase protein [Rhizobium etli Brasil 5]
          Length = 340

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 138/349 (39%), Gaps = 86/349 (24%)

Query: 44  YFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIG 101
           +FGG+ EG  D+ I++     G  RW  P    ++P     NP++F  P     L Y   
Sbjct: 2   WFGGTMEGMGDISIYMSRLPPGSERWSEPEKMSDDPEKSEQNPLIFNAPDGKTWLLY-TS 60

Query: 102 QEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGPSKNKPILLENG-----LLLC-G 150
           Q      G + +   S + G T+   + L   PG     + + ++ + G     +  C G
Sbjct: 61  QTSGNQDGSIVKCRISDDGGKTFGPVQILCDSPGTF--VRQQIVVNDRGDWLLPIFRCVG 118

Query: 151 SSVESWNSWGSWMEVTV--DAGRSWR------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
              + W+       V +  D G SW+        G +++    L     V F+  NR   
Sbjct: 119 LEGQRWSGDADTAAVLISRDGGGSWQMRDIPDSIGAVHMNILPLGGGDMVAFYR-NRFAE 177

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------ 256
            ++ S           S DGG TWS  +PT+L N NS I    L +G + + YN      
Sbjct: 178 TILSS----------RSADGGETWSPPEPTELPNNNSSIQATVLDNGGIAMVYNHSNATT 227

Query: 257 ---------------------------------TVSRGVLKVALSKDDGDSWHDALTLE- 282
                                             V R  L +A+S+D G ++   + L+ 
Sbjct: 228 SDARRQSLYDEIEGDEAGENAAVVADAGRKAVWGVPRAPLSLAISRDGGTTFPHRIDLDT 287

Query: 283 -----------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
                      ++L  EFSYP+VI+ SDG++H+  TY R  IK+V L P
Sbjct: 288 GDGFCLSNNSKDSLNREFSYPSVIEGSDGTLHVACTYYRRAIKYVRLAP 336


>gi|238482243|ref|XP_002372360.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220700410|gb|EED56748.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 332

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 90/331 (27%), Positives = 135/331 (40%), Gaps = 69/331 (20%)

Query: 50  EGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKW 107
           EG PD+ I+L   + G   W   I    +      NPVLF+ P+  L L Y       + 
Sbjct: 2   EGKPDISIYLSRLRSGEQSWSEAIRMTHDNTRSEQNPVLFRTPTGDLWLLYTSQHAGNQD 61

Query: 108 SGCMKRSYNK--GVTWSEREQLPPGILGPSKNKPILLENG---LLLCGSSVESWNSW--- 159
           S  +KR  +K  G+TW + E L P      +   I+L++G   + +    VE    W   
Sbjct: 62  SAIVKRRVSKDDGITWGKEEVLFPDSGIFIRQPAIVLDDGAWVIPVFKCRVEPGERWLGN 121

Query: 160 --GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
              S + V+ D G++W +     IP ES   +        + +   L RS      + ++
Sbjct: 122 NDISCIRVSRDEGQTWTES---VIP-ESTGCVHMEIQRLKDCSYLGLFRS-RWADHIYLA 176

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------------ 259
            S DG L+WS  +PT L NPN+GI    L  GR+++ YN  S                  
Sbjct: 177 TSPDG-LSWSPPQPTILPNPNAGICFDVLPSGRVVVVYNHSSKLDATGRRQGLYDDIADG 235

Query: 260 ---------------------RGVLKVALSKDDGDSWHDAL-----------TLEENLAM 287
                                R  L VA S D G +W   +             E+ L  
Sbjct: 236 VDERRDQSSTQDGRESFWGAPRAPLCVAWSDDSGKTWERRVLEDGDGYCMTNNSEKKLNR 295

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           E SYP+++   +G +HI YT+ R +IK+V L
Sbjct: 296 ELSYPSMV-LDEGKIHIAYTFWRQRIKYVQL 325


>gi|417531554|ref|ZP_12186236.1| hypothetical protein LTSEURB_2816 [Salmonella enterica subsp.
           enterica serovar Urbana str. R8-2977]
 gi|353664664|gb|EHD03026.1| hypothetical protein LTSEURB_2816 [Salmonella enterica subsp.
           enterica serovar Urbana str. R8-2977]
          Length = 137

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 57/114 (50%), Gaps = 1/114 (0%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQ 68
            LV  +   P +      CHAST+V +  G  + A+F G  EG+ D  IWL  ++   W 
Sbjct: 4   ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLREGSKDTAIWLSRYEHNIWT 63

Query: 69  SPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWS 122
           +P           WNPVLF  PS+ L LFYK+G +V  W      S ++G TWS
Sbjct: 64  TPQRVAAREGEAHWNPVLF-YPSDKLWLFYKVGSDVHVWKTWFIISSDRGFTWS 116


>gi|392421090|ref|YP_006457694.1| hypothetical protein A458_10170 [Pseudomonas stutzeri CCUG 29243]
 gi|390983278|gb|AFM33271.1| hypothetical protein A458_10170 [Pseudomonas stutzeri CCUG 29243]
          Length = 387

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 140/363 (38%), Gaps = 82/363 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKI-----------WLQTFKDGRWQSPIIADE 75
            HAS++  +  G  +  +F GS EGA DV++           W +       +S   A +
Sbjct: 23  VHASSVTSLPDGGLMAVWFAGSREGAADVQVRGARFDAIAAEWGEELVLATRESTQKATQ 82

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPP--- 129
           +    + NPV+   P N L LFY +   +  W+G       S + G TWS   QL     
Sbjct: 83  KHIRKLGNPVIALAPDNRLWLFY-VSVSIGGWAGSAINAMYSDDLGETWSTPRQLITTPF 141

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVD---AGRSWRKYGPIYIPNES 185
             I    +N P+  ++G +      E    +  ++ ++ +    G+S    G     N+S
Sbjct: 142 LNISTLVRNAPVFHQDGTIGLPVYHEFLGKFAEYLYISPNGEVTGKSRISKG-----NDS 196

Query: 186 LS-VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
           L   + P+    A   LR    S +   RV  S + D G TWS   P    NPNS +  V
Sbjct: 197 LQPTVVPMDEQHAVAMLRYAGASHH---RVLASRTADAGRTWSKPYPIDPANPNSALAAV 253

Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE------------------- 283
              +  LL+A N +  G  ++ L + D   D W   + L+E                   
Sbjct: 254 ATPNHGLLVALNDLEDGRFRLRLMETDSSLDLWKPVIDLDESPDPEGNPFTPEVYREIIG 313

Query: 284 --------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQI 313
                               NL            E+ YP  I+++DG  H+ Y++N T I
Sbjct: 314 DKFRLSSGPRRLSLVDEFLTNLDKRVCKTHGCDFEYEYPYFIRSADGLYHLVYSWNNTFI 373

Query: 314 KHV 316
           KHV
Sbjct: 374 KHV 376


>gi|357638395|ref|ZP_09136268.1| BNR/Asp-box repeat protein [Streptococcus urinalis 2285-97]
 gi|418416853|ref|ZP_12990052.1| hypothetical protein HMPREF9318_00800 [Streptococcus urinalis
           FB127-CNA-2]
 gi|357586849|gb|EHJ56257.1| BNR/Asp-box repeat protein [Streptococcus urinalis 2285-97]
 gi|410874671|gb|EKS22602.1| hypothetical protein HMPREF9318_00800 [Streptococcus urinalis
           FB127-CNA-2]
          Length = 310

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 130/292 (44%), Gaps = 51/292 (17%)

Query: 67  WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFY------KIGQEVQKWSGCM--KRSYNKG 118
           W    I  +  N    NP  F+ P   + L Y      + G++  +++  +  ++S ++G
Sbjct: 2   WSEAEIVSKGENRSEQNPAFFRSPDGEIWLIYTSQLSRQEGKDNMQFTSIIMVQKSSDEG 61

Query: 119 VTWSEREQLPPGILGPSKNKPILLENG-----LLLCGSSVESWNSWGSWMEVTVDAGRSW 173
           VTW + E L P     S+    +L NG       +C  S E   +  +   V+ D G++W
Sbjct: 62  VTWGDPEVLFPEEGTFSRQAIQILSNGRWIFATWICEDSPEGLTNDPTEFRVSDDTGQTW 121

Query: 174 RKYGPIYIPNES----LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
           +    + +P+ +     ++I+  P H     L   MRS      V +SES D G +WS  
Sbjct: 122 KI---VRMPDSAGRVHANLIEVEPGH-----LVAFMRS-RFADNVYISESFDYGDSWSVP 172

Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSW 275
           + T L N N+ I  +KL+ G + LAYN  +              R  + V++++D G +W
Sbjct: 173 EKTVLSNNNASISAIKLQTGEIALAYNDNAIENPEFGKVAWPGLRNPVAVSITEDFGKTW 232

Query: 276 HDALTLE--ENLA--------MEFSYPAVIQASDGSVHITYTY-NRTQIKHV 316
                 E  EN           ++ YP + Q+ +G +H+ Y Y NR  IK+V
Sbjct: 233 PIGRVFEQAENFIGAENRTNNKQYEYPTLYQSQNGDLHLVYAYRNRLCIKYV 284


>gi|320107998|ref|YP_004183588.1| hypothetical protein AciPR4_2821 [Terriglobus saanensis SP1PR4]
 gi|319926519|gb|ADV83594.1| hypothetical protein AciPR4_2821 [Terriglobus saanensis SP1PR4]
          Length = 412

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 87/348 (25%), Positives = 139/348 (39%), Gaps = 63/348 (18%)

Query: 24  FKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNVPM 81
           F S HA+ ++ +  G  L   F G+ EG  +V I +     G   W+  ++ D E     
Sbjct: 57  FSSSHAANLLLLRNGDILCFSFSGTREGDSNVAIVVSRLPKGSQTWEPAVLVDREEGKSY 116

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKW---SGCMKRSYNKGV-TWSEREQLPPGILGPSKN 137
            NPV  +  ++G +  Y   Q   K    S  +K   + G  TWS+ E +        + 
Sbjct: 117 QNPVPIE-DAHGRIWLYNTSQSANKGQADSQVLKVYSDDGAKTWSKPEVVFDKAGSYLRQ 175

Query: 138 KPILLENGLLL-------CGSSVESWNSWGSWMEVTVDAGRSWRK---YGPIYIPNESLS 187
             ++ ENG LL               ++  S ++V+ DAG++WR+    G   +   S+ 
Sbjct: 176 PTVMGENGELLLPIYYSTSSGITNGADTNYSVVDVSKDAGKTWRECRVTGSNGLVQMSIV 235

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P  +    R+     R  + I R   +  C+    W+    T L N N+ I    LK
Sbjct: 236 KLRPKSYVAFFRS-----RYADFIYRSTSTNGCE----WTAPTKTVLPNNNASIQATLLK 286

Query: 248 DGRLLLAYN----------------TVSRGVLKVALSKDDGDSWHDALTLEENLAM---- 287
           DG + +A++                T  R  L VALS D G +W     LE   A     
Sbjct: 287 DGHIAIAFDNTQGKSPEKLAAHEPQTGPRAPLSVALSSDAGLTWTSVRDLEIPDAAVRAS 346

Query: 288 -----------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
                            E+SYP+++Q   G + + YTY R  IK V+ 
Sbjct: 347 VVNSPGVGDTIQFPGEEEYSYPSILQNPSGQILVAYTYRRVAIKAVLF 394


>gi|339058255|ref|ZP_08648757.1| BNR/Asp-box repeat protein [gamma proteobacterium IMCC2047]
 gi|330720539|gb|EGG98821.1| BNR/Asp-box repeat protein [gamma proteobacterium IMCC2047]
          Length = 387

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 85/325 (26%), Positives = 147/325 (45%), Gaps = 33/325 (10%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
           A+S   +  H++ +V++  G+    ++GG+ EGA DV I+      +   W    +  E 
Sbjct: 54  ASSGVTREVHSAVMVKLGNGNLRAFWYGGAREGAKDVSIYTAELDLQSNEWLDERVLVER 113

Query: 77  PNVP---------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSER 124
                        + NPV F+     L LF+ +   V  W+G       S + G +W+  
Sbjct: 114 HATAQQLGIYLRKIGNPVTFRDGDERLWLFF-VSVSVGGWAGSTINFMISDDDGQSWTTP 172

Query: 125 EQLPPGIL----GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           ++L    L       K +P L  +G +      E    +G  + ++  +G    K   + 
Sbjct: 173 KRLIASPLFNLSTLVKGEPFLYTDGSIGLPVYHEMIGKFGELIHLS-SSGEVLDK---VR 228

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES--CDGGLTWSYAKPTQLLNPN 238
           + +  +S +QP+   ++ +  R+ MR+        + E+   D G +W+      L NPN
Sbjct: 229 LMSGRVS-LQPIVLPSSGQHARIYMRNATDAPDSTLFETWTTDAGESWAPVVRNDLPNPN 287

Query: 239 SGIDGVKLKDGRLLLAYNTV--SRGVLKVALSKDDGDSWHDALTLEEN-----LAMEFSY 291
           + I G++L DG LLL  N     R  L +  S DDGDSW      E+         +F+Y
Sbjct: 288 AAITGLRLDDGGLLLVINNQHDKRNNLSLMHSVDDGDSWQFLYQFEQEENTPEFEHQFAY 347

Query: 292 PAVIQASDGSVHITYTYNRTQIKHV 316
           P++ Q+SDG  H+ YT+N+ +IKH+
Sbjct: 348 PSLRQSSDGDFHVVYTWNKERIKHI 372


>gi|87310619|ref|ZP_01092747.1| hypothetical protein DSM3645_26524 [Blastopirellula marina DSM
           3645]
 gi|87286600|gb|EAQ78506.1| hypothetical protein DSM3645_26524 [Blastopirellula marina DSM
           3645]
          Length = 515

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 126/296 (42%), Gaps = 36/296 (12%)

Query: 44  YFGGSCEGAPDVKIWLQTFKDGRWQSP----IIADEEPNVPMWNPVLFKLPSNGLLLFYK 99
           Y GG  E   +  + + +  +G+  S     I  D+E  V  ++P  +  PS  L LF+ 
Sbjct: 52  YGGGVTEDRHNYILLVSSDDEGKTWSENKVVIDPDKEGPVRAFDPCPWVDPSGKLWLFW- 110

Query: 100 IGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSW 159
                Q   G     +   +T        P    P  + P L+ NG+++C  +  +   W
Sbjct: 111 ----CQATRGGGGDPHTFAMTTEN-----PDDAQPKWSAPQLVHNGVMMCKPTALANGDW 161

Query: 160 ----GSW-------MEVTVDAGRSWRKYGPIYIPN-ESLSVIQPVPFHTANRTLRVLMRS 207
                 W       +  + D G++W   G   +P  +  +  +P+     N  L  L+R+
Sbjct: 162 LLPTAIWGRDHSCRVVASTDQGKTWSLRGTANVPAAKDRNCDEPMIVQRKNGELWQLVRT 221

Query: 208 FNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLA-----YNTVSRGV 262
             GIG    S S DGG TW+   P  + +  S     +L+ G LLL      +    R  
Sbjct: 222 SYGIGE---SHSTDGGKTWTAVTPWNIRHTASRFFIRRLQSGNLLLVKHGPLHEKTGRSH 278

Query: 263 LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           L   LSKDDG SW   L L+E   +  SYP   ++ DG V++TY Y+R   K++ +
Sbjct: 279 LTAYLSKDDGQSWEGGLLLDERSGV--SYPDGAESQDGVVYLTYDYDRQGQKYIYM 332


>gi|149177101|ref|ZP_01855708.1| hypothetical protein PM8797T_26875 [Planctomyces maris DSM 8797]
 gi|148843993|gb|EDL58349.1| hypothetical protein PM8797T_26875 [Planctomyces maris DSM 8797]
          Length = 542

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/205 (31%), Positives = 101/205 (49%), Gaps = 17/205 (8%)

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIY 180
           W+E   L PG++    NKPI+ +NG  L  +++   +     M V+ D GR+W   G   
Sbjct: 159 WTEPRALFPGVM---INKPIVTKNGEWLMPAAIWRQDDSCRVM-VSKDQGRTWALRGAAN 214

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
           +P +  +  +P+     + ++ +L+R+ + GIG    S S D G +W+  K   L    S
Sbjct: 215 VPKDRRNCDEPMLVERQDGSVWLLVRTSSYGIGD---SVSTDAGRSWTEVK-DHLQQTTS 270

Query: 240 GIDGVKLKDGRLLL-AYNTVS-----RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
                +L  G LLL  +N +      R  L   LS DDG SW   L L+    +  SYP 
Sbjct: 271 RFHIQRLASGNLLLIKHNGIDERSRGRNHLTAYLSADDGKSWKGGLLLDTRDMV--SYPD 328

Query: 294 VIQASDGSVHITYTYNRTQIKHVVL 318
             QA DG++++ Y +NR   KH++L
Sbjct: 329 ATQAPDGTIYVIYDWNRADEKHILL 353


>gi|266620455|ref|ZP_06113390.1| putative BNR/Asp-box repeat-containing domain protein [Clostridium
           hathewayi DSM 13479]
 gi|288867941|gb|EFD00240.1| putative BNR/Asp-box repeat-containing domain protein [Clostridium
           hathewayi DSM 13479]
          Length = 389

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 73/286 (25%), Positives = 121/286 (42%), Gaps = 48/286 (16%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQK-----WSGCMKRSYNKGVTWSERE 125
           ++   +P   +++P L+  P   L +FY   +         W     +     +TWS   
Sbjct: 80  VVEHPDPECRIYDPNLWIAPDKKLWMFYTQARGFNDGRSGVWVTVCDQPDTDPLTWSAPR 139

Query: 126 QLPPGILGPSKNKPILLENG--LLLCGSSVESWNS-----------WGSWMEVTVDAGRS 172
           ++  GI+    NKPI+   G  L  C    ++  S             S +  + D G++
Sbjct: 140 RIANGIM---MNKPIITTKGEWLFPCAIWCDTSGSVPAERHGLEQEQFSNVYASSDKGKT 196

Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT 232
               G   IPN S    + +     + +L +L+R+F+GIG    S S DGG TW+  + +
Sbjct: 197 ISLRGHADIPNRSFD--ENMIVEKKDGSLWMLVRTFDGIGE---SFSTDGGYTWTPGQKS 251

Query: 233 QLLNPNSGIDGVKLKDGRLLLAYNTV--------------------SRGVLKVALSKDDG 272
            +  P S     +LK GRLLL  +                       R  L   LS+DDG
Sbjct: 252 HIDGPCSRFHISRLKSGRLLLINHYQFDQRIDLEDIMHQGNVKKWKGRSHLSALLSEDDG 311

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            +W  +L L+E    E SYP   +A DG +++TY + R   + +++
Sbjct: 312 QTWPYSLLLDER--NEVSYPDAKEADDGFIYVTYDHERVTEREILM 355


>gi|398879007|ref|ZP_10634110.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM67]
 gi|398197553|gb|EJM84530.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM67]
          Length = 432

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 89/386 (23%), Positives = 144/386 (37%), Gaps = 86/386 (22%)

Query: 11  VAEEFTFPANSAPFKS------CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-- 62
           VAE+   P  ++ F S       H+S +  +  G  +  +F G+ EGA DV++    F  
Sbjct: 45  VAEQVAKPIYTSRFASSGLVDFVHSSAVTALPDGSLMTVWFAGTREGAADVQVRSARFDA 104

Query: 63  KDGRW---------QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC--- 110
           + G W         QS  +   +    + NPV+   P N L LFY +   +  W+G    
Sbjct: 105 QTGEWGAEQVLATRQSTQLGTGKYIRKLGNPVVALAPDNRLWLFY-VSVSMGGWAGSAVN 163

Query: 111 MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWM 163
           +  S + G  WS   QL   I  P        ++ P+   +G +      E    +  ++
Sbjct: 164 VMVSEDYGRQWSAPRQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFSEYL 220

Query: 164 EVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGG 223
            ++ D G    K+   +  N     + P+      R + +L  + N   RV  + + D G
Sbjct: 221 YLSAD-GEVIDKFRISHGNNSLQPTVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAG 276

Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTL 281
            TWS   P    NPNS +  +   +  LL+A N +  G  K++L   D   D W     L
Sbjct: 277 QTWSEPYPLDPSNPNSSLAAIATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDL 336

Query: 282 -------------------------------------------------EENLAMEFSYP 292
                                                            E+  + E+ YP
Sbjct: 337 DKSPDPEGDTFSPEAYKEIIGREFRGSSGALRQPLEAQFLNNLDSRVCKEQACSFEYEYP 396

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
             I++ DG  H+ Y++N T IKHV  
Sbjct: 397 YFIRSPDGMYHLVYSWNNTFIKHVTF 422


>gi|424825293|ref|ZP_18250280.1| hypothetical protein CAB1_0654 [Chlamydophila abortus LLG]
 gi|333410392|gb|EGK69379.1| hypothetical protein CAB1_0654 [Chlamydophila abortus LLG]
          Length = 357

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 138/304 (45%), Gaps = 23/304 (7%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVP 80
           S+  ++   STIVE   G  LV +     +     K+  +    GRW  P     + +  
Sbjct: 31  SSDNRTSGMSTIVETG-GRLLVVW-----QDCDSNKLIGRRCDQGRWSEPYEIFPQASGL 84

Query: 81  MWNPVLFKLPSNGLLLFYK---IGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKN 137
             +PVL ++ S  + LFY+   +G+ + K    +  S+N GV W E   LPPGI G ++N
Sbjct: 85  FSHPVLVRISSQEIWLFYRKQQVGESISK--PYVAFSFNSGVHWLEHRLLPPGIEGTTRN 142

Query: 138 KPIL--LENGLLLCGSSV-ESWNS---WGSWMEVTVDAGRSWR-KYGPIYIPNESLSVIQ 190
            P +  + N + +   SV ES +      SW+E+     RSW  + GPI   + +   I+
Sbjct: 143 PPYVDRIRNRVYIPSFSVYESPDQELIGDSWVEIYNFFSRSWEGRMGPILGGSRNQVSIE 202

Query: 191 PVPFHTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           P      +  L +  R +F     +C+S S + G++WS  K     + ++ I     ++G
Sbjct: 203 PAIVKLTSNKLALFFRNAFVEESYICLSVSTNQGISWSKLKNLPWRSYDNSISVQSGRNG 262

Query: 250 RLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTY 308
           +L L  N+   G  L    S ++G +W+    ++   +++   P+      G +H+ YT 
Sbjct: 263 KLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSID---PSTYLDRQGCMHLVYTS 319

Query: 309 NRTQ 312
              Q
Sbjct: 320 KDAQ 323


>gi|398882671|ref|ZP_10637637.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM60]
 gi|398198338|gb|EJM85296.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM60]
          Length = 432

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/386 (23%), Positives = 144/386 (37%), Gaps = 86/386 (22%)

Query: 11  VAEEFTFPANSAPFKS------CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF-- 62
           VAE+   P  ++ F S       H+S +  +  G  +  +F G+ EGA DV++    F  
Sbjct: 45  VAEQVAKPIYTSRFASSGLVDFVHSSAVTALPDGSLMTVWFAGTREGAADVQVRSARFDA 104

Query: 63  KDGRW---------QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC--- 110
           + G W         QS  +   +    + NPV+   P N L LFY +   +  W+G    
Sbjct: 105 QTGEWGAEQVLATRQSTQLGTGKYIRKLGNPVVALAPDNRLWLFY-VSVSMGGWAGSAVN 163

Query: 111 MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWM 163
           +  S + G  WS   QL   I  P        ++ P+   +G +      E    +  ++
Sbjct: 164 VMVSEDFGRQWSAPRQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFSEYL 220

Query: 164 EVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGG 223
            ++ D G    K+   +  N     + P+      R + +L  + N   RV  + + D G
Sbjct: 221 YLSAD-GEVIDKFRISHGNNSLQPTVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAG 276

Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTL 281
            TWS   P    NPNS +  +   +  LL+A N +  G  K++L   D   D W     L
Sbjct: 277 QTWSEPYPLDPSNPNSSLAAIATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDL 336

Query: 282 -------------------------------------------------EENLAMEFSYP 292
                                                            E+  + E+ YP
Sbjct: 337 DKSPDPEGDTFSPEAYKEIIGREFRGSSGALRQPLEAQFLDNLDSRVCKEQACSFEYEYP 396

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
             I++ DG  H+ Y++N T IKHV  
Sbjct: 397 YFIRSPDGMYHLVYSWNNTFIKHVTF 422


>gi|405379113|ref|ZP_11033017.1| putative neuraminidase (sialidase) [Rhizobium sp. CF142]
 gi|397324370|gb|EJJ28731.1| putative neuraminidase (sialidase) [Rhizobium sp. CF142]
          Length = 394

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 92/368 (25%), Positives = 142/368 (38%), Gaps = 85/368 (23%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G   W  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLADGTLTCVWFGGTMEGMGDISIYMSRLAPGTDCWSEPEKMSDDPAK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL---- 132
              NP++F  P   + L Y   Q      G + +   S + G +++     PP IL    
Sbjct: 95  SEQNPLIFNAPDGRVWLLY-TSQTSGDQDGAVVKCRISEDGGKSFA-----PPTILCDLP 148

Query: 133 GPSKNKPILLENG-------LLLCGS-SVESWNSWGSWMEVTV--DAGRSWRKYGPIYIP 182
           G    + I++ NG       +  C S   + W+       V +  D G+SW+    +   
Sbjct: 149 GTFVRQQIVV-NGKGDWLLPIFRCISLPGQRWSGDADTAGVLISHDGGKSWQ----MKEI 203

Query: 183 NESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
             SL  +    VP    +R +      ++   R  +S   D G TWS  +PT L N NS 
Sbjct: 204 EGSLGAVHMNIVPLE-GDRMIAFFRDRYSENIRASLS--LDQGETWSAPEPTNLPNNNSS 260

Query: 241 IDGVKLKDGRLLLAYNT--------------------------------------VSRGV 262
           I  V L D R+ + YN                                       V R  
Sbjct: 261 IQAVGLTDRRIAMVYNHSNAASSDARRHSLYDEIESDKAPVAEISSMPARKAIWGVPRAP 320

Query: 263 LKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
           L +A+S D G  +     L+            + L  EFSYP++ Q  DG++H+ YTY R
Sbjct: 321 LSLAVSTDTGGYFERKFDLDTGDGYCLSNNSKDMLNREFSYPSITQGPDGALHVAYTYYR 380

Query: 311 TQIKHVVL 318
             IK+V L
Sbjct: 381 RAIKYVRL 388


>gi|398994208|ref|ZP_10697120.1| hypothetical protein PMI22_01711 [Pseudomonas sp. GM21]
 gi|398132879|gb|EJM22131.1| hypothetical protein PMI22_01711 [Pseudomonas sp. GM21]
          Length = 432

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 100/390 (25%), Positives = 149/390 (38%), Gaps = 87/390 (22%)

Query: 3   KDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF 62
           KD   K +    F   A+S      H+S I  +  G  +  +F GS EGA DV++    F
Sbjct: 46  KDEAAKPIYTSRF---ASSGLVDFVHSSAITALPDGSLMAVWFAGSREGAADVQVRSARF 102

Query: 63  K--DGRW-QSPIIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC- 110
               G W +  ++A  +            + NPV+   P N L LFY +   +  W+G  
Sbjct: 103 NVSTGEWGEEQVMATRDSTQQSTRKYIRKLGNPVIALAPDNRLWLFY-VSVSMGGWAGSA 161

Query: 111 --MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGS 161
             +  S + GV WS   QL   I  P        ++ P+   +G +      E    +  
Sbjct: 162 VNVMVSDDFGVNWSAPRQL---ITSPFLNISTLVRSAPVFHVDGSIGLPVYHEFLGKFAE 218

Query: 162 WMEVTVDAGRSWRKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSES 219
           ++ ++ D G    KY  I   N SL   QP  VP     R + +L  +     RV  S +
Sbjct: 219 YLYLSAD-GEVIDKYR-ISHGNHSL---QPAVVPL-DGQRGIALLRYAGETHHRVLASRT 272

Query: 220 CDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSW-- 275
            D G TWS   P    NPNS +  V   +  +L+A N +  G  K++L   D   + W  
Sbjct: 273 EDAGQTWSEPYPLDPSNPNSSLAAVATPEHGMLVALNDLQEGRFKLSLYLTDAKMEDWRL 332

Query: 276 -HD-------------ALTLEENLAMEF-------------------------------- 289
            HD              L  +E + +EF                                
Sbjct: 333 VHDLDKSPDPEGDPFSPLAYKEIIGIEFRGSSGKLRQPLEAQFLANLDNRVCKNDRCEFE 392

Query: 290 -SYPAVIQASDGSVHITYTYNRTQIKHVVL 318
             YP  I++ DG  H+ Y++N T IKHV  
Sbjct: 393 YEYPYFIRSPDGMYHLVYSWNNTFIKHVTF 422


>gi|424854452|ref|ZP_18278810.1| glycosyl hydrolase [Rhodococcus opacus PD630]
 gi|356664499|gb|EHI44592.1| glycosyl hydrolase [Rhodococcus opacus PD630]
          Length = 412

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 140/373 (37%), Gaps = 82/373 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
           AP    HA+ +  +  G     +F G+ EG PD+ +W          W  P+    +P  
Sbjct: 35  APQVQNHAANLTVLPDGSLGCVWFAGTQEGVPDISVWFSGLAPDADAWSEPVQLSGDPTR 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQE-VQKWSGCMKR-SYNKGVTWSEREQLPP--GILGPS 135
              NPVL    +  + L +       Q  +  M+R S + G TW E   L P     G  
Sbjct: 95  SEQNPVLHVTNTETVWLMWTSQHAGNQDTARVMRRISADGGRTWGEAHTLLPETDAGGVF 154

Query: 136 KNKPIL-LENGLLLC---------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
             +P++ L  G LL          G        + S M ++ D G SWR+   + +P  S
Sbjct: 155 VRQPVVALPTGRLLLPVFHCVRIEGRKWVGDRDYSSVM-ISDDQGNSWRE---VTVPG-S 209

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           +  +       A+ TL  L RS      +  S S D G TW+  + T+L N NS I  V 
Sbjct: 210 VGCVHMNIGRLADGTLVALYRS-RWADSIYRSTSTDDGDTWTEPEATELPNNNSSIQFVV 268

Query: 246 LKDGRLLLAYNTVS---------------------------------------------- 259
           L D RL L +N  S                                              
Sbjct: 269 LPDDRLALVFNESSAADATARRTSLYDEIDDDGLADGVPEQADTELEPLDDGDSRSAFWG 328

Query: 260 --RGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDGSVHIT 305
             R  + +A+S+D G +W         D   L  N    L  E+SYP++   +DG +H+ 
Sbjct: 329 APRAPMTLAMSEDGGRTWPVRRDIEIGDGYCLSNNSRDGLNREYSYPSITAGNDGRLHVA 388

Query: 306 YTYNRTQIKHVVL 318
           +T  R  I++V L
Sbjct: 389 FTRFRQAIEYVEL 401


>gi|331656696|ref|ZP_08357658.1| hypothetical protein ECKG_00503, partial [Escherichia coli TA206]
 gi|331054944|gb|EGI26953.1| hypothetical protein ECKG_00503 [Escherichia coli TA206]
          Length = 307

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 109/247 (44%), Gaps = 23/247 (9%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMK--RSYNKGVTWSEREQL--PPGILGPSKNKPI- 140
           LF  P N L L +       + +  ++  +S + G TW E   L   PG       +PI 
Sbjct: 93  LFLAPDNVLWLLWTAQISGNQDTAIVRYRKSDDLGQTWGEIATLLDKPGTF---IRQPIT 149

Query: 141 LLENG-----LLLCGSSV-ESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
           +L+NG     +  C +   E W  N   S ++++ D G SWR    + +P +SL  +   
Sbjct: 150 VLDNGNWLLPVFYCRTQPGEKWVGNDDISAVKISADGGHSWRD---VEVP-QSLGCVHMN 205

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                N TL  L RS      + +S S D G +WS  + T+L N NS I    L  G L 
Sbjct: 206 ITMLHNGTLVALFRS-RWADNIYISHSADNGESWSVPQATELPNNNSSIQVTTLASGELA 264

Query: 253 LAYNTVS 259
           L YN +S
Sbjct: 265 LVYNAMS 271


>gi|387824033|ref|YP_005823504.1| BNR/Asp-box repeat-containing protein [Francisella cf. novicida
           3523]
 gi|328675632|gb|AEB28307.1| BNR/Asp-box repeat protein [Francisella cf. novicida 3523]
          Length = 307

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/278 (26%), Positives = 122/278 (43%), Gaps = 34/278 (12%)

Query: 40  FLVAYFGGSCEGAPDVKIWLQTFKD--GRWQ----SPIIADEEPNV-------PMWNPVL 86
             + ++ G  E AP+ KI L T +   G+W       II  ++           + NP++
Sbjct: 8   LFITWYSGDQETAPNTKIMLATAEKVAGKWHFDEIRSIIDRQKFQSILKKHIHHLGNPII 67

Query: 87  FKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGL 146
           +       L+F          S  +  S + G TWS+    P  IL  +     +L    
Sbjct: 68  YSQGKKLWLVFTSSSGGWTTSSLNIMYSNDLGKTWSQ----PKTILSSN-----ILNFST 118

Query: 147 LLCGSSVESWNSWGS---WMEVTVDAGR--SWRKYGPIYIPNESLS---VIQPVPFHTAN 198
           L  GS++E  N+  +   + E     GR   + + G +   +E  +    +QP     + 
Sbjct: 119 LTRGSAIELDNNRFAIPVYKEFNNLNGRWFIFNQNGELIFVSEMTNDGVNLQPTVVPLSK 178

Query: 199 RTLRVLMRSFNG-IGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN- 256
                L R  +  I ++  +E+ D G++WS  KPTQL NP++GI  +K+++G +LLAYN 
Sbjct: 179 THALALYRQMHSPIKKIYTNETADSGISWSKVKPTQLDNPDAGIAAIKIQNG-ILLAYNN 237

Query: 257 -TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPA 293
            T SR  L +A   D+   W +  T       E SYPA
Sbjct: 238 ATNSRADLSLAFKADNSQQWQNIYTFPNKTKAELSYPA 275


>gi|226364119|ref|YP_002781901.1| hypothetical protein ROP_47090 [Rhodococcus opacus B4]
 gi|226242608|dbj|BAH52956.1| hypothetical protein [Rhodococcus opacus B4]
          Length = 409

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 140/373 (37%), Gaps = 82/373 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQ--TFKDGRWQSPIIADEEPNV 79
           AP    HA+ +  +  G     +F G+ EG PD+ +W    T     W  P+    +   
Sbjct: 32  APQVQNHAANLTVLPDGSLACVWFAGTQEGVPDISVWFSRLTPDADTWSEPVQLSHDSTR 91

Query: 80  PMWNPVLFKLPSNGL-LLFYKIGQEVQKWSGCMKR-SYNKGVTWSEREQLPPGIL--GPS 135
              NPVL    +  + LL+       Q  +  M+R S + G TW E   L P     G  
Sbjct: 92  SEQNPVLHVTNTGTVWLLWTSQHAGNQDTARVMRRISADGGRTWGEAHTLLPETEAGGVF 151

Query: 136 KNKPIL-LENGLLLC---------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
             +P+  L +G LL          G        + S M ++ D G SWR+   + +P  S
Sbjct: 152 VRQPVAALPSGRLLLPVFHCVRIEGRKWVGDRDYSSVM-ISDDDGDSWRE---VVVPG-S 206

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           +  +       A+ TL  L RS      +  S S D G TW+    T+L N NS +  V 
Sbjct: 207 VGCVHMNIGRLADGTLVALYRS-RWADSIYRSTSTDDGDTWTEPVATELPNNNSSVQFVV 265

Query: 246 LKDGRLLLAYNTVS---------------------------------------------- 259
           L D RL L +N  S                                              
Sbjct: 266 LPDDRLALVFNESSAADATARRTSLYDEIDDDGLADEVPAAPDGEPEPLDDGDSRSAFWG 325

Query: 260 --RGVLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQASDGSVHIT 305
             R  + +A+S+D G +W     LE            + L  EFSYP++   +DG +H+ 
Sbjct: 326 APRAPMTLAISEDGGRTWPLRRDLETGDGYCLSNNSRDGLNREFSYPSITTGNDGRLHVA 385

Query: 306 YTYNRTQIKHVVL 318
           +T  R  I++V L
Sbjct: 386 FTRFRQAIEYVEL 398


>gi|171913862|ref|ZP_02929332.1| hypothetical protein VspiD_21825 [Verrucomicrobium spinosum DSM
           4136]
          Length = 936

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/260 (26%), Positives = 114/260 (43%), Gaps = 37/260 (14%)

Query: 86  LFKLPSNGLLLFYKIGQ-----EVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPI 140
           L+  P+  L LFY             W+   +    +  TWSE  ++  G+   + NKPI
Sbjct: 129 LWTDPTGKLWLFYDQSMGYYDGRAGVWAITCENPDAENPTWSEPRRIWHGL---TLNKPI 185

Query: 141 LLENGLLLCGSSVESWNSWG----------------SWMEVTVDAGRSWRKYGPIYIPNE 184
           +L+NG  L   S+ + +  G                + + V+ D G++W + G + +P  
Sbjct: 186 VLKNGEWLLPISLWTRDRIGPAELRDGFKELDEFRMANIFVSTDQGKTWNRRGGVAVPET 245

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
                  V        L ++ R+  GI     S S D G TW+ A+P+ + N +S     
Sbjct: 246 DFDEHMFVELKDGR--LWLMARTKYGIAE---SFSSDQGRTWTAAQPSTIQNVSSRFFFR 300

Query: 245 KLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQAS 298
           +L  G LLL  N         R  +   LS D+G SW   L ++E   +  SYP   QA 
Sbjct: 301 RLASGNLLLVKNGPLNERLKGRSHMTAYLSDDEGKSWKGGLIIDERGGV--SYPDGFQAP 358

Query: 299 DGSVHITYTYNRTQIKHVVL 318
           DG+++IT+ +NR + + ++ 
Sbjct: 359 DGTIYITHDHNRDEDREILF 378


>gi|449133855|ref|ZP_21769369.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain protein [Rhodopirellula europaea 6C]
 gi|448887495|gb|EMB17870.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain protein [Rhodopirellula europaea 6C]
          Length = 733

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 96/206 (46%), Gaps = 27/206 (13%)

Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------VTVDAGRSWRKYGP 178
           G + NKP +L NG  +   S++     G++                V+ D G +W + G 
Sbjct: 499 GVTLNKPTVLSNGDWMLPVSLDERGGLGAFRNCFQNLDPLRGANVFVSKDQGETWHRRGR 558

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              PN      + +     + +L +L R+  GI     S S DGG TWS A  +++ +PN
Sbjct: 559 ATFPNPDWH--EHMIVQRNDDSLWMLARTRKGI---MESTSVDGGQTWSAAVASKVKHPN 613

Query: 239 SGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           +     +L+ GRLLL  +  +      R  L   LS D+G +W   L L+E   +  SYP
Sbjct: 614 ARFHVRRLQSGRLLLIKHGEAIDSHRGRVALSAWLSDDEGQTWFGGLMLDERKGI--SYP 671

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
              Q+S+GS++I+Y  NR     V++
Sbjct: 672 DGFQSSNGSIYISYDRNRATDGEVLM 697


>gi|71906197|ref|YP_283784.1| glycosyl hydrolase [Dechloromonas aromatica RCB]
 gi|71845818|gb|AAZ45314.1| Glycosyl hydrolase, BNR repeat [Dechloromonas aromatica RCB]
          Length = 375

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 4/124 (3%)

Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN- 256
           +R L VL  +  G  R  +S + +GG TW  A    L  P++ +  ++L  GRLLLA N 
Sbjct: 243 HRGLAVLRDNRAGTSRATLS-TTNGGQTWETASELALPAPDAPVALLRLASGRLLLAGNP 301

Query: 257 TVSRGVLKVALSKDDGDSWHDALTLE--ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
              +  L++ LS DDG +W     +E   +   EF+ PA++Q  DG +H+TYT+ + QI+
Sbjct: 302 QQGKEALQLWLSADDGQTWAMKRIVEAASDGGAEFADPALLQGRDGRIHLTYTWRQQQIR 361

Query: 315 HVVL 318
           +V  
Sbjct: 362 YVAF 365


>gi|412991299|emb|CCO16144.1| BNR/Asp-box repeat family protein [Bathycoccus prasinos]
          Length = 375

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 84/146 (57%), Gaps = 7/146 (4%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           ++ D GR+W+        +E+  +I+      +N ++  L R+   +G +  S S DGG 
Sbjct: 167 ISKDNGRTWKPSESSLTDSETW-LIEGTMAELSNSSVLHLFRT--SVGEIYKSLSHDGGR 223

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLE 282
           TW+ A  T L NPNS I  ++L DGRL LAYN     R  L VA+S+DDG++WH A  +E
Sbjct: 224 TWTKASSTGLPNPNSKIHLLRLADGRLALAYNHDRKYRANLYVAISEDDGENWHLAAIVE 283

Query: 283 E-NLAMEFSYPAVIQASDGSVHITYT 307
           + + +  ++YP +++  +  +++ Y+
Sbjct: 284 KGSESSMYAYPTMVE-DECKLYVAYS 308


>gi|452001230|gb|EMD93690.1| glycoside hydrolase family 33 protein [Cochliobolus heterostrophus
           C5]
          Length = 384

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 97/381 (25%), Positives = 146/381 (38%), Gaps = 103/381 (27%)

Query: 17  FPANSAPFKSC--HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPII 72
           F +N  P  +   HAS ++++  G  L A+FGGS EG  D+ I L   K G   W +P  
Sbjct: 17  FASNYIPASTVQVHASNLLQLPNGDLLCAWFGGSQEGLSDICIHLSRLKKGSRVWSTPQK 76

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL-- 127
             ++P     NPVLF +P    L      Q        + R   S ++G TWS    L  
Sbjct: 77  ISDDPERSEQNPVLFLVPGTAALWVMYTAQPAGNQDKAVVRYRVSSDEGQTWSLAHNLFN 136

Query: 128 --------PPGILGPS---------KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
                   P  +L            +N P     G    GS         S ++ T D G
Sbjct: 137 DEGLFIRQPVTVLKDGTWVLPAWYCRNPP-----GFRWVGSDDI------SVVKYTRDGG 185

Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLR----VLMRSFNGIGRVCMSESCDGGLTW 226
           ++W++ G   +PN   +V      H   + LR    +          + MS S +  + W
Sbjct: 186 KTWQEKG---VPNSIGAV------HMNIKALRDGSYIAFYRSRWADNIYMSTSTN-CIDW 235

Query: 227 SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT----------------------------- 257
           +  KPT L NPNSGI    L D RL++ YN                              
Sbjct: 236 TEPKPTSLPNPNSGICFEVLSDDRLIMIYNDSQFEKGQAKREGLYDDITPKDDKRVNQRG 295

Query: 258 --------VSRGVLKVALSKDDGDSWHDALTLEENLAM------------EFSYPAV-IQ 296
                   V R ++ V +S D G +W  + TL+E                E SYP++ ++
Sbjct: 296 STKDAVWGVPRKIMTVGVSSDGGRTWK-SKTLQEGDGFCMTNNSRTKENRELSYPSIWVE 354

Query: 297 ASDGS-VHITYTYNRTQIKHV 316
             D   V++ +T+ R +IK +
Sbjct: 355 RGDKDVVNVAFTFWRQRIKFI 375


>gi|426409730|ref|YP_007029829.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. UW4]
 gi|426267947|gb|AFY20024.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. UW4]
          Length = 431

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 85/372 (22%), Positives = 138/372 (37%), Gaps = 80/372 (21%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
           A+S      H+S +  +  G  + A+F GS EGA DV++    F  K G W +  +    
Sbjct: 58  ASSGLVDFVHSSAVTALPDGSLMTAWFAGSREGAADVQVRSARFDAKTGEWGAERVLATR 117

Query: 77  PNV---------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSER 124
            +           + NPV+   P   L LFY +   +  W+G    +  S + G  WS  
Sbjct: 118 NSTQQGTQRYIRKLGNPVIALAPDKRLWLFY-VSVSMGGWAGSAVNVMVSDDFGENWSAP 176

Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
            QL   I  P        ++ P+   +G +      E    +  ++ ++ D G    KY 
Sbjct: 177 RQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GDVIDKYR 232

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
             +  N     + P+      R + +L  + N   RV  + + D G  WS   P +  NP
Sbjct: 233 ISHGKNSLQPTVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAGQNWSEPYPLEPWNP 289

Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN----------- 284
           NS +  V   +  LL+A N +  G  K++L   D   D+W     L+++           
Sbjct: 290 NSSLAAVATPEHGLLVALNDLQEGRFKLSLYGTDEKMDNWRSLRDLDKSPDPEGDPFSPQ 349

Query: 285 --------------------------------------LAMEFSYPAVIQASDGSVHITY 306
                                                    E+ YP  I++ DG  H+ Y
Sbjct: 350 AYKKIIGNEFRGSSGALRQPLEAQFLNNLDTRVCNPRGCEFEYEYPYFIRSPDGMYHLVY 409

Query: 307 TYNRTQIKHVVL 318
           ++N T IKHV  
Sbjct: 410 SWNNTFIKHVTF 421


>gi|432337240|ref|ZP_19588687.1| hypothetical protein Rwratislav_19854 [Rhodococcus wratislaviensis
           IFP 2016]
 gi|430775820|gb|ELB91296.1| hypothetical protein Rwratislav_19854 [Rhodococcus wratislaviensis
           IFP 2016]
          Length = 408

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 94/373 (25%), Positives = 140/373 (37%), Gaps = 82/373 (21%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV 79
           AP    HA+ +  +  G     +F G+ EG PD+ +W          W  P+   ++P  
Sbjct: 31  APQVQNHAANLTVLPDGSLGCVWFAGTQEGVPDISVWFSRLAPDADAWSEPVQLSDDPTR 90

Query: 80  PMWNPVLFKLPSNGL-LLFYKIGQEVQKWSGCMKR-SYNKGVTWSEREQLPP--GILGPS 135
              NPVL    +  + LL+       Q  +  M+R S + G TW E   L P     G  
Sbjct: 91  SEQNPVLHVTNTETVWLLWTSQHAGNQDTARVMRRISADGGRTWGEAHTLLPETDAGGVF 150

Query: 136 KNKPIL-LENGLLLC---------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
             +P++ L  G LL          G        + S M ++ D G SWR+   + +P  S
Sbjct: 151 VRQPVVALPTGRLLLPVFHCVRIEGRKWVGDRDYSSVM-ISDDHGDSWRE---VTVPG-S 205

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
           +  +       A  TL  L RS      +  S S D G TW+  + T+L N NS I  V 
Sbjct: 206 VGCVHMNIGRLAGGTLVALYRS-RWADSIYRSVSTDDGDTWTEPEATELPNNNSSIQFVV 264

Query: 246 LKDGRLLLAYNTVS---------------------------------------------- 259
           L   RL L +N  S                                              
Sbjct: 265 LPGDRLALVFNESSAANATARRTSLYDEIDDDGLADGVPEQADPELEPLDDGDSRSAFWG 324

Query: 260 --RGVLKVALSKDDGDSW--------HDALTLEEN----LAMEFSYPAVIQASDGSVHIT 305
             R  + +A+S+D G +W         D   L  N    L  E+SYP++   +DG +H+ 
Sbjct: 325 APRAPMTLAMSEDGGRTWPVRRDIEIGDGYCLSNNSRDGLNREYSYPSITAGNDGRLHVA 384

Query: 306 YTYNRTQIKHVVL 318
           +T  R  I++V L
Sbjct: 385 FTRFRQAIEYVEL 397


>gi|407362319|ref|ZP_11108851.1| hypothetical protein PmanJ_00935 [Pseudomonas mandelii JR-1]
          Length = 432

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 140/372 (37%), Gaps = 80/372 (21%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADE 75
           A+S      H+S +  +  G  +  +F GS EGA DV++    F  K G W +  ++A  
Sbjct: 59  ASSGLVDFVHSSAVTALPDGSLMTVWFAGSREGAADVQVRSARFDSKTGEWGAEQVLATR 118

Query: 76  EPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSER 124
           +            + NPV+   P N + LFY +   +  W+G    +  S + G  WS  
Sbjct: 119 DSTQQGTRKYIRKLGNPVVALAPDNRVWLFY-VSVSMGGWAGSAVNVMVSDDFGEHWSAP 177

Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
            QL   I  P        ++ P+   +G +      E    +  ++ ++ D G    K+ 
Sbjct: 178 RQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GDVIDKFR 233

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
             +  N     + P+      R + +L  + N   RV  + + D G TWS   P    NP
Sbjct: 234 ISHGMNSLQPAVVPL---DGQRGIALLRYAGNTHHRVLATRTEDAGQTWSEPYPLDPSNP 290

Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLE------------- 282
           NS +  V   +  LL+A N +  G  K++L   D   D W     L+             
Sbjct: 291 NSSLAAVATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDLDKSPDPLGAPFSPE 350

Query: 283 --------------------------ENL----------AMEFSYPAVIQASDGSVHITY 306
                                     +NL            E+ YP  I++ DG  H+ Y
Sbjct: 351 AYKEIIGREFRGSSGARRHSLEAEFLDNLDNRVCKTQGCEFEYEYPYFIRSPDGMYHLVY 410

Query: 307 TYNRTQIKHVVL 318
           ++N T IKHV  
Sbjct: 411 SWNNTFIKHVTF 422


>gi|417305276|ref|ZP_12092248.1| hypothetical protein RBWH47_02271 [Rhodopirellula baltica WH47]
 gi|327538370|gb|EGF25042.1| hypothetical protein RBWH47_02271 [Rhodopirellula baltica WH47]
          Length = 114

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 44/111 (39%), Positives = 58/111 (52%), Gaps = 26/111 (23%)

Query: 236 NPNSGIDGVKLKDGRLLLAYNTVS-------------RGVLKVALSKDDGDSWHDALTLE 282
           NPNSG D V L+DGR +L YN                R +L +A+S D G +W   LTLE
Sbjct: 3   NPNSGTDAVTLQDGRQVLIYNHSEGLVKRKDAPDLKPRRILNLAISSD-GKTWKPVLTLE 61

Query: 283 ENLAM------------EFSYPAVIQASDGSVHITYTYNRTQIKHVVLLPN 321
                            E+SYPA+IQ SDG +++ YTYNR  +KH V+ P+
Sbjct: 62  HETGPHPKDPERRRHFGEYSYPAIIQTSDGMLNMVYTYNREGVKHAVVDPS 112


>gi|443643049|ref|ZP_21126899.1| BBNR/Asp-box repeat-containing glycosyl hydrolase [Pseudomonas
           syringae pv. syringae B64]
 gi|443283066|gb|ELS42071.1| BBNR/Asp-box repeat-containing glycosyl hydrolase [Pseudomonas
           syringae pv. syringae B64]
          Length = 432

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 135/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + GV+WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGVSWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|398945574|ref|ZP_10671821.1| hypothetical protein PMI27_05672 [Pseudomonas sp. GM41(2012)]
 gi|398156586|gb|EJM45002.1| hypothetical protein PMI27_05672 [Pseudomonas sp. GM41(2012)]
          Length = 433

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 141/370 (38%), Gaps = 80/370 (21%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADE 75
           A+S      H+S +  +  G  +  +F GS EGA DV++    F  K G W +  ++A  
Sbjct: 60  ASSGLVDFVHSSAVTALPDGSLMTVWFAGSREGAADVQVRSARFDPKTGEWGAEKVLATR 119

Query: 76  EPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSER 124
           +            + NPV+   P N + LFY +   +  W+G    +  S + G +WS  
Sbjct: 120 DSTQQGTRKYIRKLGNPVVALAPDNRVWLFY-VSVSMGGWAGSAVNVMVSDDFGESWSAP 178

Query: 125 EQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYG 177
            QL   I  P        ++ P+   +G +      E    +  ++ ++ D G    K+ 
Sbjct: 179 RQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GNVIDKFR 234

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
             +  N     + P+      R + +L  + +   RV  + + D G TWS   P    NP
Sbjct: 235 ISHGNNSLQPTVVPL---DGQRGIALLRYAGDTHHRVLATRTEDAGQTWSEPYPLDPSNP 291

Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLE------------- 282
           NS +  V   +  LL+A N +  G  K++L   D   D W     L+             
Sbjct: 292 NSSLAAVATPEHGLLVALNDLQEGRFKLSLYGTDAKMDDWRSLRDLDKSPDPAGAPFSPE 351

Query: 283 --------------------------ENL----------AMEFSYPAVIQASDGSVHITY 306
                                     +NL            E+ YP  I++ DG  H+ Y
Sbjct: 352 AYKEIIGREFRGSSGALRQPLEAQFLDNLDNRVCKSQGCEFEYEYPYFIRSPDGMYHLVY 411

Query: 307 TYNRTQIKHV 316
           ++N T IKHV
Sbjct: 412 SWNNTFIKHV 421


>gi|391231583|ref|ZP_10267789.1| hypothetical protein OpiT1DRAFT_04191 [Opitutaceae bacterium TAV1]
 gi|391221244|gb|EIP99664.1| hypothetical protein OpiT1DRAFT_04191 [Opitutaceae bacterium TAV1]
          Length = 581

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/188 (29%), Positives = 94/188 (50%), Gaps = 11/188 (5%)

Query: 137 NKPILLENGLLLCGSS-VESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
           NKP +L +G  +  SS ++S          + + G+S+   G + +P ++ +  +     
Sbjct: 199 NKPTVLSDGTWVWPSSPLKSPRGLPVRPLYSNNNGKSFFYKGAVPVPPQTRNADEYQITE 258

Query: 196 TANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
             +++L +L R   GIG    S S DGG TW+   P+Q+ +  S     +L+ G+LLL  
Sbjct: 259 RKDKSLWLLNRMKYGIGE---SFSTDGGTTWTEMAPSQIAHITSRFYITRLQSGKLLLVK 315

Query: 256 N-----TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
           +      V R  L   LS DDG++W   L ++E    + SYP   QA DG ++I Y Y R
Sbjct: 316 HGAIKSNVGRSQLMAFLSDDDGNNWTGGLVIDER--NDLSYPDGTQAKDGRIYIIYDYQR 373

Query: 311 TQIKHVVL 318
            + K +++
Sbjct: 374 HREKEILM 381


>gi|399518684|ref|ZP_10759638.1| BNR/Asp-box repeat protein [Pseudomonas pseudoalcaligenes CECT
           5344]
 gi|399113178|emb|CCH36196.1| BNR/Asp-box repeat protein [Pseudomonas pseudoalcaligenes CECT
           5344]
          Length = 425

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/377 (23%), Positives = 142/377 (37%), Gaps = 84/377 (22%)

Query: 18  PANSAPFKS------CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQS 69
           PA +A F S       H+++I  +  G  + A+F G+ EGA DV+I    F    G+W +
Sbjct: 45  PAMAAHFASSDLEDFVHSASITGLPGGDLMAAWFAGTREGAADVQIRAARFDASSGQWSA 104

Query: 70  P-IIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNK 117
             ++A  E            + NPV+   P N L LFY +   +  W+G       S + 
Sbjct: 105 ERVLASRESTQHAVGKHIRKLGNPVISLAPDNRLWLFY-VSVSLGGWAGSAVNAMVSDDL 163

Query: 118 GVTWSEREQLPPGILGPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           G TWS   QL   +  P  N        P+   +G +      E    +  ++ ++ +  
Sbjct: 164 GDTWSAPRQL---VTSPFLNISTLVRAAPVFHADGSIGLPVYHEFLGKFPEYLHLSANG- 219

Query: 171 RSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
                 G   I     S+   V      R + +L  S     RV  S S D G  WS  +
Sbjct: 220 ---EVLGKHRIGKGRYSLQPTVVALDDKRAVALLRYSGEEHHRVLASYSEDTGRNWSQPE 276

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGD--SWHDALTLEEN---- 284
           P +  NPNS +  V   DG LL+A N +  G  ++ L   D +  +W +   L+E+    
Sbjct: 277 PVEPSNPNSSLAAVGRPDGSLLVAMNDLEDGRFRLTLYATDANLSNWRNLGELDESPNPW 336

Query: 285 -------------------------------------------LAMEFSYPAVIQASDGS 301
                                                       + ++ YP  ++  +G+
Sbjct: 337 GEPIARDEFPAVVSEKYLASGGKPEQQATFLERVSARMCSDDGCSFDYEYPYFVRDREGA 396

Query: 302 VHITYTYNRTQIKHVVL 318
            H+ Y++N   IKH+  
Sbjct: 397 YHLVYSWNDMFIKHLTF 413


>gi|330501037|ref|YP_004377906.1| BNR/Asp-box repeat-containing protein [Pseudomonas mendocina NK-01]
 gi|328915323|gb|AEB56154.1| BNR/Asp-box repeat-containing protein [Pseudomonas mendocina NK-01]
          Length = 426

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/362 (24%), Positives = 136/362 (37%), Gaps = 78/362 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSP-IIADEEPNV---- 79
            H+++I  +  G  + A+F GS EGA DV+I    F    G+W +  ++A  E       
Sbjct: 61  VHSASITGLPGGDLMAAWFAGSREGAADVQIRAARFDAASGQWSAERVLATRESTQRAVG 120

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L LFY +   +  W+G       S + G TWS   QL   + 
Sbjct: 121 KHIRKLGNPVISLAPDNRLWLFY-VSVSLGGWAGSAVNAMVSDDLGETWSAPRQL---VT 176

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ +        G   I    
Sbjct: 177 SPFLNISTLVRAAPVFHADGSIGLPVYHEFLGKFPEYLHLSTNG----EVLGKYRIGKGR 232

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            S+   V     NR + +L  +     RV  S S D G +WS  +P +  NPNS +  V 
Sbjct: 233 YSLQPTVVPLDENRAVALLRYAGEEHHRVLASYSEDAGRSWSRPQPVEPSNPNSSLAAVG 292

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG---------------DSW-----HD-------- 277
             DG LL+A N +  G  ++ L   D                + W     HD        
Sbjct: 293 RPDGSLLVAMNDLEDGRFRLTLYATDAKLGNWRTLGELDETPNPWGEPISHDEFPTVVSE 352

Query: 278 ------------ALTLE---------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                       A  LE         E    ++ YP   +  +G+ H+ Y++N   IKH+
Sbjct: 353 KYRASGGRPELQARFLEQVTARMCSAEGCTFDYEYPYFTRDREGTYHLVYSWNDMFIKHL 412

Query: 317 VL 318
             
Sbjct: 413 AF 414


>gi|229592500|ref|YP_002874619.1| hypothetical protein PFLU5114 [Pseudomonas fluorescens SBW25]
 gi|229364366|emb|CAY52136.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25]
          Length = 422

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 83/362 (22%), Positives = 135/362 (37%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV++    F  + G W +  ++A  E       
Sbjct: 58  VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDARSGEWGAEQVLATRESTRDGTQ 117

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P   L +FY +   V  W+     +  S + G +WS   QL   + 
Sbjct: 118 RYIRKLGNPVIALGPDKRLWMFY-VSVSVGGWATSAINLMVSDDLGASWSAPRQL---VT 173

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 174 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVIDKFRISRGKNSL 232

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              I P+      R + +L  + +   RV  S + D G TWS   P +  NPNS +  V 
Sbjct: 233 QPTIVPL---DGQRAVAMLRYAGDTHHRVLASRTEDAGQTWSEPYPLEPANPNSSLAAVA 289

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGD--SWHDALTLEEN------------------- 284
            +D  LL+A N +  G  K++L   D     W   + L+E+                   
Sbjct: 290 TQDDGLLVALNDLQDGRFKLSLYGTDAQLSQWRTVVQLDESPDPLGQPFSPEAYKEIIGE 349

Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                                            E+ YP   ++ DG  H+ Y++N T IK
Sbjct: 350 GFRASSGARRLPLEQRFLSNLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 409

Query: 315 HV 316
           HV
Sbjct: 410 HV 411


>gi|149177497|ref|ZP_01856100.1| hypothetical protein PM8797T_15456 [Planctomyces maris DSM 8797]
 gi|148843647|gb|EDL58007.1| hypothetical protein PM8797T_15456 [Planctomyces maris DSM 8797]
          Length = 938

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 95/207 (45%), Gaps = 28/207 (13%)

Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSW--------------MEVTVDAGRSWRKYGP 178
           G + NKP +L  G  +   S++  + +G +              + V+ D G +W++ G 
Sbjct: 700 GVTLNKPTVLSKGEWMLPISLDQRSGFGPFKGCFQELDSQRGANVFVSTDQGATWQRRGA 759

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSY-AKPTQLLNP 237
              PN      + +     + +L +L R+  GI +     S DGG TW+   +P Q+  P
Sbjct: 760 AVFPNPDWH--EHMIVERKDGSLWMLARTRKGIMQTI---STDGGRTWANPTQPPQIRQP 814

Query: 238 NSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
           N+     +L  GRLLL  +         R  L   LS DDG +W   L ++E   +  SY
Sbjct: 815 NARFHIRRLASGRLLLIKHGDKIDSHQGRVQLSAWLSDDDGLTWQGGLVIDERKGI--SY 872

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVL 318
           P   QA DGS++I+Y  NR+    ++L
Sbjct: 873 PDGFQAPDGSIYISYDRNRSTDGEILL 899


>gi|422653452|ref|ZP_16716218.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           actinidiae str. M302091]
 gi|330966501|gb|EGH66761.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           actinidiae str. M302091]
          Length = 432

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/363 (25%), Positives = 134/363 (36%), Gaps = 82/363 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS----PIIADEEPNVP-- 80
            H+S +  +  G  +  +F GS EGA DV+I    F D R +      ++A  E      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRF-DARTEEWGGEQVLATRESTQTGT 125

Query: 81  ------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGI 131
                 + NPV+   P N L LFY +   V  W+G       S + G +WS   QL   +
Sbjct: 126 GKYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---V 181

Query: 132 LGPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
             P  N        P+   +G +      E    +  ++ ++ D G    K+      N 
Sbjct: 182 TSPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNS 240

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
               + P+    A   LR    +F+   RV  S + DGG TWS   P    NPNS +  V
Sbjct: 241 LQPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLTPSNPNSSLAAV 297

Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL-------------- 279
            +    LL+A N +  G  K++L   D            D   D L              
Sbjct: 298 GVPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIG 357

Query: 280 --------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQI 313
                    L + +  EF                   YP  I++ DG  H+ Y++N T I
Sbjct: 358 QGFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFI 417

Query: 314 KHV 316
           KHV
Sbjct: 418 KHV 420


>gi|218659020|ref|ZP_03514950.1| putative glycosyl hydrolase protein [Rhizobium etli IE4771]
          Length = 324

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 69/158 (43%), Gaps = 51/158 (32%)

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT---------------- 257
           +  S S DGG +WS  +PT+L N NS I    L DG + + YN                 
Sbjct: 163 ILSSRSADGGESWSPPEPTELPNNNSSIQATVLDDGGIAMVYNHSNATMSDARRQSLYDE 222

Query: 258 -----------------------VSRGVLKVALSKDDGDSWHDALTLE------------ 282
                                  V R  L +A+S+D G ++   + L+            
Sbjct: 223 IEGDVAGESAAIVADAGRKAVWGVPRAPLSLAISRDGGRTFPQRMDLDTGDGFCLSNNSK 282

Query: 283 ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           ++L  EFSYP+VIQ SDG++H+ YTY R  IK+V L P
Sbjct: 283 DSLNREFSYPSVIQGSDGTLHVPYTYYRRAIKYVRLAP 320


>gi|66046737|ref|YP_236578.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           syringae B728a]
 gi|63257444|gb|AAY38540.1| Glycosyl hydrolase, BNR repeat protein [Pseudomonas syringae pv.
           syringae B728a]
          Length = 432

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/359 (24%), Positives = 131/359 (36%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQAGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRLSRGNNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V +  
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE----------- 282
             LL+A N +  G  K++L   D                D      TLE           
Sbjct: 302 KGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGFR 361

Query: 283 -------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                    +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 362 SSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|167519867|ref|XP_001744273.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777359|gb|EDQ90976.1| predicted protein [Monosiga brevicollis MX1]
          Length = 332

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 116/262 (44%), Gaps = 34/262 (12%)

Query: 84  PVLFKLP-SNGLLLFY------------KIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
           P LF  P SN L LFY            K G+    + G   R   S + G TWS    +
Sbjct: 74  PSLFLQPDSNKLWLFYSESTQYDYRNDSKCGEYGHSYPGGEVRYRTSLDGGHTWSSATTI 133

Query: 128 ----PPGILGPSKNKPILLENGLLLCGSSVESWNSWG-----SWMEVTVDAGRSWRKYGP 178
                 G +       +L+ NG  L     E    +      S + ++ D G++W  YG 
Sbjct: 134 LTFQDRGHIAKVTANTMLVRNGRWLLPFWQEQHTVYDIGPGCSGVLISDDQGQNWTPYGC 193

Query: 179 IYIPNESLS-VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
           +   N S + +I+       N ++  L R+   IG +  S S D G TWS A+PT L NP
Sbjct: 194 V---NRSDTWLIENTLGTINNGSILQLFRT--QIGYLYASVSNDNGETWSNAQPTPLQNP 248

Query: 238 NSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
           ++ +    L+DGRL+LAYN     R  L +A ++ DG SW    TLE N++  + YP  +
Sbjct: 249 DAKVCMTNLRDGRLMLAYNPSKTQRLPLDLAYTR-DGVSWTHYATLESNMSTTYQYPTPV 307

Query: 296 QASDGSVHITYTYNRTQIKHVV 317
           Q  +  +       RT IK  +
Sbjct: 308 QLGNEVLTSYSADTRTGIKLAI 329


>gi|225420243|ref|ZP_03762546.1| hypothetical protein CLOSTASPAR_06586 [Clostridium asparagiforme
           DSM 15981]
 gi|225041060|gb|EEG51306.1| hypothetical protein CLOSTASPAR_06586 [Clostridium asparagiforme
           DSM 15981]
          Length = 395

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 79/331 (23%), Positives = 139/331 (41%), Gaps = 54/331 (16%)

Query: 31  TIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQS-----PIIADEEPNVPMWNPV 85
           +I   +KG   V +F G         + L T  D R ++      +I   +P   +++P 
Sbjct: 42  SIEVTEKGRIFVNFFSGQGAEVGGNFLMLCT-SDNRGETFRSCVTVIEHPDPLCRIYDPC 100

Query: 86  LFKLPSNGLLLFYKIGQEVQK-----WSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPI 140
           L+  P+  L + Y   +         W+   K      + W+   ++  GI+    NKPI
Sbjct: 101 LWISPAGELWMIYNQTKGFNDCRSGVWATVCKAPDAPRLEWTPPRRIANGIM---INKPI 157

Query: 141 LLENGLLL---------CGSSVESWNSWG----SWMEVTVDAGRSWRKYGPIYIPNESLS 187
           +  +G  L         CG+     +       S +  + D G ++   G   +P  S  
Sbjct: 158 VDASGDWLFPCAIWRDECGAPPTERHGLEQEQFSNVYASTDGGETFSLRGSADVPGRSFD 217

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
             + +     + TL +L+R+F GIG    S S DGG TW+  + + +  P S     +LK
Sbjct: 218 --EHMIVEKKDGTLWMLVRTFEGIGE---SFSHDGGRTWTPGRKSHIDGPCSRFHIRRLK 272

Query: 248 DGRLLLA-----------YNTVSRGVLK---------VALSKDDGDSWHDALTLEENLAM 287
            GRLL+             N + +G +K           LS+DDG +W   L L+E    
Sbjct: 273 SGRLLMINHFHFGERIDLNNILQQGNVKSWKGRSHLTAMLSEDDGQTWPYTLLLDER--N 330

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           + +YP   + +DG +++TY Y R + + +++
Sbjct: 331 DAAYPDAKEGADGYIYVTYDYERVKQREILM 361


>gi|422633915|ref|ZP_16699027.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           pisi str. 1704B]
 gi|330944496|gb|EGH46484.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           pisi str. 1704B]
          Length = 378

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 13  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 72

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 73  KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 131

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 132 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKF-RISRGNNSLQP 189

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 190 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 246

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 247 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 306

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 307 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 366


>gi|440724117|ref|ZP_20904455.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
           BRIP34876]
 gi|440725576|ref|ZP_20905841.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
           BRIP34881]
 gi|440358322|gb|ELP95694.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
           BRIP34876]
 gi|440368513|gb|ELQ05546.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
           BRIP34881]
          Length = 432

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|422297881|ref|ZP_16385507.1| BNR/Asp-box repeat-containing protein [Pseudomonas avellanae BPIC
           631]
 gi|407990587|gb|EKG32645.1| BNR/Asp-box repeat-containing protein [Pseudomonas avellanae BPIC
           631]
          Length = 432

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F  +   W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMAVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL   + 
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVSAMVSSDMGASWSAPRQL---VT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              + P+    A   LR    +F+   RV  S + DGG TWS   P    NPNS +  V 
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLSAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
           +    LL+A N +  G  K++L   D            D   D L               
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQ 358

Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
                   L + +  EF                   YP  I++ DG  H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|424068662|ref|ZP_17806111.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           avellanae str. ISPaVe013]
 gi|407996831|gb|EKG37287.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           avellanae str. ISPaVe013]
          Length = 432

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDEQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|422665529|ref|ZP_16725400.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           aptata str. DSM 50252]
 gi|330975946|gb|EGH76012.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           aptata str. DSM 50252]
          Length = 432

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGESWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|302188253|ref|ZP_07264926.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           syringae 642]
          Length = 432

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 133/360 (36%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G  WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGANWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GKGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|432893409|ref|ZP_20105421.1| hypothetical protein A31K_02554 [Escherichia coli KTE165]
 gi|431424389|gb|ELH06485.1| hypothetical protein A31K_02554 [Escherichia coli KTE165]
          Length = 240

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 86/207 (41%), Gaps = 54/207 (26%)

Query: 163 MEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
           ++++ D G SWR    + +P +SL  +        + TL  L RS      + +S S D 
Sbjct: 31  VKISADGGHSWRD---VEVP-QSLGCVHMNITMLHDGTLVALFRS-RWADNIYISHSVDN 85

Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS----------------------- 259
           G +WS  + T+L N NS I    L  G L L YN +S                       
Sbjct: 86  GESWSVPQATELPNNNSSIQVTTLASGELALVYNAMSAAGAVERRASLYDEIDDGDDSRK 145

Query: 260 --------------RGVLKVALSKDDGDSWHDALTLEE------------NLAMEFSYPA 293
                         R  + VA+S D G SW     L+E             L  EFSYP+
Sbjct: 146 EPTAVGRSAFWGAPRAPMTVAISADGGKSWPWRRNLDEGDGYCMTNNSLEKLNREFSYPS 205

Query: 294 VIQASDGSVHITYTYNRTQIKHVVLLP 320
           + Q+ DG++HI YT+ R  IK+V + P
Sbjct: 206 IKQSPDGTLHIAYTWWRQAIKYVRISP 232


>gi|421609054|ref|ZP_16050257.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain protein [Rhodopirellula baltica SH28]
 gi|408500157|gb|EKK04613.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain protein [Rhodopirellula baltica SH28]
          Length = 733

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 103/219 (47%), Gaps = 30/219 (13%)

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------V 165
           TWS   ++  G+   + NKP +L +G  +   S++     G + +              V
Sbjct: 489 TWSAPRRIWHGV---TLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFV 545

Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
           + D G +W + G    PN      + +     + +L +L R+ NGI     S S D G T
Sbjct: 546 SKDQGETWHRRGRATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDSGKT 600

Query: 226 WSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDAL 279
           WS A  +++ +PN+     +L+ GRLLL  +  +      R  L   LS+D+G +W   L
Sbjct: 601 WSAAVASKIKHPNARFHVRRLQSGRLLLIKHGDAIDSHHGRVGLSAWLSEDEGQTWLGGL 660

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            L+E   +  SYP   Q+S+GS++I+Y  NR     V++
Sbjct: 661 MLDERKGI--SYPDGFQSSNGSIYISYDRNRATDGEVLM 697


>gi|424073080|ref|ZP_17810499.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           avellanae str. ISPaVe037]
 gi|407996670|gb|EKG37131.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           avellanae str. ISPaVe037]
          Length = 432

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHVDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GRGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|62185252|ref|YP_220037.1| hypothetical protein CAB639 [Chlamydophila abortus S26/3]
 gi|62148319|emb|CAH64086.1| hypothetical protein CAB639 [Chlamydophila abortus S26/3]
          Length = 293

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 67/262 (25%), Positives = 121/262 (46%), Gaps = 17/262 (6%)

Query: 63  KDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYK---IGQEVQKWSGCMKRSYNKGV 119
           ++GRW  P     + +    +PVL ++ S  + LFY+   +G+ + K    +  S+N GV
Sbjct: 3   REGRWSEPYEIFPQASGLFSHPVLVRISSQEIWLFYRKQQVGESISK--PYVAFSFNSGV 60

Query: 120 TWSEREQLPPGILGPSKNKPIL--LENGLLLCGSSV-ESWNSW---GSWMEVTVDAGRSW 173
            W E   LPPGI G ++N P +  + N + +   SV ES +      SW+E+     RSW
Sbjct: 61  HWLEHRLLPPGIEGTTRNPPYVDRIRNRVYIPSFSVYESPDQELIGDSWVEIYNFFSRSW 120

Query: 174 R-KYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKP 231
             + GPI   + +   I+P      +  L +  R+ +     +C+S S + G++WS  K 
Sbjct: 121 EGRMGPILGGSRNQVPIEPAIVKLTSNKLALFFRNASVEESYICLSVSTNQGISWSKLKN 180

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFS 290
               + ++ I     ++G+L L  N+   G  L    S ++G +W+    ++   +++  
Sbjct: 181 LPWRSYDNSISVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSID-- 238

Query: 291 YPAVIQASDGSVHITYTYNRTQ 312
            P+        +H  YT    Q
Sbjct: 239 -PSTYLDRQECMHHVYTSKDAQ 259


>gi|384251712|gb|EIE25189.1| neuraminidase [Coccomyxa subellipsoidea C-169]
          Length = 568

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 77/327 (23%), Positives = 130/327 (39%), Gaps = 59/327 (18%)

Query: 28  HASTIVEVDKGHFLVAYFGGS--CEGAPDVKIWLQTFKDG--RW-QSPIIADEEPNVPMW 82
           H +T+  +  G  +  +   +   EG  D  I+     D    W  + ++   + N+P W
Sbjct: 71  HMATLAHMPNGSIIAQWQAATKWFEGVADQAIYYAISNDHGLTWGPTQLMPGTQDNLPAW 130

Query: 83  NPVLFKLPSNGLLLFYKIGQEVQKW------------SGCMKRSYNKGVTWSEREQLPPG 130
            PVL+       L F +  Q  Q W            S   + S + G TW+    L   
Sbjct: 131 GPVLYSANGTMHLWFSRSHQPCQ-WLSESGVCWAPGGSLMYQTSADNGTTWTNATAL--- 186

Query: 131 ILGPSKNKPILLENGLLLCGSSVESWNSWGSWME-------------------------V 165
                   P   E G+     +  S+N+ G W+                          +
Sbjct: 187 -------DPWSAEGGIAKLACNGLSFNAAGEWLLPYWRELGGGQQCQKVPKLHGVAGVLI 239

Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTAN-RTLRVLMRSFNG-IGRVCMSESCDGG 223
           T D G++W+K   +   N +  +   +  +TA    +R+ ++ F    G    S S DGG
Sbjct: 240 TGDQGKTWKK-AEVPKDNSTWMIENAIAAYTAGVNGMRMQLQIFRTRTGSSMQSWSSDGG 298

Query: 224 LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV--SRGVLKVALSKDDGDSWHDALTL 281
             W++   + L NPNS +D + L D  L+ A+N     R  L++A+S+D G +W  +  +
Sbjct: 299 YNWTFPSNSTLPNPNSRMDVIALPDNVLVTAFNDSPNRRSPLRLAISRDGGRNWTRSALI 358

Query: 282 EENLAMEFSYPAVI-QASDGSVHITYT 307
           E+     F YPA++  A    V + YT
Sbjct: 359 EDEPDGSFHYPALLYDAVQDQVMVIYT 385


>gi|418055776|ref|ZP_12693830.1| hypothetical protein HypdeDRAFT_2330 [Hyphomicrobium denitrificans
           1NES1]
 gi|353210054|gb|EHB75456.1| hypothetical protein HypdeDRAFT_2330 [Hyphomicrobium denitrificans
           1NES1]
          Length = 394

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 75/316 (23%), Positives = 128/316 (40%), Gaps = 32/316 (10%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVP------ 80
            H  +++E   G  L  ++    EGA + ++    F    W    +      V       
Sbjct: 67  VHGPSLIETPTG-LLAFWYRAVYEGAANAELVSARFDGDHWSPTAVVTNSNTVTRDIGLT 125

Query: 81  ---MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG---- 130
              + NPV F+   N + LF+     +  W+ C   + RS + G TWS  ++L       
Sbjct: 126 VKSLANPVPFRRSPNEIWLFFA-ASRLSGWATCEIVLIRSVDNGRTWSPAQRLYASPFLN 184

Query: 131 ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQ 190
           +   +K+ PI L    +    ++ +++       V +   R+ R    + + N      Q
Sbjct: 185 MSHLTKSLPIRLSGDRI----ALPAYHEMNRKYPVMLVLDRNGRVIDKLRMGNGGTVGYQ 240

Query: 191 PVPFHTANRTLRVLMRSFNGI--GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           P    T   T    +R  N     R+ MS + D G TW+   P  L NP   I  ++  D
Sbjct: 241 PAIVATGPTTAIAFVRRLNSFRPQRILMSHTVDAGRTWTPPTPIDLPNPGGPIAAIRYDD 300

Query: 249 GRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTL-EENLAME---FSYPAVIQASDGSV 302
            R+LLA+N        +K+A S  DG S     T+ ++N  +E    +YP +I++  G  
Sbjct: 301 TRILLAFNDDPDHESNVKLAFSNLDGTSLRRIGTVAQKNDRLEGDAVAYPFLIESEPGQF 360

Query: 303 HITYTYNRTQ--IKHV 316
            + ++    Q  I HV
Sbjct: 361 DVVFSRPPPQHAINHV 376


>gi|213968573|ref|ZP_03396715.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato T1]
 gi|213926506|gb|EEB60059.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato T1]
          Length = 432

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F  +   W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL   + 
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              + P+    A   LR    +F+   RV  S + DGG TWS   P    NPNS +  V 
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
           +    LL+A N +  G  K++L   D            D   D L               
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQ 358

Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
                   L + +  EF                   YP  I++ DG  H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|28869101|ref|NP_791720.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           tomato str. DC3000]
 gi|28852341|gb|AAO55415.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato str.
           DC3000]
          Length = 432

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F  +   W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL   + 
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              + P+    A   LR    +F+   RV  S + DGG TWS   P    NPNS +  V 
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
           +    LL+A N +  G  K++L   D            D   D L               
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEVYKEVIGQ 358

Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
                   L + +  EF                   YP  I++ DG  H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|422658889|ref|ZP_16721320.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. lachrymans
           str. M302278]
 gi|331017513|gb|EGH97569.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. lachrymans
           str. M302278]
          Length = 432

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 133/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F  +   W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAQAEEWGGEQVLATRESTQTGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL   + 
Sbjct: 127 KYIRKLGNPVIALAPDNPLWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 183 SPFLNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              + P+    A   LR    +F+   RV  S + DGG TWS   P    NPNS +  V 
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG-----------DSWHDAL--------------- 279
           +    LL+A N +  G  K++L   D            D   D L               
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEVYKEVIGQ 358

Query: 280 -------TLEENLAMEF------------------SYPAVIQASDGSVHITYTYNRTQIK 314
                   L + +  EF                   YP  I++ DG  H+ Y++N T IK
Sbjct: 359 GFRASSGALRQPMEAEFLSNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|417303900|ref|ZP_12090941.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain [Rhodopirellula baltica WH47]
 gi|327539850|gb|EGF26453.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain [Rhodopirellula baltica WH47]
          Length = 733

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 97/206 (47%), Gaps = 27/206 (13%)

Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------VTVDAGRSWRKYGP 178
           G + NKP +L +G  +   S++     G + +              V+ D G +W + G 
Sbjct: 499 GVTLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFVSKDQGETWHRRGR 558

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              PN      + +     + +L +L R+ NGI     S S D G TWS A  +++ +PN
Sbjct: 559 ATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDSGKTWSAAVTSKIKHPN 613

Query: 239 SGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           +     +L+ GRLLL  +  +      R  L   LS+D+G +W   L L+E   +  SYP
Sbjct: 614 ARFHIRRLQSGRLLLIKHGDAIDAHHGRVGLSAWLSEDEGQTWLGGLMLDERKGI--SYP 671

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
              Q+S+GS++I+Y  NR     V++
Sbjct: 672 DGFQSSNGSIYISYDRNRATDGEVLM 697


>gi|32476765|ref|NP_869759.1| hypothetical protein RB11055 [Rhodopirellula baltica SH 1]
 gi|32447311|emb|CAD77137.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 733

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 97/206 (47%), Gaps = 27/206 (13%)

Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------VTVDAGRSWRKYGP 178
           G + NKP +L +G  +   S++     G + +              V+ D G +W + G 
Sbjct: 499 GVTLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFVSKDQGETWHRRGR 558

Query: 179 IYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              PN      + +     + +L +L R+ NGI     S S D G TWS A  +++ +PN
Sbjct: 559 ATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDSGKTWSAAVTSKIKHPN 613

Query: 239 SGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           +     +L+ GRLLL  +  +      R  L   LS+D+G +W   L L+E   +  SYP
Sbjct: 614 ARFHVRRLQSGRLLLIKHGDAIDAHHGRVGLSAWLSEDEGQTWLGGLMLDERKGI--SYP 671

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
              Q+S+GS++I+Y  NR     V++
Sbjct: 672 DGFQSSNGSIYISYDRNRATDGEVLM 697


>gi|373850784|ref|ZP_09593585.1| hypothetical protein Opit5DRAFT_1639 [Opitutaceae bacterium TAV5]
 gi|372476949|gb|EHP36958.1| hypothetical protein Opit5DRAFT_1639 [Opitutaceae bacterium TAV5]
          Length = 441

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 59/187 (31%), Positives = 91/187 (48%), Gaps = 25/187 (13%)

Query: 137 NKPILLENG-----LLLCGSSVESWNSWGSWMEVTVDAGRSWR---KYGPIYIPNESLSV 188
           NKP +L +G     ++L        +SW     ++ D G++W    KY P   P  + + 
Sbjct: 198 NKPTVLSDGTWFRPVVLGPEQSSRLHSW-----ISRDRGKTWEFLSKYEPSAPPEVARTW 252

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW---SYAKPTQLLNPNSGIDGVK 245
            +P+     + +L +L RS +GI ++   ES D G TW   SY    + +N    I   +
Sbjct: 253 CEPMVIERRDGSLWMLFRSKDGIMQI---ESRDKGRTWENESYFTKERGINTRFFI--TR 307

Query: 246 LKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L+ GRLL   N    +R  L   LS+DDG +W   L L+E   +  SYP  +QA DGS++
Sbjct: 308 LQSGRLLFVVNDHPKARANLTAMLSEDDGKTWPHKLVLDERNPV--SYPDGMQAPDGSIY 365

Query: 304 ITYTYNR 310
           + Y   R
Sbjct: 366 VIYDRGR 372


>gi|398868058|ref|ZP_10623483.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM78]
 gi|398234577|gb|EJN20442.1| putative neuraminidase (sialidase) [Pseudomonas sp. GM78]
          Length = 432

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 90/387 (23%), Positives = 144/387 (37%), Gaps = 83/387 (21%)

Query: 4   DCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK 63
           D  +K +    F   A+S      H++ +  +  G  +  +F G+ EGA DV++    F 
Sbjct: 47  DKEVKPIYTSRF---ASSGLVDFVHSAAVTALPDGSLMAVWFAGTREGAADVQVRSARFN 103

Query: 64  --DGRW-QSPIIADEEPNV--------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC-- 110
              G W +  ++A  +            + NPV+   P N L LFY +   +  W+G   
Sbjct: 104 LSTGEWGEEQVMATRDSTQQSTRKYIRKLGNPVIALAPDNRLWLFY-VSVSMGGWAGSAV 162

Query: 111 -MKRSYNKGVTWSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSW 162
            +  S + G  WS   QL   I  P        ++ P+   +G +      E    +  +
Sbjct: 163 NVMVSDDFGGHWSAPRQL---ITSPFLNISTLVRSAPVFHADGSIGLPVYHEFLGKFAEY 219

Query: 163 MEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
           + ++ D G    K+   +  N     I P+      R + +L  + +   RV  S + D 
Sbjct: 220 LYLSAD-GVVIDKFRISHGNNSLQPAIVPL---DGQRGIALLRYAGDTHHRVLASRTEDA 275

Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSW---HD 277
           G  WS   P    NPNS +  V   D  LL+A N +  G  K++L   D   D W   HD
Sbjct: 276 GQNWSEPYPLVPSNPNSSLAAVATPDRGLLVALNDLQEGRFKLSLYVTDAKMDDWRLLHD 335

Query: 278 ----------------------------------ALTLE------------ENLAMEFSY 291
                                             AL  +            +    E+ Y
Sbjct: 336 LDKSPDPEGDPYTPLVYKEIIGKQFRGSSGKLRQALEAQFLANLDKRVCKSQECEFEYEY 395

Query: 292 PAVIQASDGSVHITYTYNRTQIKHVVL 318
           P  I++ DG  H+ Y++N T IKHV  
Sbjct: 396 PYFIRSPDGMYHLVYSWNNTFIKHVTF 422


>gi|440745293|ref|ZP_20924589.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
           BRIP39023]
 gi|440372969|gb|ELQ09747.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae
           BRIP39023]
          Length = 432

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 134/360 (37%), Gaps = 76/360 (21%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQ-SPIIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  +       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRDSTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGSSWSPPWQLVTTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 186 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFR-ISRGNNSLQP 243

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 244 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 300

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG---------------DSWHDALTLE---------- 282
              LL+A N +  G  K++L   D                D      TLE          
Sbjct: 301 GKGLLVALNDLREGRFKLSLYGTDEEMNVWRPLMDFDKSPDPLGTPFTLEAYKEVIGEGF 360

Query: 283 --------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                                     +    E+ YP  I++ DG  H+ Y++N T IKHV
Sbjct: 361 RSSSGALRQPMEEAFLSNLDQRVCSPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|325107979|ref|YP_004269047.1| hypothetical protein Plabr_1413 [Planctomyces brasiliensis DSM
           5305]
 gi|324968247|gb|ADY59025.1| hypothetical protein Plabr_1413 [Planctomyces brasiliensis DSM
           5305]
          Length = 429

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 52/166 (31%), Positives = 83/166 (50%), Gaps = 14/166 (8%)

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
           G+ + V+ D G++W + G + +PN +   ++       + +L +L+R+  G     ++ S
Sbjct: 230 GASILVSTDEGKTWTRRGGVKMPNPNW--MEHRAIEREDGSLWMLVRTTKG---AMLTIS 284

Query: 220 CDGGLTWSYAK-PTQLLNPNSGIDGVKLKDGRLLLAY------NTVSRGVLKVALSKDDG 272
            DGG +WS  + P  + +P +     KLK GR LL        N   R  L   LS D+G
Sbjct: 285 QDGGKSWSTPEFPEGIKHPAARFFLEKLKSGRWLLVKHGKTIDNHEGRSHLSAWLSADEG 344

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            +W   L L+E   +  SYP   QA DGS++I+Y + R    HV L
Sbjct: 345 KTWQGGLMLDERRGV--SYPDGFQADDGSIYISYDWQRGSKGHVNL 388


>gi|407694959|ref|YP_006819747.1| hypothetical protein B5T_01121 [Alcanivorax dieselolei B5]
 gi|407252297|gb|AFT69404.1| hypothetical protein B5T_01121 [Alcanivorax dieselolei B5]
          Length = 376

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 88/369 (23%), Positives = 144/369 (39%), Gaps = 76/369 (20%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRW--QSPIIAD 74
           A+ A     HA+++VE+  G     +F GS EGA DV I    F    G W  ++ ++  
Sbjct: 4   ASDAETALVHAASMVELPDGRLRAFWFAGSREGAQDVGIHSSVFDPAAGAWSDETTVVTR 63

Query: 75  EE------PNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWS-ER 124
           ++       +V      +  L  NG +  + +      W+     +  S+++G +W  +R
Sbjct: 64  DQIREGWGRHVRKLGNAVPVLDDNGRMRLFVVAVSFGGWAASRLVVLESWDQGESWQFDR 123

Query: 125 EQLPPGILGPS---KNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
                  L  S   K  P+   +G +      E    +G  + V  D     R      I
Sbjct: 124 ALTTSPFLNISTLVKTPPVRFADGSVGLPVYHEMIGKFGEMLRVGEDN----RILDNARI 179

Query: 182 PNESLSVIQPVPF----HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
                  IQP+      H A   LR    S +  G +  S++ DGG  W+    + L NP
Sbjct: 180 -GHGRKAIQPLVLVDSPHRATAFLR--NESDDHPGMLYRSDTYDGGSEWTPLVSSGLPNP 236

Query: 238 NSGIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHDAL-----------TLEEN 284
           ++ + GV L +G  LL    N+V R  L +  ++D G +W D +            LEEN
Sbjct: 237 SAAVGGVALSEGHWLLVANCNSVERDDLCMRETRDSGRTWTDRIYFHNRVSWRGGDLEEN 296

Query: 285 LAM-----------------------------------EFSYPAVIQASDGSVHITYTYN 309
             +                                   ++ YP VIQ S+G +H+ YT+N
Sbjct: 297 RFLNMIGEELDGTFGVIRDQPYLQRVEHNKCFARGCEFQYDYPYVIQTSNGDLHVLYTWN 356

Query: 310 RTQIKHVVL 318
           ++ I+H  L
Sbjct: 357 KSAIRHAWL 365


>gi|171913941|ref|ZP_02929411.1| Glycosyl hydrolase, BNR repeat [Verrucomicrobium spinosum DSM 4136]
          Length = 631

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 77/319 (24%), Positives = 143/319 (44%), Gaps = 32/319 (10%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEV-DKGHFLVAYFGGSCEGAPDVKIWL---QTFKDGR 66
            A + TF    +P  +   S +V + D+   +  Y   S   +    +W+   +T   G+
Sbjct: 294 AARDITF---DSPHDTVLGSGLVLLKDRTTMMAVYSTPSSYYSKPGTVWIAARRTLDGGK 350

Query: 67  -WQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC-----------MKRS 114
            W+   +    P+    +P +  +  +G +  + +G +   W G              RS
Sbjct: 351 SWEMEQVVARHPDCQPSHPSVL-VSKDGTIHVFYLGFKKWAWHGVNPASDTASDIWTTRS 409

Query: 115 YNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSS--VESWNSWGSWMEVTVDAGRS 172
            ++G TWS  +++  G  G + N  +   +G L+   S  V +     S + ++ D GRS
Sbjct: 410 KDQGSTWSVPQRIFTGYSG-ATNGAVETADGKLVVPFSHYVNNPGRLVSRVAISADGGRS 468

Query: 173 WRKYGPIYI--PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
           W    P+ +    +    ++P      +  + +L+R+   +     + S DGGL W+   
Sbjct: 469 WDLGAPLDMGGAGDHEGALEPAVLPLNDGRIWMLIRTTRKV--FWEAYSSDGGLNWTVPG 526

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLAME 288
           PT +   ++     +L+DGR+ L +N  S+G   L++ALS DDG SW   + L    ++ 
Sbjct: 527 PTTIDASSAPASLTRLRDGRVALVWNQRSKGRRELQLALSSDDGKSWSRPVPLVRGKSV- 585

Query: 289 FSYPAVIQA-SDGSVHITY 306
            +YP+VI+  SDG + I Y
Sbjct: 586 -TYPSVIEGPSDGDLWIGY 603


>gi|283782130|ref|YP_003372885.1| hypothetical protein Psta_4378 [Pirellula staleyi DSM 6068]
 gi|283440583|gb|ADB19025.1| hypothetical protein Psta_4378 [Pirellula staleyi DSM 6068]
          Length = 813

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 94/208 (45%), Gaps = 29/208 (13%)

Query: 133 GPSKNKPILLENGLLLCGSSVESWNSWG----------------SWMEVTVDAGRSWRKY 176
           G + NKP +L+NG  L   S+ + +  G                + + V+ D G +W + 
Sbjct: 202 GCTLNKPTVLKNGEWLLPVSLWTRDRIGPAILKEAHKNLDVIRMANVFVSTDEGATWTRR 261

Query: 177 GPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN 236
           G +  P       + +     +  L +L R+  GI     S S DGG +WS  +P+++ N
Sbjct: 262 GGVEFPGTDFD--EHMIVEKNDGRLWMLARTKQGISE---SFSSDGGKSWSKPEPSKIAN 316

Query: 237 PNSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
           P++     +L  G LLL  N         R  +   LS DDG +W   L +++    E S
Sbjct: 317 PSARFFIRRLTSGNLLLVKNGPLDVRLPRRSSMTAYLSTDDGATWPHQLLIDDR--AEVS 374

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YP  +++ DG +H+ Y +NR     ++L
Sbjct: 375 YPDAVESPDGVIHLLYDWNRHTDAEILL 402



 Score = 37.4 bits (85), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 70/161 (43%), Gaps = 10/161 (6%)

Query: 165 VTVDAGRSWRKYGPIYI-PNESLSV-IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
           ++ D G++W ++G I + P+E      +      AN  + +++R+    G +  +ES DG
Sbjct: 641 ISSDGGKTWTEHGNIRLTPDERYHGWAENNIVELANGRIAMIIRADRLGGMLFYAESKDG 700

Query: 223 GLTW-SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDAL 279
           G TW  YA  T++ NP S      L    + + +N  S  R  L + +S D   +W    
Sbjct: 701 GKTWPEYASVTRVPNPGSKATLYPLGGNNVAMLHNPNSKHRSPLSLWISFDGMKTWPYQR 760

Query: 280 TLE----ENLAMEFSYPAVIQASDGS-VHITYTYNRTQIKH 315
            L     +      +YP    + D   +H  +  NR +  H
Sbjct: 761 VLHPESVDGPQGRINYPDGFVSQDKQFLHFAFDDNRHRAVH 801


>gi|196232246|ref|ZP_03131100.1| hypothetical protein CfE428DRAFT_4266 [Chthoniobacter flavus
           Ellin428]
 gi|196223614|gb|EDY18130.1| hypothetical protein CfE428DRAFT_4266 [Chthoniobacter flavus
           Ellin428]
          Length = 782

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 118/286 (41%), Gaps = 45/286 (15%)

Query: 67  WQSP---IIADEEPNVPMWNPVLFKL---PSNGLLLF------YKIGQEVQKWSGCMKRS 114
           W  P   I   + PN P    ++  L   P+  L LF      Y  G++   +  C    
Sbjct: 97  WSKPRVVIDPTDPPNAPQRRALVGNLWTDPTGKLWLFFDQSLGYFDGRDGDWFITCDNPD 156

Query: 115 YNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSW------------ 162
            ++ V WS+  +   G    + NKP +L NG  L   ++ + +  GS             
Sbjct: 157 ADEPV-WSKPTRFADGC---TLNKPTVLSNGDWLLPVALWTRDHIGSASLKEAHHELDPI 212

Query: 163 ----MEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE 218
               +  + D G++W + G +  P         V        L +L R+  GI     S 
Sbjct: 213 RMANVFASTDQGKTWARRGGVAFPGSEFDEHMIVELRDGR--LWMLARTKKGISE---SF 267

Query: 219 SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDG 272
           S D G+TW    P+ + NP++     +L  G LLL  N         R  L   LS DDG
Sbjct: 268 STDHGVTWGEPHPSAIQNPSARFFIRRLASGNLLLVKNGPIETRLPRRSSLTAFLSTDDG 327

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            +W   L L++   +  SYP  +QA DG+++++Y +NR     ++L
Sbjct: 328 KTWGKGLLLDDRAVV--SYPDGVQAPDGTIYLSYDWNRHTDAEILL 371


>gi|218516178|ref|ZP_03513018.1| putative glycosyl hydrolase protein [Rhizobium etli 8C-3]
          Length = 300

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 68/158 (43%), Gaps = 51/158 (32%)

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT---------------- 257
           +  S S DGG  WS  +PT+L N NS I    L +G + + YN                 
Sbjct: 139 ILSSRSADGGENWSPPEPTELPNNNSSIQATVLDNGGIAMVYNHSNADMSDARRQSLYDE 198

Query: 258 -----------------------VSRGVLKVALSKDDGDSWHDALTLE------------ 282
                                  V R  L +A+S+D G ++   + L+            
Sbjct: 199 IEGDEAGENAAVVADAGRKAVWGVPRAPLSLAISRDGGRTFPHHMDLDTGDGFCLSNNSK 258

Query: 283 ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           ++L  EFSYP+VIQ SDG++H+ YTY R  IK+V L P
Sbjct: 259 DSLNREFSYPSVIQGSDGTLHVAYTYYRRAIKYVRLAP 296


>gi|412990692|emb|CCO18064.1| putative exo-alpha-sialidase [Bathycoccus prasinos]
          Length = 656

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 91/335 (27%), Positives = 139/335 (41%), Gaps = 62/335 (18%)

Query: 25  KSCHASTI----VEVDKGHFLVAYFGGS-CEGAPDVKIWLQTFKD--GRWQSPIIADEEP 77
           K  H +TI    V   K  F VA+   S  EG     I +   +D    W  P       
Sbjct: 244 KYSHMATIGRWNVNATKKGFAVAWQSSSGIEGIRGQSILISYAEDVDQPWSKPKRVSSMQ 303

Query: 78  NV--PMWNPVLFKLPS--NGLLLFYKIGQEV-----QKWSGC-------------MKRSY 115
           NV   +W+PVLF  P+  NGL LFY   Q       +  SG              + +S 
Sbjct: 304 NVKTAVWSPVLFNPPNEENGLWLFYTESQNCLRPARENKSGSKIPPRFSPGGDVKVTKSR 363

Query: 116 NKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW--NSWGSWME--------- 164
           + G TWSE +++       S   P ++ N + +  +   +W    W    E         
Sbjct: 364 DGGKTWSEPQEVHAQSKEDSPGMPKVIANRITVH-TQTGNWVLPYWKEKAESNACRQPKS 422

Query: 165 -------VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
                  V+ D G+SW+  G I +    L  I+       + +L++  R+  G  +V  +
Sbjct: 423 PASHGVLVSKDQGKSWQPKGKIRVGGTWL--IEGAVVERRDYSLQLFFRTSKG--KVYTA 478

Query: 218 ESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGR-LLLAYNTVS--------RGVLKVALS 268
            S DGG TW+ A PT + NP+S +D     D R +++A+N  +        R  L+V++S
Sbjct: 479 FSWDGGDTWTGANPTTVPNPDSKMDATSFPDTREIVVAFNDATDKNKINKGRCRLRVSIS 538

Query: 269 KDDGDSWHDALTLEE-NLAMEFSYPAVIQASDGSV 302
            D G  +     LEE +L M   YP VI   D ++
Sbjct: 539 CDGGVGFKTITELEEGSLKMYIHYPTVITDGDRAI 573


>gi|440717316|ref|ZP_20897806.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain protein [Rhodopirellula baltica SWK14]
 gi|436437502|gb|ELP31128.1| secreted protein containing Inosine/uridine-preferring nucleoside
           hydrolase domain protein [Rhodopirellula baltica SWK14]
          Length = 733

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 102/219 (46%), Gaps = 30/219 (13%)

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWME--------------V 165
           TWS   ++  G+   + NKP +L +G  +   S++     G + +              V
Sbjct: 489 TWSAPRRIWHGV---TLNKPTVLSSGDWMLPVSLDERGGLGVFRDCFQNLDPLRGANVFV 545

Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
           + D G +W + G    PN      + +     + +L +L R+ NGI     S S DGG  
Sbjct: 546 SKDRGETWHRRGRATFPNPDWH--EHMIVQRNDDSLWMLARTRNGI---MESTSVDGGKN 600

Query: 226 WSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHDAL 279
           WS A  +++ +PN+     +L+  RLLL  +  +      R  L   LS+D+G +W   L
Sbjct: 601 WSAAVASKIKHPNARFHIRRLQSSRLLLIKHGDAIDAHHGRVGLSAWLSEDEGQTWLGGL 660

Query: 280 TLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            L+E   +  SYP   Q+S+GS++I+Y  NR     V++
Sbjct: 661 MLDERKGI--SYPDGFQSSNGSIYISYDRNRATDGEVLM 697


>gi|416015491|ref|ZP_11563065.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. B076]
 gi|416028065|ref|ZP_11571190.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. race 4]
 gi|320325049|gb|EFW81118.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. B076]
 gi|320327846|gb|EFW83852.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. race 4]
          Length = 432

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+    A   LR    +F+   RV  S + DGG TWS  +P    NPNS +  V +  
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
             LL+A N +  G  K++L   D     W     L+++        A+E           
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361

Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
                                        F Y  P  I++ DG  H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|298157627|gb|EFH98707.1| BNR/Asp-box repeat protein [Pseudomonas savastanoi pv. savastanoi
           NCPPB 3335]
          Length = 432

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+    A   LR    +F+   RV  S + DGG TWS  +P    NPNS +  V +  
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
             LL+A N +  G  K++L   D     W     L+++        A+E           
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361

Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
                                        F Y  P  I++ DG  H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|422598475|ref|ZP_16672736.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           lachrymans str. M301315]
 gi|330988753|gb|EGH86856.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           lachrymans str. M301315]
          Length = 432

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+    A   LR    +F+   RV  S + DGG TWS  +P    NPNS +  V +  
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
             LL+A N +  G  K++L   D     W     L+++        A+E           
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361

Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
                                        F Y  P  I++ DG  H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|289648789|ref|ZP_06480132.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. 2250]
          Length = 432

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGCEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+    A   LR    +F+   RV  S + DGG TWS  +P    NPNS +  V +  
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
             LL+A N +  G  K++L   D     W     L+++        A+E           
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361

Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
                                        F Y  P  I++ DG  H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|307107037|gb|EFN55281.1| hypothetical protein CHLNCDRAFT_134209 [Chlorella variabilis]
          Length = 410

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 73/145 (50%), Gaps = 7/145 (4%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           ++ D+G +W+  G I   +    ++ P     +   L ++ R+    GR  MS S D G+
Sbjct: 250 ISRDSGATWKASGDIE--DARTWLVNPTLEEGSKGQLIMMFRT--STGRTYMSSSADKGV 305

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS--RGVLKVALSKDDGDSWHDALTLE 282
           TWS      L NPNS    V + DG++L  +N     R  L +ALS +D  SW     +E
Sbjct: 306 TWSRPTYNSLPNPNSPFSTVTI-DGQVLCVFNNSQTIRAPLALALSVNDCKSWEPLAMVE 364

Query: 283 ENLAMEFSYPAVIQASDGSVHITYT 307
           E+    FS P++++ +D +V + YT
Sbjct: 365 EDATGNFSCPSIVEWADDTVKVAYT 389


>gi|395500535|ref|ZP_10432114.1| hypothetical protein PPAM2_30810 [Pseudomonas sp. PAMC 25886]
          Length = 417

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 83/362 (22%), Positives = 131/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRW-QSPIIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV++    F  K   W    ++A  E       
Sbjct: 53  VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDAKSAEWGPEQVLATRESTRDGTH 112

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L +FY +   V  W+     +  S + G  WS   QL   I 
Sbjct: 113 RYIRKLGNPVIALAPDNRLWMFY-VSVSVGGWATSAINVMVSDDLGTHWSAPRQL---IT 168

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 169 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVVDKFRISRGKNSL 227

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              I P       R + +L  +     RV  S + D G TWS   P +  NPNS +  V 
Sbjct: 228 QPTIVP---QDGQRAIAMLRYAGETHHRVLASRTEDAGQTWSEPYPLEPSNPNSSLAAVS 284

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN------------------- 284
            +   LL+A N +  G  K++L   D   + W   + L+++                   
Sbjct: 285 TEGDGLLVALNDLQDGRFKLSLYGTDANLNQWRTVVELDQSPDPLGQPFSPEAYKEIIGE 344

Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                                            E+ YP   ++ DG  H+ Y++N T IK
Sbjct: 345 GFRESSGARRLPLEQRFLSSLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 404

Query: 315 HV 316
           HV
Sbjct: 405 HV 406


>gi|410089991|ref|ZP_11286592.1| BNR/Asp-box repeat-containing protein [Pseudomonas viridiflava
           UASWS0038]
 gi|409762627|gb|EKN47637.1| BNR/Asp-box repeat-containing protein [Pseudomonas viridiflava
           UASWS0038]
          Length = 432

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 86/362 (23%), Positives = 131/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSP-IIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F      W S  ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAATEEWGSEHVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGIL 132
                + NPV+   P   L LFY +   V  W+G       S + G +WS   QL   + 
Sbjct: 127 KYIRKLGNPVIALAPDKRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPRQL---VT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 183 SPFLNISTLVRAAPVFHADGSIGLPVYHEFLGKFAEYLYLSPD-GDVIDKFRISRGKNSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              + P+    A   LR    + +   RV  S + DGG TWS   P    NPNS +  V 
Sbjct: 242 QPTVVPLDGQRAVALLRYAGETHH---RVLASRTEDGGQTWSEPHPLSPSNPNSSLAAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLE--------------------- 282
           +    LL+A N +  G  K++L   D   D W   + L+                     
Sbjct: 299 VPGKGLLVALNDLREGRFKLSLYGTDEQMDIWRPVIDLDKSPDPLGTPFSLEAYKEVIGE 358

Query: 283 ----------------------------ENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                                       +    E+ YP  I++ +G  H+ Y++N T IK
Sbjct: 359 GFRSSSGARRQPMEAEFLSNLDQRVCSPQGCDFEYEYPYFIRSPEGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|395799123|ref|ZP_10478405.1| hypothetical protein A462_27693 [Pseudomonas sp. Ag1]
 gi|395336810|gb|EJF68669.1| hypothetical protein A462_27693 [Pseudomonas sp. Ag1]
          Length = 417

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/362 (22%), Positives = 131/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRW-QSPIIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV++    F  K   W    ++A  E       
Sbjct: 53  VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDAKSAEWGPEQVLATRESTRDGTH 112

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L +FY +   V  W+     +  S + G  WS   QL   I 
Sbjct: 113 RYIRKLGNPVIALAPDNRLWMFY-VSVSVGGWATSAINVMVSDDLGTNWSAPRQL---IT 168

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 169 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSPD-GAVVDKFRISRGKNSL 227

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              I P       R + +L  +     RV  S + D G TWS   P +  NPNS +  V 
Sbjct: 228 QPTIVP---QDGQRAIAMLRYAGETHHRVLASRTEDAGQTWSEPYPLEPSNPNSSLAAVS 284

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN------------------- 284
            +   LL+A N +  G  K++L   D   + W   + L+++                   
Sbjct: 285 TEGDGLLVALNDLQDGRFKLSLYGTDAKLNQWRTVVELDQSPDPLGQPFSPEAYKEIIGE 344

Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                                            E+ YP   ++ DG  H+ Y++N T IK
Sbjct: 345 GFRASSGALRLPLEQRFLSNLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 404

Query: 315 HV 316
           HV
Sbjct: 405 HV 406


>gi|421139204|ref|ZP_15599246.1| BNR/Asp-box repeat protein [Pseudomonas fluorescens BBc6R8]
 gi|404509579|gb|EKA23507.1| BNR/Asp-box repeat protein [Pseudomonas fluorescens BBc6R8]
          Length = 417

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 83/362 (22%), Positives = 131/362 (36%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRW-QSPIIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV++    F  K   W    ++A  E       
Sbjct: 53  VHSSSVTALPGGDLMAVWFAGSREGAADVQVRTARFDAKSAEWGPEHVLATRESTRDGTH 112

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P N L +FY +   V  W+     +  S + G  WS   QL   I 
Sbjct: 113 RYIRKLGNPVIALAPDNRLWMFY-VSVSVGGWATSAINVMVSDDLGTNWSAPRQL---IT 168

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      N  
Sbjct: 169 SPFFNISTLVRAAPVFHADGSIGVPVYHEFMGKFAEYLYLSPD-GAVVDKFRISRGKNSL 227

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              I P       R + +L  +     RV  S + D G TWS   P +  NPNS +  V 
Sbjct: 228 QPTIVP---QDGQRAIAMLRYAGETHHRVLASRTEDAGQTWSEPYPLEPSNPNSSLAAVS 284

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN------------------- 284
            +   LL+A N +  G  K++L   D   + W   + L+++                   
Sbjct: 285 TEGDGLLVALNDLQDGRFKLSLYGTDAKLNQWRTVVELDQSPDPLGQPFSPAAYKEIIGE 344

Query: 285 ------------------------------LAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                                            E+ YP   ++ DG  H+ Y++N T IK
Sbjct: 345 GFRASSGALRLPLEQRFLSNLDYRVCKPRGCDFEYEYPYFSRSPDGMYHLVYSWNNTFIK 404

Query: 315 HV 316
           HV
Sbjct: 405 HV 406


>gi|422584317|ref|ZP_16659428.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. 0893_23]
 gi|330869135|gb|EGH03844.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           aesculi str. 0893_23]
          Length = 432

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 87/359 (24%), Positives = 133/359 (37%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGCEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GELIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+    A   LR    +F+   RV  S + DGG TWS  +P    NPNS +  V +  
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
             LL+A N +  G  K++L   D     W     L+++        A+E           
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361

Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
                                        F Y  P  I++ DG  H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|431927356|ref|YP_007240390.1| hypothetical protein Psest_2228 [Pseudomonas stutzeri RCH2]
 gi|431825643|gb|AGA86760.1| hypothetical protein Psest_2228 [Pseudomonas stutzeri RCH2]
          Length = 431

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 86/366 (23%), Positives = 138/366 (37%), Gaps = 88/366 (24%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKI-----------WLQTFKDGRWQSPIIADE 75
            HAS++  +  G  +  +F GS EGA DV+I           W +       +S   A  
Sbjct: 67  VHASSVTSLPGGKLMAVWFAGSREGAADVQIRGARYDALTAEWGEELVLATRESTQQATR 126

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPP--- 129
           +    + N V+   P N L LFY +   +  W+G       S + G +WS  +QL     
Sbjct: 127 KYIRKLGNSVIALAPDNRLWLFY-VSVSIGGWAGSAINTMYSDDFGQSWSTPKQLITTPF 185

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVD---AGRSWRKYGPIYIPNES 185
             I    +N P+   +G +      E    +  ++ ++      G+S      I +  +S
Sbjct: 186 LNISTLVRNAPVFHRDGTIGLPVYHEFLGKFAEYLYISPTGDVVGKS-----RISMGTDS 240

Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIG--RVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
           L   QP  VP    N    + M  + G+   RV  S + D G TWS   P    NPNS +
Sbjct: 241 L---QPTVVPMDEQNA---IAMLRYAGVTHHRVLASRTEDAGRTWSKPFPIDPANPNSAL 294

Query: 242 DGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEEN--------------- 284
             V   +  LL+A N +  G  ++ L + +   + W   + L+E+               
Sbjct: 295 AAVATPNHGLLVALNDLEDGRFRLRLMETNSSLELWKPVIDLDESPDPEGNPFSPQAYRE 354

Query: 285 ----------------------------------LAMEFSYPAVIQASDGSVHITYTYNR 310
                                                E+ YP  I+++DG  H+ Y++N 
Sbjct: 355 IIGDKFRLSSGPRRLSLVDEFLTNLDQRVCKTHGCDFEYEYPYFIRSADGLYHLVYSWNN 414

Query: 311 TQIKHV 316
           T IKHV
Sbjct: 415 TFIKHV 420


>gi|398960857|ref|ZP_10678356.1| hypothetical protein PMI25_00011 [Pseudomonas sp. GM30]
 gi|398153510|gb|EJM42009.1| hypothetical protein PMI25_00011 [Pseudomonas sp. GM30]
          Length = 431

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 135/362 (37%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV+I    F      W +  ++A  E       
Sbjct: 67  VHSSSVTALPGGDLMAVWFAGSREGAADVQIRTARFDANTAEWGAEQVLATRESTRDGTR 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P   L +FY +   V  W+     +  S + G  WS   QL   I 
Sbjct: 127 RYIRKLGNPVIALAPDQRLWMFY-VSVSVGGWATSAINLMVSDDFGRNWSVPRQL---IT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      +  
Sbjct: 183 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVIDKFRISRGKHSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              I P+      R + +L  + +   RV  S + D G TWS   P +  NPNS +  V 
Sbjct: 242 QPTIVPL---DGKRAVAMLRYAGDTHHRVLASRTEDAGQTWSEPYPLEPANPNSSLAAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDGD--SWHDALTLEE-------------------- 283
             D  LL+A N +  G  K++L   D +  +W + + L++                    
Sbjct: 299 TGDNGLLVALNDLQDGRFKLSLYGTDANLSNWRNVIELDQSPDPLGQPFAPEAYRAIIGE 358

Query: 284 -------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                              NL            E+ YP   + SDG  H+ Y++N T IK
Sbjct: 359 GFRASSGARRLPLEQRFLSNLDYRVCKPQGCDFEYEYPYFSRDSDGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|344997734|ref|YP_004800588.1| hypothetical protein SACTE_0092 [Streptomyces sp. SirexAA-E]
 gi|344313360|gb|AEN08048.1| conserved hypothetical protein [Streptomyces sp. SirexAA-E]
          Length = 396

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 78/326 (23%), Positives = 132/326 (40%), Gaps = 50/326 (15%)

Query: 33  VEVDKGHFLVA--YFGGSCEGAPDVKIWLQTFKDGR-WQSP--IIADEEPNVPMWNPVLF 87
           VE  +G  L A  Y G   E   +  +   +  DG  W  P  ++  ++P V +++P L+
Sbjct: 34  VERTRGGRLYANWYTGMETETGGNFVVITSSDDDGTTWTGPRFVVEHDDPEVRVYDPCLW 93

Query: 88  KLPSNGLLLFYKIGQE-----VQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILL 142
           + P+  L   +   ++     +  W        +    W+   ++  GI+    NKP +L
Sbjct: 94  RDPAGRLWATWNQSRDFFDGRIGVWVATSGNPDDDEPVWTAPRRIANGIM---MNKPTVL 150

Query: 143 ENGLLL-------CGSSVESWNSWG---SWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
            +G  L       C +  E         S + V+ D G ++   G   +PN S    + +
Sbjct: 151 ADGTWLFPAAIWACHTPTEEHALEAERFSNVYVSTDEGETFAYLGGADVPNRSFD--EHM 208

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
                +  L +L+R F+G+G    S S DGG TWS  + + +  P S     +L  GRLL
Sbjct: 209 VVEKRDGELWMLVRCFDGVGE---SFSKDGGRTWSPGRRSHIDGPCSRFHIRRLPSGRLL 265

Query: 253 LAYNT--------------------VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYP 292
           +  +                       R  L   LS DDG +W   L L++    + SYP
Sbjct: 266 MINHADFGDRRSREEIEEQGNVKEWKGRTNLTAFLSDDDGATWPHRLLLDDR--DDVSYP 323

Query: 293 AVIQASDGSVHITYTYNRTQIKHVVL 318
                +DG + + Y ++R   + V L
Sbjct: 324 DAAVGADGRIFVVYDHDRFGDRAVYL 349


>gi|171910491|ref|ZP_02925961.1| hypothetical protein VspiD_04945 [Verrucomicrobium spinosum DSM
           4136]
          Length = 650

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 117/283 (41%), Gaps = 39/283 (13%)

Query: 67  WQSPIIADEEPN-VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
           W  P+   + P    + +P+ +  P   L +FYK  Q   K +G  +  +    T++ R 
Sbjct: 91  WSKPVAVIQGPEGTAIADPLPWVDPQGRLWIFYK--QVTPKTAGQAESGFAG--TFAIRA 146

Query: 126 QLP---------PGILGPSKN---KPILLENGLLLCGSSVESWNSWGSWME-------VT 166
             P         P ++G       KP++L +G  L    + + ++W   M        VT
Sbjct: 147 DDPEKAEPQWTVPQLIGEGGTLFGKPLVLPDGRWLAPFFLSARSAWREQMAGKETGVIVT 206

Query: 167 VDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW 226
            DAG +W   G   I  E  +  +       + ++ + +R+  G+     S S D G TW
Sbjct: 207 SDAGATWAWQGGTSIAPELRNFSEATLAPRRDGSILMAIRTQKGL---YESTSPDAGKTW 263

Query: 227 SYAKPTQLLNPNSGIDG--VKLKDGRLLLAYN----TVS----RGVLKVALSKDDGDSWH 276
           S A P    +P S      ++LK G  LL Y+    T S    R  L   LS D+G +W 
Sbjct: 264 SEAVPMAGFSPGSATRACLLRLKSGAFLLVYHEPETTASGKYPRAKLTAWLSDDEGRTWS 323

Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
             L L+E      SYP   QA DG + ITY   R +     +L
Sbjct: 324 HKLLLDER--ARVSYPDATQAPDGRIFITYDLGRYEPADKAIL 364


>gi|424923654|ref|ZP_18347015.1| neuraminidase (sialidase) [Pseudomonas fluorescens R124]
 gi|404304814|gb|EJZ58776.1| neuraminidase (sialidase) [Pseudomonas fluorescens R124]
          Length = 431

 Score = 67.8 bits (164), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 135/362 (37%), Gaps = 80/362 (22%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV+I    +      W +  ++A  E       
Sbjct: 67  VHSSSVTALPGGDLMAVWFAGSREGAADVQIRTARYDANSAEWGAEQVLATRESTRDGTQ 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P   L +FY +   V  W+     +  S + G  WS   QL   I 
Sbjct: 127 RYIRKLGNPVIALAPDQRLWMFY-VSVSVGGWATSAINVMVSDDVGRNWSAPRQL---IT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P  N        P+   +G +      E    +  ++ ++ D G    K+      +  
Sbjct: 183 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSAD-GAVIDKFRISRGKHSL 241

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
              I P+      R + +L  + +   RV  S + D G TWS   P +  NPNS +  V 
Sbjct: 242 QPTIVPL---DGRRAVAMLRYAGDTHHRVLASRTEDAGQTWSEPYPLEPANPNSSLAAVG 298

Query: 246 LKDGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE-------------------- 283
             D  LL+A N +  G  K++L   D   ++W + + L++                    
Sbjct: 299 TDDNGLLVALNDLQDGRFKLSLYGTDASLNNWRNVIELDQSPDPLGQPFAPDAYKAIIGE 358

Query: 284 -------------------NL----------AMEFSYPAVIQASDGSVHITYTYNRTQIK 314
                              NL            E+ YP   + SDG  H+ Y++N T IK
Sbjct: 359 GFRASSGARRLPLEQRFLSNLDYRVCKPQGCDFEYEYPYFSRGSDGLYHLVYSWNNTFIK 418

Query: 315 HV 316
           HV
Sbjct: 419 HV 420


>gi|110834776|ref|YP_693635.1| hypothetical protein ABO_1915 [Alcanivorax borkumensis SK2]
 gi|110647887|emb|CAL17363.1| conserved hypothetical protein [Alcanivorax borkumensis SK2]
          Length = 436

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 84/368 (22%), Positives = 135/368 (36%), Gaps = 73/368 (19%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
           A+ A  +  HA+++VE+  G     +F G+ EGA DV I    F  + G W   I++   
Sbjct: 52  ASDADTRMVHAASMVELPDGRLRAFWFAGTREGAADVSINSAIFDPQLGTWSDEIVSVTR 111

Query: 77  PNVPM-WNPVLFKL-------PSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSERE 125
             +   W   + KL         +G +  + +      W+     +  S + G +W    
Sbjct: 112 EQISQGWGRHIRKLGNALPVLDEDGRMRLFVVAVSFGGWAASRLVVLESSDMGASWEFDT 171

Query: 126 QLPP----GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
            L       I    K  PI   +G +      E    +G  + +  D     R  G   I
Sbjct: 172 SLVATPFLNISTLVKTPPIHYSDGTIGLPVYHEMIGKFGEILRLDRDN----RILGKARI 227

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFN--GIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
                  IQP+     N      +R+ N    G +  S S DGG TW+  + + L NP+S
Sbjct: 228 -GHGRKAIQPLVLVDNNNHAVSFLRNENEPNNGYLYQSSSNDGGWTWTPLEYSPLDNPSS 286

Query: 240 GIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHD-------------------- 277
            + GV +  G  L+A   N   R  L +  ++D G  W D                    
Sbjct: 287 ALGGVAMGPGHWLVATNCNREERDDLCIRETRDSGQHWEDRWFFHNREQWRGNALEPGTF 346

Query: 278 -----------------ALTLE----------ENLAMEFSYPAVIQASDGSVHITYTYNR 310
                             L LE             A ++ YP +++  +G +H+ YT+N+
Sbjct: 347 VDLVDDEFAGETIPEDRRLLLERVKANKCHGDHGCAFQYDYPYMVRTRNGDLHMLYTWNK 406

Query: 311 TQIKHVVL 318
           T I+H  L
Sbjct: 407 TAIRHAWL 414


>gi|71734501|ref|YP_275606.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           phaseolicola 1448A]
 gi|71555054|gb|AAZ34265.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. phaseolicola
           1448A]
          Length = 432

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 85/359 (23%), Positives = 133/359 (37%), Gaps = 74/359 (20%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           + P+      R + +L  +   + RV  S + DGG TWS  +P    NPNS +  V +  
Sbjct: 245 VVPL---DGQRAVALLRYAGETLHRVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGVPG 301

Query: 249 GRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME----------- 288
             LL+A N +  G  K++L   D     W     L+++        A+E           
Sbjct: 302 KGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEGFR 361

Query: 289 -----------------------------FSY--PAVIQASDGSVHITYTYNRTQIKHV 316
                                        F Y  P  I++ DG  H+ Y++N T IKHV
Sbjct: 362 ASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 420


>gi|402703439|ref|ZP_10851418.1| hypothetical protein RhelC_04125 [Rickettsia helvetica C9P9]
          Length = 111

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 19/103 (18%)

Query: 220 CDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTV-----SRGVLKVALSKDDGDS 274
            + G+ WS  +P  LLNP+S ID + L D  LLL YN V     SR +L VA S D+G +
Sbjct: 2   INNGIYWSNLEPVNLLNPDSAIDAINLGDDTLLLTYNRVINNHKSRNILSVATSYDEGLN 61

Query: 275 WHDALTLEEN------------LAMEFSYPAVIQASDGS-VHI 304
           WH  +T+  +            L+ E+SYPA+I + D + +H+
Sbjct: 62  WH-PITIRNSIYPEGDIEYSNILSEEYSYPAIIMSPDNNEIHV 103


>gi|373853013|ref|ZP_09595813.1| hypothetical protein Opit5DRAFT_3868 [Opitutaceae bacterium TAV5]
 gi|372475242|gb|EHP35252.1| hypothetical protein Opit5DRAFT_3868 [Opitutaceae bacterium TAV5]
          Length = 454

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 89/199 (44%), Gaps = 22/199 (11%)

Query: 128 PPGIL--GPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR---KYGPIYIP 182
           PP  L  G   NKP +L +G       +   +       V+ + G++W    K+ P   P
Sbjct: 196 PPQFLAYGVCINKPTVLSDGTWFRTVVLGPEDPSRLHFYVSRNKGKTWEFLSKHTPSAPP 255

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
             + +  +P+     + +L  L RS   I ++   ES DGG TW   K         GID
Sbjct: 256 EIAKAWCEPMVIERRDGSLWTLFRSERSIKQI---ESSDGGKTW---KNESDFTTERGID 309

Query: 243 G----VKLKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
                 +L+ GRLL   N    SR  L   LS DDG +W   L L+E  A+  SYP  +Q
Sbjct: 310 TRFFITRLQSGRLLFVVNDHPKSRANLTAMLSDDDGKTWSHKLVLDERNAV--SYPDGLQ 367

Query: 297 ASDGSVHITY---TYNRTQ 312
             DG+++I Y    YN+ Q
Sbjct: 368 TPDGNIYIIYDRGRYNKDQ 386


>gi|225163403|ref|ZP_03725721.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
 gi|224801991|gb|EEG20269.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
          Length = 434

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 86/183 (46%), Gaps = 17/183 (9%)

Query: 137 NKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR---KYGPIYIPNESLSVIQPVP 193
           NKP +L NG  L        +     + ++ D G++W+   K+ P   P  + +  + + 
Sbjct: 192 NKPTVLANGDWLRPVVASREDPARIQIYISRDKGKTWQFLSKHTPGQAPEIAKTYCESML 251

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG----VKLKDG 249
               + +L +L RS N I ++   ES D G +W         +   GID      +L+ G
Sbjct: 252 IERRDGSLWMLFRSKNSIKQI---ESFDSGKSWQNESD---FSTERGIDTRFFITRLQSG 305

Query: 250 RLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307
           +LLL  N    +R  L   LS DDG +W   L L+E   +  SYP  +QA DGS+++ Y 
Sbjct: 306 KLLLVINDHPKARANLTAMLSDDDGKTWPHKLVLDERNPV--SYPDGVQAPDGSIYVIYD 363

Query: 308 YNR 310
           + R
Sbjct: 364 HGR 366


>gi|70731208|ref|YP_260949.1| BNR/Asp-box repeat-containing protein [Pseudomonas protegens Pf-5]
 gi|68345507|gb|AAY93113.1| BNR/Asp-box repeat protein [Pseudomonas protegens Pf-5]
          Length = 431

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 84/370 (22%), Positives = 134/370 (36%), Gaps = 96/370 (25%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNV---- 79
            H+S++  +  G  +  +F GS EGA DV++    F  + G W +  ++A  E       
Sbjct: 67  VHSSSVTALPGGDLMAVWFAGSREGAADVQVRSARFDAQSGEWGAEQVLATRESTRVGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPGIL 132
                + NPV+   P   L +FY +   V  W+     +  S + G +WS   QL   + 
Sbjct: 127 RYIRKLGNPVIALGPDQRLWMFY-VSVSVGGWATSAINLMVSDDLGRSWSAPRQL---VT 182

Query: 133 GPSKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAG--------RSWRKYG 177
            P  N        P+   +G +      E    +  ++ ++ D          R      
Sbjct: 183 SPFFNISTLVRAAPVFHADGSIGLPVYHEFMGKFAEYLYLSADGAVIDKFRISRGKHSLQ 242

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
           P  +P +              R + +L  + +   +V  S + D G TWS   P Q  NP
Sbjct: 243 PTIVPLDE------------RRAVAMLRYAGDTHHKVLASRTEDAGQTWSEPYPLQPSNP 290

Query: 238 NSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDG--DSW------------------HD 277
           NS +  V   D  LL+A N +  G  K++L   D     W                  H+
Sbjct: 291 NSSLAAVGTDDQGLLVALNDLQDGRFKLSLYGTDAGLSQWRPLVELDQSPDPLGQPFSHE 350

Query: 278 A-----------------LTLEENLA--------------MEFSYPAVIQASDGSVHITY 306
           A                 L LE+                  E+ YP   + +DG  H+ Y
Sbjct: 351 AYKDIIGEGFRASSGARRLPLEQRFLSNLDYRVCKPQGCDFEYEYPYFSRGTDGLYHLVY 410

Query: 307 TYNRTQIKHV 316
           ++N T IKHV
Sbjct: 411 SWNNTFIKHV 420


>gi|149916410|ref|ZP_01904929.1| BNR/Asp-box repeat protein [Roseobacter sp. AzwK-3b]
 gi|149809680|gb|EDM69534.1| BNR/Asp-box repeat protein [Roseobacter sp. AzwK-3b]
          Length = 373

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 136/313 (43%), Gaps = 34/313 (10%)

Query: 20  NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW---LQTFKDGRWQ-------- 68
           +  P +    S  + ++     + +F GS E  PDV+I    L+ ++ G WQ        
Sbjct: 52  DHTPGEGMAHSPAITINGDEVGILWFQGSSEAQPDVEIHRAVLRRYETG-WQAGPPEPYL 110

Query: 69  -SPIIADE-EPN---VPMWNPVLFKLPSNGLLLFYKIGQEVQKWS--GCMKRSYNKG-VT 120
            S  + D  EP    V + N V  +   + L   Y     V  W+         +KG   
Sbjct: 111 TSAALGDAFEPRQLVVTLGNTVQNEAAQDAL---YATVVSVGGWAMASVADVRMDKGNPV 167

Query: 121 WSEREQLPPGILGPSKNKPILLENGLLLCGS---SVESWNSWGSWMEVTVDAGRSWRKYG 177
           W+ +  L P +     N+ +L+++ ++       ++ ++   GS   V V      R   
Sbjct: 168 WARKLNLSPFL-----NRSMLVKSPMVAYSDGSMALPAYFEMGSTYGVLVRTDAQGRARD 222

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
              +  + +  IQP+            +R F+  G+V +S + DGG TWS A+ T + +P
Sbjct: 223 LRRMTGKRIKPIQPMIVVLDGARALAFLRDFDNSGQVYVSRTQDGGRTWSEAEATGIAHP 282

Query: 238 NSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
           ++ +  + +  G +L+A N    S  +L +ALS+D+G SW    TL+     +  YP + 
Sbjct: 283 SAPVAALPVSGGHILMAMNGGADSADLLHLALSEDEGGSWRVIHTLDRQPG-DARYPMLR 341

Query: 296 QASDGSVHITYTY 308
           +  DG + + Y++
Sbjct: 342 RLPDGQILLAYSH 354


>gi|365129132|ref|ZP_09340823.1| hypothetical protein HMPREF1032_02587 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363621763|gb|EHL72956.1| hypothetical protein HMPREF1032_02587 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 386

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/329 (23%), Positives = 130/329 (39%), Gaps = 69/329 (20%)

Query: 40  FLVAYFGGSCEGAPDVKIWLQTFKDG---RWQSPIIADEEPNVPMWNPVLFKLPSNGLLL 96
           F+  Y G   E   ++ I   +   G   R  + ++   +P+  +++P L+  P   L +
Sbjct: 43  FVNFYSGQDAEVGGNIMILCISDDQGKTFRSCAAVVEHPDPSCRIYDPNLWIDPLGRLWM 102

Query: 97  FYKIGQEVQKWSGCMKRSYNKGVT--WSEREQLPPGILGPSKNKPILLENGLLLCGSSVE 154
            Y              R +N G +  W+   + P   + P  + P  + NG+++    V 
Sbjct: 103 TY-----------AQARGFNDGRSGVWAAICEQPDADV-PQWSAPRRIANGIMMNKPLVT 150

Query: 155 SWNSW----GSWME---------------------VTVDAGRSWRKYGPIYIPNESLSVI 189
           S N W      W +                      ++D G +    G   +P  S    
Sbjct: 151 SRNEWLFPCAIWRDDSGSVPTERHGLENEQFSNVYASIDEGHTIALRGHADVPARSFD-- 208

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           + +     + TL +L+R+F+GIG    S S DGG TWS  + + +  P S     +L  G
Sbjct: 209 EHMLVEKRDGTLWMLVRTFDGIGE---SFSSDGGRTWSPGQKSHIDGPCSRFFIRRLSSG 265

Query: 250 RLLLAYNT--------------------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
           RLL+  +                       R  L   LS+DDG +W   L L+E    E 
Sbjct: 266 RLLMINHYKFSHRIDLDDIMRQGNVKRWQGRSHLTAMLSEDDGATWPYTLLLDER--DEV 323

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SYP   +A DG +++TY + R   + +++
Sbjct: 324 SYPDAKEADDGYIYVTYDHQRVTKREILM 352


>gi|407801650|ref|ZP_11148494.1| hypothetical protein S7S_00727 [Alcanivorax sp. W11-5]
 gi|407025087|gb|EKE36830.1| hypothetical protein S7S_00727 [Alcanivorax sp. W11-5]
          Length = 434

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 88/377 (23%), Positives = 146/377 (38%), Gaps = 83/377 (22%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRW--QS 69
           +  F ++ A  ++ HA+++VE+  G     +F G+ EG+ DV I    F    G W  + 
Sbjct: 42  QLNFASDGATMQT-HAASMVELPDGRIRAFWFAGTREGSADVNIHSSVFDPATGDWTGEQ 100

Query: 70  PIIADEEPN------VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVT 120
            ++  E+ +      V     V+  + S+G L  Y +      W+     +  S + G T
Sbjct: 101 VMLTREQVSEGLGRYVRKLGNVVPAVDSDGRLRLYMVVVSFGGWAASRLAVAESADDGST 160

Query: 121 WSEREQLPPGILGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSW 173
           W   + L   +  P        K  PI  E+G L      E    +G  + +  D     
Sbjct: 161 WRISDGL---VTSPFFNLSTLVKAPPIRYEDGSLGLPVYHEFAAKFGEVLRL--DEDNRI 215

Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGI--GRVCMSESCDGGLTWSYAKP 231
            +   I    ESL   QP+       +    +R+ N    G +  S++ D G +WS    
Sbjct: 216 LERSRIGKLRESL---QPLVLVNGPDSAVAFLRNANNSRPGILWRSDTRDTGHSWSSLDD 272

Query: 232 TQLLNPNSGIDGVKL-KDGRLLLA-YNTVSRGVLKVALSKDDGDSWHDALTLEENLAM-- 287
            QL NP S + GV+L KD  LL+A  N V R  L +  ++D G +W     + ++ A+  
Sbjct: 273 GQLPNPGSAVGGVRLGKDHWLLVANNNAVERDDLYILETRDHGQTWQTLHAMHDDAALRN 332

Query: 288 ------------------------------------------------EFSYPAVIQASD 299
                                                           +F YP VI+AS+
Sbjct: 333 APVSVDGFRDMVRTALSSIDEGPTTKDQRDRVMQAAVRNNCRSDFCVSQFDYPYVIRASN 392

Query: 300 GSVHITYTYNRTQIKHV 316
           G +HI Y++ ++ I H 
Sbjct: 393 GDIHILYSWKKSLIAHA 409


>gi|422675853|ref|ZP_16735192.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
           syringae pv. aceris str. M302273]
 gi|330973566|gb|EGH73632.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
           syringae pv. aceris str. M302273]
          Length = 321

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 73/264 (27%), Positives = 109/264 (41%), Gaps = 25/264 (9%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFK--DGRWQSP-IIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 13  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQAGTG 72

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP--- 129
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 73  KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTTPF 131

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS- 187
             I    +  P+   +G +      E    +  ++ ++ D G    K+  I   N SL  
Sbjct: 132 LNISTLVRGAPVFHADGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKF-RISRGNNSLQP 189

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            + P+    A   LR    +F+   RV  S + DGG +WS   P    NPNS +  V + 
Sbjct: 190 TVVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQSWSEPYPLTPSNPNSSLAAVGVP 246

Query: 248 DGRLLLAYNTVSRGVLKVALSKDD 271
              LL+A N +  G  K++L   D
Sbjct: 247 GKGLLVALNDLREGRFKLSLYGTD 270


>gi|78211616|ref|YP_380395.1| hypothetical protein Syncc9605_0061 [Synechococcus sp. CC9605]
 gi|78196075|gb|ABB33840.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 415

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 144/361 (39%), Gaps = 77/361 (21%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMWNPV 85
           HA  +  +        +  GS EG   + ++L     + GRW  P     +      NP+
Sbjct: 44  HAPQLCWIRPDLLGCVWMAGSGEGNAGMSVFLSLLGAEGGRWSEPQRISRDQERSEQNPL 103

Query: 86  LFKLPSNGLLLFYKIGQEVQ----------------KWSGCMK--RSYNKGV------TW 121
           LF   S+G L      Q V+                +W+  ++  R    G+      TW
Sbjct: 104 LFV--SDGCLHLIHSAQLVRDPEDRAAVDASSSFSMQWTAILRHQRLALDGLDPSDLETW 161

Query: 122 S-EREQLPPGILGP---SKNKPILLENG--LLLCGSSVESWNSWG---SWMEVTVDAGRS 172
           S E   +P  +       +N P  LENG  LL    S+E+  ++G   S M     +G+ 
Sbjct: 162 SAEAWSMPVDLFDDPAFCRNPPYPLENGHWLLPIYRSLEAGGAFGHDHSEMVRLDPSGQC 221

Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNG--IGRVCMSESCDGGLTWSYAK 230
             +  P  IP+ +  V   V    A+R    L++ F      ++  S S DGG TWS  +
Sbjct: 222 LDR--PFGIPDSTGRVHGSV---VASRGGEQLLQFFRSRLADQIYRSVSSDGGHTWSAPE 276

Query: 231 PTQLLNPNSGIDGVKLKDGRLLLAYNTV----------------------SRGVLKVALS 268
           PTQL N NS I    L  GRL + +N                        +R  L +A+S
Sbjct: 277 PTQLPNNNSSIQACWLASGRLAMIFNRFGFAPDPGASEEPLKWGEARWPRTRWPLSIAIS 336

Query: 269 KDDGDSW---HDALT-------LEENLAMEFSYPAVIQASDGSVHITYTY-NRTQIKHVV 317
            DDG  W    D  T       +  +L  + +YP +I+   G +H+ Y++  R  I++V 
Sbjct: 337 DDDGLQWPWIRDIDTGFGFCGPMNWDLNGQLAYPTLIEGQPGELHVAYSWAGRQAIRYVT 396

Query: 318 L 318
           L
Sbjct: 397 L 397


>gi|291276607|ref|YP_003516379.1| hypothetical protein HMU03750 [Helicobacter mustelae 12198]
 gi|290963801|emb|CBG39637.1| Putative inner membrane protein [Helicobacter mustelae 12198]
          Length = 382

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 38/321 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW---LQTFKDGRWQSPI-IAD------- 74
           S H+STI  +     LVA+F GS EGA DV I+   L++    +WQ P  I D       
Sbjct: 59  SAHSSTITPIRDNSLLVAFFAGSREGAADVAIYGSILESAYSKKWQQPFKILDRFMLMQD 118

Query: 75  -EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSY---NKGV----TWSEREQ 126
             E    + NPVL+ L     L  + +G  +  W+      Y   NKG      + +   
Sbjct: 119 SREYIAKLGNPVLYTL--GDTLHLFVVGVSIGGWATSKIYHYSAENKGAIPTFVFQKALH 176

Query: 127 LPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIP 182
           L P   I    + +P+ +       G  +  ++   +   + +  DA  +         P
Sbjct: 177 LSPFLNISNLVRTRPVEITLSTQEKGFILPIYHELATKYPINLVFDARGNLLMTSQ---P 233

Query: 183 NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
           N    ++QP     +  +  +  R+      +  +++CD  L +    PT L N ++ ++
Sbjct: 234 NHLNGLLQPSLTSISQTSCMLAYRANKKAKNILYTQTCDNNLKYGPLTPTNLKNYDNSLN 293

Query: 243 GVKLKDGRLLLAYNT----VSRGVLKVALSKDDGDSWHDALTLEENLAM---EFSYPAVI 295
              L + ++ L +NT     +RG  K+ LSK       + L   ++      E SYP   
Sbjct: 294 LFAL-NKQVYLLHNTPLHNSNRG--KLTLSKMQSPEVFEKLFDIDSTPTQKGEVSYPTAW 350

Query: 296 QASDGSVHITYTYNRTQIKHV 316
               G +H+TYT++R  I++V
Sbjct: 351 MDPHGILHVTYTFDRKFIRYV 371


>gi|418295150|ref|ZP_12907020.1| hypothetical protein PstZobell_17629 [Pseudomonas stutzeri ATCC
           14405 = CCUG 16156]
 gi|379066503|gb|EHY79246.1| hypothetical protein PstZobell_17629 [Pseudomonas stutzeri ATCC
           14405 = CCUG 16156]
          Length = 351

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 82/350 (23%), Positives = 133/350 (38%), Gaps = 84/350 (24%)

Query: 41  LVAYFGGSCEGAPDVKIWLQTFK--DGRWQSPII---------ADEEPNVPMWNPVLFKL 89
           +  +F GS EGA DV++    +    G W S ++         A ++    + NPV+   
Sbjct: 1   MAVWFAGSREGAADVQVRGARYDALTGEWGSELVLATRESTQEATQKHIRKLGNPVIALA 60

Query: 90  PSNGLLLFYKIGQEVQKWSGCM---KRSYNKGVTWSEREQLPP----GILGPSKNKPILL 142
           P N L LFY +   +  W+G       S + G TWS  +QL       I    +N P+  
Sbjct: 61  PDNKLWLFY-VSVSIGGWAGSAINAMYSDDLGETWSTPKQLITTPFLNISTLVRNAPVFH 119

Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDA---GRSWRKYGPIYIPNESLSVIQP--VPFHTA 197
            +G +      E    +  ++ ++      G+S    G        +  +QP  VP    
Sbjct: 120 SDGTIGLPVYHEFLGKFAEYLYISPSGDVIGKSRISKG--------MDSLQPTVVPMDE- 170

Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
            R + +L  +     +V  S + D G TWS   P    NPNS +  V   +  LL+A N 
Sbjct: 171 QRAIAMLRYAGAAHHKVLASRTEDAGRTWSEPYPIDPANPNSSLAAVATPNHGLLVALND 230

Query: 258 VSRGVLKVALSKDDG--DSWHDALTLEEN------------------------------- 284
           +  G  ++ L + +   + W     L+E+                               
Sbjct: 231 LEDGRFRLRLMETNSGLELWKPVFDLDESPDPEGNPFSPQAYREIIGDKFRLSSGARRLP 290

Query: 285 -----LA-------------MEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                LA              E+ YP  I+++DG  H+ Y++N T IKHV
Sbjct: 291 LVDEFLANLDRRVCSAHGCDFEYEYPYFIRSADGLYHLVYSWNNTFIKHV 340


>gi|451849312|gb|EMD62616.1| glycoside hydrolase family 33 protein [Cochliobolus sativus ND90Pr]
          Length = 369

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 146/360 (40%), Gaps = 76/360 (21%)

Query: 17  FPANSAPFKSC--HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPII 72
           F +N  P  +   HAS ++++  G  L A+FGGS EG  D+ I L   K G   W +P  
Sbjct: 17  FASNYIPASTVQVHASNLLQLPNGDLLCAWFGGSQEGLSDICIHLSRLKKGSRVWSTPQK 76

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPP 129
             ++      NPVLF +P    L      Q        + R   S + G TWS  + L  
Sbjct: 77  ISDDSERSEQNPVLFLVPGTTTLWVMYTAQPAVNQDKAIVRYRVSSDGGQTWSLAQNLFD 136

Query: 130 GILGPSKNKPILLENGLLLCGSSVESW---NSWG---------SWMEVTVDAGRSWRK-- 175
                 +     LE+G  +    + +W   N  G         S ++ T D G++W++  
Sbjct: 137 DEGLFIRQPVTFLEDGTWV----LPAWYCRNPPGFCWVGSDDISVVKYTRDGGKTWQEKE 192

Query: 176 ----YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
                G +++  ++L     + F+ +     + M +         S +C   + W+  KP
Sbjct: 193 VPNSIGAVHMNIKALLDGSYIAFYRSRWADNIYMST---------STNC---IDWTEPKP 240

Query: 232 TQLLNPNSGI---------------DGVKLKDGRLLLAYNT-------VSRGVLKVALSK 269
           T L NPNSGI               D +  +D + +    +       V R ++ V +S 
Sbjct: 241 TPLPNPNSGICFEFEEGQAKREGLYDDITPEDDKRINQRGSTRDAVWGVPRKMMTVGVSS 300

Query: 270 DDGDSWHD-ALTLEENLAM----------EFSYPA--VIQASDGSVHITYTYNRTQIKHV 316
           D G +W    L + +   M          E SYP+  V + ++  V++ +T+ R +IK +
Sbjct: 301 DGGRTWKSKTLQVGDGFCMTNNSRTKENRELSYPSIWVERNANDVVNVAFTFWRQRIKFI 360


>gi|420416897|ref|ZP_14916004.1| hypothetical protein HPNQ4044_0822 [Helicobacter pylori NQ4044]
 gi|393035410|gb|EJB36455.1| hypothetical protein HPNQ4044_0822 [Helicobacter pylori NQ4044]
          Length = 372

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 136/316 (43%), Gaps = 42/316 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  IG  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-IGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNES 185
              +    KNKP+   +G    G  +  ++   +   + +  D   + R+   +  PN  
Sbjct: 180 FLNLSHLIKNKPLNTTDG----GFVLPLYHELATQYPLLLKFDQQNNPRE---LLRPNAL 232

Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +QP   PF    +   ++    +      M E+C     W     T L N N  ++ 
Sbjct: 233 NHQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNL 288

Query: 244 VKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           + L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +  
Sbjct: 289 INL-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSH 344

Query: 301 SVHITYTYNRTQIKHV 316
            + I YTYNR+ IKH+
Sbjct: 345 FIDIVYTYNRSHIKHI 360


>gi|355576659|ref|ZP_09045914.1| hypothetical protein HMPREF1008_01891 [Olsenella sp. oral taxon 809
           str. F0356]
 gi|354816896|gb|EHF01411.1| hypothetical protein HMPREF1008_01891 [Olsenella sp. oral taxon 809
           str. F0356]
          Length = 187

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 77/163 (47%), Gaps = 27/163 (16%)

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +PN S   + P      +  L  LMRS      +  SES D G +W+   PT L N NSG
Sbjct: 2   VPN-SAGRVHPTVVELEDGHLVCLMRSREA-DWIYRSESTDFGDSWTEPVPTPLPNNNSG 59

Query: 241 IDGVKLKDGRLLLAYNTVS--------------RGVLKVALSKDDGDSWHDALTLE---- 282
           I  VKL  GR+ +AYN  S              R  + VALS+D   ++     +E    
Sbjct: 60  ICAVKLASGRVAVAYNHSSAPQAYGQKGAWPGLRCPVSVALSEDGCKTFPLIRHIERGQG 119

Query: 283 ----ENLA--MEFSYPAVIQASDGSVHITYTY-NRTQIKHVVL 318
               EN +  M++ YP ++QA+DG +H+ Y Y  R  +K V L
Sbjct: 120 YVGDENRSNNMQYEYPCIMQAADGMLHLAYAYETRKGVKWVTL 162


>gi|188527573|ref|YP_001910260.1| hypothetical protein HPSH_03975 [Helicobacter pylori Shi470]
 gi|188143813|gb|ACD48230.1| hypothetical protein HPSH_03975 [Helicobacter pylori Shi470]
          Length = 372

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+      +   +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDCLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|373849439|ref|ZP_09592240.1| hypothetical protein Opit5DRAFT_0294 [Opitutaceae bacterium TAV5]
 gi|372475604|gb|EHP35613.1| hypothetical protein Opit5DRAFT_0294 [Opitutaceae bacterium TAV5]
          Length = 378

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 97/204 (47%), Gaps = 29/204 (14%)

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSW---GSWMEVTVDA 169
           RS + G TWS+R   P      S + P+ L +G LL     E W      G W+  + D 
Sbjct: 166 RSGDGGKTWSDRYAAPCN----SPHGPVALRDGRLLYAGK-ELWTEEKKVGVWL--SADD 218

Query: 170 GRSWRKYGPIY-IPNESLSVIQPV-PFHTANRTLRVLMRSFNGIGRVCMS-ESCDGGLTW 226
           G +W++   +   P E+++    +     A+ TL V +R+ NG  R  +  ES DGG TW
Sbjct: 219 GLTWKRAAWLEPRPGETVTDYHELSAIEAADGTLIVQIRNHNGGRRETLQCESHDGGATW 278

Query: 227 SYAKPTQLLNPNSGIDG-----VKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHDAL 279
           +  +         G+DG     V+LKDGRL++ Y      RG+ +  +S D G +W +  
Sbjct: 279 TLPRAI-------GVDGFPSHLVRLKDGRLVMTYGWREAPRGI-RGRVSADHGRTWSEEF 330

Query: 280 TLEENL-AMEFSYPAVIQASDGSV 302
            L ++    +  YP  ++ +DG +
Sbjct: 331 VLTDDAPGWDVGYPGTVELADGQL 354


>gi|190894659|ref|YP_001984952.1| putative glycosyl hydrolase [Rhizobium etli CIAT 652]
 gi|190700320|gb|ACE94402.1| putative glycosyl hydrolase protein [Rhizobium etli CIAT 652]
          Length = 369

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 81/348 (23%), Positives = 131/348 (37%), Gaps = 86/348 (24%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLPPGSERWSEPEKMSDDPEK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++FK P     L Y   Q      G + +   S + G T+   + L   PG    
Sbjct: 95  SEQNPLIFKAPDGKTWLLY-TSQTSGNQDGAIVKCRISDDGGKTFGPVQILCDSPGTF-- 151

Query: 135 SKNKPILLENG-----LLLC-GSSVESWNSWGSWMEVTV--DAGRSWR------KYGPIY 180
            + + ++ + G     +  C G   + W+       V +  D G SW+        G ++
Sbjct: 152 VRQQIVVNDRGDWLLPIFRCVGLEGQRWSGDADTAAVLISRDGGASWQMRDIPDSIGAVH 211

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           +    L     V F+  NR    ++           S S DGG TWS  +PT+L N NS 
Sbjct: 212 MNILPLGGGDMVAFYR-NRFAETILS----------SRSADGGETWSPPEPTELPNNNSS 260

Query: 241 IDGVKLKDGRLLLAYN---------------------------------------TVSRG 261
           I    L +G + + YN                                        V R 
Sbjct: 261 IQATVLDNGGIAMVYNHSNATTSDARRQSLYDEIEGDEAGENAAVVADAGRKAVWGVPRA 320

Query: 262 VLKVALSKDDGDSWHDALTLE------------ENLAMEFSYPAVIQA 297
            L +A+S+D G ++   + L+            ++L  EFSYP+VI+ 
Sbjct: 321 PLSLAISRDGGTTFPHRIDLDTGDGFCLSNNSKDSLNREFSYPSVIEG 368


>gi|420433839|ref|ZP_14932844.1| hypothetical protein HPHPH24_0875 [Helicobacter pylori Hp H-24]
 gi|420507605|ref|ZP_15006114.1| hypothetical protein HPHPH24B_0780 [Helicobacter pylori Hp H-24b]
 gi|420532939|ref|ZP_15031302.1| hypothetical protein HPHPM1_0779 [Helicobacter pylori Hp M1]
 gi|420534600|ref|ZP_15032948.1| hypothetical protein HPHPM2_0749 [Helicobacter pylori Hp M2]
 gi|420536308|ref|ZP_15034650.1| hypothetical protein HPHPM3_0779 [Helicobacter pylori Hp M3]
 gi|420541499|ref|ZP_15039807.1| hypothetical protein HPHPM6_0901 [Helicobacter pylori Hp M6]
 gi|420543018|ref|ZP_15041313.1| hypothetical protein HPHPM9_0702 [Helicobacter pylori Hp M9]
 gi|393050254|gb|EJB51214.1| hypothetical protein HPHPH24_0875 [Helicobacter pylori Hp H-24]
 gi|393117458|gb|EJC17959.1| hypothetical protein HPHPH24B_0780 [Helicobacter pylori Hp H-24b]
 gi|393140570|gb|EJC40943.1| hypothetical protein HPHPM1_0779 [Helicobacter pylori Hp M1]
 gi|393141240|gb|EJC41605.1| hypothetical protein HPHPM2_0749 [Helicobacter pylori Hp M2]
 gi|393143956|gb|EJC44300.1| hypothetical protein HPHPM3_0779 [Helicobacter pylori Hp M3]
 gi|393148119|gb|EJC48443.1| hypothetical protein HPHPM6_0901 [Helicobacter pylori Hp M6]
 gi|393160081|gb|EJC60330.1| hypothetical protein HPHPM9_0702 [Helicobacter pylori Hp M9]
          Length = 372

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|386750984|ref|YP_006224204.1| hypothetical protein HPSH417_02785 [Helicobacter pylori Shi417]
 gi|384557242|gb|AFH97710.1| hypothetical protein HPSH417_02785 [Helicobacter pylori Shi417]
          Length = 372

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+      +   +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|300022380|ref|YP_003754991.1| hypothetical protein Hden_0853 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299524201|gb|ADJ22670.1| hypothetical protein Hden_0853 [Hyphomicrobium denitrificans ATCC
           51888]
          Length = 394

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 75/330 (22%), Positives = 131/330 (39%), Gaps = 44/330 (13%)

Query: 25  KSCHASTIVEVDKGHF-------------LVAYFGGSCEGAPDVKIWLQTFKDGRWQSPI 71
           +  HA+TI   D+  F             L  ++    EGA + +I    F    W    
Sbjct: 51  QQIHAATITPPDEKDFVHGPSLIQTPAGLLAFWYRAVYEGAANAEIVSSRFDGTHWSPTA 110

Query: 72  IADEEPNVP---------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGV 119
           +  +   V          + NPV F+  +N + LF+     +  W+ C   + RS + G 
Sbjct: 111 VVKDSSAVTRDIGLTVKSLANPVAFRRSANEIWLFFA-ASRLSGWATCEIILVRSTDNGQ 169

Query: 120 TWSEREQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
           TW   ++L       +   +K+ PI L +G  +   +    N     M V    GR   K
Sbjct: 170 TWERAQRLYASPFLNMSHLTKSLPIRL-SGDRIALPAYHEMNRKYPIMLVLDGNGRVVDK 228

Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGI--GRVCMSESCDGGLTWSYAKPTQ 233
              + + N      QP    T+  T    +R  N      + ++ + D G TW+   P  
Sbjct: 229 ---VRMGNGGTVGYQPAIVPTSPTTAIAFVRRLNSFRPQSILITRTTDAGRTWTPVAPID 285

Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSR--GVLKVALSKDDGDSWHDALTL----EENLAM 287
           L NP   I  ++  + R+LLA+N   +    +K+A S  DG S+    TL    ++    
Sbjct: 286 LPNPGGPIAAIRYDETRILLAFNDDPKDESNVKLAFSNLDGTSFRRIGTLVQVNDQVKGD 345

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIKHVV 317
             +YP ++++  G   + ++  R   +HV+
Sbjct: 346 ATAYPFLLESEPGQFDVVFS--RPPPQHVI 373


>gi|420451898|ref|ZP_14950748.1| hypothetical protein HPHPA6_0645 [Helicobacter pylori Hp A-6]
 gi|393069739|gb|EJB70534.1| hypothetical protein HPHPA6_0645 [Helicobacter pylori Hp A-6]
          Length = 372

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLIGLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   ++ PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LFRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|87306692|ref|ZP_01088839.1| hypothetical protein DSM3645_10172 [Blastopirellula marina DSM
           3645]
 gi|87290871|gb|EAQ82758.1| hypothetical protein DSM3645_10172 [Blastopirellula marina DSM
           3645]
          Length = 421

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 76/161 (47%), Gaps = 14/161 (8%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           V+ D G+SW + G    PN      + +     + +L +L R+  G+     S S DGG 
Sbjct: 232 VSSDQGKSWERRGAATFPNPDWH--EHMIVEREDGSLWMLARTTKGL---MQSISTDGGE 286

Query: 225 TWSY-AKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------RGVLKVALSKDDGDSWHD 277
           TW+  ++P  +  PN+     +L  G++LL  +         R  L   LS D+G +W  
Sbjct: 287 TWAKPSEPAGIQQPNARFHVRRLASGKILLIKHGDKIDSHHGRVQLSAWLSDDEGKTWTG 346

Query: 278 ALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            L L+E   +  SYP   QA DG ++I+Y  NR     ++L
Sbjct: 347 GLVLDERKGV--SYPDGFQAPDGRIYISYDRNRATDGEILL 385


>gi|420408708|ref|ZP_14907864.1| hypothetical protein HPNQ4216_0918 [Helicobacter pylori NQ4216]
 gi|393024124|gb|EJB25237.1| hypothetical protein HPNQ4216_0918 [Helicobacter pylori NQ4216]
          Length = 372

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  IG  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-IGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420421925|ref|ZP_14921003.1| hypothetical protein HPNQ4110_0734 [Helicobacter pylori NQ4110]
 gi|393038443|gb|EJB39477.1| hypothetical protein HPNQ4110_0734 [Helicobacter pylori NQ4110]
          Length = 372

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   + ++    
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEDLSHHS 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKVLDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|407458234|ref|YP_006736539.1| hypothetical protein B601_0714 [Chlamydia psittaci WS/RT/E30]
 gi|405785505|gb|AFS24251.1| hypothetical protein B601_0714 [Chlamydia psittaci WS/RT/E30]
          Length = 357

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 134/310 (43%), Gaps = 35/310 (11%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           S+  ++   STIVE   G  LV +         G  C    D+ IW + ++    +S + 
Sbjct: 31  SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
           +         +PVL K+ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI
Sbjct: 86  S---------HPVLVKISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136

Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSWG----SWMEVTVDAGRSWR-KYGPIYIPNE 184
            G ++N P +  + N + +   +V           SW+E+     +SW  + GPI   + 
Sbjct: 137 EGTTRNPPYVDRIRNRVYIPSYNVYEIPGQAVIGDSWVEIYNFFSKSWEGRMGPILGGSR 196

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
           +   I+P      +  L +  R+ +     +CM+ S + G +WS  K     + ++ I  
Sbjct: 197 NQVPIEPAIVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAV 256

Query: 244 VKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
              ++G+L L  N+   G  L    S ++G +W+    ++   +++   P+      G +
Sbjct: 257 QSGRNGKLYLFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLDRQGCM 313

Query: 303 HITYTYNRTQ 312
           H+ YT    Q
Sbjct: 314 HLVYTSKDDQ 323


>gi|420423565|ref|ZP_14922636.1| hypothetical protein HPHPA4_0790 [Helicobacter pylori Hp A-4]
 gi|393041287|gb|EJB42303.1| hypothetical protein HPHPA4_0790 [Helicobacter pylori Hp A-4]
          Length = 372

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I         + 
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSQSS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420489046|ref|ZP_14987643.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11]
 gi|420522725|ref|ZP_15021149.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11b]
 gi|393107796|gb|EJC08336.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11]
 gi|393129682|gb|EJC30116.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-11b]
          Length = 372

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|408372942|ref|ZP_11170641.1| hypothetical protein A11A3_02647 [Alcanivorax hongdengensis A-11-3]
 gi|407767294|gb|EKF75732.1| hypothetical protein A11A3_02647 [Alcanivorax hongdengensis A-11-3]
          Length = 413

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 91/375 (24%), Positives = 137/375 (36%), Gaps = 79/375 (21%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
             TF A+ A     HA+++VE+  G     +F GS EGA DV I    F  ++G W    
Sbjct: 41  RMTF-ASDADTTMVHAASMVELPDGRLRAFWFAGSREGASDVAIHSALFDPRNGSWADEH 99

Query: 72  IA-DEEPNVPMWNPVLFKL-------PSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVT 120
           +A   E     W   + KL         NG +  + +      W+     +  S + G +
Sbjct: 100 VAVTREQIADQWGRYVRKLGNAVPVLDDNGHMRLFVVAVSFGGWAASRLVVLGSDDLGTS 159

Query: 121 WSEREQLPPG----ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKY 176
           W   + L       I    K  PI   +G +      E    +G  + +           
Sbjct: 160 WQFEDALTTSPFMNISTLVKAPPIRYSDGTVGLPVYHEMLGKFGEILRLDPQN----HIL 215

Query: 177 GPIYIPNESLSVIQPV-----PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
           G   I       IQP+     P H A   LR   R  NG   +  S S +GG  W     
Sbjct: 216 GKARI-GHGRKAIQPLVLVDSPTHAAA-FLRNEQRPNNG--HLYQSHSVNGGWDWQPLTY 271

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSW-------------H 276
           ++L NP+S + GV +  G  LLA   NT  R  L +  ++D G SW              
Sbjct: 272 SRLTNPSSALGGVVIAPGHWLLATNCNTEERDDLCIRETRDGGASWTPRWFFHNRQAWRD 331

Query: 277 DALTLEENLAM---------------------------------EFSYPAVIQASDGSVH 303
           D +     L M                                 ++ YP +++AS+G +H
Sbjct: 332 DQMQPRPFLNMIDNELDKTVQVDNPARYLDRVRENKCRRGECTFQYDYPYMLRASNGDLH 391

Query: 304 ITYTYNRTQIKHVVL 318
           + YT+N+  I+H  L
Sbjct: 392 VLYTWNKAAIRHAWL 406


>gi|420440343|ref|ZP_14939299.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-30]
 gi|420499684|ref|ZP_14998238.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-26]
 gi|393056570|gb|EJB57481.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-30]
 gi|393149664|gb|EJC49973.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-26]
          Length = 372

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|386752570|ref|YP_006225789.1| hypothetical protein HPSH169_02985 [Helicobacter pylori Shi169]
 gi|384558828|gb|AFH99295.1| hypothetical protein HPSH169_02985 [Helicobacter pylori Shi169]
          Length = 372

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+      +   +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESTLEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420479141|ref|ZP_14977790.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-34]
 gi|393095383|gb|EJB95987.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-34]
          Length = 372

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420456911|ref|ZP_14955730.1| hypothetical protein HPHPA16_0760 [Helicobacter pylori Hp A-16]
 gi|393074309|gb|EJB75069.1| hypothetical protein HPHPA16_0760 [Helicobacter pylori Hp A-16]
          Length = 372

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDSSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|406593611|ref|YP_006740790.1| hypothetical protein B712_0716 [Chlamydia psittaci NJ1]
 gi|405789483|gb|AFS28225.1| hypothetical protein B712_0716 [Chlamydia psittaci NJ1]
          Length = 357

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 136/315 (43%), Gaps = 45/315 (14%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           S+  ++   STIVE   G  LV +         G  C    D+ IW + ++    +S + 
Sbjct: 31  SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
           +         +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI
Sbjct: 86  S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFRWLEHRLLPPGI 136

Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
            G ++N P +  + N +      + S+N +          SW+E+     +SW  + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191

Query: 180 YIPNESLSVIQP-VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              + +   I+P +   T+NR       +      +CM+ S + G +WS  K     + +
Sbjct: 192 LGGSRNQVPIEPAIVKFTSNRLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251

Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           + I     ++G+L L  N+   G  L    S ++G +W+    ++   +++   P+    
Sbjct: 252 NSIAVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308

Query: 298 SDGSVHITYTYNRTQ 312
             G +H+ YT    Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323


>gi|420428574|ref|ZP_14927609.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-17]
 gi|393046233|gb|EJB47213.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-17]
          Length = 372

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SAHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N     + +    LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|87311054|ref|ZP_01093178.1| probable sialidase [Blastopirellula marina DSM 3645]
 gi|87286145|gb|EAQ78055.1| probable sialidase [Blastopirellula marina DSM 3645]
          Length = 371

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/249 (25%), Positives = 113/249 (45%), Gaps = 19/249 (7%)

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG---ILGPSKNKPILL 142
           L +L    + LFY +   ++     M+ S ++  TWSE     PG       + ++ I  
Sbjct: 106 LLRLADGRIALFYLVKNSIKDCKMRMRTSSDEAETWSEPIDCMPGEENYFVVNNDRVIQT 165

Query: 143 ENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI--PNESLSVIQPVP 193
           ++G L+  +++   +  G W         ++ D G++WR+     I   ++    +   P
Sbjct: 166 KSGRLIAPAAIHVRD--GKWQGAADIVCFLSDDNGKTWRRSQQTVIGKRDDGSRFVTQEP 223

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KDGRLL 252
                +  R+LM   + +GR  +  S DGG T+S  +P    +P S     ++   G LL
Sbjct: 224 GVIELKDGRILMWIRSNLGRQAVCYSSDGGETFSDPQPWNFKSPVSPASIKRIPSTGDLL 283

Query: 253 LAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
           L +N     R  L VA+S D+G +W  A ++E +    F Y A I   D  V ++Y +  
Sbjct: 284 LVWNDNVKRRTPLNVAISADEGATWTHAKSIETDPNGWFCYTA-IHCVDDRVLLSY-WLT 341

Query: 311 TQIKHVVLL 319
            Q+KH +L+
Sbjct: 342 EQLKHPLLI 350


>gi|384450754|ref|YP_005663354.1| alpha-rhamnosidase-like protein, putative [Chlamydophila psittaci
           6BC]
 gi|328914848|gb|AEB55681.1| alpha-rhamnosidase-like protein, putative [Chlamydophila psittaci
           6BC]
          Length = 319

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 130/301 (43%), Gaps = 35/301 (11%)

Query: 30  STIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPM 81
           STIVE   G  LV +         G  C    D+ IW + ++    +S + +        
Sbjct: 2   STIVET-SGRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLFS-------- 48

Query: 82  WNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGILGPSKNKPI 140
            +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI G ++N P 
Sbjct: 49  -HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGIEGTTRNPPY 107

Query: 141 L--LENGLLLCGSSVESWNSWG----SWMEVTVDAGRSWR-KYGPIYIPNESLSVIQPVP 193
           +  + N + +   +V           SW+E+     +SW  + GPI   + +   I+P  
Sbjct: 108 VDRIRNRVYIPSYNVYEIPGQAVIGDSWVEIYNFFSKSWEGRMGPILGGSRNQVPIEPAI 167

Query: 194 FHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLL 252
               +  L +  R+ +     +CM+ S + G +WS  K     + ++ I     ++G+L 
Sbjct: 168 VKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAVQSGRNGKLY 227

Query: 253 LAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRT 311
           L  N+   G  L    S ++G +W+    ++   +++   P+      G +H+ YT    
Sbjct: 228 LFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLDRQGCMHLVYTSKDD 284

Query: 312 Q 312
           Q
Sbjct: 285 Q 285


>gi|449071311|ref|YP_007438391.1| hypothetical protein AO9_03425 [Chlamydophila psittaci Mat116]
 gi|449039819|gb|AGE75243.1| hypothetical protein AO9_03425 [Chlamydophila psittaci Mat116]
          Length = 357

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 136/315 (43%), Gaps = 45/315 (14%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           S+  ++   STIVE   G  LV +         G  C    D+ IW + ++    +S + 
Sbjct: 31  SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
           +         +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI
Sbjct: 86  S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136

Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
            G ++N P +  + N +      + S+N +          SW+E+     +SW  + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              + +   I+P      +  L +  R+ +     +CM+ S + G +WS  K     + +
Sbjct: 192 LGGSRNQVPIEPAIVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251

Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           + I     ++G+L L  N+   G  L    S ++G +W+    ++   +++   P+    
Sbjct: 252 NSIAVQSGRNGKLYLFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308

Query: 298 SDGSVHITYTYNRTQ 312
             G +H+ YT    Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323


>gi|407460857|ref|YP_006738632.1| hypothetical protein B603_0723 [Chlamydia psittaci WC]
 gi|405787140|gb|AFS25884.1| hypothetical protein B603_0723 [Chlamydia psittaci WC]
          Length = 357

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 136/315 (43%), Gaps = 45/315 (14%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           S+  ++   STIVE   G  LV +         G  C    D+ IW + ++    +S + 
Sbjct: 31  SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
           +         +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI
Sbjct: 86  S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136

Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
            G ++N P +  + N +      + S+N +          SW+E+     +SW  + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              + +   I+P      +  L +  R+ +     +CM+ S + G +WS  K     + +
Sbjct: 192 LGGSRNQVPIEPAIVKLTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251

Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           + I     ++G+L L  N+   G  L    S ++G +W+    ++   +++   P+    
Sbjct: 252 NSIAVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308

Query: 298 SDGSVHITYTYNRTQ 312
             G +H+ YT    Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323


>gi|329943014|ref|ZP_08291788.1| hypothetical protein G5Q_0689 [Chlamydophila psittaci Cal10]
 gi|332287596|ref|YP_004422497.1| hypothetical protein CPSIT_0711 [Chlamydophila psittaci 6BC]
 gi|384451748|ref|YP_005664346.1| hypothetical protein CPS0A_0729 [Chlamydophila psittaci 01DC11]
 gi|384452722|ref|YP_005665319.1| hypothetical protein CPS0D_0725 [Chlamydophila psittaci 08DC60]
 gi|384453701|ref|YP_005666297.1| hypothetical protein CPS0C_0726 [Chlamydophila psittaci C19/98]
 gi|384454680|ref|YP_005667275.1| putative alpha-rhamnosidase-like protein [Chlamydophila psittaci
           02DC15]
 gi|392376827|ref|YP_004064605.1| hypothetical protein Cpsi_6531 [Chlamydophila psittaci RD1]
 gi|407454220|ref|YP_006733328.1| hypothetical protein B595_0770 [Chlamydia psittaci 84/55]
 gi|407455499|ref|YP_006734390.1| hypothetical protein B598_0713 [Chlamydia psittaci GR9]
 gi|407456909|ref|YP_006735482.1| hypothetical protein B600_0767 [Chlamydia psittaci VS225]
 gi|313848170|emb|CBY17171.1| hypothetical protein Cpsi_6531 [Chlamydophila psittaci RD1]
 gi|325507153|gb|ADZ18791.1| hypothetical protein CPSIT_0711 [Chlamydophila psittaci 6BC]
 gi|328814561|gb|EGF84551.1| hypothetical protein G5Q_0689 [Chlamydophila psittaci Cal10]
 gi|334692482|gb|AEG85701.1| hypothetical protein CPS0C_0726 [Chlamydophila psittaci C19/98]
 gi|334693458|gb|AEG86676.1| hypothetical protein CPS0A_0729 [Chlamydophila psittaci 01DC11]
 gi|334694437|gb|AEG87654.1| putative alpha-rhamnosidase-like protein [Chlamydophila psittaci
           02DC15]
 gi|334695411|gb|AEG88627.1| hypothetical protein CPS0D_0725 [Chlamydophila psittaci 08DC60]
 gi|405780979|gb|AFS19729.1| hypothetical protein B595_0770 [Chlamydia psittaci 84/55]
 gi|405782042|gb|AFS20791.1| hypothetical protein B598_0713 [Chlamydia psittaci GR9]
 gi|405784170|gb|AFS22917.1| hypothetical protein B600_0767 [Chlamydia psittaci VS225]
          Length = 357

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 130/302 (43%), Gaps = 19/302 (6%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVP 80
           S+  ++   STIVE   G  LV +     +     K+  +    G W  P     + +  
Sbjct: 31  SSDNRTSGMSTIVETS-GRLLVVW-----QDCDSNKLIGRRCDLGIWSEPYEIFPQESGL 84

Query: 81  MWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGILGPSKNKP 139
             +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI G ++N P
Sbjct: 85  FSHPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGIEGTTRNPP 144

Query: 140 IL--LENGLLLCGSSVESWNSWG----SWMEVTVDAGRSWR-KYGPIYIPNESLSVIQPV 192
            +  + N + +   +V           SW+E+     +SW  + GPI   + +   I+P 
Sbjct: 145 YVDRIRNRVYIPSYNVYEIPGQAVIGDSWVEIYNFFSKSWEGRMGPILGGSRNQVPIEPA 204

Query: 193 PFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRL 251
                +  L +  R+ +     +CM+ S + G +WS  K     + ++ I     ++G+L
Sbjct: 205 IVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAVQSGRNGKL 264

Query: 252 LLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
            L  N+   G  L    S ++G +W+    ++   +++   P+      G +H+ YT   
Sbjct: 265 YLFANSARAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLDRQGCMHLVYTSKD 321

Query: 311 TQ 312
            Q
Sbjct: 322 DQ 323


>gi|420448593|ref|ZP_14947473.1| hypothetical protein HPHPH44_0668 [Helicobacter pylori Hp H-44]
 gi|393065947|gb|EJB66775.1| hypothetical protein HPHPH44_0668 [Helicobacter pylori Hp H-44]
          Length = 372

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF GS EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGSKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK +  +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420395231|ref|ZP_14894459.1| hypothetical protein HPCPY1124_0732 [Helicobacter pylori CPY1124]
 gi|393014498|gb|EJB15670.1| hypothetical protein HPCPY1124_0732 [Helicobacter pylori CPY1124]
          Length = 337

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 27  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEKLSYYS 86

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 87  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN   +
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 199

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 200 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325


>gi|406592550|ref|YP_006739730.1| hypothetical protein B711_0772 [Chlamydia psittaci CP3]
 gi|406594360|ref|YP_006741832.1| hypothetical protein B599_0718 [Chlamydia psittaci MN]
 gi|410858610|ref|YP_006974550.1| hypothetical protein BN356_6571 [Chlamydia psittaci 01DC12]
 gi|405782792|gb|AFS21540.1| hypothetical protein B599_0718 [Chlamydia psittaci MN]
 gi|405788422|gb|AFS27165.1| hypothetical protein B711_0772 [Chlamydia psittaci CP3]
 gi|410811505|emb|CCO02158.1| hypothetical protein BN356_6571 [Chlamydia psittaci 01DC12]
          Length = 357

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 75/315 (23%), Positives = 136/315 (43%), Gaps = 45/315 (14%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           S+  ++   STIVE   G  LV +         G  C    D+ IW + ++    +S + 
Sbjct: 31  SSDNRTSGMSTIVETS-GRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
           +         +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI
Sbjct: 86  S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136

Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW---------GSWMEVTVDAGRSWR-KYGPI 179
            G ++N P +  + N +      + S+N +          SW+E+     +SW  + GPI
Sbjct: 137 EGTTRNPPYVDRIRNRVY-----IPSYNVYEIPGQEVIGDSWVEIYNFFSKSWEGRMGPI 191

Query: 180 YIPNESLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPN 238
              + +   I+P      +  L +  R+ +     +CM+ S + G +WS  K     + +
Sbjct: 192 LGGSRNQVPIEPAIVKFTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYD 251

Query: 239 SGIDGVKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           + I     ++G+L L  N+   G  L    S ++G +W+    ++   +++   P+    
Sbjct: 252 NSIAVQSGRNGKLYLFANSAPAGERLTCFSSYNEGITWNHPKLIDSGYSVD---PSAYLD 308

Query: 298 SDGSVHITYTYNRTQ 312
             G +H+ YT    Q
Sbjct: 309 RQGCMHLVYTSKDDQ 323


>gi|254779428|ref|YP_003057533.1| hypothetical protein HELPY_0795 [Helicobacter pylori B38]
 gi|254001339|emb|CAX29324.1| Conserved hypothetical protein; putative signal peptide
           [Helicobacter pylori B38]
          Length = 358

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 48  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTKEELSKNA 107

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 108 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFARKLSLSP 165

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 166 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 220

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 221 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 276

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 277 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 332

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 333 DIVYTYNRSHIKHI 346


>gi|385217538|ref|YP_005779014.1| hypothetical protein HPF16_0779 [Helicobacter pylori F16]
 gi|317177587|dbj|BAJ55376.1| hypothetical protein HPF16_0779 [Helicobacter pylori F16]
          Length = 372

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDNKINRWSEAFIILTKEELSFYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|208434500|ref|YP_002266166.1| hypothetical protein HPG27_539 [Helicobacter pylori G27]
 gi|208432429|gb|ACI27300.1| hypothetical protein HPG27_539 [Helicobacter pylori G27]
          Length = 358

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 133/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 48  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFTILTKEDLSHHS 107

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 108 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKVYQFESALEPIRFKFARKLSLSP 165

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 166 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 220

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 221 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 276

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 277 L-NKELFLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNSHFID 333

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 334 IVYTYNRSHIKHI 346


>gi|420504219|ref|ZP_15002746.1| hypothetical protein HPHPP62_0678 [Helicobacter pylori Hp P-62]
 gi|393154631|gb|EJC54912.1| hypothetical protein HPHPP62_0678 [Helicobacter pylori Hp P-62]
          Length = 358

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 48  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSRHS 107

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 108 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 165

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 166 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 220

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 221 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 276

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 277 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 332

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 333 DIVYTYNRSHIKHI 346


>gi|420430364|ref|ZP_14929393.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-20]
 gi|393047776|gb|EJB48748.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-20]
          Length = 372

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 129/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N     + +    LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420490736|ref|ZP_14989319.1| hypothetical protein HPHPP13_0839 [Helicobacter pylori Hp P-13]
 gi|420524586|ref|ZP_15022993.1| hypothetical protein HPHPP13B_0844 [Helicobacter pylori Hp P-13b]
 gi|393107649|gb|EJC08190.1| hypothetical protein HPHPP13_0839 [Helicobacter pylori Hp P-13]
 gi|393130857|gb|EJC31281.1| hypothetical protein HPHPP13B_0844 [Helicobacter pylori Hp P-13b]
          Length = 372

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT-----WSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFRFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK +  +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420401548|ref|ZP_14900742.1| hypothetical protein HPCPY6081_0398 [Helicobacter pylori CPY6081]
 gi|393019137|gb|EJB20281.1| hypothetical protein HPCPY6081_0398 [Helicobacter pylori CPY6081]
          Length = 372

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSYYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN   +
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|338740003|ref|YP_004676965.1| hypothetical protein HYPMC_3184 [Hyphomicrobium sp. MC1]
 gi|337760566|emb|CCB66399.1| conserved protein of unknown function [Hyphomicrobium sp. MC1]
          Length = 394

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 68/340 (20%), Positives = 134/340 (39%), Gaps = 51/340 (15%)

Query: 11  VAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRWQSP 70
           ++ E   P   + F   H ++ V +  G     ++    EGA D ++    F    W   
Sbjct: 54  LSSEIVTPKGESEF--VHGASAVAMPNGGLQAFWYRAPYEGANDAELVSARFDGKEWSPT 111

Query: 71  IIADEEPNVP---------MWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKG 118
            +      V          + NPV F+     + LF+   + +  W+ C   + RS++ G
Sbjct: 112 HVVTTSARVSHDLAITIKSLANPVPFRRSDKEIWLFFATSR-LSGWATCEIGLIRSFDNG 170

Query: 119 VTWSEREQLPP----GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWR 174
            TW   E L       +   +K+ P L  +G +      E    +   + +  D     R
Sbjct: 171 ATWGPAESLYATPFFNMSHLTKSTPFLFSDGRIGLPVYQEMNQKFPVLLVLDADGHVIDR 230

Query: 175 K---------YGPIYI---PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG 222
           +         Y P+ +   P+ +++ ++P+     +R  R           + ++ + DG
Sbjct: 231 RRMGKAGKVGYQPMIVATGPSTAIAFVRPL----KSRRPR----------EILITRTTDG 276

Query: 223 GLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV--LKVALSKDDGDSWHDALT 280
           G TW+  +PT L NP   I  ++     +L+A+N  ++    + +AL+  DG ++  +  
Sbjct: 277 GQTWTPVEPTNLPNPGGPISAIRYDATHILMAFNDDAKMERNITLALADLDGKTFRRSGV 336

Query: 281 LE----ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           +     +      +YP +IQ+  G   + Y+     I HV
Sbjct: 337 IARMDTDRQDHIVAYPFLIQSGPGQFDVMYSRPIKTINHV 376


>gi|420506413|ref|ZP_15004928.1| hypothetical protein HPHPP74_1239 [Helicobacter pylori Hp P-74]
 gi|393115918|gb|EJC16428.1| hypothetical protein HPHPP74_1239 [Helicobacter pylori Hp P-74]
          Length = 372

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDNKTNRWSEAFTILTKEELSKNA 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420436991|ref|ZP_14935975.1| hypothetical protein HPHPH28_0700 [Helicobacter pylori Hp H-28]
 gi|393053305|gb|EJB54249.1| hypothetical protein HPHPH28_0700 [Helicobacter pylori Hp H-28]
          Length = 372

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAST++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASTLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPFYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPYFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|217032365|ref|ZP_03437860.1| hypothetical protein HPB128_25g13 [Helicobacter pylori B128]
 gi|298736271|ref|YP_003728797.1| hypothetical protein HPB8_776 [Helicobacter pylori B8]
 gi|216945932|gb|EEC24548.1| hypothetical protein HPB128_25g13 [Helicobacter pylori B128]
 gi|298355461|emb|CBI66333.1| conserved hypothetical protein [Helicobacter pylori B8]
          Length = 337

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 27  SVHASALINLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSKNA 86

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 87  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 144

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 199

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 200 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 256 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 311

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325


>gi|420463871|ref|ZP_14962647.1| hypothetical protein HPHPH4_1004 [Helicobacter pylori Hp H-4]
 gi|393079353|gb|EJB80086.1| hypothetical protein HPHPH4_1004 [Helicobacter pylori Hp H-4]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      +  +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFVRKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420443569|ref|ZP_14942497.1| hypothetical protein HPHPH41_0647 [Helicobacter pylori Hp H-41]
 gi|393061076|gb|EJB61945.1| hypothetical protein HPHPH41_0647 [Helicobacter pylori Hp H-41]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + RK   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRK---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK +  +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420470428|ref|ZP_14969137.1| hypothetical protein HPHPH11_0782 [Helicobacter pylori Hp H-11]
 gi|393085861|gb|EJB86540.1| hypothetical protein HPHPH11_0782 [Helicobacter pylori Hp H-11]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/328 (25%), Positives = 142/328 (43%), Gaps = 43/328 (13%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP- 70
           +FT P  +    S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    
Sbjct: 51  QFTIPKPNGAL-SVHASSLISLPNDNLLSAYFSGTKEGASDVKISANLFDSKTNRWSEAF 109

Query: 71  -IIADEEPN-------VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
            I+  EE +         + NP+LF L  + +LLF  +G  +  W+          +   
Sbjct: 110 TILTKEELSHYSHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPI 167

Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSW 173
              ++ +  L P   +    KNKP+   +G    G  +  ++   +   + +  D   + 
Sbjct: 168 HFKFARKLSLSPFLNLSHLIKNKPLNTTDG----GFVLPLYHELATQYPLLLKFDKQNNP 223

Query: 174 RKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
           R+   +  PN     +QP   PF    +   ++    +      M E+C     W     
Sbjct: 224 RE---LLRPNTLNHQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPML 276

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAME 288
           T L N N  ++ + L +  L L +N   +S    ++ LSK ++ +S+     L++  A E
Sbjct: 277 TNLKNLNDALNLINL-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANE 333

Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHV 316
            SYP+    +   + I YTYNR+ IKH+
Sbjct: 334 VSYPSY-SLNPHFIDIVYTYNRSHIKHI 360


>gi|420492600|ref|ZP_14991174.1| hypothetical protein HPHPP15_0915 [Helicobacter pylori Hp P-15]
 gi|420526622|ref|ZP_15025023.1| hypothetical protein HPHPP15B_1184 [Helicobacter pylori Hp P-15b]
 gi|393107039|gb|EJC07582.1| hypothetical protein HPHPP15_0915 [Helicobacter pylori Hp P-15]
 gi|393131927|gb|EJC32350.1| hypothetical protein HPHPP15B_1184 [Helicobacter pylori Hp P-15b]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASALINLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|419419543|ref|ZP_13959778.1| Hypothetical protein HP17_05720 [Helicobacter pylori NCTC 11637 =
           CCUG 17874]
 gi|384372253|gb|EIE27869.1| Hypothetical protein HP17_05720 [Helicobacter pylori NCTC 11637 =
           CCUG 17874]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASALINLPNDNLLSAYFSGTKEGAKDVKISANLFDGKTNRWSEAFTILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L ++ +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|387782418|ref|YP_005793131.1| hypothetical protein KHP_0738 [Helicobacter pylori 51]
 gi|261838177|gb|ACX97943.1| hypothetical protein KHP_0738 [Helicobacter pylori 51]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV-- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSFYS 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLMRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N+    + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNSSDLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420435864|ref|ZP_14934863.1| hypothetical protein HPHPH27_1221 [Helicobacter pylori Hp H-27]
 gi|420465270|ref|ZP_14964037.1| hypothetical protein HPHPH6_0683 [Helicobacter pylori Hp H-6]
 gi|420495871|ref|ZP_14994435.1| hypothetical protein HPHPP23_0961 [Helicobacter pylori Hp P-23]
 gi|393051723|gb|EJB52674.1| hypothetical protein HPHPH27_1221 [Helicobacter pylori Hp H-27]
 gi|393082757|gb|EJB83473.1| hypothetical protein HPHPH6_0683 [Helicobacter pylori Hp H-6]
 gi|393112182|gb|EJC12703.1| hypothetical protein HPHPP23_0961 [Helicobacter pylori Hp P-23]
          Length = 372

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASALINLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|425789178|ref|YP_007017098.1| hypothetical protein HPAKL117_02740 [Helicobacter pylori
           Aklavik117]
 gi|425627493|gb|AFX90961.1| hypothetical protein HPAKL117_02740 [Helicobacter pylori
           Aklavik117]
          Length = 372

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/326 (25%), Positives = 137/326 (42%), Gaps = 39/326 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP- 70
           +FT P       S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    
Sbjct: 51  QFTIPKPDDAL-SVHASVLINLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAF 109

Query: 71  -IIADEEPNV-------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
            I+  EE +         + NP+LF L  + +LLF  +G  +  W+          +   
Sbjct: 110 VILTKEELSFYSHEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPI 167

Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
              +  +  L P   +    +NKP+   +G  +     E    +   ++   D   + R+
Sbjct: 168 HFKFVRKLSLSPFLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE 225

Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
              +  PN     +QP   PF    +   V+    +      M E+C    TW     T 
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTTWQKPMLTN 278

Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFS 290
           L N N  ++ + L +  L L +N     + +  + LSK ++ +S+     L++  A E S
Sbjct: 279 LKNLNDALNLINL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVS 335

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
           YP+    +   + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNSHFIDIVYTYNRSHIKHI 360


>gi|405971945|gb|EKC36746.1| hypothetical protein CGI_10014218 [Crassostrea gigas]
          Length = 391

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 121/300 (40%), Gaps = 35/300 (11%)

Query: 44  YFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQE 103
           Y G SC    D +         +W    +  +       NPVLF      +L  +   QE
Sbjct: 96  YSGASCGCQIDWEANQLKNNSDQWTKAQVVSQRKGYSNQNPVLFHDNKTDVLYLFHSQQE 155

Query: 104 VQKWSGCMKRSYN--------KGVTWSEREQL-PPGILGPSK-----NKPIL-LENGLLL 148
            +K S  ++   +        K VT S   Q  PP ++   K     N+ ++ L N  L 
Sbjct: 156 AKKASAGIQSEDSAEIWVLSAKNVTNSTAIQFSPPRVMFKHKGSFDRNRVVVSLRNTWLY 215

Query: 149 -----CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESL--SVIQPVPFHTANRTL 201
                 GSS +  ++      V  D   SW  + P    N  +  SV++P      N  L
Sbjct: 216 PMYYAGGSSKDQTSNLKEC--VDHDVFSSWLDH-PFTASNYLVQPSVVRP---KKGNPRL 269

Query: 202 RVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN--TVS 259
            V  R       V  +ES D G TW+    T L N NSGI+   L  G L + YN  T  
Sbjct: 270 IVFFRDRRAQN-VYRAESPDDGKTWTRPSKTTLPNNNSGIEASVLSSGNLAIVYNPTTKD 328

Query: 260 RGVLKVALSKDDGDSWHDALTLE---ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           R  L V+LS D G +W     LE    +  +EFSYP + Q S  ++  T  + + + KH+
Sbjct: 329 RNPLSVSLSDDQGITWKYTRNLEYTKSSNGVEFSYPTLFQDSSEAIGSTCVH-KMRGKHI 387


>gi|420483924|ref|ZP_14982551.1| hypothetical protein HPHPP3_0763 [Helicobacter pylori Hp P-3]
 gi|420514196|ref|ZP_15012669.1| hypothetical protein HPHPP3B_0577 [Helicobacter pylori Hp P-3b]
 gi|393101617|gb|EJC02187.1| hypothetical protein HPHPP3_0763 [Helicobacter pylori Hp P-3]
 gi|393158659|gb|EJC58919.1| hypothetical protein HPHPP3B_0577 [Helicobacter pylori Hp P-3b]
          Length = 372

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|385225478|ref|YP_005785403.1| hypothetical protein HMPREF0462_0836 [Helicobacter pylori 83]
 gi|332673624|gb|AEE70441.1| conserved hypothetical protein [Helicobacter pylori 83]
          Length = 358

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 134/315 (42%), Gaps = 40/315 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV-- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 48  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSYYS 107

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+          +      + ++  L P
Sbjct: 108 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 165

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 166 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 220

Query: 188 VIQP--VPFHT-ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
            +QP   PF   A  T R      + +    M E+C     W     T L N N  ++ +
Sbjct: 221 QLQPSLTPFKDCALMTFRN-----HSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLI 275

Query: 245 KLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
            L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   
Sbjct: 276 NL-NQELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHF 331

Query: 302 VHITYTYNRTQIKHV 316
           + I YTYNR+ IKH+
Sbjct: 332 IDIVYTYNRSHIKHI 346


>gi|420410111|ref|ZP_14909254.1| hypothetical protein HPNQ4200_0646 [Helicobacter pylori NQ4200]
 gi|393028424|gb|EJB29510.1| hypothetical protein HPNQ4200_0646 [Helicobacter pylori NQ4200]
          Length = 347

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 37  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFTILTKEDLSHHS 96

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      +  +  L P
Sbjct: 97  HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVRKLSLSP 154

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 155 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNALNH 209

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 210 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 265

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 266 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 321

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 322 DIVYTYNRSHIKHI 335


>gi|420480610|ref|ZP_14979253.1| hypothetical protein HPHPP1_0821 [Helicobacter pylori Hp P-1]
 gi|420511109|ref|ZP_15009596.1| hypothetical protein HPHPP1B_0902 [Helicobacter pylori Hp P-1b]
 gi|393096626|gb|EJB97223.1| hypothetical protein HPHPP1_0821 [Helicobacter pylori Hp P-1]
 gi|393119593|gb|EJC20083.1| hypothetical protein HPHPP1B_0902 [Helicobacter pylori Hp P-1b]
          Length = 372

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|421710075|ref|ZP_16149432.1| BNR repeat-like domain protein [Helicobacter pylori R018c]
 gi|421723357|ref|ZP_16162611.1| BNR repeat-like domain protein [Helicobacter pylori R056a]
 gi|407210266|gb|EKE80145.1| BNR repeat-like domain protein [Helicobacter pylori R018c]
 gi|407224380|gb|EKE94156.1| BNR repeat-like domain protein [Helicobacter pylori R056a]
          Length = 372

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKKELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+      +   +      +  +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFVRKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|385220636|ref|YP_005782108.1| hypothetical protein HPIN_03975 [Helicobacter pylori India7]
 gi|317009443|gb|ADU80023.1| hypothetical protein HPIN_03975 [Helicobacter pylori India7]
          Length = 323

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 132/317 (41%), Gaps = 44/317 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 13  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSHYS 72

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGC-----------MKRSYNKGVTWSE 123
                   NP+LF L  + +LLF  +G  +  W+             ++  + + ++ S 
Sbjct: 73  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIRFKFVRKLSLSP 130

Query: 124 REQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPN 183
              L   I    KNKP+   +G  +     E    +   ++   D   S R+   +  PN
Sbjct: 131 FLNLSHLI----KNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKQNSPRE---LLRPN 181

Query: 184 ESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
                +QP   PF    +   ++    +      M E+C     W     T L N N  +
Sbjct: 182 ALNHQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDAL 237

Query: 242 DGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           + + L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    + 
Sbjct: 238 NLINL-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNS 294

Query: 300 GSVHITYTYNRTQIKHV 316
             + I YTYNR+ IKH+
Sbjct: 295 HFIDIVYTYNRSHIKHI 311


>gi|386746260|ref|YP_006219477.1| hypothetical protein HPB14_03790 [Helicobacter pylori HUP-B14]
 gi|384552509|gb|AFI07457.1| hypothetical protein HPB14_03790 [Helicobacter pylori HUP-B14]
          Length = 372

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQS--PIIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSETFTILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|385228312|ref|YP_005788245.1| hypothetical protein HPPN120_02855 [Helicobacter pylori Puno120]
 gi|344334750|gb|AEN15194.1| hypothetical protein HPPN120_02855 [Helicobacter pylori Puno120]
          Length = 372

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSFYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIQFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C    +W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTSWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|421711574|ref|ZP_16150917.1| BNR repeat-like domain protein [Helicobacter pylori R030b]
 gi|407212723|gb|EKE82585.1| BNR repeat-like domain protein [Helicobacter pylori R030b]
          Length = 372

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK +  +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420519358|ref|ZP_15017802.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-5b]
 gi|393128450|gb|EJC28894.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp H-5b]
          Length = 372

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|218682619|ref|ZP_03530220.1| putative glycosyl hydrolase protein [Rhizobium etli CIAT 894]
          Length = 358

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 64/252 (25%), Positives = 101/252 (40%), Gaps = 27/252 (10%)

Query: 22  APFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNV 79
           +P    HA+ +  +  G     +FGG+ EG  D+ I++     G  RW  P    ++P  
Sbjct: 35  SPCIQNHAANLAFLPDGTLSCVWFGGTMEGMGDISIYMSRLAPGAERWSEPEKMSDDPKK 94

Query: 80  PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL--PPGILGP 134
              NP++F  P     L Y   Q      G + +   S + G T+     L   PG    
Sbjct: 95  SEQNPLIFNAPDRKTWLLY-TSQTSGNQDGSIVKCRISDDGGKTFGPVRVLCDSPGTF-- 151

Query: 135 SKNKPILLENG-------LLLC-GSSVESW--NSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
              +  ++ NG       +  C G   + W  N+  + + ++ D G SW+ +    IP+ 
Sbjct: 152 --VRQQIVVNGRGDWLLPIFRCVGLDGQRWSGNADTAAVLISRDRGASWQMHD---IPDS 206

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +V   +        +      F     +  S S DGG TWS  KPT+L N NS I   
Sbjct: 207 IGAVHMNILPLGGGEMIAFYRNRF--AENILFSRSSDGGETWSPPKPTELPNNNSSIQAT 264

Query: 245 KLKDGRLLLAYN 256
            L D  + + YN
Sbjct: 265 MLNDAAIAMVYN 276


>gi|384894376|ref|YP_005768425.1| hypothetical protein HPSAT_03905 [Helicobacter pylori Sat464]
 gi|308063630|gb|ADO05517.1| hypothetical protein HPSAT_03905 [Helicobacter pylori Sat464]
          Length = 337

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 27  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 86

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 87  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 199

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 200 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325


>gi|149174320|ref|ZP_01852947.1| probable sialidase [Planctomyces maris DSM 8797]
 gi|148846865|gb|EDL61201.1| probable sialidase [Planctomyces maris DSM 8797]
          Length = 382

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 106/245 (43%), Gaps = 24/245 (9%)

Query: 83  NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLP-PGILGPSKNKPIL 141
           +P L +L    +L  Y    +  + +  ++RS + G TWSE++Q+  PG    + +  + 
Sbjct: 119 HPNLIRLSEQEILFSYVGWDDENQRNVFLRRSTDNGKTWSEQQQISEPGWYCCNADHALR 178

Query: 142 LENGLLL-----------CGSSVESWNSWGSWMEVTVDAGRSW-RKYGPIYIPNESLSVI 189
           L  G +L            G +        S++  + D  R+W R    +  P       
Sbjct: 179 LSTGRILIPAHGPYAKQYVGGTSYKGGDLHSFVFYSDDGFRTWKRSANSMTAPGRGCH-- 236

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KD 248
           +P         L   +R+ N   R+  S S DGG  WS  +PT+L +P S     ++   
Sbjct: 237 EPAIVELKEGRLLCFLRNTNQ--RLYQSFSDDGGAHWSRPEPTKLPSPESPAIVKRIPST 294

Query: 249 GRLLLAYNTVS------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G LLL +N V+      R  L  A+S+DDG +W     ++   A + +YP++   +D + 
Sbjct: 295 GDLLLLWNNVASPTNWPRTPLSAAISQDDGKTWGHFKDIDARPAFDAAYPSITFVNDEAF 354

Query: 303 HITYT 307
              Y+
Sbjct: 355 IAYYS 359


>gi|385229873|ref|YP_005789789.1| hypothetical protein HPPN135_02860 [Helicobacter pylori Puno135]
 gi|344336311|gb|AEN18272.1| hypothetical protein HPPN135_02860 [Helicobacter pylori Puno135]
          Length = 372

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDNKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|412994058|emb|CCO14569.1| exported exo-alpha-sialidase [Bathycoccus prasinos]
          Length = 692

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 133/357 (37%), Gaps = 70/357 (19%)

Query: 2   KKDCGIKG-----LVAEEFTFPANSAPFKSCHASTIVEVDKGH----FLVAYFGGSCEGA 52
           K  CG K      ++ + F F  N   FK  H   I E  K              S EGA
Sbjct: 251 KNGCGSKEAKKNIVLKQSFLF-RNGESFKYGHMVMIGEAPKNSPWKWAATWQASNSFEGA 309

Query: 53  PDVKIWLQTFKD--GRWQSPI---IADEEPNVPMWNPVLFKLPSNGLL-LFYKIGQEVQK 106
              KI +    D  G W  P+   I        +W PV  K P+ G+  LFY      +K
Sbjct: 310 FGQKIVISFADDPTGTWTPPMVIPIPVTPSTAAVWGPVWHKDPNTGVAWLFYAASTSCRK 369

Query: 107 WSGCMKRSYN-----------KGVTWSEREQLPPGIL---GPSKNKPILLENGLLL---- 148
            +G  K  Y             G+ WS     P  IL     S + P L+ N L +    
Sbjct: 370 NAGHGKVKYAPGGDIQVIKSLDGIQWS----APKTILMQAAESNHLPKLVSNQLAVHEPT 425

Query: 149 ----------------CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPV 192
                           C  SV       + + ++ D G+SW   G          +   V
Sbjct: 426 GYWVLPMWRESPHSNTCKPSVSKGTRTSAGVLLSKDQGKSWSVVGSYRARGVRWLIEGTV 485

Query: 193 PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD---- 248
                 + L +L RS  G   +    S DGG +WS A+ T + NPNS ++ ++L+     
Sbjct: 486 AEVGDTKDLLLLHRS--GEKYLYKQISKDGGKSWSNAERTSVPNPNSKVNLIRLESSKGA 543

Query: 249 -----GRLLLAYNTVSRGV---LKVALSKDDGDSWHDALTLEENL-AMEFSYPAVIQ 296
                G L LAYN    GV   L VALS  DG  W     LE++   + F+YP + Q
Sbjct: 544 PTNLGGSLALAYNDAMSGVRRKLSVALS-SDGHKWSKLQLLEDSSPGLHFNYPTIAQ 599


>gi|420453401|ref|ZP_14952240.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-8]
 gi|393071009|gb|EJB71798.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp A-8]
          Length = 372

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420455377|ref|ZP_14954206.1| hypothetical protein HPHPA14_0817 [Helicobacter pylori Hp A-14]
 gi|393072349|gb|EJB73127.1| hypothetical protein HPHPA14_0817 [Helicobacter pylori Hp A-14]
          Length = 372

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKINRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420474079|ref|ZP_14972755.1| hypothetical protein HPHPH19_0917 [Helicobacter pylori Hp H-19]
 gi|393089215|gb|EJB89855.1| hypothetical protein HPHPH19_0917 [Helicobacter pylori Hp H-19]
          Length = 372

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPMLTNLKNLNDALNLID 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|384889215|ref|YP_005763517.1| hypothetical protein HPV225_0586 [Helicobacter pylori v225d]
 gi|297379781|gb|ADI34668.1| Hypothetical protein HPV225_0586 [Helicobacter pylori v225d]
          Length = 337

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVK+    F  K  RW    I   +  +  + 
Sbjct: 27  SVHASALISLPNDNLLSAYFSGTKEGARDVKVSANLFDSKINRWSEAFIILTKEELSYYS 86

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT-----WSEREQLPP 129
                   NP+LF L ++ +LLF  +G  +  W+          +      + ++  L P
Sbjct: 87  HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQLESALEPIRFRFVQKLSLSP 144

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 145 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 199

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 200 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNSHFI 311

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325


>gi|108562984|ref|YP_627300.1| hypothetical protein HPAG1_0559 [Helicobacter pylori HPAG1]
 gi|107836757|gb|ABF84626.1| hypothetical protein HPAG1_0559 [Helicobacter pylori HPAG1]
          Length = 372

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYTLLLK--FDKNNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420400167|ref|ZP_14899369.1| hypothetical protein HPCPY3281_0734 [Helicobacter pylori CPY3281]
 gi|393018384|gb|EJB19534.1| hypothetical protein HPCPY3281_0734 [Helicobacter pylori CPY3281]
          Length = 337

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 27  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFILLTKEELSYYS 86

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 87  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 145 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYP--LLLKFDKHNNPRE---LLRPNALNH 199

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 200 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N  +  + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 256 L-NQELYLIHNPSNSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325


>gi|421719657|ref|ZP_16158941.1| BNR repeat-like domain protein [Helicobacter pylori R046Wa]
 gi|407220980|gb|EKE90785.1| BNR repeat-like domain protein [Helicobacter pylori R046Wa]
          Length = 372

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYP--LLLKFDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWKKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELFLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|210134781|ref|YP_002301220.1| neuraminidase [Helicobacter pylori P12]
 gi|210132749|gb|ACJ07740.1| neuraminidase [Helicobacter pylori P12]
          Length = 372

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 82/326 (25%), Positives = 137/326 (42%), Gaps = 39/326 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
           +FT P  +    S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    
Sbjct: 51  QFTIPKPNGAL-SVHASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKINRWSEAF 109

Query: 72  IADEEPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
           I   +  +  +         NP+LF L  + +LLF  +G  +  W+          +   
Sbjct: 110 IILTKEELSHYSHEYIKKLGNPLLF-LHEDKILLFV-VGVSMGGWATSKIYQLESTLEPI 167

Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
              + ++  L P   +    KNKP+   +G  +     E    +   ++   D   + R+
Sbjct: 168 HFKFVQKLSLSPFLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNSRE 225

Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
              +  PN     +QP   PF    +   ++    +      M E+C     W     T 
Sbjct: 226 ---LLRPNTLNHQLQPSLTPF----KNCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTN 278

Query: 234 LLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFS 290
           L N N  ++ + L +  L L +N   +S    ++ LSK ++ +S+     L++  A E S
Sbjct: 279 LKNLNDALNLINLNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKILKVLDK--ANEVS 335

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
           YP+    +   + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNPHFIDIVYTYNRSHIKHI 360


>gi|420396383|ref|ZP_14895603.1| hypothetical protein HPCPY1313_0212 [Helicobacter pylori CPY1313]
 gi|393014374|gb|EJB15547.1| hypothetical protein HPCPY1313_0212 [Helicobacter pylori CPY1313]
          Length = 372

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFILLTKEELSYYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN   +
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWKKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|322379165|ref|ZP_08053558.1| hypothetical protein HSUHS1_0796 [Helicobacter suis HS1]
 gi|321148396|gb|EFX42903.1| hypothetical protein HSUHS1_0796 [Helicobacter suis HS1]
          Length = 354

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 131/323 (40%), Gaps = 47/323 (14%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPIIADEEPNV- 79
           P KS H++T+  +   + L AYFGGS EG  DVKI+  L + K   WQ        P + 
Sbjct: 40  PVKSVHSATLSALSNDYLLAAYFGGSAEGLSDVKIYANLYSIKTQTWQKAFALLSAPELS 99

Query: 80  --------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT----WSEREQL 127
                    + NPVLF    +  L  + +G  +  W+          +T    +S  + L
Sbjct: 100 RLANAYVKALGNPVLFT--QHDRLYLFVVGVSLGGWATSRIYMLVSNITPPFHFSLLQIL 157

Query: 128 PPGILGPSKNKPILLENGLLLC---GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
           P   L P  N   L+   +L     G  + S++       + +      R    I  PN 
Sbjct: 158 P---LSPFLNISHLVRTAVLPTTDGGFLLPSYHELAHKFPLLLKFDTQDRLEFLIK-PNT 213

Query: 185 SLSVIQPV--PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
               +QP   P+        + +RS+    +   +++C   L W+   P  L N ++  +
Sbjct: 214 LKQQLQPTLAPYKACA---LMALRSYK--TQQFYTQTCKTPLQWN---PPHLSNLHNYDN 265

Query: 243 GVKLKDGR--LLLAYN------TVSRGVLKVALSKDDGDSWHDALT-LEENLAMEFSYPA 293
            + L +    L L YN         R  L +A  K   D +   LT L+ +L  E SYPA
Sbjct: 266 SIVLFNANHTLYLLYNEPLKNSNNPRSALYLAKFK---DGYFSPLTILDSSLKGEVSYPA 322

Query: 294 VIQASDGSVHITYTYNRTQIKHV 316
           V+   +  +H+ YT  R  I H+
Sbjct: 323 VLNF-NNQIHVLYTKERKAIMHL 344


>gi|420460081|ref|ZP_14958880.1| hypothetical protein HPHPA27_0567 [Helicobacter pylori Hp A-27]
 gi|393077183|gb|EJB77932.1| hypothetical protein HPHPA27_0567 [Helicobacter pylori Hp A-27]
          Length = 372

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 82/326 (25%), Positives = 136/326 (41%), Gaps = 39/326 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
           +FT P       S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    
Sbjct: 51  QFTIPRPDDAL-SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAF 109

Query: 72  IADEEPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
           I   +  +  +         NP+LF L  + +LLF  +G  +  W+      +   +   
Sbjct: 110 ILLTKEELSHYSHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPI 167

Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
              +  +  L P   +    KNKP+   +G  +     E    +   ++   D   + R+
Sbjct: 168 HFKFVRKLSLSPFLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE 225

Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
              +  PN     +QP   PF    +   ++    +      M E+C     W     T 
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAIMAFRNHSFKDNLMLETCKTPTIWQKPMLTN 278

Query: 234 LLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFS 290
           L N N  ++ + L +  L L +N   +S    ++ LSK ++ +S+     L++  A E S
Sbjct: 279 LKNLNDALNLINL-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVS 335

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
           YP+    +   + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNPHFIDIVYTYNRSHIKHI 360


>gi|52843133|ref|YP_096932.1| BNR/Asp box repeat-containing protein [Legionella pneumophila
           subsp. pneumophila str. Philadelphia 1]
 gi|52630244|gb|AAU28985.1| hypothetical BNR/Asp box repeat protein [Legionella pneumophila
           subsp. pneumophila str. Philadelphia 1]
          Length = 390

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 80/340 (23%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV--- 79
           K  H++T+ +VD  H L  YF GS EG PDVKI+   +   R  W +P I      +   
Sbjct: 71  KKAHSATLTKVDDSHLLAMYFAGSREGGPDVKIYSSIYDIHRNKWSTPKIVLTREKLRKN 130

Query: 80  ------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG 130
                  + NPV++    NG L  + +      W+        S +KG +++ +  L   
Sbjct: 131 SHQLIRKLGNPVIYN--QNGKLHLFVVATAFAGWAASKIYHFESLDKGKSFNYKATL--- 185

Query: 131 ILGPSKN-------KPI-LLENGLLL-CGSSVES-------WNSWGSWMEVTVDAGRSWR 174
            L P  N        P+ L++ G  L     +ES       ++  GS+ E ++       
Sbjct: 186 YLSPFFNLSTLIRASPLPLVDGGFFLPAYHELESKYPLMLRFDKKGSYTE-SIQINHIGG 244

Query: 175 KYGPIYIPNESLSVIQPVPFHTANRT---LRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
           +  P  IP  SL+ +    F  AN+T   +R+         + C     +GGL+W   + 
Sbjct: 245 QLQPSIIPLSSLNCL--AGFRNANKTDHWMRI---------QYCQ----EGGLSWLDPQK 289

Query: 232 TQLLNPNSGIDGVKLKD---------------GRLLLAYNTVSRGVLKVALSKDDGDSWH 276
           T + N N+ I+                     G+L+L+    S G+     + D G+  +
Sbjct: 290 TNIKNMNNSINLFNFNSEVFLVHNRTHKNNIKGQLVLSRMENSTGIFSYLFTLDTGEINN 349

Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            ++T          YP+ +    G V I Y  N  ++ H+
Sbjct: 350 KSVT----------YPSTL-VDRGYVDIIYDKNANEVIHI 378


>gi|378778819|ref|YP_005187261.1| putative BNR/Asp box repeat protein [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
 gi|364509637|gb|AEW53161.1| putative BNR/Asp box repeat protein [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
          Length = 395

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 80/340 (23%), Positives = 135/340 (39%), Gaps = 80/340 (23%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV--- 79
           K  H++T+ +VD  H L  YF GS EG PDVKI+   +   R  W +P I      +   
Sbjct: 76  KKAHSATLTKVDDSHLLAMYFAGSREGGPDVKIYSSIYDIHRNKWSTPKIVLTREKLRKN 135

Query: 80  ------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG 130
                  + NPV++    NG L  + +      W+        S +KG +++ +  L   
Sbjct: 136 SHQLIRKLGNPVIYN--QNGKLHLFVVATAFAGWAASKIYHFESLDKGKSFNYKATL--- 190

Query: 131 ILGPSKN-------KPI-LLENGLLL-CGSSVES-------WNSWGSWMEVTVDAGRSWR 174
            L P  N        P+ L++ G  L     +ES       ++  GS+ E ++       
Sbjct: 191 YLSPFFNLSTLIRASPLPLVDGGFFLPAYHELESKYPLMLRFDKKGSYTE-SIQINHIGG 249

Query: 175 KYGPIYIPNESLSVIQPVPFHTANRT---LRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
           +  P  IP  SL+ +    F  AN+T   +R+         + C     +GGL+W   + 
Sbjct: 250 QLQPSIIPLSSLNCL--AGFRNANKTDHWMRI---------QYCQ----EGGLSWLDPQK 294

Query: 232 TQLLNPNSGIDGVKLKD---------------GRLLLAYNTVSRGVLKVALSKDDGDSWH 276
           T + N N+ I+                     G+L+L+    S G+     + D G+  +
Sbjct: 295 TNIKNMNNSINLFNFNSEVFLVHNRTHKNNIKGQLVLSRMENSTGIFSYLFTLDTGEINN 354

Query: 277 DALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            ++T          YP+ +    G V I Y  N  ++ H+
Sbjct: 355 KSVT----------YPSTL-VDRGYVDIIYDKNANEVIHI 383


>gi|421712896|ref|ZP_16152227.1| BNR repeat-like domain protein [Helicobacter pylori R32b]
 gi|407216262|gb|EKE86099.1| BNR repeat-like domain protein [Helicobacter pylori R32b]
          Length = 372

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPFYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELFLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPYFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|322379649|ref|ZP_08053968.1| hypothetical neuraminidase (sialidase) [Helicobacter suis HS5]
 gi|321147911|gb|EFX42492.1| hypothetical neuraminidase (sialidase) [Helicobacter suis HS5]
          Length = 354

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 83/323 (25%), Positives = 131/323 (40%), Gaps = 47/323 (14%)

Query: 23  PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPIIADEEPNV- 79
           P KS H++T+  +   + L AYFGGS EG  DVKI+  L + K   WQ        P + 
Sbjct: 40  PVKSVHSATLSALSNDYLLAAYFGGSAEGLSDVKIYANLYSIKTQTWQKTFALLSAPELS 99

Query: 80  --------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT----WSEREQL 127
                    + NPVLF    +  L  + +G  +  W+          +T    +S  + L
Sbjct: 100 RLANAYVKALGNPVLFT--QHDRLYLFVVGVSLGGWATSRIYMLVSNITPPFHFSLLQIL 157

Query: 128 PPGILGPSKNKPILLENGLLLC---GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
           P   L P  N   L+   +L     G  + S++       + +      R    I  PN 
Sbjct: 158 P---LSPFLNISHLVRTAVLPTTDGGFLLPSYHELAHKFPLLLKFDTQDRLEFLIK-PNT 213

Query: 185 SLSVIQPV--PFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGID 242
               +QP   P+        + +RS+    +   +++C   L W+   P  L N ++  +
Sbjct: 214 LKQQLQPTLAPYKACA---LMALRSYK--TQQFYTQTCKTPLQWN---PPHLSNLHNYDN 265

Query: 243 GVKLKDGR--LLLAYN------TVSRGVLKVALSKDDGDSWHDALT-LEENLAMEFSYPA 293
            + L +    L L YN         R  L +A  K   D +   LT L+ +L  E SYPA
Sbjct: 266 SIVLFNANHTLYLLYNEPLKNSNNPRSALYLAKFK---DGYFSPLTILDSSLKGEVSYPA 322

Query: 294 VIQASDGSVHITYTYNRTQIKHV 316
           V+   +  +H+ YT  R  I H+
Sbjct: 323 VLNF-NNQIHVLYTKERKAIMHL 344


>gi|407459486|ref|YP_006737589.1| hypothetical protein B602_0720 [Chlamydia psittaci M56]
 gi|405786183|gb|AFS24928.1| hypothetical protein B602_0720 [Chlamydia psittaci M56]
          Length = 357

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/310 (23%), Positives = 132/310 (42%), Gaps = 35/310 (11%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAY--------FGGSCEGAPDVKIWLQTFKDGRWQSPII 72
           S+  ++   STIVE   G  LV +         G  C    D+ IW + ++    +S + 
Sbjct: 31  SSDNRTSGMSTIVET-SGRLLVVWQDCDSNKLIGRRC----DLGIWSEPYEIFPQESGLF 85

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG-CMKRSYNKGVTWSEREQLPPGI 131
           +         +PVL ++ S  + LFY+  Q  +  S   +  S+N G  W E   LPPGI
Sbjct: 86  S---------HPVLVRISSQEIWLFYRKQQVGESRSQPYVAFSFNSGFHWLEHRLLPPGI 136

Query: 132 LGPSKNKPIL--LENGLLLCGSSVESWNSW----GSWMEVTVDAGRSWR-KYGPIYIPNE 184
            G ++N P +  + N + +   +V           SW+E+      SW  + GPI   + 
Sbjct: 137 EGTTRNPPYVDRIRNRVYIPSYNVYEMPGQEVIGDSWVEIYNFFSTSWEGRMGPILGGSR 196

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
           +   I+P      +  L +  R+ +     +CM+ S + G +WS  K     + ++ I  
Sbjct: 197 NQVPIEPAIVKLTSNKLALFFRNASVEESYICMAVSTNQGSSWSKLKNLPWRSYDNSIAV 256

Query: 244 VKLKDGRLLLAYNTVSRG-VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
              ++G+L L  N+   G  L    S ++G  W+    ++   +++   P+      G +
Sbjct: 257 QSGRNGKLYLFANSAPAGERLTCFSSYNEGIIWNHPKLIDSGYSVD---PSAYLDRQGCM 313

Query: 303 HITYTYNRTQ 312
           H+ YT    Q
Sbjct: 314 HLVYTSKDDQ 323


>gi|418936222|ref|ZP_13489953.1| putative glycosyl hydrolase protein, partial [Rhizobium sp.
           PDO1-076]
 gi|375057060|gb|EHS53253.1| putative glycosyl hydrolase protein, partial [Rhizobium sp.
           PDO1-076]
          Length = 207

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 67/164 (40%), Gaps = 58/164 (35%)

Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT--------------- 257
           ++  S S D G  WS  +PT L N NS I  + LK+G + + YN                
Sbjct: 38  QILYSWSDDDGRNWSPPQPTVLPNNNSSIQAIDLKNGSIAIVYNHSNAESSDQRRLSLYD 97

Query: 258 -------------------------------VSRGVLKVALSKDDGDSWHDALTL----- 281
                                          V R  L +A S DDG S+ + + L     
Sbjct: 98  EIEGEGGGEAVDEQAEPATGGTYDGPRAVWGVPRAPLSLAFSYDDGVSFLEHVDLDTGDG 157

Query: 282 -------EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
                  +++L  EFSYP++I+  DG++H+ YTY R  IK+V +
Sbjct: 158 YCLSNNSQQSLNREFSYPSIIEGPDGTLHVAYTYFRRAIKYVRI 201


>gi|420446924|ref|ZP_14945819.1| hypothetical protein HPHPH43_0612 [Helicobacter pylori Hp H-43]
 gi|393064634|gb|EJB65469.1| hypothetical protein HPHPH43_0612 [Helicobacter pylori Hp H-43]
          Length = 372

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFILLTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHEDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420427208|ref|ZP_14926253.1| hypothetical protein HPHPA9_1075 [Helicobacter pylori Hp A-9]
 gi|393042141|gb|EJB43152.1| hypothetical protein HPHPA9_1075 [Helicobacter pylori Hp A-9]
          Length = 372

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 129/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIVLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G    G  +  ++   +   + +   +       +  PN    
Sbjct: 180 FLNLSHLIKNKPLSTTDG----GFVLPLYHELATQYPLLLKFDKQ-NNPKELLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|171909756|ref|ZP_02925226.1| hypothetical protein VspiD_01250 [Verrucomicrobium spinosum DSM
           4136]
          Length = 391

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 103/232 (44%), Gaps = 46/232 (19%)

Query: 110 CMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTV-- 167
           C+ RS + G+TWS     P   +  S + P  L++G LL          W    ++ +  
Sbjct: 174 CL-RSIDGGITWS----TPIKTIVNSPHGPTQLKDGRLLYAGK----QLWSEDKKIGICE 224

Query: 168 --DAGRSWRKYGPI------YIPN--ESLSVIQPVPFHTANRTLRVLMRSFNG--IGRVC 215
             D G +W+    I       +PN    L  I+      A+ T+ V +R+ N    G   
Sbjct: 225 SKDDGLTWQWLSEIPAREGDAVPNGYHELHAIE-----AADGTIIVQIRNHNKQHAGETL 279

Query: 216 MSESCDGGLTWS-------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL-KVAL 267
            +ES DGG TWS       +  P+ LL         KL+DGRLL+ Y    +    +V +
Sbjct: 280 QTESSDGGKTWSTPHSIGVWGLPSHLL---------KLRDGRLLMTYGHRRKPFGNQVRV 330

Query: 268 SKDDGDSWHDALTL-EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SKD+G +W +A  L E+ +  +  YP+ ++  DGS    +  +  + K  VL
Sbjct: 331 SKDNGKTWGEASILSEDGIGGDLGYPSTVELGDGSFLSVWYESMKEPKKAVL 382


>gi|421718196|ref|ZP_16157495.1| BNR repeat-like domain protein [Helicobacter pylori R038b]
 gi|407221462|gb|EKE91266.1| BNR repeat-like domain protein [Helicobacter pylori R038b]
          Length = 347

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 135/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 37  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFTILTKEDLSHHS 96

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 97  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPVHFKFARKLSLSP 154

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 155 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 209

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 210 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 265

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 266 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 321

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 322 DIVYTYNRSHIKHI 335


>gi|420404284|ref|ZP_14903467.1| hypothetical protein HPCPY6261_1462 [Helicobacter pylori CPY6261]
 gi|393018163|gb|EJB19314.1| hypothetical protein HPCPY6261_1462 [Helicobacter pylori CPY6261]
          Length = 372

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCALMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIS 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420497543|ref|ZP_14996103.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25]
 gi|420527849|ref|ZP_15026243.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25c]
 gi|393113822|gb|EJC14340.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25]
 gi|393134975|gb|EJC35384.1| BNR/Asp-box repeat protein [Helicobacter pylori Hp P-25c]
          Length = 372

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      + E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLILETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|385249100|ref|YP_005777319.1| hypothetical protein HPF57_0605 [Helicobacter pylori F57]
 gi|317181895|dbj|BAJ59679.1| hypothetical protein HPF57_0605 [Helicobacter pylori F57]
          Length = 372

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS ++ +   + L  YF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASALISLPNDNLLSTYFSGTKEGARDVKISANLFDSKINRWSKAFVILTKEELSHYS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN   +
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|385226812|ref|YP_005786736.1| hypothetical protein HPSNT_02980 [Helicobacter pylori SNT49]
 gi|344331725|gb|AEN16755.1| hypothetical protein HPSNT_02980 [Helicobacter pylori SNT49]
          Length = 372

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I     ++  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTREDLSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KN P+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNNPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N+    + +  + LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELYLIHNSSDSSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|397665554|ref|YP_006507092.1| hypothetical protein LPO_3264 [Legionella pneumophila subsp.
           pneumophila]
 gi|395128965|emb|CCD07186.1| conserved protein of unknown function [Legionella pneumophila
           subsp. pneumophila]
          Length = 383

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/316 (23%), Positives = 127/316 (40%), Gaps = 32/316 (10%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGR--WQSPIIADEEPNV--- 79
           K  H++T+ +VD  H L  YF GS EG PDVKI+   +   R  W +P I      +   
Sbjct: 64  KKAHSATLTKVDDSHLLAMYFAGSREGGPDVKIYSSIYDIHRNKWSAPKIVLTREKLCKN 123

Query: 80  ------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGC---MKRSYNKGVTWSEREQLPPG 130
                  + NPV++    NG L  + +      W+        S +KG +++ +  L   
Sbjct: 124 SHQLIRKLGNPVIYN--QNGKLHLFVVATAFAGWAASKIYHFESLDKGKSFNYKATL--- 178

Query: 131 ILGPSKNKPILLENG---LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
            L P  N   L+      L+  G  + +++   S   + +   +       I I N    
Sbjct: 179 YLSPFFNLSTLIRASPLPLVDGGFFLPAYHELESKYPLMLRFDKKGSYTESIQI-NHIGG 237

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESC-DGGLTWSYAKPTQLLNPNSGIDGVKL 246
            +QP     +N       R+ N I      + C +GGL+W   + T + N N+ I+    
Sbjct: 238 QLQPSIIPLSNLNCLAGFRNANKIDHWMRIQYCQEGGLSWLDPQKTNIKNMNNSINLFNF 297

Query: 247 KDGRLLLAYNTVSRGVLK--VALSKDDGD----SWHDALTLEENLAMEFSYPAVIQASDG 300
            +  + L +N   +  +K  + LS+ +      S+   L   E      +YP+ +    G
Sbjct: 298 -NSEVFLVHNRTHKNNIKGQLVLSRMENSTGIFSYLFTLDTGEINNKSVTYPSTL-VDRG 355

Query: 301 SVHITYTYNRTQIKHV 316
            V I Y  N  ++ H+
Sbjct: 356 YVDIIYDKNANEVIHI 371


>gi|15645205|ref|NP_207375.1| hypothetical protein HP0580 [Helicobacter pylori 26695]
 gi|410023813|ref|YP_006893066.1| hypothetical protein C695_02995 [Helicobacter pylori Rif1]
 gi|410501580|ref|YP_006936107.1| hypothetical protein C730_02995 [Helicobacter pylori Rif2]
 gi|410682100|ref|YP_006934502.1| hypothetical protein C694_02990 [Helicobacter pylori 26695]
 gi|419417467|ref|ZP_13957901.1| hypothetical protein HP79_02214 [Helicobacter pylori P79]
 gi|2313698|gb|AAD07646.1| predicted coding region HP0580 [Helicobacter pylori 26695]
 gi|384372982|gb|EIE28534.1| hypothetical protein HP79_02214 [Helicobacter pylori P79]
 gi|409893741|gb|AFV41799.1| hypothetical protein C694_02990 [Helicobacter pylori 26695]
 gi|409895470|gb|AFV43392.1| hypothetical protein C695_02995 [Helicobacter pylori Rif1]
 gi|409897131|gb|AFV44985.1| hypothetical protein C730_02995 [Helicobacter pylori Rif2]
          Length = 372

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDSSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|425790554|ref|YP_007018471.1| hypothetical protein HPAKL86_01960 [Helicobacter pylori Aklavik86]
 gi|425628869|gb|AFX89409.1| hypothetical protein HPAKL86_01960 [Helicobacter pylori Aklavik86]
          Length = 372

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFIILTKEELSYYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      +  +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPILTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NQELYLIHNPSDLSMRRKELLLSKLENSNSFRTLKILDK--ANEVSYPSY-SFNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|383749753|ref|YP_005424856.1| hypothetical protein HPELS_03670 [Helicobacter pylori ELS37]
 gi|380874499|gb|AFF20280.1| hypothetical protein HPELS_03670 [Helicobacter pylori ELS37]
          Length = 372

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 132/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFTILTKEDLSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALESIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|384896119|ref|YP_005770108.1| hypothetical protein HMPREF4655_21021 [Helicobacter pylori 35A]
 gi|315586735|gb|ADU41116.1| conserved hypothetical protein [Helicobacter pylori 35A]
          Length = 372

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    +   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  N +LLF  +G  +  W+          +      +  +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNY 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCALMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420501367|ref|ZP_14999911.1| hypothetical protein HPHPP30_1200 [Helicobacter pylori Hp P-30]
 gi|393150173|gb|EJC50481.1| hypothetical protein HPHPP30_1200 [Helicobacter pylori Hp P-30]
          Length = 372

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 133/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420406765|ref|ZP_14905935.1| hypothetical protein HPCPY6311_0608 [Helicobacter pylori CPY6311]
 gi|393023602|gb|EJB24716.1| hypothetical protein HPCPY6311_0608 [Helicobacter pylori CPY6311]
          Length = 372

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSYYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIRFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN   +
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|384899112|ref|YP_005774492.1| hypothetical protein HPF30_0747 [Helicobacter pylori F30]
 gi|317179056|dbj|BAJ56844.1| hypothetical protein HPF30_0747 [Helicobacter pylori F30]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 134/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSYYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIRFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN   +
Sbjct: 180 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNN 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNT--VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSGLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420438606|ref|ZP_14937580.1| hypothetical protein HPHPH29_0669 [Helicobacter pylori Hp H-29]
 gi|393056206|gb|EJB57118.1| hypothetical protein HPHPH29_0669 [Helicobacter pylori Hp H-29]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SAHASSLIGLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N     + +    LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420494076|ref|ZP_14992645.1| hypothetical protein HPHPP16_0724 [Helicobacter pylori Hp P-16]
 gi|393110877|gb|EJC11401.1| hypothetical protein HPHPP16_0724 [Helicobacter pylori Hp P-16]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLSTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENPNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420425260|ref|ZP_14924321.1| hypothetical protein HPHPA5_0789 [Helicobacter pylori Hp A-5]
 gi|393041678|gb|EJB42692.1| hypothetical protein HPHPA5_0789 [Helicobacter pylori Hp A-5]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L  YF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASSLIGLPNDNLLSTYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPMLTNLKNLNDALNLID 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|373849440|ref|ZP_09592241.1| hypothetical protein Opit5DRAFT_0295 [Opitutaceae bacterium TAV5]
 gi|372475605|gb|EHP35614.1| hypothetical protein Opit5DRAFT_0295 [Opitutaceae bacterium TAV5]
          Length = 386

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 98/217 (45%), Gaps = 30/217 (13%)

Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAG 170
           MKRS + G+TWS R ++P    G + + P+ L +G +   ++ ++W        V+ D G
Sbjct: 180 MKRSTDGGLTWSPRYKVP----GYNPHGPVALRDGRMFFAAAHDAW--------VSEDDG 227

Query: 171 RSWRKYGPIYI-PNESLSVIQPVPFHTANRTLRVLMRSFNGIG-----RVCMSESCDGGL 224
            +W  +  +   P E  S+        A+  L V +R+          R+  S S DGG 
Sbjct: 228 LTWTLFSKLPTGPGEMHSI------EAADGALIVHVRNKVKTAKGTRQRIIQSRSEDGGK 281

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV-LKVALSKDDGDSWHDALTLEE 283
           TW   +      P+     V+L DG L+  Y        ++  LS+D G SW +   + +
Sbjct: 282 TWDAPRVVAEGYPSH---LVRLADGTLITTYGWRQAPFGIRGKLSRDHGRSWSEEFVVTD 338

Query: 284 NLA-MEFSYPAVIQASDGSVHITYTYNRTQIKHVVLL 319
           + A  +  YP+ +Q  DGS+ +T  Y   +  H+  L
Sbjct: 339 DGANWDLGYPSTVQLRDGSL-LTVWYEAPEGSHLAGL 374


>gi|444374201|ref|ZP_21173508.1| hypothetical protein C528_01385 [Helicobacter pylori A45]
 gi|443621425|gb|ELT81864.1| hypothetical protein C528_01385 [Helicobacter pylori A45]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 80/316 (25%), Positives = 133/316 (42%), Gaps = 42/316 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASALINLPNDNLLNAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L ++ +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNES 185
              +    KNKP+   +G    G  +  ++   +   + +  D   + R+   +  PN  
Sbjct: 180 FLNLSHLIKNKPLNTTDG----GFVLPLYHELATQYPLLLKFDQQNNPRE---LLRPNTL 232

Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +QP   PF    +   V+    +      M E+C     W     T L N N  ++ 
Sbjct: 233 NHQLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWKKPTLTNLKNLNDALNL 288

Query: 244 VKLKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           + L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +  
Sbjct: 289 INLNE-ELYLIHNPSDLSLRRKELWLSKLENPNSFKTLKVLDK--ANEVSYPSY-SLNPH 344

Query: 301 SVHITYTYNRTQIKHV 316
            + I YTYNR+ IKH+
Sbjct: 345 FIDIVYTYNRSHIKHI 360


>gi|386755606|ref|YP_006228823.1| hypothetical protein HPPC18_02785 [Helicobacter pylori PeCan18]
 gi|384561864|gb|AFI02330.1| hypothetical protein HPPC18_02785 [Helicobacter pylori PeCan18]
          Length = 372

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 130/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|420466880|ref|ZP_14965637.1| hypothetical protein HPHPH9_0565 [Helicobacter pylori Hp H-9]
 gi|393085278|gb|EJB85966.1| hypothetical protein HPHPH9_0565 [Helicobacter pylori Hp H-9]
          Length = 372

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 130/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDGKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESTLEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|308184363|ref|YP_003928496.1| hypothetical protein HPSJM_02935 [Helicobacter pylori SJM180]
 gi|308060283|gb|ADO02179.1| hypothetical protein HPSJM_02935 [Helicobacter pylori SJM180]
          Length = 340

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 90/345 (26%), Positives = 146/345 (42%), Gaps = 53/345 (15%)

Query: 5   CGIKGLVAE------EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW 58
           C IK L+ +      + T P  +    S HAS+++ +   + L AYF G+ EGA DVKI 
Sbjct: 4   CIIKPLLLKPPPYFTQLTIPKPNDAL-SVHASSLIGLPNDNLLSAYFSGTKEGARDVKIS 62

Query: 59  LQTF--KDGRWQSP--IIADEE--PNVPMW-----NPVLFKLPSNGLLLFYKIGQEVQKW 107
              F  K  RW     I+  EE   NV  +     NP+LF L  + +LLF  +G  +  W
Sbjct: 63  ANLFDSKTNRWSEAFTILTKEELSKNVHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGW 120

Query: 108 SGC-----------MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESW 156
           +             ++  + + ++ S    L   I    KNKP+   +G  +     E  
Sbjct: 121 ATSKIYQLESALEPIRFKFVRKLSLSPFLNLSHLI----KNKPLSTTDGGFMLPLYHELA 176

Query: 157 NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRV 214
             +   ++   D   + R+   +  PN     +QP   PF    +   V+    +     
Sbjct: 177 TQYPLLLK--FDQQNNPRE---LLRPNALNHQLQPSLTPF----KDCAVMAFRNHSFKDS 227

Query: 215 CMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA--LSK-DD 271
            M E+C     W     T L N N  ++ + L +  L L +N     + +    LSK ++
Sbjct: 228 LMLETCKTPTAWQKPMLTNLKNLNDALNLINL-NKELYLIHNPSDLSLRRKELWLSKLEN 286

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
            +S+     L++  A E SYP+    +   + I YTYNR+ IKH+
Sbjct: 287 PNSFKTLKVLDK--ANEVSYPSY-SLNPHFIDIVYTYNRSHIKHI 328


>gi|421716657|ref|ZP_16155966.1| BNR repeat-like domain protein [Helicobacter pylori R037c]
 gi|407219878|gb|EKE89689.1| BNR repeat-like domain protein [Helicobacter pylori R037c]
          Length = 372

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N     + +    LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLETSNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|387908103|ref|YP_006338437.1| hypothetical protein MWE_0937 [Helicobacter pylori XZ274]
 gi|387573038|gb|AFJ81746.1| hypothetical protein MWE_0937 [Helicobacter pylori XZ274]
          Length = 358

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HA+ ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 48  SVHANALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSYYS 107

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 108 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLNP 165

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 166 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 220

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 221 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 276

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 277 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFI 332

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 333 DIVYTYNRSHIKHI 346


>gi|412985115|emb|CCO20140.1| BNR/Asp-box repeat protein [Bathycoccus prasinos]
          Length = 863

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/91 (38%), Positives = 51/91 (56%), Gaps = 9/91 (9%)

Query: 202 RVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR 260
           R L++ F    G++  SES DGG++W+ A P +L NPNS I+ + L DG+  +AYN+  R
Sbjct: 665 RELVQFFRTATGKIYRSESDDGGISWTSAGPIELPNPNSKINAIVLSDGQTAIAYNSHDR 724

Query: 261 GV--------LKVALSKDDGDSWHDALTLEE 283
                     L VA+S D G +W     +EE
Sbjct: 725 QYGLKQMRVNLTVAVSPDRGKNWAVLAQIEE 755


>gi|343085113|ref|YP_004774408.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
 gi|342353647|gb|AEL26177.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
           marinum DSM 745]
          Length = 372

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 66/271 (24%), Positives = 123/271 (45%), Gaps = 34/271 (12%)

Query: 60  QTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV 119
           QT+ D   Q  +I   +  + + +  L +L +  + LFY     +      M+ S ++G 
Sbjct: 85  QTWTD---QDKVIVPNDGKMNVMSVSLLRLQNGDIALFYARKNSLSDCIPLMRISKDEGE 141

Query: 120 TWSE-------REQLPPGILGPSKNKPILLENGLLLC------GSSVESWNSWGS-WMEV 165
           TWSE       RE    G    + N+ + L+NG L+        S  E +N+ G  +   
Sbjct: 142 TWSEPSPCITDRE----GYFVLNNNRVLQLDNGRLIMPVALHRTSEGEDFNNNGRLFTYF 197

Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
           + D G +W+    +  P + +   +P      N  L +++R+ +G+     S+  D G  
Sbjct: 198 SDDNGATWKSGREVKNPKKVI-FQEPGLIALKNGKLMMIIRTDDGVQYAAFSK--DLGEK 254

Query: 226 WSYAKPTQLLNPNSGIDGVKL-KDGRLLLAYNT--VSRGV-------LKVALSKDDGDSW 275
           W+ AKPT++ +P S     ++   G L++ +N   V  G+       L VA+SKD+G +W
Sbjct: 255 WTPAKPTKIASPLSPATIARIPSTGDLIMVWNNNGVKEGLYKGKRTPLNVAISKDEGKTW 314

Query: 276 HDALTLEENLAMEFSYPAVIQASDGSVHITY 306
                LE++    + Y A+   ++  + ++Y
Sbjct: 315 EKEKVLEDDPDGWYCYTAIHFVNEDQMLLSY 345


>gi|421714835|ref|ZP_16154153.1| BNR repeat-like domain protein [Helicobacter pylori R036d]
 gi|407215689|gb|EKE85527.1| BNR repeat-like domain protein [Helicobacter pylori R036d]
          Length = 372

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 82/325 (25%), Positives = 135/325 (41%), Gaps = 37/325 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP- 70
           +FT P       S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    
Sbjct: 51  QFTIPKPDDAL-SVHASALISLPNDNLLSAYFSGTKEGARDVKISANFFNNKTNRWSEAF 109

Query: 71  -IIADEEPN-------VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
            I+  E+ +         + NP+LF L  + +LLF  +G  +  W+      +   +   
Sbjct: 110 TILTKEDLSKNAHEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKVYQFESALEPI 167

Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
              ++ +  L P   +    KNKP+   +G  +     E    +   ++   D   + R+
Sbjct: 168 RFKFARKLSLSPFLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE 225

Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
              +  PN     +QP   PF    +   ++           M E+C     W     T 
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAIMAFRNYSFKDSLMLETCKTPTIWQKPMLTN 278

Query: 234 LLNPNSGIDGVKLKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSY 291
           L N N  ++ + L +  L L +N   +S    ++ LSK +  +    L + +  A E SY
Sbjct: 279 LKNLNDALNLINLNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSY 336

Query: 292 PAVIQASDGSVHITYTYNRTQIKHV 316
           P+    +   + I YTYNR+ IKH+
Sbjct: 337 PSY-SLNPHFIDIVYTYNRSHIKHI 360


>gi|386754302|ref|YP_006227520.1| hypothetical protein HPSH112_04130 [Helicobacter pylori Shi112]
 gi|384560560|gb|AFI01027.1| hypothetical protein HPSH112_04130 [Helicobacter pylori Shi112]
          Length = 372

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/326 (24%), Positives = 134/326 (41%), Gaps = 39/326 (11%)

Query: 14  EFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPI 71
           +FT P       S HAS ++ +   + L AYF G+ EGA DV I    F  K  RW    
Sbjct: 51  QFTIPKPDGAL-SVHASALISLPNDNLLSAYFSGTKEGARDVTISANLFDSKINRWSEAF 109

Query: 72  IADEEPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV--- 119
           I   +  +  +         NP+LF L  + +LLF  +G  +  W+          +   
Sbjct: 110 IILTKEELSYYSHEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPI 167

Query: 120 --TWSEREQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRK 175
              + ++  L P   +    +NKP+   +G  +     E    +   ++   D   + R+
Sbjct: 168 HFKFVQKLSLSPFLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE 225

Query: 176 YGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
              +  PN     +QP   PF    +   V+    +      M E+C     W     T 
Sbjct: 226 ---LLRPNALNHQLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTN 278

Query: 234 LLNPNSGIDGVKLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFS 290
           L N N  ++ + L +  L L +N     + +  + LSK ++ +S+     L++  A E S
Sbjct: 279 LKNLNDALNLINL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVS 335

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
           YP+    +   + I YTYNR+ IKH+
Sbjct: 336 YPSY-SLNSHFIDIVYTYNRSHIKHI 360


>gi|254428400|ref|ZP_05042107.1| hypothetical protein ADG881_1630 [Alcanivorax sp. DG881]
 gi|196194569|gb|EDX89528.1| hypothetical protein ADG881_1630 [Alcanivorax sp. DG881]
          Length = 425

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 80/368 (21%), Positives = 132/368 (35%), Gaps = 73/368 (19%)

Query: 19  ANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEE 76
           A+ A     HA+++VE+  G     +F G+ EGA DV I    F  + G W    ++   
Sbjct: 44  ASDADTSMVHAASMVELPDGRLRAFWFAGTREGAADVSINSAIFDPESGAWSDETVSVTR 103

Query: 77  PNVPM-WNPVLFKL-------PSNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSERE 125
             +   W   + KL         +G +  + +      W+     +  S + G  W    
Sbjct: 104 EQISRGWGRHIRKLGNAVPVLDDDGHMRLFVVAVSFGGWAASRLVVMESSDLGANWDFDT 163

Query: 126 QLPP----GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
            L       I    K  PI   +G +      E    +G  + +  D     R  G   I
Sbjct: 164 ALVATPFLNISTLVKTPPIHYSDGSIGLPVYHEMIGKFGEILRLDQDN----RILGKARI 219

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFN--GIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
                  IQP+     +      +R+ N    G +  S S DGG  WS  +   + NP+S
Sbjct: 220 -GHGRKAIQPLVLVDDSNHAVSFLRNENEPNNGYLYQSSSSDGGWRWSPLEYAPVDNPSS 278

Query: 240 GIDGVKLKDGRLLLAY--NTVSRGVLKVALSKDDGDSWHD-------------------- 277
            + GV +  G  L+A   N   R  L +  ++D G  W D                    
Sbjct: 279 AVGGVAMGPGHWLVATNCNREERDDLCIRETRDAGQHWEDRWFFHNREQWRGNALKPDAF 338

Query: 278 -----------ALTLEENLAME----------------FSYPAVIQASDGSVHITYTYNR 310
                      A+  ++ L +E                + YP +++  +G +H+ YT+N+
Sbjct: 339 ADLIDDEFVGKAIPEDQRLFLERVKANKCQPHRGCEFQYDYPYMVRTRNGDLHLLYTWNK 398

Query: 311 TQIKHVVL 318
           T I+H  L
Sbjct: 399 TAIRHAWL 406


>gi|116071744|ref|ZP_01469012.1| hypothetical protein BL107_06329 [Synechococcus sp. BL107]
 gi|116065367|gb|EAU71125.1| hypothetical protein BL107_06329 [Synechococcus sp. BL107]
          Length = 406

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 83/360 (23%), Positives = 135/360 (37%), Gaps = 75/360 (20%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMWNPV 85
           HA  +  +        +  G  EG   + ++L     + G W  P    ++      NP+
Sbjct: 41  HAPQLCWIRPDLLGCVWMAGGGEGTAGMSVYLSLLSSEQGSWGVPQCISQDLERSEQNPL 100

Query: 86  LFKLPSNGLLLFYKIGQEVQ----------------KWSGCMKRSYNK------------ 117
           LF   S+G+L      Q V+                +W+  ++    K            
Sbjct: 101 LFL--SDGVLHLIHSAQRVRDPSDRTWAEAQSSFSMQWTAVLRHQTLKIDVIDLGRPESW 158

Query: 118 -GVTWSEREQLPPGILGPS--KNKPILLENG--LLLCGSSVESWNSWGSWMEVTVDAGRS 172
            G  WS  + L   I  P+  ++ P  L +G  LL    S+E   S+G      +    +
Sbjct: 159 EGQAWSAAKDL---IETPAFCRHPPHRLADGTWLLPIYRSLEQGGSFGHDHSEVLQLDAA 215

Query: 173 WRKYG-PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKP 231
               G P  +P  +  V   +       TL   +RS       C S S D GL+WS  + 
Sbjct: 216 GVPMGSPFAVPESTGRVHGSIVPSADGTTLHQFLRSRLADWVYC-STSEDAGLSWSSPQS 274

Query: 232 TQLLNPNSGIDGVKLKDGRLLLAYNTV----------------------SRGVLKVALSK 269
           T L N NS I   +L+ GRL L +N                        +R  L +A+S 
Sbjct: 275 TDLPNNNSSIQACRLESGRLALIFNRFGFLPDPNDPDASSAWGEARWPRTRWPLSIAISA 334

Query: 270 DDGDSW---HDALT-------LEENLAMEFSYPAVIQASDGSVHITYTY-NRTQIKHVVL 318
           DDG SW    D  T       +  +L  + +YP +++   G +H+ Y++  R  I+++ L
Sbjct: 335 DDGQSWPWIRDIDTGFGFCGPMNWDLNGQLAYPTLLEGRPGELHVAYSWAERQAIRYLSL 394


>gi|420398787|ref|ZP_14897998.1| hypothetical protein HPCPY1962_0970 [Helicobacter pylori CPY1962]
 gi|393013016|gb|EJB14193.1| hypothetical protein HPCPY1962_0970 [Helicobacter pylori CPY1962]
          Length = 372

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 79/316 (25%), Positives = 132/316 (41%), Gaps = 42/316 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVT-------WSEREQL 127
                   NP+LF L  + +LLF  +G  +  W+    + Y    T       +  +  L
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATS--KIYQLESTLEPIHFKFVRKLSL 177

Query: 128 PP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES 185
            P   +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN  
Sbjct: 178 SPFLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNAL 232

Query: 186 LSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG 243
              +QP   PF    +   ++    +      M E+C     W     T L N N  ++ 
Sbjct: 233 NYQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNL 288

Query: 244 VKLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDG 300
           + L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +  
Sbjct: 289 INL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSH 344

Query: 301 SVHITYTYNRTQIKHV 316
            + I YTYNR+ IKH+
Sbjct: 345 FIDIVYTYNRSHIKHI 360


>gi|406834050|ref|ZP_11093644.1| hypothetical protein SpalD1_20489 [Schlesneria paludicola DSM
           18645]
          Length = 400

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 103/216 (47%), Gaps = 33/216 (15%)

Query: 102 QEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENG-LLLCGSSVESWNS-W 159
           +EV  W   M RS + GVTWS R Q+P      S + P+ L +G LL  G+++ + +   
Sbjct: 178 REVGTW---MWRSTDGGVTWSARYQVPCN----SPHGPVALASGRLLYIGAALWADDRVV 230

Query: 160 GSWMEVTVDAGRSWR---KYGPIYIPN-ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVC 215
           G+W   + D G +W+   ++ P      ++   +  V   +     ++   +        
Sbjct: 231 GTWE--STDDGVTWQLLSRFTPRPTDQTKNYHELHGVECVSGKIIAQIRNHNTQNAHETL 288

Query: 216 MSESCDGGLTWS-------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA-L 267
            SES DGG TWS       +  P+ LL         +LKDGRLL++Y          A +
Sbjct: 289 QSESTDGGKTWSIPHSIGVWGLPSHLL---------RLKDGRLLMSYGYRRAPYGNQARM 339

Query: 268 SKDDGDSWHDALTL-EENLAMEFSYPAVIQASDGSV 302
           S+D+G +W D +T+ ++ +  +  YP+ ++  DG++
Sbjct: 340 SQDEGATWSDPITISDDGIRSDLGYPSTVECDDGTL 375


>gi|385215823|ref|YP_005775779.1| hypothetical protein HPF32_0555 [Helicobacter pylori F32]
 gi|317180351|dbj|BAJ58137.1| hypothetical protein HPF32_0555 [Helicobacter pylori F32]
          Length = 372

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KN P+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNNPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ANEVSYPSY-SLNSHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|384887761|ref|YP_005762272.1| hypothetical protein HPKB_0764 [Helicobacter pylori 52]
 gi|261839591|gb|ACX99356.1| hypothetical protein HPKB_0764 [Helicobacter pylori 52]
          Length = 372

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 128/313 (40%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI +  F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISVNLFDSKINRWSEAFIILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R    +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRD---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +    +    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAAMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420458413|ref|ZP_14957223.1| hypothetical protein HPHPA26_0579 [Helicobacter pylori Hp A-26]
 gi|393075934|gb|EJB76688.1| hypothetical protein HPHPA26_0579 [Helicobacter pylori Hp A-26]
          Length = 372

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW        + ++  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFTILTKEDLSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KN P+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNNPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDNLMLETCKTPTIWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNSHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|384897290|ref|YP_005772718.1| hypothetical protein HPLT_02885 [Helicobacter pylori Lithuania75]
 gi|317012395|gb|ADU83003.1| hypothetical protein HPLT_02885 [Helicobacter pylori Lithuania75]
          Length = 310

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 78/310 (25%), Positives = 129/310 (41%), Gaps = 34/310 (10%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------ADEE 76
           HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I            E
Sbjct: 2   HASALISLPNDNLLSAYFSGTKEGAKDVKISANLFDSKTNRWSEAFILLTKEELSHHSHE 61

Query: 77  PNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP-- 129
               + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P  
Sbjct: 62  YIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSPFL 119

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVI 189
            +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN     +
Sbjct: 120 NLSHLIKNKPLSTTDGGFVLPLYHELATQYTLLLK--FDKNNNPRE---LLRPNALNHQL 174

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDG 249
           QP    T  +   ++    +      M E+C     W     T L N N  ++ + L + 
Sbjct: 175 QPSL--TPLKDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLINLNE- 231

Query: 250 RLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
            L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   + I Y
Sbjct: 232 ELYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFIDIVY 288

Query: 307 TYNRTQIKHV 316
           TYNR+ IKH+
Sbjct: 289 TYNRSHIKHI 298


>gi|384892614|ref|YP_005766707.1| hypothetical protein HPCU_03165 [Helicobacter pylori Cuz20]
 gi|308061911|gb|ADO03799.1| hypothetical protein HPCU_03165 [Helicobacter pylori Cuz20]
          Length = 337

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 130/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DV I    F  K  RW    I   +  +  + 
Sbjct: 27  SVHASALISLPNDNLLSAYFSGTKEGARDVTISANLFDSKINRWSEAFIILTKEELSYYS 86

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 87  HEYIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 144

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 145 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 199

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N N  ++ + 
Sbjct: 200 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 255

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 256 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 311

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 312 DIVYTYNRSHIKHI 325


>gi|308182736|ref|YP_003926863.1| hypothetical protein HPPC_02875 [Helicobacter pylori PeCan4]
 gi|308064921|gb|ADO06813.1| hypothetical protein HPPC_02875 [Helicobacter pylori PeCan4]
          Length = 310

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 132/313 (42%), Gaps = 40/313 (12%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV---- 79
           HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +     
Sbjct: 2   HASALISLPNDNLLSAYFSGTKEGARDVKISANLFDGKINRWSEAFIILTKEELSCYSHE 61

Query: 80  ---PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP-- 129
               + NP+LF L  + +LLF  +G  +  W+          +      + ++  L P  
Sbjct: 62  YIKKLGNPLLF-LHGDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSPFL 119

Query: 130 GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNESLS 187
            +    +NKP+   +G    G  +  ++   +   + +  D   + R+   +  PN    
Sbjct: 120 NLSHLIRNKPLNTTDG----GFMLPLYHELATQYPLLLKFDKHNNPRE---LLRPNALNH 172

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 173 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 228

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N     + +  + LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 229 L-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKILKILDK-ADEVSYPSY-SLNSHFID 285

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 286 IVYTYNRSHIKHI 298


>gi|237751695|ref|ZP_04582175.1| neuraminidase [Helicobacter bilis ATCC 43879]
 gi|229373061|gb|EEO23452.1| neuraminidase [Helicobacter bilis ATCC 43879]
          Length = 398

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 127/329 (38%), Gaps = 45/329 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD-----GRWQSPIIADEEPNV- 79
           S HAS IV  D    +  ++ G+ EGA DVKI+ Q+F D      +W  P++      + 
Sbjct: 53  SAHASAIVAKD-SKLMTLFYAGTREGAFDVKIY-QSFLDTKTLDSKWSEPLVLLTPERLT 110

Query: 80  --------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGI 131
                    + NP++FK   + + LF  +G  +  W+      +      +    +    
Sbjct: 111 SLSGRYVKKLGNPIVFKDSRDRVHLFV-VGVSLGGWATSRIYQFYFSDDLATIHYVRELH 169

Query: 132 LGPSKNKPIL-------LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE 184
           L P  N   L       LENG  +   S E W+ +   +    D   +      I   N 
Sbjct: 170 LNPFGNYSHLVRTPALPLENGGFILAFSHEMWHKFP--LIAFFDENGNMLTTKAI---NH 224

Query: 185 SLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDG-GLTWSYAKPTQLLNPNSGIDG 243
             + +QP       +      R+ N        ++C+    TW   + + + N +S I  
Sbjct: 225 LKAQLQPSIIALNEKECAAFYRTNNRYNTNTYYQTCENYAKTWHDIETSNMKNYDSSILL 284

Query: 244 VKLKDGRLLLAYNTVSRGVLKVALSKDDGDS---WHDALTLEENLAM-----------EF 289
           VKL D  +L+  +     + +    K D  S   ++ A  ++                E 
Sbjct: 285 VKLGDEVVLITNDGKHNPIWQNDTKKGDRQSLSLYYLADKIKGQFVYIDTIDTAKKGGEV 344

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           SYPA I   +  +HITYTYNR  IKH  L
Sbjct: 345 SYPAAI-LDNNLLHITYTYNRKTIKHTAL 372


>gi|207091785|ref|ZP_03239572.1| hypothetical protein HpylHP_01328 [Helicobacter pylori
           HPKX_438_AG0C1]
          Length = 372

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLNAYFSGTKEGARDVKISANLFDSKINRWSEAFILLTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      + E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLILETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELLLSKLEKSNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420404615|ref|ZP_14903795.1| hypothetical protein HPCPY6271_0106 [Helicobacter pylori CPY6271]
 gi|393024485|gb|EJB25595.1| hypothetical protein HPCPY6271_0106 [Helicobacter pylori CPY6271]
          Length = 372

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 40/315 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    +   +  +  + 
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKINRWSEAFVILTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKHNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFN-GIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
            +QP   PF         LM   N       M E+C     W     T L N N  ++ +
Sbjct: 235 QLQPSLTPFKDC-----ALMAFRNYSFKDNLMLETCKTPTAWQKPMLTNLKNLNDALNLI 289

Query: 245 KLKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
            L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   
Sbjct: 290 NL-NKELYLIHNPSDSSLRRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHF 345

Query: 302 VHITYTYNRTQIKHV 316
           + I YTYNR+ IKH+
Sbjct: 346 IDIVYTYNRSHIKHI 360


>gi|412988631|emb|CCO17967.1| sialidase [Bathycoccus prasinos]
          Length = 516

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 82/293 (27%), Positives = 119/293 (40%), Gaps = 65/293 (22%)

Query: 60  QTFKD-GRWQSPIIADEEPNVPMWNPVLFKLP--SNGLLLFYKIGQEVQKWSGCMKR--- 113
           + F D G +  P  A E  N  +W PVLF  P  ++ + LFY       + +GC  R   
Sbjct: 192 KIFADVGSYSVPTDAGE--NGAVWGPVLFANPKKNHEIWLFYS----QSRTNGCGSRVPG 245

Query: 114 -SYNKGV------------TWSEREQL------PPGILGPSKNKPILLENG--------- 145
             Y  G              W+  E++        GI   + NKPI+L            
Sbjct: 246 MDYAPGGDILAKKFNVVSGAWTHTEKIVLSQDFDDGIPKVTANKPIVLRRSGRWVLPYWN 305

Query: 146 --LLLCGSSVESWN------SWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTA 197
              +L   SV            G+ + ++ D G SW     I   N  L     V   T 
Sbjct: 306 ERAMLESPSVNPEKCSKLNGKEGAGVLLSDDHGESWYPSETIRGRNTWLIENAVVELETD 365

Query: 198 NRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL-NPNSGIDGVKLK-DGRLLLAY 255
           +  L V        G V  SES DGG TW+ A   + + NPNS ID ++L+  G L LA+
Sbjct: 366 DHLLMVFR---TKAGYVFRSESLDGGQTWTDASAMKDVPNPNSKIDMIRLQPSGLLALAF 422

Query: 256 N----------TVSRGVLKVALSKDDGDSWHDALTLEENL--AMEFSYPAVIQ 296
           N          +  R  LKVA+S +DG++W     LE ++  ++   YP ++Q
Sbjct: 423 NDHAKHPAKGCSRCRTFLKVAVSSNDGNTWTRIRELEMDVDGSLRSHYPTLLQ 475


>gi|260220195|emb|CBA27491.1| hypothetical protein Csp_A02540 [Curvibacter putative symbiont of
           Hydra magnipapillata]
          Length = 348

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 55/113 (48%), Gaps = 3/113 (2%)

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV 262
            LMR     G+V    + DGG  W       L NP+S +    L   ++LLA+N    G 
Sbjct: 226 ALMRDERHNGKVTAVSTEDGGAHWQDLPDLALGNPDSAVAAFGLGPQQMLLAHNPTPEGR 285

Query: 263 LKVALSK-DDGDSWHDALTLEE-NLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
            K+ LS+  DG  W    TL + +   EFSYPA+  A D ++ I YT +R  +
Sbjct: 286 AKLELSRSQDGLQWTPQQTLRQGSEEAEFSYPAMAWADD-ALWIAYTVDRATL 337


>gi|149175559|ref|ZP_01854179.1| probable sialidase [Planctomyces maris DSM 8797]
 gi|148845544|gb|EDL59887.1| probable sialidase [Planctomyces maris DSM 8797]
          Length = 371

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/304 (24%), Positives = 135/304 (44%), Gaps = 37/304 (12%)

Query: 32  IVEVDKGHFLVAY---FGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV- 85
           ++ +D G  LV Y   +G +       ++      DG   W  P    E  N+   N + 
Sbjct: 53  LLTLDNGRLLVVYTQWYGATGNDHDPARLVEIHSDDGGKTWSEPQTVQE--NIGKMNVMS 110

Query: 86  --LFKLPSNGLLLFYKIGQEVQKWSGC-MKRSYNKGVTWSEREQLP---PGILGPSKNKP 139
             L K  S  +LL Y I  +  +++    K S ++G TWSE +QL     G++    +  
Sbjct: 111 ASLVKTTSGKILLTY-IRIDSNRFANLWFKESTDEGNTWSEPKQLSHGKKGLIFTVNSAA 169

Query: 140 ILLENGLLLCGS--SVESWNS---WGSWMEVTVDAGRSW-RKYGPIYIPNESLSVIQPVP 193
           I L++G +L  +  S  +W     + S+   T D G++W R    +  P      ++P  
Sbjct: 170 IRLKSGRILLAAFGSPSAWQKDEHFVSFSFYTDDEGKTWHRSANEVDCPLRG--AMEPEI 227

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG--IDGVKLKDGRL 251
               +  + +L+R+     R+  + S DGG TWS A+ T +++P +   +     K+  L
Sbjct: 228 EQLNDGRILMLIRT--QTTRMYRAYSSDGGETWSPAEKTDIVHPEAPMLLQRAPGKNAPL 285

Query: 252 LLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           +L +N            R  L + LS D+G +W +   +E++    +SY A +   + S+
Sbjct: 286 ILIWNNAVVPGADHQGPRTPLTLGLSYDEGKTWENLTKIEDDPKGSYSY-ASMDFRNDSL 344

Query: 303 HITY 306
           H+ Y
Sbjct: 345 HLVY 348


>gi|384890956|ref|YP_005765089.1| BNR/Asp-box repeat-containing protein [Helicobacter pylori 908]
 gi|385223630|ref|YP_005783556.1| BNR/Asp-box repeat-containing protein [Helicobacter pylori 2017]
 gi|385231481|ref|YP_005791400.1| BNR/Asp-box repeat-containing protein [Helicobacter pylori 2018]
 gi|307637265|gb|ADN79715.1| BNR/Asp-box repeat protein [Helicobacter pylori 908]
 gi|325995858|gb|ADZ51263.1| BNR/Asp-box repeat protein [Helicobacter pylori 2018]
 gi|325997452|gb|ADZ49660.1| BNR/Asp-box repeat protein [Helicobacter pylori 2017]
          Length = 372

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 129/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K   W    I   +  +  + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNCWSEAFILLTKEELSHYS 121

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  N +LLF  +G  +  W+          +      ++++  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFAQKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    + E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLILETCKTPTAWQKPMLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N     + +    LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420450232|ref|ZP_14949096.1| hypothetical protein HPHPH45_0633 [Helicobacter pylori Hp H-45]
 gi|393068091|gb|EJB68896.1| hypothetical protein HPHPH45_0633 [Helicobacter pylori Hp H-45]
          Length = 310

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 132/315 (41%), Gaps = 42/315 (13%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW-- 82
            HAS+++ +   + L  YF G+ EGA DVKI    F  K  RW    I   +  +  +  
Sbjct: 1   MHASSLISLPNDNLLSTYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHYSH 60

Query: 83  -------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP- 129
                  NP+LF L  + +LLF  +G  +  W+          +      +  +  L P 
Sbjct: 61  EYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSPF 118

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSW--MEVTVDAGRSWRKYGPIYIPNESL 186
             +    KNKP+   +G    G  +  ++   +   + +  D   + R+   +  PN   
Sbjct: 119 LNLSHLIKNKPLSTTDG----GFVLPLYHELATQYPLLLKFDQQNNPRE---LLRPNALN 171

Query: 187 SVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGV 244
             +QP   PF    +   ++    +      M E+C     W     T L N N  ++ +
Sbjct: 172 HQLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLI 227

Query: 245 KLKDGRLLLAYN--TVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
            L +  L L +N   +S    ++ LSK ++ +S+     L++  A E SYP+    +   
Sbjct: 228 NLNE-ELYLIHNPSDLSLRRKELLLSKLENPNSFKTLKVLDK--ANEVSYPSY-SLNPHF 283

Query: 302 VHITYTYNRTQIKHV 316
           + I YTYNR+ IKH+
Sbjct: 284 IDIVYTYNRSHIKHI 298


>gi|336453827|ref|YP_004608293.1| BNR/Asp-box repeat protein [Helicobacter bizzozeronii CIII-1]
 gi|335333854|emb|CCB80581.1| BNR/Asp-box repeat protein [Helicobacter bizzozeronii CIII-1]
          Length = 381

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 89/343 (25%), Positives = 135/343 (39%), Gaps = 72/343 (20%)

Query: 25  KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPII---------- 72
           KS HA+T+  +D  + + AYFGGS EG  DVKI+  L   K   W   +           
Sbjct: 53  KSMHAATLSVLDTDYLMGAYFGGSAEGVGDVKIYAHLYNLKTQEWGQALPLLSAPLLSQK 112

Query: 73  ADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWS--------GCMKR-----SYNKGV 119
           A E   + + NPVL+   ++  L  + +G  +  W+          +KR      +N+  
Sbjct: 113 AHEFVKI-LGNPVLYA--THNTLYLFVVGASLGGWATSKIYLLCAPLKRVLEQLQHNQSP 169

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRS---WRKY 176
                + LP   L P  N   L+    L       S +  G  + +  +  +      K+
Sbjct: 170 QLDFIQTLP---LSPFLNISHLVRTSPL-------STDDGGFVLPIYQELAKKTPLLLKF 219

Query: 177 GP------IYIPNESLSVIQP--VPF-HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
            P      +  PN     +QP   P+ H A    R   R F+       +++C   L W 
Sbjct: 220 DPHARLESVIQPNLLKQQLQPSLAPYQHCALMAFRSYKR-FD-----LYTQTCQTPLKWD 273

Query: 228 YAKPTQLLNPNSGIDGVKL--KDGRLLLAYNTV------SRGVLKVA--LSKDDGDSWHD 277
              P QL N  +  D + L   DG L L YN        SR  L +A      D  S+  
Sbjct: 274 ---PPQLSNLKNYDDSLNLFNVDGTLYLIYNAPLQEGYNSRSTLMLAQFAPTPDKPSYFK 330

Query: 278 ALTLEENLAM--EFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            L + +      E SYPA +   D  VH+ YT +R  I+H++ 
Sbjct: 331 PLCILDFTQQGGEVSYPATLIFKD-QVHVLYTKDRQHIQHIIF 372


>gi|217033525|ref|ZP_03438954.1| hypothetical protein HP9810_905g44 [Helicobacter pylori 98-10]
 gi|216944050|gb|EEC23481.1| hypothetical protein HP9810_905g44 [Helicobacter pylori 98-10]
          Length = 347

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 132/314 (42%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPNV-- 79
           S HAS ++ +     L AYF G+ EGA DVKI    F  K   W     I+  EE +   
Sbjct: 37  SVHASALISLPNDKLLSAYFSGTKEGARDVKISANLFDSKINCWSEAFVILTKEELSYYS 96

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      + ++  L P
Sbjct: 97  HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIRFKFVQKLSLSP 154

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 155 FLNLSHLIRNKPLSTTDGGFMLPLYHELATQYPLLLKF--DKHNNPRE---LLRPNALNH 209

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    + +    M E+C     W     T L N N  ++ + 
Sbjct: 210 QLQPSLTPF----KDCAIMAFRNHSLKDSLMLETCKTPTAWQKPMLTNLKNLNDALNLIN 265

Query: 246 LKDGRLLLAYNTVSRGVLK--VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +  + LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 266 L-NQELYLIHNPSDSSLHRKELLLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNSHFI 321

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 322 DIVYTYNRSHIKHI 335


>gi|403378213|ref|ZP_10920270.1| hypothetical protein PJC66_00095 [Paenibacillus sp. JC66]
          Length = 670

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/300 (25%), Positives = 129/300 (43%), Gaps = 33/300 (11%)

Query: 34  EVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPVLFKLPS 91
           ++  G  LV Y+  +     D +I +    D    W  P I  + P     +P +  L S
Sbjct: 53  QLPNGKLLVVYYNATAHQGTDGRIAMVESTDHGLTWSEPRIIMDTPEDDR-DPSITVL-S 110

Query: 92  NGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL-LENGLLLCG 150
           +G LL     +E  K    + RS ++G+TWS   ++   +   + +  I+ L+ G LL  
Sbjct: 111 DGTLLLNWFMRENGKNQVFVSRSTDEGLTWSPPSKVQTNLEEAAVSSKIVELDTGELLLP 170

Query: 151 SSVESWNSWGSWMEVTV---DAGRSWRKYG--------PIYIPNE--SLSVIQPVPFHTA 197
                 +  G +  V V   D G +W            P+  PN   ++  ++PV     
Sbjct: 171 IYGFYASEQGVYRSVVVRSSDGGMTWDPSQESTMAEPVPLPQPNPFPAIGFVEPVIADLG 230

Query: 198 NRTLRVLMRSF----NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLL 253
           +  L  + RS+    NGIG   MSES D G TW+ A+ T +    S  D + L +G +L 
Sbjct: 231 SGHLISMHRSYSGSDNGIG--WMSESFDYGHTWTPAERTTMYAHCS--DLLVLSNGTVLH 286

Query: 254 AYN------TVSRGVL-KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
            +       +  R V+ +V     D + W DAL  +   + + +YP+ ++  DG + I +
Sbjct: 287 TWGDRKFEFSSGRAVVGRVMEPGTDWEDWEDALIYKSPGSTDMAYPSAVELPDGRIFIVF 346


>gi|403382038|ref|ZP_10924095.1| hypothetical protein PJC66_19676 [Paenibacillus sp. JC66]
          Length = 359

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/208 (24%), Positives = 96/208 (46%), Gaps = 16/208 (7%)

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRS 172
           RS N G  W E  Q    +    +  P+ L++G LL  +  E+ N     +  + D G S
Sbjct: 138 RSTNGGSDW-EAPQPAANVGQAVRGVPVQLKDGSLLVSTYGEADNRGRVIISKSEDLGAS 196

Query: 173 WRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSF-----NGIGR---VCMSESCDGGL 224
           W +   +   NE+    +P  + T +  +  L+RS      +  GR   +  SES DGGL
Sbjct: 197 WTRVAQL--ENENYHFYEPFLYVTPSGKIVALLRSNSLHGPDASGRSHPLFTSESLDGGL 254

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV-LKVALSKDDGDSWHDALTL-- 281
           +WS     ++ +P S    ++L+ G++L+ Y    +   ++  L   + + W D   +  
Sbjct: 255 SWSDPVMRRVFSP-SPFHALRLQSGKVLINYGYRYQPAGVRAFLLDSECEGWDDIEEIVL 313

Query: 282 -EENLAMEFSYPAVIQASDGSVHITYTY 308
            ++ +A +  Y + +Q ++G + I Y +
Sbjct: 314 RDDGIAGDLGYTSAVQLANGDILIAYYF 341


>gi|420412153|ref|ZP_14911282.1| hypothetical protein HPNQ4228_1044 [Helicobacter pylori NQ4228]
 gi|393027811|gb|EJB28899.1| hypothetical protein HPNQ4228_1044 [Helicobacter pylori NQ4228]
          Length = 372

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFNNKTNRWSKAFTILTKEDLSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+          +      +  +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFVRKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNILNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPTLTNLKNLNDALNLIN 290

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           L +  L L +N     + +    LSK ++ +S+     L++  A E SY +    +   +
Sbjct: 291 L-NKELYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYSSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|373852611|ref|ZP_09595411.1| hypothetical protein Opit5DRAFT_3465 [Opitutaceae bacterium TAV5]
 gi|372474840|gb|EHP34850.1| hypothetical protein Opit5DRAFT_3465 [Opitutaceae bacterium TAV5]
          Length = 468

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/168 (26%), Positives = 75/168 (44%), Gaps = 23/168 (13%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           V+ D GR+W + G +  PN      + +     + ++ +  R+  G+     S S DGG 
Sbjct: 238 VSSDQGRTWERRGGVAFPNPRFD--EHMFIERRDGSIWMTARTRTGL---WESTSTDGGR 292

Query: 225 TWS--YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------RGVLKVALSKD 270
           TWS   A P +  +    I   +L+ G ++L  + V             R  L   LS D
Sbjct: 293 TWSKPTASPVRTASARHFI--RRLQSGNIVLVKHGVEIDKAPMSKGEPFRSHLTAFLSDD 350

Query: 271 DGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           DG +W   L L+E      SYP   QA DG++++++  +R     +++
Sbjct: 351 DGHTWKGGLLLDER--RRVSYPDGFQAPDGTIYVSWDRDRNGKGEILM 396


>gi|412985815|emb|CCO17015.1| probable sialidase [Bathycoccus prasinos]
          Length = 635

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 47/163 (28%), Positives = 76/163 (46%), Gaps = 24/163 (14%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           V+ + G++W+  G I   N SL +   +      R ++V       +G +  S S D G 
Sbjct: 449 VSENRGKTWQPRGQITQSNTSL-IEGSIAELNDGRIIQVFR---TRVGCLYKSYSKDEGK 504

Query: 225 TWSYAKPTQLLNPNSGIDGVKLK-DGRLLLAYNTVS---------------RGVLKVALS 268
           +W+  +P  + NPNS +  ++L+ +G LLLA+N +                R  L VA+S
Sbjct: 505 SWTEPEPMPVPNPNSKVFLMRLEPNGELLLAFNNMKNQYRSRRGATKCRACRTHLHVAIS 564

Query: 269 KD-DGDSWHDALTLEENLAME---FSYPAVIQASDGSVHITYT 307
           +D DG+ W     LE+ +  E     YP + Q    SV + YT
Sbjct: 565 RDGDGNEWETIARLEDEIGYEAPRIHYPYMAQKDASSVLVAYT 607


>gi|225155980|ref|ZP_03724464.1| hypothetical protein ObacDRAFT_8593 [Diplosphaera colitermitum
           TAV2]
 gi|224803333|gb|EEG21572.1| hypothetical protein ObacDRAFT_8593 [Diplosphaera colitermitum
           TAV2]
          Length = 475

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 78/166 (46%), Gaps = 19/166 (11%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           V+ D G++W + G +  P       + +     + ++ +  R+ NG+     S S DGG 
Sbjct: 262 VSSDQGKTWTRRGGVKFPGPRFD--EHMFIERRDGSIWMTARTRNGL---WESTSADGGR 316

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------------TVSRGVLKVALSKDDG 272
           TW+  K + +   +S     +L+ G +LL  +             +SR  L   LS+DDG
Sbjct: 317 TWARPKESPIKTASSRHFIRRLQSGNILLIKHGGKIHEAPIWQGRLSRSRLAAFLSEDDG 376

Query: 273 DSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
            +W   L L++   +  SYP   QA DG+++I++ + R     +++
Sbjct: 377 LTWTGGLMLDKRGGV--SYPDGFQAPDGTLYISWDHQRNTEGEILM 420


>gi|420509188|ref|ZP_15007690.1| hypothetical protein HPHPH24C_0661 [Helicobacter pylori Hp H-24c]
 gi|420538012|ref|ZP_15036342.1| hypothetical protein HPHPM4_0783 [Helicobacter pylori Hp M4]
 gi|420539835|ref|ZP_15038152.1| hypothetical protein HPHPM5_0913 [Helicobacter pylori Hp M5]
 gi|393120614|gb|EJC21103.1| hypothetical protein HPHPH24C_0661 [Helicobacter pylori Hp H-24c]
 gi|393145567|gb|EJC45898.1| hypothetical protein HPHPM4_0783 [Helicobacter pylori Hp M4]
 gi|393146052|gb|EJC46381.1| hypothetical protein HPHPM5_0913 [Helicobacter pylori Hp M5]
          Length = 302

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 125/300 (41%), Gaps = 38/300 (12%)

Query: 40  FLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW---------NPVLFK 88
            L AYF G+ EGA DVKI    F  K  RW    I   +  +  +         NP+LF 
Sbjct: 6   LLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSHYSHEYIKKLGNPLLF- 64

Query: 89  LPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP--GILGPSKNKPIL 141
           L  N +LLF  +G  +  W+      +   +      ++ +  L P   +    KNKP+ 
Sbjct: 65  LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSPFLNLSHLIKNKPLN 123

Query: 142 LENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQP--VPFHTANR 199
             +G  +     E    +   + +  D   + R+   +  PN     +QP   PF    +
Sbjct: 124 TTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNHQLQPSLTPF----K 174

Query: 200 TLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN--T 257
              V+    +      M E+C     W     T L N N  ++ + L +  L L +N   
Sbjct: 175 DCAVMAFRNHSFKDSLMLETCKTPTDWQKPMLTNLKNLNDALNLINLNE-ELYLIHNPSD 233

Query: 258 VSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           +S    ++ LSK ++ +S+     L++  A E SYP+    +   + I YTYNR+ IKH+
Sbjct: 234 LSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFIDIVYTYNRSHIKHI 290


>gi|196229021|ref|ZP_03127887.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
           flavus Ellin428]
 gi|196227302|gb|EDY21806.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
           flavus Ellin428]
          Length = 441

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 87/349 (24%), Positives = 145/349 (41%), Gaps = 45/349 (12%)

Query: 12  AEEFTFPANSAPFKSC---HASTIVEVDKGHFLVAY---FGGSCEGAPDVKIWLQTFKDG 65
           AEE T   N  P K        + V +  G  L  Y   +GG+ +G+      + +   G
Sbjct: 90  AEENTVVLNIEPTKEHPRNSEGSFVRLKSGRILFYYSQFYGGAEDGSAARIAAIHSDDSG 149

Query: 66  R-WQSP-IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSE 123
           R W +P ++ +      + +  L +L S  L  FY +          ++ S ++  TWSE
Sbjct: 150 RTWSAPEVVLENTAGNNVMSVTLLRLASGRLAFFYILKNSWLDARPYVRFSTDEAATWSE 209

Query: 124 REQL--PPGILGPSKNKPILLENGLLLC-----GSSVESWNSWGSWMEVTVD-------A 169
            + +   PG    + ++ + L +G L+       S ++  +S  S+    +D        
Sbjct: 210 PKLVVAAPGYFVLNNDRVVQLSSGRLVAPVAFHRSRLDDPHSAKSFDGRAIDLWYYSDDE 269

Query: 170 GRSWRKYGPIY-IPNESLSVIQ-PVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           G++W++    + IP  S S +Q P     A+  L    R+  G+       S D G  W+
Sbjct: 270 GQTWQQSWSWWGIPVNSNSGLQEPGVVELADGQLFSWARTDQGVQYGF--SSADHGKDWT 327

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLA-YNTVS---------RGVLKVALSKDDGDSWHD 277
              PT+L +P S     +L D   LLA YN  S         R  L  A+S D+G +W  
Sbjct: 328 APAPTELKSPTSPASIKRLPDSSNLLAIYNDHSGRFPFPPKRRAPLVAAISTDNGLTWPK 387

Query: 278 ALTLEENLAMEFSYPAVIQASDGSVHITYT--------YNRTQIKHVVL 318
           A  +EE+L   + Y A+    D +V + Y          NR +I+ + L
Sbjct: 388 AKLIEEDLNGWYCYTAMYFVDD-AVLLAYCAGDDKVGGLNRLRIRRITL 435


>gi|171909757|ref|ZP_02925227.1| probable sialidase [Verrucomicrobium spinosum DSM 4136]
          Length = 385

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 125/315 (39%), Gaps = 39/315 (12%)

Query: 40  FLVAYFGGSCEGAPDVKIWLQTFKDGR-WQSPI-IADEEPNVPMWNPVLFKLPSNGLLLF 97
           +   ++GG+ + +P   + + +  DGR W  P+ I +      + +  L +L S  L  F
Sbjct: 68  YYTQFYGGAADESPARIVGIHSDDDGRTWSQPVTIVENNAGKNIMSVSLLRLASGKLAFF 127

Query: 98  YKIGQEVQKWSGCMKRSYNKGVTWSEREQL--PPGILGPSKNKPILLENGLLLC------ 149
           Y +    Q     M  S + G TWS    +   PG    + ++ I  + G L+       
Sbjct: 128 YCLKNSWQDCRPQMCLSSDDGATWSSPRNILEAPGYFVLNNDRVIQTKAGRLIAPLAFHR 187

Query: 150 --GSSVESWNSWGS-----WMEVTVDAGRSWRKYGPIY-IPNESLSVIQ-PVPFHTANRT 200
              S   S  S+ +     W+  + D G SW +    + IP  S S +Q P     A+ +
Sbjct: 188 SRNSDPHSSKSFDARAITMWL-YSDDEGASWHEAATWWAIPVRSGSGLQEPGVVELADGS 246

Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLA-YNTVS 259
           L    R+  G      S+  DGG  W+  +PT L +P S      L     LLA +N  S
Sbjct: 247 LFSWSRTDQGAQYGFTSK--DGGTNWTAPEPTALKSPTSPASIKTLPGTTHLLAIFNDHS 304

Query: 260 RGV---------LKVALSKDDGDSWHDALTLEENL-------AMEFSYPAVIQASDGSVH 303
            G          L  A+S D G +W     LE+N        A+ F+  AV+        
Sbjct: 305 GGFPFPQGKRTPLVAAVSSDGGKTWPKVKLLEDNPDGWYCYNAIHFTKDAVLLGYCAGDK 364

Query: 304 ITYTYNRTQIKHVVL 318
                NR +I+ + +
Sbjct: 365 TVGGLNRLRIRRITM 379


>gi|257484992|ref|ZP_05639033.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           tabaci str. ATCC 11528]
          Length = 56

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 22/37 (59%), Positives = 30/37 (81%)

Query: 282 EENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +E L  EFSYP++IQA+DGS+H+ +TY R +IKHV L
Sbjct: 14  QEKLNREFSYPSIIQAADGSLHVAFTYFRQKIKHVHL 50


>gi|196232313|ref|ZP_03131167.1| hypothetical protein CfE428DRAFT_4333 [Chthoniobacter flavus
           Ellin428]
 gi|196223681|gb|EDY18197.1| hypothetical protein CfE428DRAFT_4333 [Chthoniobacter flavus
           Ellin428]
          Length = 387

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 54/209 (25%), Positives = 89/209 (42%), Gaps = 20/209 (9%)

Query: 112 KRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
           +RS N G TW +    P   +G + + PI L++G LL              +E + D GR
Sbjct: 164 RRSVNDGKTWEK----PVKTIGSAPHGPIQLQDGRLLYVGLSTHEKVLKLGVEESRDDGR 219

Query: 172 SWRKYGPIYIPNESLS--------VIQPVPFHTANRTLRVLMRSF---NGIGRVCMSESC 220
           SW+    I +  E             +P      +  L  ++R+    +    +  +ES 
Sbjct: 220 SWQYLAEIPVAEEDQRDPDWRVRMQDEPHVVELPDGKLVAMIRAEPKDHSQTYLRQTEST 279

Query: 221 DGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA-LSKDDGDSW--HD 277
           DGG TW+    T +      +  + LK+G LL+ Y        + A +SKD G +W    
Sbjct: 280 DGGKTWTTTHKTPIYGYPPHL--LMLKNGWLLVVYGVRREPFGEHACISKDGGKTWDVEH 337

Query: 278 ALTLEENLAMEFSYPAVIQASDGSVHITY 306
            +TL      +  YP+ +Q +DGS++  Y
Sbjct: 338 EITLRTAPNSDLGYPSSVQLADGSIYTVY 366


>gi|149176442|ref|ZP_01855056.1| probable sialidase [Planctomyces maris DSM 8797]
 gi|148844794|gb|EDL59143.1| probable sialidase [Planctomyces maris DSM 8797]
          Length = 394

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 106/241 (43%), Gaps = 28/241 (11%)

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPP----GILGPSKNKPIL 141
           L +L    + LFY     +      M+ S ++G  WSE  ++ P    G    + ++ I 
Sbjct: 124 LLRLQDGRIALFYARKNSLHDLLPVMRLSVDEGAHWSEPVEIIPQKEVGYYVLNNDRVIQ 183

Query: 142 LENGLLLCGSSV-------ESWNSWGSWM-EVTVDAGRSWRKYGPIYIPNE----SLSVI 189
           L+NG ++   ++       + +N    ++   + D G++W++   + +  +       + 
Sbjct: 184 LKNGRIVVPLALHQNLPGSDRFNPNAKFLCYYSDDQGKTWQRGTEVVVETQPGGKQPYMQ 243

Query: 190 QPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KD 248
           +P      + TL    R+  G   V  S+  DGG T+S  KP+Q+++P S     ++   
Sbjct: 244 EPGLVELKDGTLMGFCRTNGGSQYVAFSQ--DGGQTFSELKPSQIISPVSPASIERIPST 301

Query: 249 GRLLLAYN---------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASD 299
           G LLL +N         +  R  L +A+SKD+G +W     +E N    + Y A+    D
Sbjct: 302 GDLLLVWNNHQGIDAKLSGKRTPLTLAISKDEGKTWQQIQNIETNPNGWYCYTAIEFTKD 361

Query: 300 G 300
           G
Sbjct: 362 G 362


>gi|288925246|ref|ZP_06419181.1| putative alpha-rhamnosidase [Prevotella buccae D17]
 gi|288338011|gb|EFC76362.1| putative alpha-rhamnosidase [Prevotella buccae D17]
          Length = 1043

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 41/69 (59%), Gaps = 4/69 (5%)

Query: 9   GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--R 66
            +V +EF +   SA F   HASTI ++  G  + AYFGG+ E  P+V IW+     G  R
Sbjct: 918 AIVEDEFLY--ESAGFPQAHASTITQLKNGDLVAAYFGGTWERNPNVCIWVSRKPKGSDR 975

Query: 67  WQSPIIADE 75
           W++P++A +
Sbjct: 976 WEAPVLAGD 984


>gi|373851343|ref|ZP_09594143.1| hypothetical protein Opit5DRAFT_2197 [Opitutaceae bacterium TAV5]
 gi|372473572|gb|EHP33582.1| hypothetical protein Opit5DRAFT_2197 [Opitutaceae bacterium TAV5]
          Length = 431

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/212 (26%), Positives = 85/212 (40%), Gaps = 32/212 (15%)

Query: 133 GPSKNKPILLENGLLLCGSSV-------------------ESWNSWGSWMEVTVDAGRSW 173
           G S NKPI+L +G  L  +S+                   E  N  G+ +  + D G++W
Sbjct: 168 GCSLNKPIVLSSGDWLLPASLWDRNKLRHPTSAPFIEAFHELDNERGAGVLASTDQGKTW 227

Query: 174 RKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQ 233
            + G + IP  +   +  V      R   V + + +  G +  S S D G TW     T 
Sbjct: 228 ARRGGVRIPRPNFDEVNLV----ERRDGSVWLTARSASGDIWQSTSSDRGATWVPPSETG 283

Query: 234 LLNPNS-------GIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLA 286
           +LN ++           + L      +   T  R  L   LS DDG +W   L L+E   
Sbjct: 284 ILNVSARHFVRRLRSGRLLLVRHGEQIGVRTEKRSHLAAFLSDDDGLTWRGGLLLDEREG 343

Query: 287 MEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           +  SYP   Q  DG + I+Y  NR     V+L
Sbjct: 344 V--SYPDGGQFDDGVIFISYDRNRDTDGEVLL 373


>gi|421721593|ref|ZP_16160868.1| BNR repeat-like domain protein [Helicobacter pylori R055a]
 gi|407224928|gb|EKE94703.1| BNR repeat-like domain protein [Helicobacter pylori R055a]
          Length = 372

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SVHASSLINLPNNNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+          +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQLESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDKQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N N  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMALRNHSFKDSLMLETCKTPTAWQKPMLTNLKNLNDSLNLIN 290

Query: 246 LKDGRLLLAYN--TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L +  L L +N   +S    ++ LSK +  +    L + +  A E SYP+    +   + 
Sbjct: 291 LNE-ELFLIHNPSDLSLRRKELLLSKLENPNSFKTLKILDK-ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|67458778|ref|YP_246402.1| hypothetical protein RF_0386 [Rickettsia felis URRWXCal2]
 gi|67004311|gb|AAY61237.1| unknown [Rickettsia felis URRWXCal2]
          Length = 149

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 67/133 (50%), Gaps = 23/133 (17%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVAYFGG----SCEGAPDVKIWLQT-FKDG---RWQ 68
           F  N++  K  HA+TI   D    + ++FG     S +   +  IW+   FK     +W 
Sbjct: 11  FILNNSDLKDVHAATIFAFDDKKLITSFFGKTRSESGKSEENNSIWISLGFKSNNHYQWT 70

Query: 69  SP--IIADE---EPNVPMW---------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRS 114
           +P  I++ +   + ++P+          NPV+    +N LL+F KIG   + WSG + RS
Sbjct: 71  TPERIVSPQYFRDHHIPLKQDNGIISCGNPVI-TFFNNQLLIFSKIGPYPRTWSGILSRS 129

Query: 115 YNKGVTWSEREQL 127
           Y++G+TW + E L
Sbjct: 130 YDQGLTWQKPELL 142


>gi|296124013|ref|YP_003631791.1| hypothetical protein Plim_3780 [Planctomyces limnophilus DSM 3776]
 gi|296016353|gb|ADG69592.1| hypothetical protein Plim_3780 [Planctomyces limnophilus DSM 3776]
          Length = 823

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 78/175 (44%), Gaps = 22/175 (12%)

Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLR 202
            NG+L+     E+W  +G+    T D    W +   + + +                T++
Sbjct: 644 RNGVLISSDQGETWTEYGNIRLTTDDRYHGWAENNIVELDDG---------------TIK 688

Query: 203 VLMRSFNGIGRVCMSESCDGGLTW-SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-- 259
           +++R     G + M+ES DGG TW ++A+ T++ NP S     KL+D  + + +N  S  
Sbjct: 689 MIIRGDRLGGVLYMAESIDGGKTWPNFAQKTEIPNPGSKATIYKLRDNGVAMLHNPNSSH 748

Query: 260 RGVLKVALSKDDGDSW--HDALTLEENLAME--FSYPAVIQASDGSVHITYTYNR 310
           R  L + +S D   +W     L  E +   +   +YP    + DG +H  Y  NR
Sbjct: 749 RSPLSLWVSYDGMKTWPYRRVLVKESSDGSQGRLNYPDGFVSEDGYLHFAYDDNR 803



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 69/160 (43%), Gaps = 14/160 (8%)

Query: 165 VTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGL 224
           V+ D G++W   G +  P       + +     + TL +L R+  G+     S S D G 
Sbjct: 238 VSQDQGKTWNYRGGVAFPGPQFD--EHMIIERKDGTLWMLARTTEGM---FESLSQDQGK 292

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYN------TVSRGVLKVALSKDDGDSWHDA 278
           TWS  +      P       +L  G LLL  +      T  R  L   LS D+G +W   
Sbjct: 293 TWSQPEMAMPHIPTR-FHIRRLASGSLLLVRHGQVDERTKHRTRLMAFLSHDEGQTWGGG 351

Query: 279 LTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           L L++   +  SYP   QA DG ++I+Y   R + + V L
Sbjct: 352 LMLDDRGGV--SYPDGFQAPDGRIYISYDRERAKEREVQL 389


>gi|255533465|ref|YP_003093837.1| hypothetical protein Phep_3584 [Pedobacter heparinus DSM 2366]
 gi|255346449|gb|ACU05775.1| hypothetical protein Phep_3584 [Pedobacter heparinus DSM 2366]
          Length = 583

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 98/227 (43%), Gaps = 40/227 (17%)

Query: 114 SYNKGVTWSEREQLPPGILGPS------KNKPILLENGLLLCGSSVESWNSWGSWMEVTV 167
           + N G  WSE  ++  G +G        KNK  LL     +   S +  N+   +    +
Sbjct: 163 TLNNGKDWSEPTEIYHGYVGSIRGFIQLKNKRFLLVFAKAIPARSQKPSNNSTDYGLHDI 222

Query: 168 DA------GRSWRKY-GPIYIPNESLSV-----IQPVPFHTANRTLRVLMRSFNGIGRVC 215
            A      G+SW+     + IP ES        ++P     ++  + +L+R+  G+    
Sbjct: 223 GALYSDNMGKSWQSSENDLKIPIESTQTTRYGGVEPNIIELSDGKIWMLIRTNKGV--TY 280

Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS------------RGVL 263
            S S D G +W   +PT+ ++ +S    ++L D R+++ +N+              R  L
Sbjct: 281 QSFSTDSGTSWHQPEPTKFISSDSPAATLRLADNRIVMFWNSNQRWDNKRTYALGGRETL 340

Query: 264 KVALSKDDGDSW---HDALTLEENLAME-----FSYPAVIQASDGSV 302
             A+S D+G +W    + LT      ME      +YP+ IQ++DG +
Sbjct: 341 HAAISSDEGKTWKGFREVLTSPSTKQMEKGDRGTAYPSAIQSADGKI 387


>gi|412985201|emb|CCO20226.1| BNR/Asp-box repeat protein [Bathycoccus prasinos]
          Length = 602

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 72/156 (46%), Gaps = 17/156 (10%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+ YG +  P   L  I+       + ++ +L R+   IG +  S S D G+TW+
Sbjct: 423 DNGKTWKAYGKLDHPRTWL--IENTLSERDDGSVLMLFRT--KIGVLYASASYDEGITWT 478

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNT---------VSRGVLKVALSKDDGDSWHDA 278
             + T + NP+S I   +L  G L +  N           +R  L VA+S D G +W   
Sbjct: 479 PPESTAIPNPDSKIVLTRLHSGALAMVLNAHGKLKGQKRHARTFLDVAVSGDGGATWRRI 538

Query: 279 LTLEENLA---MEFSYPAVIQA-SDGSVHITYTYNR 310
             LE+      +   YP  I+A  +G   +  TY++
Sbjct: 539 ARLEDESQSPLVRSHYPTAIEAFREGGTRLLITYSK 574


>gi|343085110|ref|YP_004774405.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
 gi|342353644|gb|AEL26174.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
           marinum DSM 745]
          Length = 371

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 70/291 (24%), Positives = 124/291 (42%), Gaps = 32/291 (10%)

Query: 31  TIVEVDKGHFLVAYF---GGSCEGAPDVKIWLQTFKDG--RW--QSPIIADEEPNVPMWN 83
           + +++  G  L AY    GGS + A  V I  +  +DG   W  +  +I   E  + + +
Sbjct: 49  SFIQLKDGRILFAYSKFEGGSGDHASGV-IAGRYSEDGGNTWTGKDEVIISNEGGMNVMS 107

Query: 84  PVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PPGILGPSKNKPI 140
               +L    + LFY     +      M+ S ++  TWS+   +     G    + ++ I
Sbjct: 108 VSFLRLIDGSIALFYARKNSLDDCIPYMRFSTDEAKTWSDPFPVITDKKGYFVLNNDRVI 167

Query: 141 LLENGLLLCGSSVESWNSWGSWMEVTV------DAGRSWRKYGPIYIPNESLSVIQ-PVP 193
            L  G LL   S+         M  ++      D G++W  +  + +PN++  V+Q P  
Sbjct: 168 QLPGGRLLAPVSLHKTAGVDFRMRGSIFCYYSDDNGKTW--HSGMEMPNKNRVVVQEPGV 225

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD-GRLL 252
               N  + + +R+  G+  +  SE  D GLTW    P+ + +P +     ++ + G L+
Sbjct: 226 VLLENGKILMWLRTDRGVQYLSKSE--DQGLTWEEVYPSDIASPRAPASLKRIPETGDLM 283

Query: 253 LAYN---------TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAV 294
           L +N            R  L  A+SKD+G SW     LE++    F Y A+
Sbjct: 284 LIWNNNKSLVKETAAYRTPLTAAISKDEGKSWTHLKVLEDDPDGFFCYTAI 334


>gi|83951271|ref|ZP_00960004.1| Glycosyl hydrolase, BNR repeat protein [Roseovarius nubinhibens
           ISM]
 gi|83839170|gb|EAP78466.1| Glycosyl hydrolase, BNR repeat protein [Roseovarius nubinhibens
           ISM]
          Length = 379

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 6/128 (4%)

Query: 186 LSVIQPVPFHTANRTLRVLMRSFNGIGRVCM--SESCDGGLTWSYAKPTQLLNPNSGIDG 243
           L+ IQP+      R     +R F+ + R C+   E+ DGG +WS  + T + NP++ +  
Sbjct: 228 LNPIQPMIVPLDGRRAVAFLRDFD-VKRGCLLRCETEDGGQSWSPVETTVIANPSAPVAA 286

Query: 244 VKLKDGRLLLAYNT--VSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           + L DGR+L+A N    +   L +++S D+G +W      E   +    YP +   ++G 
Sbjct: 287 LPLGDGRILMAGNDDPDAPADLCLSISADEGRTWQRCRRFEGK-SGGLRYPMLRLLAEGE 345

Query: 302 VHITYTYN 309
           + + Y++ 
Sbjct: 346 IALCYSHG 353


>gi|385222020|ref|YP_005771153.1| neuraminidase (sialidase) [Helicobacter pylori SouthAfrica7]
 gi|317010799|gb|ADU84546.1| neuraminidase (sialidase) [Helicobacter pylori SouthAfrica7]
          Length = 332

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 134/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAST++ +  G+ L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 22  SVHASTLISLPNGNLLSAYFSGTKEGARDVKISANLFDGKSNRWGEAFIILTKEKLSKNA 81

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGP 134
                 + NP+LF L ++ +LLF  +G  +  W+      +   +   + + +    L P
Sbjct: 82  HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQFESALEPIDFKFVQKLSLNP 139

Query: 135 SKN-------KPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
             N       KP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 140 FLNLSHLVRTKPLNTTDGGFVLPIYHELATQYPLLLK--FDKHHNPRE---LLRPNALNH 194

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C    TW    PT L N +  ++ + 
Sbjct: 195 QLQPSLTPF----KDCAIMAFRNHSFKDHLMLETCKNPTTWQKVMPTNLKNLDDSLNLLN 250

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 251 LNGSLYLIHNPSDLSLRRKELLLSKLENINSFQTLKILDK--ANEVSYPSS-SLNPYFID 307

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 308 IVYTYNRSNIKHI 320


>gi|374850878|dbj|BAL53855.1| glycosyl hydrolase [uncultured Acidobacteria bacterium]
          Length = 365

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 129/324 (39%), Gaps = 40/324 (12%)

Query: 20  NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIA---- 73
            +A  +  HA  ++    G  L+  +     G  D +I     +DG   W +P+      
Sbjct: 28  EAASPRERHAYPVIARTAGGQLLTVWTVMEAGERDGRIVGSFSRDGGRTWSAPVTLIDVP 87

Query: 74  ---DEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE--QLP 128
              D +PN+ +  P +    +       ++   + +    M  S ++G +WS     ++P
Sbjct: 88  GKLDADPNIIIDGPRILVYSTT-----VEVPNRIDRSEVWMTVSEDEGKSWSPPVLLRMP 142

Query: 129 PGILGPSKNKPILLENGLLLCGSSVESWNSWG------------SWMEVTVDAGRSWRKY 176
              +   ++  + L +G L+   S + W   G            S + ++ D G +W   
Sbjct: 143 FRYVVGKRHLGLKLRDGTLVMPVSWDLWAEAGTPARTEGEMDLKSGVLLSRDGGVTWTPS 202

Query: 177 GPIYI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAK 230
           G ++I      P     V +P     AN  L +L+R+  G   +  + S DGG TW+   
Sbjct: 203 GELHIFERKVHPGAIWGVDEPAIVELANGELYMLLRT--GTEWLYEARSPDGGRTWTPPA 260

Query: 231 PTQLLNPNSGIDGVKLKD--GRLLLAYNTVSRGV--LKVALSKDDGDSWHDALTLEENLA 286
            + LL  NS     +L      +++ +N   R    L VALS D G SW     +  +  
Sbjct: 261 RSPLLGHNSPAALWRLDQHPQEIIVIWNNSPRHRYPLTVALSADGGRSWSRPKVVATSDG 320

Query: 287 MEFSYPAVIQASDGSVHITYTYNR 310
            + SYP + QA DG+    +   R
Sbjct: 321 PQVSYPGITQAPDGTFVAVWQQQR 344


>gi|196232221|ref|ZP_03131075.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
           flavus Ellin428]
 gi|196223589|gb|EDY18105.1| glycosyl hydrolase BNR repeat-containing protein [Chthoniobacter
           flavus Ellin428]
          Length = 397

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 94/225 (41%), Gaps = 34/225 (15%)

Query: 111 MKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGS--------------SVESW 156
           M RS + G+TWS R    P I+  S + PI L +G LL                 S +  
Sbjct: 181 MIRSNDGGITWSARY---PTIVN-SPHGPIQLSDGRLLYAGKQLWTPEKRVGVCVSNDDG 236

Query: 157 NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCM 216
            +W    E+   AG S  +Y  ++           V        + +   + NG G    
Sbjct: 237 TAWQWLAEIPARAGDSVAQYHELHA----------VEAGEGRLLVHIRNENKNGHGETLQ 286

Query: 217 SESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL--KVALSKDDGDS 274
           +ES DGG TWS      +    S ++  +L DGRLL+ Y    R  L  +  +S D G +
Sbjct: 287 TESTDGGKTWSEPHSIGVWGLPSFLN--RLHDGRLLMTYGH-RRAPLGNQARVSTDAGRT 343

Query: 275 WHDALTLE-ENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           W + L +  +  + +  YP+ ++  DGS+   +  +  +  + VL
Sbjct: 344 WSEPLIIYGDGPSGDLGYPSTVEFDDGSLLTVWYEHLKESPYAVL 388


>gi|408907236|emb|CCM11068.1| BNR/Asp-box repeat protein [Helicobacter heilmannii ASB1.4]
          Length = 365

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 84/328 (25%), Positives = 130/328 (39%), Gaps = 47/328 (14%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRW-QSPIIAD--- 74
           + P +S H++++  + K   L AYFGGS E   DV IW  L   + G+W Q+  + D   
Sbjct: 47  TPPVQSVHSASLSALGKHTLLAAYFGGSKEAKSDVCIWGNLYNTQTGQWSQAFKLLDPRT 106

Query: 75  -----EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG----CMKRSYNK--GVTWSE 123
                 E    + NPVL+    N L LF  +G  +  W+      +    NK   + + +
Sbjct: 107 LSHQAHEFVKILGNPVLYT-HQNRLYLFV-VGASLGGWATSKIYVLSAPANKPQALHYEQ 164

Query: 124 REQLPP--GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYI 181
              L P   I    +N P    +G  +     E       +++        +     +  
Sbjct: 165 TLHLSPLLNISHLVRNPPTPTTDGGFVLPIYHELAKKSPVFLKFDSHTHLQF-----LLR 219

Query: 182 PNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS 239
           PN     +QP  VP+        V  RS+        ++SC     W + KP  LLN  S
Sbjct: 220 PNSLKQQLQPSLVPY---QHCAFVAYRSYK--TERFYTQSCLD--FWHWQKP--LLNNLS 270

Query: 240 GIDGVKL---KDGRLLLAYNTV------SRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
             D        +G L L YN        +RG L++A  +    ++     L+ +   E S
Sbjct: 271 NHDSANTLFNANGVLYLIYNQALPHHKNARGALRLARLEPKTGAFMPLKILDRSQKGEVS 330

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHVVL 318
           YP V+   D  VH+ YT  R  I H+V 
Sbjct: 331 YPFVM-LFDNQVHVLYTKERKAIAHMVF 357


>gi|83747403|ref|ZP_00944443.1| Hypothetical Protein RRSL_02925 [Ralstonia solanacearum UW551]
 gi|83725990|gb|EAP73128.1| Hypothetical Protein RRSL_02925 [Ralstonia solanacearum UW551]
          Length = 81

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 40/69 (57%), Gaps = 12/69 (17%)

Query: 260 RGVLKVALSKDDGDSWH--------DALTLEENLA----MEFSYPAVIQASDGSVHITYT 307
           R  L +A+S+D G +W         D   +  N A     E+SYP+++Q +DG++HI +T
Sbjct: 4   RAPLSLAISEDGGQTWRRRRNLEIGDGYCMTNNSADQRNREYSYPSIVQGTDGALHIAFT 63

Query: 308 YNRTQIKHV 316
           Y R +IK+V
Sbjct: 64  YFRQRIKYV 72


>gi|430741382|ref|YP_007200511.1| BNR/Asp-box repeat-containing protein [Singulisphaera acidiphila
           DSM 18658]
 gi|430013102|gb|AGA24816.1| BNR/Asp-box repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 642

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 137/328 (41%), Gaps = 36/328 (10%)

Query: 17  FPANSAPFKSCHASTIVEVDKGHFLVA---YFGGSCEGAPDVKIWLQTFKDGR-WQSPII 72
           FPA +   +    S +V+ D G  + A   + G   + A    I + +   GR W    +
Sbjct: 304 FPATAQNPRYSEGSVLVQRD-GSLIYATTEFQGSGSDFAKARIIAVDSTDQGRTWSERRV 362

Query: 73  ADEEPNVPMWNPV---LFKLPSNG-----LLLFYKIGQEVQKWSGCMKRSYNKGVTWSER 124
             E  NV   N +   L +L +N      + LFY +          ++ S ++  T+ E 
Sbjct: 363 LQE--NVGQQNVMSVTLRRLKANNPFDGPIGLFYLVKNSPTDLDVYLRISNDEAKTFGEP 420

Query: 125 EQLP--PGILGPSKNKPILLENGLLLC--GSSVESWNS--WGSWMEVTVDAGRSW-RKYG 177
             +   PG    + ++  +L  G +L    ++ ++  S  + S    + D G +W R  G
Sbjct: 421 RLVTTDPGYHVLNNDRVTVLSTGRILVPIATTEDAVKSPKYASSCYFSDDQGETWKRSRG 480

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
            I+ P      ++P     A+   R+LM+    +  + +SES DGG +WS AK  ++  P
Sbjct: 481 LIHYPGPG--AMEPEVLEQADG--RLLMQIRTKVDHIAVSESTDGGESWSEAKSWEIAGP 536

Query: 238 NSGIDGVKL-KDGRLLLAYNTV--------SRGVLKVALSKDDGDSWHDALTLEENLAME 288
            S     ++   G LLL +N           R  L  ALS D+G +W     LE +    
Sbjct: 537 ESPATLRRIPSTGNLLLIWNNTLRDGKVSSVRTPLTSALSADEGRTWSFHRNLENDPTKV 596

Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHV 316
           ++Y +V+    G   +TY    T+   +
Sbjct: 597 YAYTSVV-FDRGRALLTYYVGDTKTGRI 623


>gi|412992711|emb|CCO18691.1| Putative Glycosyl hydrolase, BNR repeat [Bathycoccus prasinos]
          Length = 583

 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 60/117 (51%), Gaps = 16/117 (13%)

Query: 212 GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT----------VSRG 261
           G + +S S D G +W     T L NPNS +  V L +G +++ +N           V+RG
Sbjct: 442 GAIYVSHSADNGESWDTPHATPLQNPNSKVSAVLLPNGVIVVCHNAHAFMKHPRIGVTRG 501

Query: 262 VLKVALSKDDGDSWHDA-LTLEENL--AMEFSYPAVIQASDG-SVHITYTYNRTQIK 314
            L ++ S DDG +W +A L +E++L       YP ++   DG  + I+Y+ +   +K
Sbjct: 502 KLALSTSSDDGRTWKEAVLFIEDSLEPGKHVHYPTMVL--DGCDLRISYSVSGEGVK 556


>gi|315452521|ref|YP_004072791.1| neuraminidase [Helicobacter felis ATCC 49179]
 gi|315131573|emb|CBY82201.1| neuraminidase [Helicobacter felis ATCC 49179]
          Length = 358

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 82/338 (24%), Positives = 127/338 (37%), Gaps = 54/338 (15%)

Query: 18  PANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRWQSPIIADE 75
           P    P +S H++ +  +  GH LV YFGGS EG PDV I+  L   K   W SP     
Sbjct: 30  PIPPPPVQSVHSANLGALPSGHLLVVYFGGSEEGLPDVCIYGNLYDPKTKVW-SPAFKLL 88

Query: 76  EPNV----------PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSERE 125
            P             + NPVL  L  +  +  + +G  +  W+    R Y          
Sbjct: 89  SPTQLSTLSKSYIETLGNPVLAIL--DHKVYLFVVGVSLGGWAT--SRIYTLQADLPSLL 144

Query: 126 QLPPGI-------LGP-------SKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGR 171
              P +       L P        +N P+  E+G    G  +  ++   +   + +    
Sbjct: 145 TQKPHLHFISTLSLSPLLNISHLVRNAPLPTEDG----GFILPIYHELATKFPLLLKFDS 200

Query: 172 SWRKYGPIYIPNESLSVIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
             R    I  PN   S +QP  VP+         LM   +       +++C   L W+  
Sbjct: 201 KARLQRVIK-PNHLTSQLQPTLVPYQQC-----ALMAFRSSKDTHLYTQTCKDLLHWNPP 254

Query: 230 KPTQLLNPNSGIDGVKLKDGRLLLAYNT--------VSRGVLKVALSKDDGDSWHDALT- 280
            P+ L N +  +      +    L YN          S   L          S+ D L  
Sbjct: 255 IPSNLHNYDDALALFN-ANHETYLVYNAPLPNASSSRSSLRLARFAPTSKRSSYFDPLAI 313

Query: 281 LEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           L+++L  E SYP+V+   D ++H+ YT +R  I H+ L
Sbjct: 314 LDQSLEGEVSYPSVLVLQD-TIHLVYTRHRRTIMHLTL 350


>gi|301385787|ref|ZP_07234205.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato Max13]
          Length = 289

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/284 (24%), Positives = 101/284 (35%), Gaps = 69/284 (24%)

Query: 94  LLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLE 143
           L LFY +   V  W+G       S + G +WS   QL   +  P  N        P+   
Sbjct: 2   LWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHA 57

Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
           +G +      E    +  ++ ++ D G    K+      N     + P+    A   LR 
Sbjct: 58  DGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRY 116

Query: 204 LMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL 263
              +F+   RV  S + DGG TWS   P    NPNS +  V +    LL+A N +  G  
Sbjct: 117 AGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVGVPGKGLLVALNDLREGRF 173

Query: 264 KVALSKDDG-----------DSWHDAL----------------------TLEENLAMEF- 289
           K++L   D            D   D L                       L + +  EF 
Sbjct: 174 KLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFL 233

Query: 290 -----------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
                             YP  I++ DG  H+ Y++N T IKHV
Sbjct: 234 SNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 277


>gi|302061461|ref|ZP_07253002.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato K40]
          Length = 290

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/284 (24%), Positives = 101/284 (35%), Gaps = 69/284 (24%)

Query: 94  LLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLE 143
           L LFY +   V  W+G       S + G +WS   QL   +  P  N        P+   
Sbjct: 3   LWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHA 58

Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
           +G +      E    +  ++ ++ D G    K+      N     + P+    A   LR 
Sbjct: 59  DGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRY 117

Query: 204 LMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL 263
              +F+   RV  S + DGG TWS   P    NPNS +  V +    LL+A N +  G  
Sbjct: 118 AGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAVGVPGKGLLVALNDLREGRF 174

Query: 264 KVALSKDDG-----------DSWHDAL----------------------TLEENLAMEF- 289
           K++L   D            D   D L                       L + +  EF 
Sbjct: 175 KLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFL 234

Query: 290 -----------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
                             YP  I++ DG  H+ Y++N T IKHV
Sbjct: 235 SNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 278


>gi|149177156|ref|ZP_01855763.1| hypothetical protein PM8797T_27150 [Planctomyces maris DSM 8797]
 gi|148844048|gb|EDL58404.1| hypothetical protein PM8797T_27150 [Planctomyces maris DSM 8797]
          Length = 373

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/223 (25%), Positives = 89/223 (39%), Gaps = 33/223 (14%)

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKP-------------ILLENGLLLCGSSVESWNSW 159
           RS + G +W++ E  P    G S+  P             +    G     S       W
Sbjct: 148 RSSDGGRSWTKSEDFPAAEAGWSEYIPFGDIWAGSDGALHVSCYQGQFKDPSQSTKTKGW 207

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE- 218
            S    + D GR+W            +SVI P   H       +  +++    R+   E 
Sbjct: 208 KSSHLKSEDDGRTW----------SVVSVIGPA--HNETDLFYLGEKNWLAAARIDKMEL 255

Query: 219 --SCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVA--LSKDDGDS 274
             S D G+TW   +P    N  +G    +LKDGRLLL+Y     G   V+  LS D+G +
Sbjct: 256 IRSNDNGVTWQKPQPVTGRNEING-HLTRLKDGRLLLSYGIRVNGRRGVSAKLSSDEGQT 314

Query: 275 WHDALTLEENL-AMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           W + + +       +  YP+ IQ  +G + +T  Y+    +HV
Sbjct: 315 WSEPIRISHTADGGDCGYPSSIQKENGEI-VTAWYSSESPQHV 356


>gi|422591606|ref|ZP_16666246.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           morsprunorum str. M302280]
 gi|330879173|gb|EGH13322.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           morsprunorum str. M302280]
          Length = 288

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 68/282 (24%), Positives = 100/282 (35%), Gaps = 69/282 (24%)

Query: 96  LFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLENG 145
           LFY +   V  W+G       S + G +WS   QL   +  P  N        P+   +G
Sbjct: 3   LFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHADG 58

Query: 146 LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
            +      E    +  ++ ++ D G    K+      N     + P+    A   LR   
Sbjct: 59  SIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRYAG 117

Query: 206 RSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKV 265
            +F+   RV  S + DGG TWS   P    NPNS +  V +    LL+A N +  G  K+
Sbjct: 118 ETFH---RVLASRTEDGGQTWSEPYPLTPSNPNSSLAAVGVPGKGLLVALNDLREGRFKL 174

Query: 266 ALSKDDG-----------DSWHDAL----------------------TLEENLAMEF--- 289
           +L   D            D   D L                       L + +  EF   
Sbjct: 175 SLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFLSN 234

Query: 290 ---------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
                           YP  I++ DG  H+ Y++N T IKHV
Sbjct: 235 LDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 276


>gi|343085111|ref|YP_004774406.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
 gi|342353645|gb|AEL26175.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
           marinum DSM 745]
          Length = 388

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/230 (24%), Positives = 101/230 (43%), Gaps = 29/230 (12%)

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL---PPGILGPSKNKPILL 142
           L +L +  + +FY            M+ S ++  TWS+         G    + ++ + L
Sbjct: 126 LLRLQNGDIAMFYLRKNNTDDCIPYMRISKDEAQTWSDPIACITDDEGYFVLNNDRVVQL 185

Query: 143 ENGLLL------CGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT 196
           E+G L+       G  +E       +   + D G++W +   +  P E + + +P     
Sbjct: 186 EDGTLMLAVAKHAGPGMEWRGRADLYAYSSTDNGQTWDRSEMVPNPKE-IVLQEPGLVEL 244

Query: 197 ANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY- 255
           A+ ++ +++R+  G+   C S S D G TWS  + + L++P S     ++     LLA  
Sbjct: 245 ADGSILMVIRTDAGVQ--CYSYSKDNGKTWSAVEESTLVSPVSPATIERIPGTGDLLAVW 302

Query: 256 -NTVS--------RGVLKVALSKDDGDSWHDALTLEEN-------LAMEF 289
            N VS        R  L  A+SKD+G +W +  TLE +       +AMEF
Sbjct: 303 NNNVSEDPDIAKLRTPLSSAISKDEGKTWTNYKTLENDPDGWYCYIAMEF 352


>gi|109947931|ref|YP_665159.1| neuraminidase (sialidase) [Helicobacter acinonychis str. Sheeba]
 gi|109715152|emb|CAK00160.1| predicted neuraminidase (sialidase) [Helicobacter acinonychis str.
           Sheeba]
          Length = 347

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 80/315 (25%), Positives = 136/315 (43%), Gaps = 40/315 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAST++ +   + L AYF G+ EGA DVKI    F  K  RW     ++  EE +   
Sbjct: 37  SVHASTLISLSNNNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFTLLTKEELSKNA 96

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L ++ +LLF  +G  +  W+      +   +      + ++  L P
Sbjct: 97  HEYIKKLGNPLLF-LHNDKILLFV-VGVSMGGWATSKIYQFESALEPIDFKFVQKLSLSP 154

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    + KP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 155 FLNLSHLVRTKPLNTTDGGFVLPLYHELATQYPLLLK--FDKHHNPRE---LLRPNALNH 209

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C    TW    PT L N N  ++ + 
Sbjct: 210 QLQPSLTPF----KDCAIMTFRNHFFKDYLMLETCKHPTTWQKPMPTNLKNLNDALNLLN 265

Query: 246 LKDGRLLLAYNTVSRGVLK---VALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGS 301
           L     L+  +  S   L+   + LSK ++ +S+     L++  A E SYP+    +   
Sbjct: 266 LNGSLYLI--HNPSDLFLRRKELLLSKLENANSFQTLKILDK--ANEVSYPSS-SLNSHF 320

Query: 302 VHITYTYNRTQIKHV 316
           + I YTYNR+ IKH+
Sbjct: 321 IDIVYTYNRSNIKHI 335


>gi|420502896|ref|ZP_15001432.1| hypothetical protein HPHPP41_1082 [Helicobacter pylori Hp P-41]
 gi|393150768|gb|EJC51073.1| hypothetical protein HPHPP41_1082 [Helicobacter pylori Hp P-41]
          Length = 372

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 133/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L+AYF G+ EGA DVKI    F  K  RW    I         + 
Sbjct: 62  SAHASSLISLPNDNLLIAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSQSS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|386747305|ref|YP_006220513.1| hypothetical protein HCD_01355 [Helicobacter cetorum MIT 99-5656]
 gi|384553547|gb|AFI05303.1| hypothetical protein HCD_01355 [Helicobacter cetorum MIT 99-5656]
          Length = 372

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 48/94 (51%), Gaps = 13/94 (13%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +  G+ L AYF GS EGA DVKI    F  K  RW    I   + N+    
Sbjct: 62  SVHASSLISLPNGNLLSAYFSGSKEGARDVKISANLFDSKTNRWSEAFILLTKENLFKNS 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWS 108
                 + NP+LF L  + +LLF  +G  +  W+
Sbjct: 122 HEFIKKLGNPLLF-LHKDKILLFV-VGVSMGGWA 153


>gi|420461945|ref|ZP_14960732.1| hypothetical protein HPHPH3_0770 [Helicobacter pylori Hp H-3]
 gi|393080108|gb|EJB80837.1| hypothetical protein HPHPH3_0770 [Helicobacter pylori Hp H-3]
          Length = 372

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 132/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I         + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSQSS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|373849242|ref|ZP_09592043.1| hypothetical protein Opit5DRAFT_0097 [Opitutaceae bacterium TAV5]
 gi|372475407|gb|EHP35416.1| hypothetical protein Opit5DRAFT_0097 [Opitutaceae bacterium TAV5]
          Length = 443

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/215 (23%), Positives = 88/215 (40%), Gaps = 36/215 (16%)

Query: 133 GPSKNKPILLENGLLLCGSSV-------------ESWNSWGSWMEVTV----DAGRSWRK 175
           G + NKPI+L NG  L   S+             +++    S     V    D GR+W +
Sbjct: 183 GSTLNKPIVLANGDWLLPISLWHRGLIHKPFAERDAFRELDSLRGANVFASGDEGRTWER 242

Query: 176 YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLL 235
            G +  P  +    + +     + T+ +  R+  G+     S S DGG TWS    + + 
Sbjct: 243 RGRVAFPRPNFD--EHMFIERRDGTIWMTARTRRGL---YESISADGGRTWSAPVESAIK 297

Query: 236 NPNSGIDGVKLKDGRLLLAY------------NTVSRGVLKVALSKDDGDSWHDALTLEE 283
             ++     +L  G +LL                  R  L   LS DDG +W   L L++
Sbjct: 298 TASARHFIRRLASGNILLVKYGEKIDEVPTRDGKTVRSHLTAFLSDDDGHTWKGGLLLDD 357

Query: 284 NLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
              +  +YP   QA DG++++++  +R     V++
Sbjct: 358 RTPV--TYPDGFQAPDGTLYVSWDKSRNAEGEVLM 390


>gi|255533458|ref|YP_003093830.1| glycosyl hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255346442|gb|ACU05768.1| glycosyl hydrolase BNR repeat-containing protein [Pedobacter
           heparinus DSM 2366]
          Length = 383

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 56/245 (22%), Positives = 109/245 (44%), Gaps = 25/245 (10%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL--- 127
           +I ++E  + + +  L +L +  + LFY            ++ S ++  TW++       
Sbjct: 105 VIVEKEGTMNVMSVSLLRLKNGKIALFYLKKNSTSDCIPMVRFSSDEAKTWTDPIACITD 164

Query: 128 PPGILGPSKNKPILLENGLLLCGSSVES------WNSWGS-WMEVTVDAGRSWRKYGPIY 180
             G    + ++ I L++G +L   ++        W++ G+ W   + D G++W+    + 
Sbjct: 165 KKGYFVLNNSRVIQLKSGRILLAVALHQTPEQAKWSNSGTLWSYYSDDNGKTWKPGQQVA 224

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNS- 239
            P+    V+   P     +   VLM      G   +S S + G +WS A+P+ + +P S 
Sbjct: 225 NPD---GVVTQEPGVVELKNGNVLMFIRTDAGVQYLSYSKNKGQSWSPAQPSTIKSPVSP 281

Query: 240 -GIDGVKLKDGRLLLAYNTV---------SRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
             I  + L  G L+L +N            R  L VA+SKD+G +W +  T+E++    +
Sbjct: 282 ASIKRIPLT-GDLILVWNNNGGDNPLIKGKRTPLNVAISKDEGKTWQNIKTVEDDPNGWY 340

Query: 290 SYPAV 294
            Y A+
Sbjct: 341 CYIAI 345


>gi|420472054|ref|ZP_14970750.1| hypothetical protein HPHPH18_0671 [Helicobacter pylori Hp H-18]
 gi|393091409|gb|EJB92041.1| hypothetical protein HPHPH18_0671 [Helicobacter pylori Hp H-18]
          Length = 372

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 79/313 (25%), Positives = 134/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|26988313|ref|NP_743738.1| BNR repeat-containing protein [Pseudomonas putida KT2440]
 gi|24983060|gb|AAN67202.1|AE016347_3 BNR domain protein [Pseudomonas putida KT2440]
          Length = 663

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 3/110 (2%)

Query: 210 GIGRVCMSESCDGGLTWSYAKPTQLLNPNSG--IDGVKLKDGRLLLAYNT-VSRGVLKVA 266
           G   V MS S D   TW+       L P+S   I   +   GRLL+ +N  + R  L +A
Sbjct: 526 GDSGVLMSVSTDMMKTWTPWVNYLALAPSSSSRIWLGRSPSGRLLICWNNDIERKTLTLA 585

Query: 267 LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           LS D+G +W   + LE N   + +YP V    DG + I Y   RT  + +
Sbjct: 586 LSDDEGVTWPHKVVLEPNSVGQVTYPIVTFGDDGLILIAYDNERTAKRQI 635


>gi|225158831|ref|ZP_03725147.1| hypothetical protein ObacDRAFT_7801 [Diplosphaera colitermitum
           TAV2]
 gi|224802584|gb|EEG20840.1| hypothetical protein ObacDRAFT_7801 [Diplosphaera colitermitum
           TAV2]
          Length = 386

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 60/254 (23%), Positives = 104/254 (40%), Gaps = 45/254 (17%)

Query: 83  NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSK------ 136
           +P L +LP   L + Y       K       S ++G TW+     PP ++   K      
Sbjct: 88  SPALRRLPDGRLGMLYSFRLGTTKAMRHFVTSKDEGKTWT-----PPVVVASGKYVTGCH 142

Query: 137 NKPILLENGLLLCG-SSVESWNSWGSWMEVTV--DAGRSWRKYGPIYIPN---------- 183
           ++  +L++G  +      E+W     +  V    D GRSWR   P+ +P           
Sbjct: 143 DRFTILQSGRFVAPLHCSENWEDHYLYTRVATSDDNGRSWRVSEPLELPYVGGLGGLGDH 202

Query: 184 ------ESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
                 ES    +P      + +L + MR+   +G    SES D GLTWS  +  ++++P
Sbjct: 203 WKGAFYES-GCYEPGVVQRVDGSLLMTMRT--AMGTQFCSESFDEGLTWSPLRSLEVVSP 259

Query: 238 NSGIDGVKLKDGRLLLA-----YNTV-----SRGVLKVALSKDDGDSW--HDALTLEENL 285
            +  + V++    +LL      Y        +R  +   LS DDG +W  H    + ++ 
Sbjct: 260 QAPANLVRIPGTPVLLLLWSSDYGATIGTGGNRRCISCGLSMDDGRTWPHHLRRHVIQDT 319

Query: 286 AMEFSYPAVIQASD 299
              + YP+V+  +D
Sbjct: 320 VSSWDYPSVLFHND 333


>gi|374373452|ref|ZP_09631112.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
           19437]
 gi|373234425|gb|EHP54218.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
           19437]
          Length = 387

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 69/267 (25%), Positives = 103/267 (38%), Gaps = 24/267 (8%)

Query: 68  QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127
           Q   I D E  + + +  L +L +  + LFY            ++ S ++  TWS     
Sbjct: 107 QDVKIIDREGGMNVMSVSLLRLRNGTIALFYLRKNSTVDCIPVVRFSNDEAKTWSAPVDC 166

Query: 128 ---PPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYG 177
                G    + ++ I L +G LL   +  S  +   W E        + D G++W    
Sbjct: 167 ITDQQGYFVLNNDRVIQLSDGRLLLAVAKHSSPADTKWQEKGALFAYYSDDNGKTWTAGK 226

Query: 178 PIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNP 237
            +  P    ++I   P   A +  R+LM      G    S S D G TWS A P  L +P
Sbjct: 227 QVPTP---ATIITQEPGLVALKDGRILMYIRASGGSQYASYSKDRGATWSDALPFTLKSP 283

Query: 238 NSGIDGVKL-KDGRLLLAYNTVS---------RGVLKVALSKDDGDSWHDALTLEENLAM 287
            +     ++   G LL  +N            R  L VA+SKDDG SW     LE +   
Sbjct: 284 IAPATIERIPSTGDLLAVWNNNDGSIATIKGKRTPLTVAISKDDGISWIHIKNLEADPDG 343

Query: 288 EFSYPAVIQASDGSVHITYTYNRTQIK 314
            + Y A I     SV ++Y     Q K
Sbjct: 344 WYCYTA-IHFYKNSVLLSYCAGSQQAK 369


>gi|15611594|ref|NP_223245.1| hypothetical protein jhp0527 [Helicobacter pylori J99]
 gi|4155072|gb|AAD06108.1| putative [Helicobacter pylori J99]
          Length = 372

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTDWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|412988849|emb|CCO15440.1| hypothetical protein CHLNCDRAFT_134209 [Bathycoccus prasinos]
          Length = 676

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 79/190 (41%), Gaps = 37/190 (19%)

Query: 166 TVDAGRSWRKYGPIYIPNESLS--VIQPVPFHTA-NRTLRVLMRSFNGIGRVCMSESCDG 222
           + D G +W+  G I     +    VI+   F +  N  L +LMRS  G+     S S D 
Sbjct: 447 STDKGETWKANGAIQFQTGAKGRWVIEGSIFESGENHHLTMLMRSTEGV--AFKSISSDL 504

Query: 223 GLTWSYAKPTQLLNPNSGIDG-VKLKDGRLLLAYNTVS---------------------- 259
           G+TWS   P+ L+NP+S ++  V  K  RL +A+N  +                      
Sbjct: 505 GVTWSAPLPSALVNPDSKLNTIVNRKTKRLYVAFNDETIAMTASEGGADAIKGTSEDKEQ 564

Query: 260 ------RGVLKVALSKDDGDSWHDALTLEENL---AMEFSYPAVIQASDGSVHITYTYNR 310
                 R  L +A S DDG++ +    ++E +    +   YP +    D  + +TY+ + 
Sbjct: 565 TAGGKKRDELVIAHSDDDGENLYIFARIDEGVTPPGVMIHYPTMAFYDDQELFVTYSISY 624

Query: 311 TQIKHVVLLP 320
               + V  P
Sbjct: 625 NNATNFVEKP 634


>gi|302133069|ref|ZP_07259059.1| BNR/Asp-box repeat protein [Pseudomonas syringae pv. tomato NCPPB
           1108]
          Length = 291

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 100/284 (35%), Gaps = 69/284 (24%)

Query: 94  LLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPGILGPSKN-------KPILLE 143
           L LFY +   V  W+G       S + G +WS   QL   +  P  N        P+   
Sbjct: 4   LWLFY-VSVSVGGWAGSSVNAMVSSDMGASWSAPRQL---VTSPFLNISTLVRGAPVFHA 59

Query: 144 NGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRV 203
           +G +      E    +  ++ ++ D G    K+      N     + P+    A   LR 
Sbjct: 60  DGSIGLPVYHEFLGKFAEYLYLSAD-GEVIDKFRISRGKNSLQPTVVPLDGQRAVALLRY 118

Query: 204 LMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVL 263
              +F+   RV  S + DGG TWS   P    NPNS +    +    LL+A N +  G  
Sbjct: 119 AGETFH---RVLASRTEDGGQTWSEPYPLAPSNPNSSLAAGGVPGKGLLVALNDLREGRF 175

Query: 264 KVALSKDDG-----------DSWHDAL----------------------TLEENLAMEF- 289
           K++L   D            D   D L                       L + +  EF 
Sbjct: 176 KLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFSLEAYKEVIGQGFRASSGALRQPMEAEFL 235

Query: 290 -----------------SYPAVIQASDGSVHITYTYNRTQIKHV 316
                             YP  I++ DG  H+ Y++N T IKHV
Sbjct: 236 SNLDQRVCAPQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKHV 279


>gi|420413529|ref|ZP_14912653.1| hypothetical protein HPNQ4099_0770 [Helicobacter pylori NQ4099]
 gi|420418517|ref|ZP_14917609.1| hypothetical protein HPNQ4076_0654 [Helicobacter pylori NQ4076]
 gi|393029516|gb|EJB30597.1| hypothetical protein HPNQ4099_0770 [Helicobacter pylori NQ4099]
 gi|393033343|gb|EJB34406.1| hypothetical protein HPNQ4076_0654 [Helicobacter pylori NQ4076]
          Length = 372

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 133/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFTILTKEDLSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALESIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    KNKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLIKNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNALNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   ++    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAIMAFRNHSFKDSLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELLLSKLENSNSFKTLKILDK--ADEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420531064|ref|ZP_15029439.1| hypothetical protein HPHPP28B_0548 [Helicobacter pylori Hp P-28b]
 gi|393139090|gb|EJC39471.1| hypothetical protein HPHPP28B_0548 [Helicobacter pylori Hp P-28b]
          Length = 372

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I         + 
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFIILTKEELSQSS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK +  +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420432150|ref|ZP_14931169.1| hypothetical protein HPHPH16_0883 [Helicobacter pylori Hp H-16]
 gi|393049743|gb|EJB50709.1| hypothetical protein HPHPH16_0883 [Helicobacter pylori Hp H-16]
          Length = 372

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK +  +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|424663009|ref|ZP_18100046.1| hypothetical protein HMPREF1205_03395 [Bacteroides fragilis HMW
           616]
 gi|404576699|gb|EKA81437.1| hypothetical protein HMPREF1205_03395 [Bacteroides fragilis HMW
           616]
          Length = 360

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 66/144 (45%), Gaps = 8/144 (5%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G SW K   + I       ++P     ++  + + MR+  G       +  +G L   
Sbjct: 197 DEGNSWDKSDQVVIDESKYGGLEPTIVELSSDKVLMSMRTDLGFQYFAYFDH-NGKLVKG 255

Query: 228 YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS---RGVLKVALSKDDGDSWHDALTLEE- 283
             + + L + +S    + LK G LL  +N      R  L ++LS D+GDSW++   LEE 
Sbjct: 256 PVR-SSLPSSDSPAHLISLKHGELLAVHNNCDVDMRNRLSISLSLDEGDSWNEEFILEES 314

Query: 284 -NLAMEFSYPAVIQASDGSVHITY 306
            N    FSYP++ +  D  + ITY
Sbjct: 315 ANEKNSFSYPSLKKVGD-RLLITY 337


>gi|420477675|ref|ZP_14976330.1| hypothetical protein HPHPH23_1232 [Helicobacter pylori Hp H-23]
 gi|393092354|gb|EJB92975.1| hypothetical protein HPHPH23_1232 [Helicobacter pylori Hp H-23]
          Length = 372

 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 128/312 (41%), Gaps = 34/312 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
           L     L+   + +S    ++ LSK +  +    L + +  A E SYP+    +   + I
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFIDI 348

Query: 305 TYTYNRTQIKHV 316
            YTYNR+ IKH+
Sbjct: 349 VYTYNRSHIKHI 360


>gi|373957578|ref|ZP_09617538.1| hypothetical protein Mucpa_6016 [Mucilaginibacter paludis DSM
           18603]
 gi|373894178|gb|EHQ30075.1| hypothetical protein Mucpa_6016 [Mucilaginibacter paludis DSM
           18603]
          Length = 418

 Score = 47.0 bits (110), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 4/106 (3%)

Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPN--SGIDGVKLKDGRLLLAYNTV-SRGVLKV 265
           N  G +  S S D G +W   +  + + PN  S +   KLK G +LL  N    R  L V
Sbjct: 281 NNAGNIQYSNSYDNGNSWDTIRAFKKIFPNAASRVFIRKLKSGNILLIINNSWYRTNLTV 340

Query: 266 ALSKDDGDSW-HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
            LS ++G SW +  + +  ++    SYP  +Q +DG +H+    +R
Sbjct: 341 FLSVNNGVSWKYKTVLVANDIVASPSYPDAVQTTDGVIHMVNDADR 386


>gi|313148318|ref|ZP_07810511.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|423278118|ref|ZP_17257032.1| hypothetical protein HMPREF1203_01249 [Bacteroides fragilis HMW
           610]
 gi|313137085|gb|EFR54445.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
 gi|404586128|gb|EKA90701.1| hypothetical protein HMPREF1203_01249 [Bacteroides fragilis HMW
           610]
          Length = 568

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ ++NG  +  ++       G +  V+ D G +W+    +  P+            +  
Sbjct: 144 PVFIKNGKRIVVAAHGGVTPKGCYTYVSDDDGLTWKCSNTVTSPDHRGGGFHKGIRWNHG 203

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+     R   + S DGGLTWS ++P+      +     +L 
Sbjct: 204 AVEPTVVELKDGTLWMLMRTSQD--RHYQAFSQDGGLTWSESEPSPFYGTITMPTLGRLA 261

Query: 248 DGRLLLAY-NT-----------------VSRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 262 DGRLLLFWCNTTPLPEKEGTNGVWDDVFTNRDVTHVAVSDDDGKTW 307


>gi|420442053|ref|ZP_14940994.1| hypothetical protein HPHPH36_0841 [Helicobacter pylori Hp H-36]
 gi|393058853|gb|EJB59739.1| hypothetical protein HPHPH36_0841 [Helicobacter pylori Hp H-36]
          Length = 372

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASSLIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFVLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK +  +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELLLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|385218843|ref|YP_005780318.1| hypothetical protein HPGAM_02985 [Helicobacter pylori Gambia94/24]
 gi|317014001|gb|ADU81437.1| hypothetical protein HPGAM_02985 [Helicobacter pylori Gambia94/24]
          Length = 340

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 129/312 (41%), Gaps = 34/312 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNVPMW- 82
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +  + 
Sbjct: 30  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHYS 89

Query: 83  --------NPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                   NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 90  HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESTLEPIHFKFARKLSLSP 147

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 148 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 202

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 203 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTDWQKPISTNLKNLDDSLNLLN 258

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
           L     L+   + +S    ++ LSK +  +    L + +  A E SYP+    +   + I
Sbjct: 259 LNKELYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFIDI 316

Query: 305 TYTYNRTQIKHV 316
            YTYNR+ IKH+
Sbjct: 317 VYTYNRSHIKHI 328


>gi|424664116|ref|ZP_18101153.1| hypothetical protein HMPREF1205_04502 [Bacteroides fragilis HMW
           616]
 gi|404576152|gb|EKA80892.1| hypothetical protein HMPREF1205_04502 [Bacteroides fragilis HMW
           616]
          Length = 568

 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 70/166 (42%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +  V+ D G +W+    +  P+            +  
Sbjct: 144 PVFIRNGKRIVVAAHGGVTPKGCYTYVSDDDGLTWKCSNTVTSPDHRGGGFHKGIRWNHG 203

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+     R   + S DGGLTWS ++P+      +     +L 
Sbjct: 204 AVEPTVVELKDGTLWMLMRTSQD--RHYQAFSQDGGLTWSESEPSPFYGTITMPTLGRLA 261

Query: 248 DGRLLLAY-NT-----------------VSRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 262 DGRLLLFWCNTTPLPEKEGTNGVWDDVFTNRDVTHVAVSDDDGKTW 307


>gi|420475418|ref|ZP_14974089.1| hypothetical protein HPHPH21_0602 [Helicobacter pylori Hp H-21]
 gi|393093525|gb|EJB94142.1| hypothetical protein HPHPH21_0602 [Helicobacter pylori Hp H-21]
          Length = 372

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 134/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|420485606|ref|ZP_14984224.1| hypothetical protein HPHPP4_0781 [Helicobacter pylori Hp P-4]
 gi|420516100|ref|ZP_15014563.1| hypothetical protein HPHPP4C_0821 [Helicobacter pylori Hp P-4c]
 gi|420517803|ref|ZP_15016257.1| hypothetical protein HPHPP4D_0819 [Helicobacter pylori Hp P-4d]
 gi|393103741|gb|EJC04304.1| hypothetical protein HPHPP4_0781 [Helicobacter pylori Hp P-4]
 gi|393123302|gb|EJC23771.1| hypothetical protein HPHPP4D_0819 [Helicobacter pylori Hp P-4d]
 gi|393124399|gb|EJC24867.1| hypothetical protein HPHPP4C_0821 [Helicobacter pylori Hp P-4c]
          Length = 372

 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 129/314 (41%), Gaps = 38/314 (12%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  N +LLF  +G  +  W+      +   +      +  +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALDPIHFKFVRKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFILPLYHELVTQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T  L        + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTN-LKNLDDSLNLL 289

Query: 246 LKDGRLLLAYNTVSRGVLKVA--LSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
             +G L L +N     + +    LSK ++ +S+     L++  A E SYP+    +   +
Sbjct: 290 NLNGILYLIHNPSDSSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFI 346

Query: 303 HITYTYNRTQIKHV 316
            I YTYNR+ IKH+
Sbjct: 347 DIVYTYNRSHIKHI 360


>gi|425432777|ref|ZP_18813325.1| hypothetical protein HMPREF1391_00919 [Helicobacter pylori
           GAM100Ai]
 gi|410714701|gb|EKQ72155.1| hypothetical protein HMPREF1391_00919 [Helicobacter pylori
           GAM100Ai]
          Length = 347

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 130/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 37  SVHASALIGLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 96

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + NP+LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 97  HEYIKKLGNPLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 154

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 155 FLNLSHLIRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDKQNNPRE---LLRPNTLNH 209

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 210 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 265

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 266 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-NLNPHFID 322

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 323 IVYTYNRSHIKHI 335


>gi|417511435|ref|ZP_12176057.1| hypothetical protein LTSESEN_3055, partial [Salmonella enterica
           subsp. enterica serovar Senftenberg str. A4-543]
 gi|353643055|gb|EHC87336.1| hypothetical protein LTSESEN_3055, partial [Salmonella enterica
           subsp. enterica serovar Senftenberg str. A4-543]
          Length = 61

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/59 (45%), Positives = 37/59 (62%), Gaps = 2/59 (3%)

Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           R  L +  S+D+G+SW   L LE +   E+SYPA+I +  G VHITYT+NR  I +  L
Sbjct: 2   RYPLSLIASQDNGESWLPLLDLESDRG-EYSYPAII-SEGGVVHITYTWNRKNIVYCRL 58


>gi|160890988|ref|ZP_02071991.1| hypothetical protein BACUNI_03435 [Bacteroides uniformis ATCC 8492]
 gi|270294310|ref|ZP_06200512.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|156859209|gb|EDO52640.1| BNR/Asp-box repeat protein [Bacteroides uniformis ATCC 8492]
 gi|270275777|gb|EFA21637.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 398

 Score = 46.2 bits (108), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 74/162 (45%), Gaps = 18/162 (11%)

Query: 109 GCMKRSYNKGVTWSE--------REQLPPGILG--PSKNKPILLE--NGLL---LCGSSV 153
           G + RSY+ G TW++         E    G+ G  P+ N+  L E  +G +   +  S  
Sbjct: 161 GYLLRSYDDGRTWNDLIIPCSVPSEINYSGLGGKLPAYNRGALYEGKDGRIFWAVVASDT 220

Query: 154 ESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
           E+  +   ++ ++ D G++W +YG     ++ +S  +   + T N  L   MRS     +
Sbjct: 221 EALRNTSVYLLISKDKGKTW-EYGTKIAKDKKVSFNETSMYETPNGDLIAFMRSEAFGDQ 279

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
            C++ S DGG T+   K          +  ++L D R+LL Y
Sbjct: 280 ACIARSIDGGKTFGEWKSMGF--KGHPLQAMRLPDNRVLLTY 319


>gi|395798866|ref|ZP_10478149.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. Ag1]
 gi|395337100|gb|EJF68958.1| BNR/Asp-box repeat-containing protein [Pseudomonas sp. Ag1]
          Length = 267

 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 42/180 (23%), Positives = 66/180 (36%), Gaps = 52/180 (28%)

Query: 189 IQPVPFHTANRTLRVLMR-SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
           +QP       R    ++R + +   +V  S + D G TWS   P    NPNS +  V   
Sbjct: 77  LQPTVVPMDERNAVAMLRYAGDTFHKVLASRTEDAGHTWSEPYPLDPANPNSSLAAVATP 136

Query: 248 DGRLLLAYNTVSRGVLKVALSKDDG--DSWHDALTLEE---------------------- 283
           +  LL+A N +  G  ++ L   +   D W   + L+E                      
Sbjct: 137 NNGLLVALNDLKDGRFRLRLMGTNSRLDFWKPLINLDESPDPEGNPFSAEAYREIIGDKF 196

Query: 284 -----------------NLA----------MEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
                            NL            E+ YP  I++++G  H+ Y++N T IKHV
Sbjct: 197 RLSSGPRREMLVDKFLVNLDQRVCEETGCNFEYEYPYFIKSAEGLYHLVYSWNNTFIKHV 256


>gi|317480936|ref|ZP_07940016.1| hypothetical protein HMPREF1007_03135 [Bacteroides sp. 4_1_36]
 gi|316902829|gb|EFV24703.1| hypothetical protein HMPREF1007_03135 [Bacteroides sp. 4_1_36]
          Length = 370

 Score = 46.2 bits (108), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 74/162 (45%), Gaps = 18/162 (11%)

Query: 109 GCMKRSYNKGVTWSE--------REQLPPGILG--PSKNKPILLE--NGLL---LCGSSV 153
           G + RSY+ G TW++         E    G+ G  P+ N+  L E  +G +   +  S  
Sbjct: 133 GYLLRSYDDGRTWNDLIIPCSVPSEINYSGLGGKLPAYNRGALYEGKDGRIFWAVVASDT 192

Query: 154 ESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
           E+  +   ++ ++ D G++W +YG     ++ +S  +   + T N  L   MRS     +
Sbjct: 193 EALRNTSVYLLISKDKGKTW-EYGTKIAKDKKVSFNETSMYETPNGDLIAFMRSEAFGDQ 251

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
            C++ S DGG T+   K          +  ++L D R+LL Y
Sbjct: 252 ACIARSIDGGKTFGEWKSMGF--KGHPLQAMRLPDNRVLLTY 291


>gi|289626890|ref|ZP_06459844.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
           syringae pv. aesculi str. NCPPB 3681]
          Length = 279

 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 84/218 (38%), Gaps = 23/218 (10%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGCEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQLPPG-- 130
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL     
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQLVTSPF 185

Query: 131 --ILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             I    +  P+   +G +      E    +  ++ ++ D G    K+      N     
Sbjct: 186 LNISTLVRGAPVFHTDGSIGLPVYHEFLGKFAEYLYLSAD-GELIDKFRISRGKNSLQPT 244

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW 226
           + P+    A   LR    +F+   RV  S + DGG TW
Sbjct: 245 VVPLDGQRAVALLRYAGETFH---RVLASRTEDGGQTW 279


>gi|386749930|ref|YP_006223137.1| hypothetical protein HCW_06235 [Helicobacter cetorum MIT 00-7128]
 gi|384556173|gb|AFI04507.1| hypothetical protein HCW_06235 [Helicobacter cetorum MIT 00-7128]
          Length = 377

 Score = 45.8 bits (107), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 71/312 (22%), Positives = 127/312 (40%), Gaps = 36/312 (11%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPII---------ADE 75
            HAS+++ +     LVAYF G+ EGA DV+I    +     +W    I         +  
Sbjct: 65  VHASSLISLPNKTLLVAYFSGTREGAKDVQINANIYNPNTKKWSEAFILLTRETLSKSAH 124

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCM-----KRSYNKGVTWSEREQLPP- 129
           E    + NP+LF    N +LLF  +G  +  W+          S +K         L P 
Sbjct: 125 EYIKKLGNPLLF-WHQNKILLFV-VGVSMGGWATSKIYQLESPSIHKPFKLVRSLHLSPF 182

Query: 130 -GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSV 188
             +    + KP+  ++G  +     E  + +   ++    A  +      ++ PN   + 
Sbjct: 183 LNLSHLVRTKPLNTDDGGFILPLYHELASQYPLLLKFDFQANPT-----ELFRPNLLHAQ 237

Query: 189 IQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKD 248
           +QP    T  +   ++    +      M ++C     W    PT L         +   +
Sbjct: 238 LQPSI--TPYKKCAIMAFRNHHFSNALMLQTCLSPTKWQSLLPTNL-KNLDDSLNLLNLN 294

Query: 249 GRLLLAYN----TVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
             L L +N    ++ R  L ++  K+D +++H    L++  A E SYP+ +   +  + I
Sbjct: 295 NTLYLIHNPDNLSLRRQKLLISQLKND-NTFHTLSVLDK--ADEVSYPSSL-IDNNFIDI 350

Query: 305 TYTYNRTQIKHV 316
           TYTYNR  I+H+
Sbjct: 351 TYTYNRQTIRHI 362


>gi|26990557|ref|NP_745982.1| hypothetical protein PP_3852 [Pseudomonas putida KT2440]
 gi|24985537|gb|AAN69446.1|AE016580_5 hypothetical protein PP_3852 [Pseudomonas putida KT2440]
          Length = 518

 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 3/119 (2%)

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG--VKLKDGRLLLAYNT-VS 259
            L+RS     ++  S S D   TW+  +   +  P++       +   G LLL +N  + 
Sbjct: 374 ALLRSLGSNSQMQYSVSRDLMRTWTPWQDYTVTAPSASSRAWLGRTPSGLLLLCWNNDLV 433

Query: 260 RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHVVL 318
           RG L V LS DDG ++     +E + +++ SYP V    DG + + Y   R   K + +
Sbjct: 434 RGTLTVGLSDDDGATYRYKKIIEPDSSIQVSYPVVSFGDDGEILVIYDNGRDSHKQIRI 492


>gi|32472278|ref|NP_865272.1| sialidase [Rhodopirellula baltica SH 1]
 gi|32443514|emb|CAD72956.1| probable sialidase [Rhodopirellula baltica SH 1]
          Length = 1153

 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 72/331 (21%), Positives = 132/331 (39%), Gaps = 53/331 (16%)

Query: 32  IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQSP---IIADEEPNVPMWNPV 85
            V++  G  L+ Y  F G+ + AP   +   +  +G  W +    +I   + +  + +  
Sbjct: 57  FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 116

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
           L +L    + LFY     ++K+               M+ S ++G TWSE  ++ P    
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTPDAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 170

Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
           PS      ++ I L +G L+   +V     W  + +       ++ D G +W++     +
Sbjct: 171 PSYSVLNNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA-L 229

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
            +ESL+  +P     ++  + +  RS N      +S S D G TWS  KP+    P    
Sbjct: 230 TSESLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSDLKPSSFTQPTVSP 285

Query: 242 DGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
             ++     G LL+ +N           V R     A+SKDDG +W +   +  +    +
Sbjct: 286 ASIERIPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGWY 345

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
            Y A+    D  +     Y R     +  +P
Sbjct: 346 CYTAIEFVDDHVLLAHCEYPRLNSLQLTRIP 376


>gi|420487269|ref|ZP_14985874.1| hypothetical protein HPHPP8_0748 [Helicobacter pylori Hp P-8]
 gi|420521053|ref|ZP_15019484.1| hypothetical protein HPHPP8B_0642 [Helicobacter pylori Hp P-8b]
 gi|393102390|gb|EJC02955.1| hypothetical protein HPHPP8_0748 [Helicobacter pylori Hp P-8]
 gi|393127640|gb|EJC28085.1| hypothetical protein HPHPP8B_0642 [Helicobacter pylori Hp P-8b]
          Length = 372

 Score = 45.1 bits (105), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 75/312 (24%), Positives = 129/312 (41%), Gaps = 34/312 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV---- 79
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I   +  +    
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSESFILLTKEELSKNA 121

Query: 80  -----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALEPIRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
           L     L+   + +S    ++ LSK +  +    L + +  A E SYP+    +   + I
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKVLKVLDK-ANEVSYPSY-SLNPHFIDI 348

Query: 305 TYTYNRTQIKHV 316
            YTYNR+ IKH+
Sbjct: 349 VYTYNRSHIKHI 360


>gi|373953332|ref|ZP_09613292.1| hypothetical protein Mucpa_1672 [Mucilaginibacter paludis DSM
           18603]
 gi|373889932|gb|EHQ25829.1| hypothetical protein Mucpa_1672 [Mucilaginibacter paludis DSM
           18603]
          Length = 418

 Score = 45.1 bits (105), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 53/106 (50%), Gaps = 4/106 (3%)

Query: 209 NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG--IDGVKLKDGRLLLAYN-TVSRGVLKV 265
           N  G +  S S + G +W      + + PN+   +   KLK G +LL  N +  R  L V
Sbjct: 281 NNAGNIQYSNSYNNGSSWDTISGFKTIFPNAASRVFIGKLKSGNILLIINNSFYRTNLTV 340

Query: 266 ALSKDDGDSW-HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
            LS ++G SW + ++ +E ++    SYP  +Q +D  +H+    +R
Sbjct: 341 FLSVNNGVSWKYKSVLVENDIVAAPSYPDAVQTTDNVIHMVNDADR 386


>gi|420445202|ref|ZP_14944116.1| hypothetical protein HPHPH42_0609 [Helicobacter pylori Hp H-42]
 gi|393063396|gb|EJB64243.1| hypothetical protein HPHPH42_0609 [Helicobacter pylori Hp H-42]
          Length = 372

 Score = 44.7 bits (104), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 131/313 (41%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SVHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
            E    + N +LF L  N +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNSLLF-LHDNKILLFV-VGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    + +    M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSLKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKILDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YTYNR+ IKH+
Sbjct: 348 IVYTYNRSHIKHI 360


>gi|449134458|ref|ZP_21769959.1| sialidase [Rhodopirellula europaea 6C]
 gi|448887088|gb|EMB17476.1| sialidase [Rhodopirellula europaea 6C]
          Length = 1153

 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 134/332 (40%), Gaps = 55/332 (16%)

Query: 32  IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RW---QSPIIADEEPNVPMWNPV 85
            V++  G  L+ Y  F G+ + AP   +   +  +G  W    + +I   + +  + +  
Sbjct: 57  FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSSDNGITWTTEDASVIERGDDDANLMSVS 116

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
           L +L    + LFY     ++K+               M+ S ++G TWSE  ++ P    
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTPEAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 170

Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
           PS      ++ I L +G L+   +V     W  + +       ++ D G++W++     +
Sbjct: 171 PSYSVLNNDRVIQLRSGRLIVPLAVHYRVGWPGYRKSAEIVCYLSDDQGKTWKRSQSA-L 229

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPN--- 238
            +ESL+  +P     ++  L +  RS N      +S S D G  WS   P+    P    
Sbjct: 230 TSESLAQ-EPGVVELSDDRLMMFCRSSNA---QLLSYSDDQGDFWSDFTPSSFTQPTVSP 285

Query: 239 SGIDGVKLKDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAME 288
           + I+ ++   G LL+ +N           V R     A+SKDDG +W +   +  +    
Sbjct: 286 ASIERIQ-STGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGW 344

Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           + Y A+    D  +     Y R     +  +P
Sbjct: 345 YCYTAIEFVGDHVLLAHCEYPRLNSLQITRIP 376


>gi|422406818|ref|ZP_16483837.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
           syringae pv. glycinea str. race 4]
 gi|330882049|gb|EGH16198.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. race 4]
          Length = 235

 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 47/181 (25%), Positives = 68/181 (37%), Gaps = 54/181 (29%)

Query: 189 IQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL 246
           +QP  VP     R + +L  +     RV  S + DGG TWS  +P    NPNS +  V +
Sbjct: 44  LQPTVVPL-DGQRAVALLRYAGETFHRVLASRTEDGGQTWSEPQPLTPSNPNSSLAAVGV 102

Query: 247 KDGRLLLAYNTVSRGVLKVALSKDDGDS--WHDALTLEEN-------LAME--------- 288
               LL+A N +  G  K++L   D     W     L+++        A+E         
Sbjct: 103 PGKGLLVALNDLREGRFKLSLYSTDEQMKVWRPLPDLDKSPDPLGTPFALEAYKAAIGEG 162

Query: 289 -------------------------------FSY--PAVIQASDGSVHITYTYNRTQIKH 315
                                          F Y  P  I++ DG  H+ Y++N T IKH
Sbjct: 163 FRASSGARRQPMEAEFLSNLDQRVCSAQGCDFEYEYPYFIRSPDGLYHLVYSWNNTFIKH 222

Query: 316 V 316
           V
Sbjct: 223 V 223


>gi|421611766|ref|ZP_16052898.1| sialidase [Rhodopirellula baltica SH28]
 gi|408497479|gb|EKK02006.1| sialidase [Rhodopirellula baltica SH28]
          Length = 1153

 Score = 44.7 bits (104), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 71/332 (21%), Positives = 133/332 (40%), Gaps = 55/332 (16%)

Query: 32  IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQSP---IIADEEPNVPMWNPV 85
            V++  G  L+ Y  F G+ + AP   +   +  +G  W +    +I   + +  + +  
Sbjct: 57  FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 116

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPP---- 129
           L +L    + LFY     ++K+               M+ S ++G TWSE  ++ P    
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTPDAKHLFLDDILMRTSSDEGDTWSEPTRIVPEDTP 171

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIY 180
              +L  + ++ I L +G L+   +V     W  + +       ++ D G +W++     
Sbjct: 172 SYSVL--NNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA- 228

Query: 181 IPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSG 240
           + +ESL+  +P     ++  + +  RS N      +S S D G TWS  KP+    P   
Sbjct: 229 LTSESLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSDLKPSSFTQPTVS 284

Query: 241 IDGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAME 288
              ++     G LL+ +N           V R     A+SKDDG +W +   +  +    
Sbjct: 285 PASIERFPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGW 344

Query: 289 FSYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
           + Y A+    D  +     Y R     +  +P
Sbjct: 345 YCYTAIEFVDDHVLLAHCEYPRLNSLQLTRVP 376


>gi|87310349|ref|ZP_01092479.1| probable sialidase [Blastopirellula marina DSM 3645]
 gi|87286848|gb|EAQ78752.1| probable sialidase [Blastopirellula marina DSM 3645]
          Length = 373

 Score = 44.3 bits (103), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 119/314 (37%), Gaps = 44/314 (14%)

Query: 32  IVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD-GR-WQSPIIADEEPNVP----MWNPV 85
           +V+   G  L  Y     E  P+  I  ++  D GR W    I   +P  P      +P 
Sbjct: 52  MVQRKDGSLLFCY----TEYGPEGGIVAKSSTDQGRTWTEAKILVPQPVSPDPGRFNHPS 107

Query: 86  LFKLPSNGLLLFYKIGQEVQK---WSGCMKRSYNKGVTWSEREQLPP--GILGPSKNKPI 140
           L +L +  LLL Y       K        +RS + G TW E+  L P  G      +K +
Sbjct: 108 LLRLTNGDLLLSYNYTTHPTKPYYAVTLYRRSTDDGATWGEQFPLTPYSGYTLIHNDKLL 167

Query: 141 LLENGLLLCGSSVESW-------NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVP 193
            LE+G ++  +  + +       N +   +  + D G SW     +    ES  +     
Sbjct: 168 TLEDGRIIAVAETKKYLPSSQDHNGYVGLVFYSDDQGYSWYPSKNVVDLYESRKIDVQEA 227

Query: 194 FHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLN-PNSGIDGVKL--KDGR 250
                R  R++M + +  G    + S D G TWS  K    L  P +G+  V+     G 
Sbjct: 228 DAVELRDGRLMMFARSYSGHPVKAFSTDRGETWSQGKMIPALKMPYAGLPTVRRIPSTGD 287

Query: 251 LLLAY-----------NTVSRGVLKVALSKDDGDSWHDALTLEENLAME----FSYPAVI 295
           LL  +               R  L  A+SKD+G+++     L+ N+  E      Y  V 
Sbjct: 288 LLFVWISERSALKSDPKVPVRSTLTSAISKDEGETF----ILQRNIVKEPGNDLGYQCVE 343

Query: 296 QASDGSVHITYTYN 309
              DG+  I Y  N
Sbjct: 344 FLEDGTALIAYHAN 357


>gi|420420528|ref|ZP_14919614.1| hypothetical protein HPNQ4161_1038 [Helicobacter pylori NQ4161]
 gi|393036219|gb|EJB37259.1| hypothetical protein HPNQ4161_1038 [Helicobacter pylori NQ4161]
          Length = 372

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 74/311 (23%), Positives = 131/311 (42%), Gaps = 32/311 (10%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  EE +   
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFIILTKEELSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF    +  +L + +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLFS--HDDKILLFVVGVSMGGWATSKIYQFESALEPIHFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   + +  D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYP--LLLKFDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            +QP    T+ +   V+    +      M E+C     W     T L N +  ++ + L 
Sbjct: 235 QLQPSL--TSFKDCAVMAFRNHSFKDNLMLETCKTPTAWQKPISTNLKNLDDSLNLLNLN 292

Query: 248 DGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHIT 305
               L+   + +S    ++ LSK +  +S+     L++  A E SYP+    +   + I 
Sbjct: 293 GILYLIHNPSDLSLRRKELWLSKLETSNSFKTLKVLDK--ANEVSYPSY-SLNPHFIDIV 349

Query: 306 YTYNRTQIKHV 316
           YTYNR+ IKH+
Sbjct: 350 YTYNRSHIKHI 360


>gi|430743658|ref|YP_007202787.1| BNR/Asp-box repeat-containing protein [Singulisphaera acidiphila
           DSM 18658]
 gi|430015378|gb|AGA27092.1| BNR/Asp-box repeat protein [Singulisphaera acidiphila DSM 18658]
          Length = 410

 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 110/247 (44%), Gaps = 44/247 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNG---LLLFYKIGQ--EVQKWSGCMK--------RSYNK 117
           +IAD+ P+  M NP        G   +L+    GQ  E Q  +G  +        +S ++
Sbjct: 126 VIADDGPHT-MGNPCPVVDRETGTIWMLMTRNHGQDKESQILNGTSQETRSVWVMKSTDE 184

Query: 118 GVTWSEREQLPPGILGPS-------KNKPILLENGLLL--CGSSVESWNSWGSWMEVTVD 168
           G+TWS R ++ P +  P            I LE+G ++  C   +       S +  + D
Sbjct: 185 GLTWSPRVEITPQVKLPDWTWYATGPGVSIQLESGRMVVPCDHFLAKSKVSRSHVIYSDD 244

Query: 169 AGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRS---FNGIGRVCMSESCDGGLT 225
            G +W+  G   +  E ++  Q      +NRTL + MR      G GR  +S S DGGLT
Sbjct: 245 HGATWKIGG---VGGERVNECQVAEL--SNRTLLLNMRHQPPRRGEGRA-ISTSRDGGLT 298

Query: 226 WSYAKPT---QLLNP--NSGIDGVKLKDGRL---LLAYNTVS--RGVLKVALSKDDGDSW 275
           W+   PT    L+ P   + + G+K + GR    LL  N  S  R ++ V LS+D G +W
Sbjct: 299 WT--PPTLDPSLVEPACQASLIGLKDEQGRASNRLLFSNPASSKRELMTVRLSEDGGKTW 356

Query: 276 HDALTLE 282
               +L 
Sbjct: 357 SQHKSLH 363


>gi|440717535|ref|ZP_20898020.1| sialidase [Rhodopirellula baltica SWK14]
 gi|436437339|gb|ELP30987.1| sialidase [Rhodopirellula baltica SWK14]
          Length = 1121

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 71/331 (21%), Positives = 132/331 (39%), Gaps = 53/331 (16%)

Query: 32  IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQSP---IIADEEPNVPMWNPV 85
            V++  G  L+ Y  F G+ + AP   +   +  +G  W +    +I   + +  + +  
Sbjct: 25  FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 84

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
           L +L    + LFY     ++K+               M+ S ++G TWSE  ++ P    
Sbjct: 85  LLRLQDGRIGLFY-----IRKYDPTPDAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 138

Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
           PS      ++ I L +G L+   +V     W  + +       ++ D G +W++     +
Sbjct: 139 PSYSVLNNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA-L 197

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
            ++SL+  +P     ++  + +  RS N      +S S D G TWS  KP+    P    
Sbjct: 198 TSKSLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSELKPSSFTQPTVSP 253

Query: 242 DGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
             ++     G LL+ +N           V R     A+SKDDG +W +   +  +    +
Sbjct: 254 ASIERIPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGWY 313

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
            Y A+    D  +     Y R     +  +P
Sbjct: 314 CYTAIEFVDDHVLLAHCEYPRLNSLQLTRVP 344


>gi|78485668|ref|YP_391593.1| BNR repeat-containing glycosyl hydrolase [Thiomicrospira crunogena
           XCL-2]
 gi|78363954|gb|ABB41919.1| Glycosyl hydrolase [Thiomicrospira crunogena XCL-2]
          Length = 354

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 51/232 (21%), Positives = 92/232 (39%), Gaps = 38/232 (16%)

Query: 113 RSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVE-SWNSWGSWMEVTVDAGR 171
           RS + G TW    ++  G  G +     L    LL+   +++       S   ++ D G+
Sbjct: 124 RSLDGGQTWEAPVRVQKGYCGATTTMISLSSGALLMTAQNLDYEAGRHYSLTYLSEDEGQ 183

Query: 172 SWRKYGPIYIPNESL--SVIQPVPFHTANRTLRVLMRS-----FNGIGRVCMSESCDGGL 224
           SW     I I  +       +       +  +  L+R+     +NG        S + G 
Sbjct: 184 SWHASNKIDIGGQGHHGGCYEGTLLELKDGRVWYLIRTNLDWFWNGY-------SNNQGK 236

Query: 225 TWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS----------------------RGV 262
           TW++ +P      + G+   +L+ GRLLLAYN V+                      R  
Sbjct: 237 TWTHLEPGLSAGSSPGML-CRLQSGRLLLAYNQVTKEGETLAPRIAGLFSEVPASWQREE 295

Query: 263 LKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIK 314
           L + LS+DDG +W D + +        SY ++ +   G + +T   ++ ++K
Sbjct: 296 LSIRLSEDDGKTWSDPIVIAACKGAWLSYCSLFEPHAGEIWLTTMQSQLKLK 347


>gi|406832273|ref|ZP_11091867.1| hypothetical protein SpalD1_11554 [Schlesneria paludicola DSM
           18645]
          Length = 397

 Score = 43.5 bits (101), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 42/95 (44%), Gaps = 3/95 (3%)

Query: 218 ESCDGGLTWSY-AKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGV-LKVALSKDDGDSW 275
            S D G TW   + P       +    ++L DGR+ L Y   +    ++  LS D G SW
Sbjct: 282 RSADDGKTWKLDSTPVPDAGEGNPPAMIRLADGRVCLTYGVRAEPYEMRARLSSDGGRSW 341

Query: 276 HDALTLE-ENLAMEFSYPAVIQASDGSVHITYTYN 309
           H  + L  +    +  YPA +Q  DG +   Y ++
Sbjct: 342 HKEIVLRGKGGGRDIGYPATVQRPDGKIVTIYYFH 376


>gi|417303213|ref|ZP_12090274.1| sialidase [Rhodopirellula baltica WH47]
 gi|327540483|gb|EGF27066.1| sialidase [Rhodopirellula baltica WH47]
          Length = 1153

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 71/331 (21%), Positives = 132/331 (39%), Gaps = 53/331 (16%)

Query: 32  IVEVDKGHFLVAY--FGGSCEGAPDVKIWLQTFKDG-RWQS---PIIADEEPNVPMWNPV 85
            V++  G  L+ Y  F G+ + AP   +   +  +G  W +    +I   + +  + +  
Sbjct: 57  FVQLKDGRLLLVYTKFIGTGDHAPAALVSRHSNDNGITWTTEDDSVIERGDDDANLMSVS 116

Query: 86  LFKLPSNGLLLFYKIGQEVQKWSGC------------MKRSYNKGVTWSEREQLPPGILG 133
           L +L    + LFY     ++K+               M+ S ++G TWSE  ++ P    
Sbjct: 117 LLRLQDGRIGLFY-----IRKYDPTLDAKHLFLDDILMRTSSDEGDTWSEPTRIVPKDT- 170

Query: 134 PS-----KNKPILLENGLLLCGSSVESWNSWGSWME-------VTVDAGRSWRKYGPIYI 181
           PS      ++ I L +G L+   +V     W  + +       ++ D G +W++     +
Sbjct: 171 PSYSVLNNDRVIQLSSGRLIVPLAVHYRVGWPGYRKSAEMVCYLSDDQGATWKRSQSA-L 229

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGI 241
            ++SL+  +P     ++  + +  RS N      +S S D G TWS  KP+    P    
Sbjct: 230 TSKSLAQ-EPGVVELSDGRVMMFCRSSNA---QLLSYSDDQGDTWSDLKPSSFTQPTVSP 285

Query: 242 DGVKL--KDGRLLLAYNT----------VSRGVLKVALSKDDGDSWHDALTLEENLAMEF 289
             ++     G LL+ +N           V R     A+SKDDG +W +   +  +    +
Sbjct: 286 ASIERIPSTGDLLMLWNNGDDELAKKQPVGRRPFTAAISKDDGKTWQNIQNVGTDPEGWY 345

Query: 290 SYPAVIQASDGSVHITYTYNRTQIKHVVLLP 320
            Y A+    D  +     Y R     +  +P
Sbjct: 346 CYTAIQFVDDHVLLAHCEYPRLNSLQLTRVP 376


>gi|412988324|emb|CCO17660.1| hypothetical protein PTSG_00864 [Bathycoccus prasinos]
          Length = 554

 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 65/303 (21%), Positives = 111/303 (36%), Gaps = 62/303 (20%)

Query: 20  NSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVK---IWLQTFKDG-RWQSPIIADE 75
           + + FK  H +   E+  G   VA+     E    ++    W  +  +G  +  P +  +
Sbjct: 139 SESEFKYMHMAMPTELPDGSIAVAFQASPTEYEGSIRQALYWAISRDNGVTFSRPEMIAK 198

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSG--------------CMKRSYNKGVTW 121
           +  +P+W PVL K+    + L Y +     ++                  K S + G TW
Sbjct: 199 DDQLPVWTPVL-KVSGQRVFLLYTVSSRKCRYYDQSRNALRHSPGGDVVYKVSDDNGKTW 257

Query: 122 SEREQL-----PPGILGPSKNKPILLENG-------------------------LLLCGS 151
           S  + +       G+     N  + LE+G                         LL  GS
Sbjct: 258 SSGQVILSYGSEDGMPKVIANTLVELEDGSWVLPFWREPGKTCPQMKSEVKDASLLRKGS 317

Query: 152 -----SVESWNSWGSWMEVTVDAGRSWRKYGPIY---IPNESLSVIQPVPFHTANRTLRV 203
                S++   +W  +  +T   G SW     +          +  +    +   RT  V
Sbjct: 318 AGVIRSMDQGLTWEVFGNLTATKGESWLIENTVVERKSERRKTTKKKENNNNKGGRTNNV 377

Query: 204 LMRSF---NGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR 260
           L++ F   +G+  V +S   D G TW+ A  T L NPNS +  +++ DG    AY+ V  
Sbjct: 378 LVQHFRTKSGVAFVSLSR--DNGKTWTEASETTLPNPNSKMHTIRIDDGDTYDAYDNVDE 435

Query: 261 GVL 263
             L
Sbjct: 436 KTL 438


>gi|420482145|ref|ZP_14980782.1| hypothetical protein HPHPP2_0672 [Helicobacter pylori Hp P-2]
 gi|420512568|ref|ZP_15011051.1| hypothetical protein HPHPP2B_0676 [Helicobacter pylori Hp P-2b]
 gi|393099379|gb|EJB99960.1| hypothetical protein HPHPP2_0672 [Helicobacter pylori Hp P-2]
 gi|393157631|gb|EJC57892.1| hypothetical protein HPHPP2B_0676 [Helicobacter pylori Hp P-2b]
          Length = 372

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 12/83 (14%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPII---------AD 74
           S HAS+++ +   + L AYF G+ EGA DVKI    F  K  RW    I           
Sbjct: 62  SAHASSLISLPNDNLLSAYFSGTKEGARDVKISANLFDGKTNRWSEAFILLTKEELSHHS 121

Query: 75  EEPNVPMWNPVLFKLPSNGLLLF 97
            E    + NP+LF L  + +LLF
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLF 143


>gi|320537217|ref|ZP_08037181.1| hypothetical protein HMPREF9554_01919 [Treponema phagedenis F0421]
 gi|320145930|gb|EFW37582.1| hypothetical protein HMPREF9554_01919 [Treponema phagedenis F0421]
          Length = 525

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 49/191 (25%), Positives = 76/191 (39%), Gaps = 35/191 (18%)

Query: 146 LLLCGSSVESWNSWGSWMEVTV------DAGRSWRKYGPIYIPNESLSVIQPVPFHTANR 199
           LL       + NSWG+     V      D  ++W+    I  P   +S  +       + 
Sbjct: 318 LLFSAYGGSTANSWGAHFAERVAVFWTDDEAKTWQHSRYIKTPVHKMS--ESAIIEAPDG 375

Query: 200 TLRVLMRS-FNGIGRVCMSESCDGGLTWSYAKPTQLLNP--NSGIDGVKLK-----DGRL 251
            L +  RS F G   V MS+S DGG TWS A+P        N+ I  + L+       + 
Sbjct: 376 ALIIFSRSDFQG---VMMSQSNDGGATWSEAQPLAGFTATVNNLISVINLERLSELQNKT 432

Query: 252 LLAYN-------TVSRGVLKVALSKDDGDSWHDALTLEENLAME---------FSYPAVI 295
           L+A++       T S G +K+     +GD W       E   +E         F+Y  + 
Sbjct: 433 LVAFSYANGLDRTRSAGTVKIGALVKNGDKWQLDFLWNEGSPLELAVTAADKFFAYSCLA 492

Query: 296 QASDGSVHITY 306
           +  DGS+ + Y
Sbjct: 493 ELPDGSLALAY 503


>gi|400293104|ref|ZP_10794989.1| BNR/Asp-box repeat protein [Actinomyces naeslundii str. Howell 279]
 gi|399901810|gb|EJN84680.1| BNR/Asp-box repeat protein [Actinomyces naeslundii str. Howell 279]
          Length = 771

 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 51/211 (24%), Positives = 85/211 (40%), Gaps = 44/211 (20%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
           +  S + GVTWS+   L P +       LG      I L NG       + L  ++ ++W
Sbjct: 507 IAHSDDDGVTWSKPRDLNPQLKQPWMRFLGTCPGNGIALRNGPHAGRLVVPLYFNNDQNW 566

Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
            +  + +  + D G +W+              +  P    +E+ S+ +       +  L 
Sbjct: 567 LAMCATVAYSDDHGETWQLGRSPNEGRQTPEGELDPQTFVDETWSLHEAAVVERRDGVLL 626

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAK-----PTQLLNPNS-------GIDGVKLKDGR 250
           + MR+ +  GRV +SES D G TW   +     P     PN+       G D +   +  
Sbjct: 627 LFMRNQHPRGRVAVSESHDAGQTWGAIRFDKELPEIWCQPNAISLPSTEGEDRIVFANAS 686

Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTL 281
           L+L +    RG   + LS D G +W  + TL
Sbjct: 687 LMLPF----RGCGVIRLSLDGGRTWKYSRTL 713


>gi|255035678|ref|YP_003086299.1| hypothetical protein Dfer_1900 [Dyadobacter fermentans DSM 18053]
 gi|254948434|gb|ACT93134.1| hypothetical protein Dfer_1900 [Dyadobacter fermentans DSM 18053]
          Length = 384

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 38/160 (23%), Positives = 72/160 (45%), Gaps = 22/160 (13%)

Query: 157 NSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHT-----ANRTLRVLMRSFNGI 211
            +WG  M ++ D G++W   GP+    E  ++  P+   T      + TL   +R+   +
Sbjct: 202 KAWGGVM-LSRDKGKTW---GPVITIGEKANL--PLAAETDVILLKDGTLYAALRAQQEV 255

Query: 212 GRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVS-----RGVLKVA 266
             +  ++S D G TWS  +       +      +LK G +LL     +     +G   + 
Sbjct: 256 -HMHFAKSKDMGKTWSDVEDIGFRGHSPSF--TRLKSGEILLTTRAFTDAPSKKGYTGLR 312

Query: 267 LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
           +S+D+  +W     ++E L    +YP+ ++  DGSV + Y
Sbjct: 313 ISRDEAKTWEGPYMVDETLG---AYPSTVELKDGSVLVVY 349


>gi|406835791|ref|ZP_11095385.1| hypothetical protein SpalD1_29249 [Schlesneria paludicola DSM
           18645]
          Length = 405

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 3/106 (2%)

Query: 167 VDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTW 226
            DA  +W+ + P+   +  L+  +P+ +   +  L  L R     G +  S S D G TW
Sbjct: 240 TDAIDAWKSF-PVLGSSSELAAEEPLWWVLPDNRLLALFRDNRHSGFLYRSFSRDNGQTW 298

Query: 227 SYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT--VSRGVLKVALSKD 270
                T   +  S I G++L+DGR +L  N     R  L ++LS D
Sbjct: 299 DTPVRTNFPDATSKIHGLRLRDGRYVLVSNANPKKRDPLVISLSND 344


>gi|223935468|ref|ZP_03627385.1| sialidase [bacterium Ellin514]
 gi|223895878|gb|EEF62322.1| sialidase [bacterium Ellin514]
          Length = 402

 Score = 42.7 bits (99), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 76/302 (25%), Positives = 124/302 (41%), Gaps = 33/302 (10%)

Query: 40  FLVAYF-GGSCEGAPDVKIWLQTFKDG-RW-QSP-IIADEEPNVPMWNPVLFKLPSNGLL 95
           FL + F GG+ + +P   + + +   G  W Q P ++ D   N  + +  L +L S  + 
Sbjct: 79  FLYSQFRGGAADESPSRIVSVYSDDAGLTWNQKPELVVDTGTNQNVMSVSLLRLKSGKIA 138

Query: 96  LFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL--PPGILGPSKNKPILLENGLLLC---- 149
           LFY I Q +      ++ S ++  TWS  + +   PG    + ++ I L NG L+     
Sbjct: 139 LFYLIKQNLHDCHPWIQLSSDEAKTWSSPKLVVDAPGYFVLNNDRVIQLNNGRLIVPVAF 198

Query: 150 -GSSVESWNSWGSWMEVTV-------DAGRSWRKYGPIY-IPNESLSVIQ-PVPFHTANR 199
             S     N + ++    +       D G+ W +    + +P  S + +Q P     A+ 
Sbjct: 199 HRSRNSKLNDYSTFDPRAIALWYLSDDEGQIWHEAETWWAMPLASKTGLQEPGVVERADG 258

Query: 200 TLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK-DGRLLLAYNTV 258
           T  VL  +    G      S + G TWS    T+L +P S     +L     LL  +N  
Sbjct: 259 T--VLSWARTDAGAQYGFASANSGATWSAPIATELKSPVSPASIKRLPGSAELLAVFNDH 316

Query: 259 S---------RGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           S         R  L  A+S D G +W     LE + A  + Y A I  ++ +V   Y   
Sbjct: 317 SGQFPFPKGKRTPLVAAISPDGGRTWKKRKILENDPAGSYCYTA-IHFTENAVLFAYLDF 375

Query: 310 RT 311
           RT
Sbjct: 376 RT 377


>gi|410099125|ref|ZP_11294098.1| hypothetical protein HMPREF1076_03276 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409219606|gb|EKN12567.1| hypothetical protein HMPREF1076_03276 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 706

 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 50/236 (21%), Positives = 99/236 (41%), Gaps = 26/236 (11%)

Query: 101 GQEVQKWSGCMKRSYNKGVTWSEREQLPPGILG-----------PSKNKPILLE--NGL- 146
           G E+    G + RS++KG TW E   +PP +             P+ N+  + E  NG  
Sbjct: 466 GGEIAFLGGYILRSFDKGKTW-EDPIIPPTLDSEIYISATGEPLPTYNRGAMCEGKNGKI 524

Query: 147 --LLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVL 204
             ++ G          + + V+ D G +W +Y  +   ++S++  +   + T    +   
Sbjct: 525 YWIVAGHDPAPLGKTSNHLLVSEDKGETW-QYSGLVATDDSVAFNEASVYETPKGDIVGF 583

Query: 205 MRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR--GV 262
           +R+       C++ S DGG T+ +       +P   +  ++L D R+LL Y    +  G+
Sbjct: 584 LRT-TLYDHACIARSTDGGKTFEWKSMQFQGHP---LAALRLPDNRVLLTYGYRHKPFGI 639

Query: 263 LKVALSKDDGDSW--HDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
               L+ +  D     + +  ++    +  YP  I   D  V + Y YN+   +++
Sbjct: 640 RARILNAECTDYAISEEFILRDDGDGPDLGYPWPIMLEDNRVLVVYYYNKNGHRNI 695


>gi|420414952|ref|ZP_14914068.1| hypothetical protein HPNQ4053_0541 [Helicobacter pylori NQ4053]
 gi|393033919|gb|EJB34980.1| hypothetical protein HPNQ4053_0541 [Helicobacter pylori NQ4053]
          Length = 372

 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 133/313 (42%), Gaps = 36/313 (11%)

Query: 26  SCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP--IIADEEPN--- 78
           S HAS ++ +   + L AYF G+ EGA DVKI    F  K  RW     I+  E+ +   
Sbjct: 62  SVHASALISLPNDNLLSAYFSGTKEGARDVKISANLFDSKTNRWSEAFTILTKEDLSHHS 121

Query: 79  ----VPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGV-----TWSEREQLPP 129
                 + NP+LF L  + +LLF  +G  +  W+      +   +      ++ +  L P
Sbjct: 122 HEYIKKLGNPLLF-LHDDKILLFV-VGVSMGGWATSKIYQFESALESVRFKFARKLSLSP 179

Query: 130 --GILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187
              +    +NKP+   +G  +     E    +   ++   D   + R+   +  PN    
Sbjct: 180 FLNLSHLVRNKPLNTTDGGFMLPLYHELATQYPLLLK--FDQQNNPRE---LLRPNTLNH 234

Query: 188 VIQP--VPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVK 245
            +QP   PF    +   V+    +      M E+C     W     T L N +  ++ + 
Sbjct: 235 QLQPSLTPF----KDCAVMAFRNHSFKDSLMLETCKTPTAWQKPISTNLKNLDDSLNLLN 290

Query: 246 LKDGRLLLAY-NTVSRGVLKVALSK-DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVH 303
           L     L+   + +S    ++ LSK ++ +S+     L++  A E SYP+    +   + 
Sbjct: 291 LNGILYLIHNPSDLSLRRKELWLSKLENSNSFKTLKVLDK--ANEVSYPSY-SLNPHFID 347

Query: 304 ITYTYNRTQIKHV 316
           I YT+NR+ IKH+
Sbjct: 348 IVYTHNRSHIKHI 360


>gi|421882034|ref|ZP_16313327.1| predicted neuraminidase (sialidase) [Helicobacter bizzozeronii
          CCUG 35545]
 gi|375315749|emb|CCF81323.1| predicted neuraminidase (sialidase) [Helicobacter bizzozeronii
          CCUG 35545]
          Length = 352

 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 2/45 (4%)

Query: 25 KSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIW--LQTFKDGRW 67
          KS HA+T+  +D  + + AYFGGS EG  DVKI+  L   K   W
Sbjct: 53 KSMHAATLSVLDTDYLMGAYFGGSAEGVGDVKIYAHLYNLKTQEW 97


>gi|326773741|ref|ZP_08233024.1| sialidase A [Actinomyces viscosus C505]
 gi|326636971|gb|EGE37874.1| sialidase A [Actinomyces viscosus C505]
          Length = 771

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 95/247 (38%), Gaps = 48/247 (19%)

Query: 114 SYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESWNSW 159
           S + G+TWS    L P +       LG      I L NG       + L  ++ ++W + 
Sbjct: 510 SDDDGLTWSRPRDLNPELKQQWMRFLGTCPGNGIALRNGPHAGRLIVPLYFNNDQNWLAM 569

Query: 160 GSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
            + +  + D G +W+              +  P    +E+ S+ +       +  L + M
Sbjct: 570 CATVAYSDDHGETWQLGHGPNEGRQTPQGELDPQTFVDETWSLHEAAVVERQDGVLVLFM 629

Query: 206 RSFNGIGRVCMSESCDGGLTW-----SYAKPTQLLNPNS-------GIDGVKLKDGRLLL 253
           R+ +  GRV +SES D G TW       A P     PN+       G D +   +  L+L
Sbjct: 630 RNQHPRGRVAVSESHDAGQTWGPIRFDEALPEIWCQPNAISLPSPEGEDRIVFANASLML 689

Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
            +    RG   + LS D G SW  + T        + Y  +    DG + I +  N  Q 
Sbjct: 690 PF----RGCGVIRLSLDGGRSWKHSRTFNPG---HYVYQCMCVLPDGRIGILWE-NECQG 741

Query: 314 KHVVLLP 320
            +   LP
Sbjct: 742 LYFSRLP 748


>gi|257484989|ref|ZP_05639030.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
          syringae pv. tabaci str. ATCC 11528]
          Length = 113

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 38/79 (48%), Gaps = 4/79 (5%)

Query: 23 PFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD---GRWQSPIIADEEPNV 79
          P+   HA+ + E+  G  L  +F G+ EG  D+ + L + +D   G W  P    E+   
Sbjct: 15 PYAQNHAANMHELADGTLLCTWFAGTQEGMADIFVLL-SRRDPATGVWSEPQKMSEDATR 73

Query: 80 PMWNPVLFKLPSNGLLLFY 98
             NP+LF+ P   L L +
Sbjct: 74 SEQNPILFQAPGGPLWLIW 92


>gi|329947419|ref|ZP_08294623.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 170 str.
           F0386]
 gi|328525169|gb|EGF52220.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 170 str.
           F0386]
          Length = 724

 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 83/211 (39%), Gaps = 44/211 (20%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
           +  S + GVTWS+   L P +       LG      I L +G       + L  ++  +W
Sbjct: 460 IAHSDDDGVTWSKPRDLNPQLKQPWMRFLGTCPGNGIALRHGPHAGRLVVPLYFNNDRNW 519

Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
            +  + +  + D G +WR              +  P    +E+ S+ +       + TL 
Sbjct: 520 LAMCATVAYSDDHGETWRLSHGPNEGRETSDGQLDPQTFVDETWSLHEATVVERQDGTLL 579

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWSYAK-----PTQLLNPNS-------GIDGVKLKDGR 250
           + MR+ +  GRV +SES D G TW   +     P     PN+         D +   +  
Sbjct: 580 LFMRNQHPHGRVAVSESHDAGQTWGTVRFDEDLPEIWCQPNAIRLPSPEDEDRIVFANAS 639

Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTL 281
           L+L +    RG   + LS D G +W  + T 
Sbjct: 640 LMLPF----RGCGVIRLSLDGGRTWKRSRTF 666


>gi|346970813|gb|EGY14265.1| oxidoreductase [Verticillium dahliae VdLs.17]
          Length = 337

 Score = 42.0 bits (97), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 57/134 (42%), Gaps = 15/134 (11%)

Query: 166 TVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
           TVD G +W   GP   P ++    + V   +A + + +L        R+ ++E  DGG +
Sbjct: 51  TVDGGATWTSVGPRLAPEDAALQFRDVQAFSATKAV-ILSIGEGTYSRIYVTE--DGGTS 107

Query: 226 WSYA----KPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWH--DAL 279
           W       +PT   N       V  +D    LA +    G  ++  +KD G SW   DA 
Sbjct: 108 WDLVFTNEEPTAFYNC------VDFEDQERGLAVSDPVDGKFRLVETKDGGRSWGIVDAS 161

Query: 280 TLEENLAMEFSYPA 293
            +   LA EF + A
Sbjct: 162 GMAPALAGEFGFSA 175


>gi|420243216|ref|ZP_14747166.1| BNR/Asp-box repeat protein [Rhizobium sp. CF080]
 gi|398063066|gb|EJL54823.1| BNR/Asp-box repeat protein [Rhizobium sp. CF080]
          Length = 370

 Score = 41.6 bits (96), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 46/100 (46%), Gaps = 14/100 (14%)

Query: 219 SCDGGLTWSY-AKPTQLLNPNSGIDGV-----KLKDGRLLL--AYNTVSRGVLKVALSKD 270
           S D G  W    +P     PN+G  G      +L DGRL L   Y     G L+V +S D
Sbjct: 258 SSDEGKNWELIGRPV----PNTGYAGNPPTINRLNDGRLALVFGYRDAPYG-LRVRVSSD 312

Query: 271 DGDSWHDALTLEENLAM-EFSYPAVIQASDGSVHITYTYN 309
           +G +W D + + ++  M +  YP  +   DG +   Y YN
Sbjct: 313 EGKTWSDDIIIRDDGGMSDLGYPRTVVRPDGKLLSVYYYN 352


>gi|149179051|ref|ZP_01857624.1| probable sialidase [Planctomyces maris DSM 8797]
 gi|148842091|gb|EDL56481.1| probable sialidase [Planctomyces maris DSM 8797]
          Length = 358

 Score = 41.6 bits (96), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 100/229 (43%), Gaps = 31/229 (13%)

Query: 91  SNGLLLFYKIGQ------EVQKWSGCM-KRSYNKGVTWSEREQLPPGILGPSKNK---PI 140
           +NG LL  + G       E     G M K+S ++G TWSE   L P    P+K +   P 
Sbjct: 28  NNGDLLILQDGSILFCYTEYGDHGGIMAKKSVDQGKTWSEPWTLVPQPKLPAKGRYVHPS 87

Query: 141 LL--ENGLLLCGSSVESWNSWGSWMEV----TVDAGRSWRKYGPI-------YIPNESLS 187
           LL  +NG +L   +  +  +   +  V    + D G++W +  P+        + N+ L 
Sbjct: 88  LLRIKNGEILLAYNYSTHPAKPYYATVLYRRSADEGQTWTEQLPVSAYPGYNLVHNDKLL 147

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMS-ESCDGGLTWSYAKPTQLLNPNSGI----- 241
            ++           + +  S +  G V ++  S D G +W  +K T  L     I     
Sbjct: 148 SLRDGRIIAVGENKKYMPDSNDHNGYVGLTFYSDDQGHSWHSSKNTVDLYGTRKIEVQEA 207

Query: 242 DGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFS 290
           D V+LKDGRL++   + S G    A S D G++W +   L  +L M ++
Sbjct: 208 DAVELKDGRLMMFARSYS-GYPVRAYSTDRGETWSEG-KLIRDLKMPYA 254



 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 64/283 (22%), Positives = 110/283 (38%), Gaps = 49/283 (17%)

Query: 67  WQSPIIADEEPNVP----MWNPVLFKLPSNGLLLFYKIGQEVQK---WSGCMKRSYNKGV 119
           W  P     +P +P      +P L ++ +  +LL Y       K    +   +RS ++G 
Sbjct: 65  WSEPWTLVPQPKLPAKGRYVHPSLLRIKNGEILLAYNYSTHPAKPYYATVLYRRSADEGQ 124

Query: 120 TWSEREQLP----PGILGPSKNKPILLENGLLLCGSSVESW----NSWGSWMEVTV---D 168
           TW+E  QLP    PG      +K + L +G ++     + +    N    ++ +T    D
Sbjct: 125 TWTE--QLPVSAYPGYNLVHNDKLLSLRDGRIIAVGENKKYMPDSNDHNGYVGLTFYSDD 182

Query: 169 AGRSWRK-------YGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 221
            G SW         YG        + V +       +  L +  RS++G      + S D
Sbjct: 183 QGHSWHSSKNTVDLYG-----TRKIEVQEADAVELKDGRLMMFARSYSGY--PVRAYSTD 235

Query: 222 GGLTWSYAKPTQLLN-PNSGIDGVKL--KDGRLLLAY----NTVS--------RGVLKVA 266
            G TWS  K  + L  P +G+  V+     G LL  +    +T+         R  L  A
Sbjct: 236 RGETWSEGKLIRDLKMPYAGLPTVRRIPSTGDLLFIWVSEKSTIEENGRKLPMRSALTTA 295

Query: 267 LSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYN 309
           +S+D+G ++     + ++   +F Y  +    D    I Y  N
Sbjct: 296 ISRDEGKTFLHQRNIVQDPTNDFGYQCIEFIDDDLALIAYHTN 338


>gi|320534602|ref|ZP_08035050.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 171 str.
           F0337]
 gi|320133178|gb|EFW25678.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 171 str.
           F0337]
          Length = 760

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 57/250 (22%), Positives = 95/250 (38%), Gaps = 48/250 (19%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
           +  S + GVTWS    L P +       LG      I L NG       + L  ++  +W
Sbjct: 499 IAHSDDDGVTWSTPRDLNPQVKQPWMRFLGTCPGNGIALRNGPHAGRLIVPLYFNNERNW 558

Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
            +  + +  + D G +W+                 P    +E+ S+ +       +  L 
Sbjct: 559 LAMCATVAYSDDHGETWQLGHGPNEGRQTPEGALDPQTFVDETWSLHEAAVVERQDGVLL 618

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWS-----------YAKPTQLLNPN-SGIDGVKLKDGR 250
           + MR+ +  GRV +SES D G TW            + +P  +  P+  G D +   +  
Sbjct: 619 LFMRNQHPRGRVAVSESHDAGQTWGPIRFDEELPEIWCQPNAISLPSPEGEDRIVFANAS 678

Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
           L+L +    RG   + LS D G +W  + T        + Y  +    DG + I +  N 
Sbjct: 679 LMLPF----RGCGVIRLSLDGGRTWKHSRTFNPG---HYVYQCMCVLPDGRIGILWE-NE 730

Query: 311 TQIKHVVLLP 320
            Q  +   LP
Sbjct: 731 CQGLYFSRLP 740


>gi|325068374|ref|ZP_08127047.1| exo-alpha-sialidase [Actinomyces oris K20]
          Length = 716

 Score = 41.6 bits (96), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 95/247 (38%), Gaps = 48/247 (19%)

Query: 114 SYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESWNSW 159
           S + G+TWS    L P +       LG      I L NG       + L  ++ ++W + 
Sbjct: 455 SDDDGLTWSRPRDLNPELKQPWMRFLGTCPGNGIALRNGPHAGRLIVPLYFNNDQNWLAM 514

Query: 160 GSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
            + +  + D G +W+              +  P    +E+ S+ +       +  L + M
Sbjct: 515 CATVAYSDDHGETWQLGHGPNEGRQTPQGELDPQTFVDETWSLHEAAVVERHDGVLVLFM 574

Query: 206 RSFNGIGRVCMSESCDGGLTW-----SYAKPTQLLNPNS-------GIDGVKLKDGRLLL 253
           R+ +  GRV +SES D G TW       A P     PN+       G D +   +  L+L
Sbjct: 575 RNQHPRGRVAVSESHDAGQTWGPIRFDEALPEIWCQPNAISLPSPEGEDRIVFANASLML 634

Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
            +    RG   + LS D G +W  + T        + Y  +    DG + I +  N  Q 
Sbjct: 635 PF----RGCGVIRLSLDGGRTWKHSRTFNPG---HYVYQCMCVLPDGRIGILWE-NECQG 686

Query: 314 KHVVLLP 320
            +   LP
Sbjct: 687 LYFSRLP 693


>gi|149178198|ref|ZP_01856792.1| Sialidase (Precursor) [Planctomyces maris DSM 8797]
 gi|148842980|gb|EDL57349.1| Sialidase (Precursor) [Planctomyces maris DSM 8797]
          Length = 385

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 68/145 (46%), Gaps = 16/145 (11%)

Query: 170 GRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYA 229
           G++W    PI       +V++       +  L + MRS++G  R  ++ S DGG+TWS  
Sbjct: 227 GKTWELSEPIGEKTNECAVVE-----LTDGRLLMNMRSYHGENRRAIAFSSDGGMTWSDV 281

Query: 230 KPTQ-LLNP--NSGIDGVKLKD----GRLLLAY-NTVSRGVLKVALSKDDGDSWHDALTL 281
              Q L+ P   + +  V+  D    G++L +   +  R  + V LS+DDG +W  +  +
Sbjct: 282 SLDQALIEPVCQASLIRVRFPDVEKPGQILFSNPASKKREKMVVRLSEDDGRTWTASRLV 341

Query: 282 EENLAMEFSYPAVIQASDGSVHITY 306
               A   +Y ++    DG V + Y
Sbjct: 342 TPGSA---AYSSLAGIGDGKVGLLY 363


>gi|343082891|ref|YP_004772186.1| hypothetical protein [Cyclobacterium marinum DSM 745]
 gi|342351425|gb|AEL23955.1| hypothetical protein Cycma_0172 [Cyclobacterium marinum DSM 745]
          Length = 395

 Score = 41.6 bits (96), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 42/221 (19%), Positives = 87/221 (39%), Gaps = 22/221 (9%)

Query: 109 GCMKRSYNKGVTWS----------EREQLPPGILGPSKNKPILLENGL-----LLCGSSV 153
           G + +S + G TW           E    P G   P+ N+  + E        ++  S  
Sbjct: 163 GYLVKSVDGGETWKGPIYPPSIAPEVNHTPMGNKVPAYNRGAMYEGKSGRIFWIVAASDT 222

Query: 154 ESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGR 213
           E      +++ V+ D G +W     + + +E +   +   + T    +   +R+     +
Sbjct: 223 EKRGKTSNYLMVSDDKGNTWEHLSTVAVDDEVI-FNEASAYETPKGDIVAFIRTAKYDDQ 281

Query: 214 VCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSR--GVLKVALSKD- 270
             ++ S DGG T+   K   +      +  ++L D R+ L Y    +  G+    L+ + 
Sbjct: 282 AVIARSKDGGKTFD--KWESMGFQGHPLQALRLPDNRVFLVYGYRHKPYGIRARILNAEC 339

Query: 271 -DGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNR 310
            D  +  + +  E+  + +  YP  +   D  V +TY +N+
Sbjct: 340 TDFATAEEIIIREDGGSTDIGYPWAVNMGDDKVLVTYYFNK 380


>gi|422406819|ref|ZP_16483838.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
           syringae pv. glycinea str. race 4]
 gi|330882050|gb|EGH16199.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           glycinea str. race 4]
          Length = 203

 Score = 41.2 bits (95), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 15/115 (13%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQL 180


>gi|422605480|ref|ZP_16677493.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
           syringae pv. mori str. 301020]
 gi|330889135|gb|EGH21796.1| BNR/Asp-box repeat-containing protein [Pseudomonas syringae pv.
           mori str. 301020]
          Length = 216

 Score = 41.2 bits (95), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 15/115 (13%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQL 180


>gi|257482882|ref|ZP_05636923.1| BNR/Asp-box repeat-containing protein, partial [Pseudomonas
           syringae pv. tabaci str. ATCC 11528]
          Length = 271

 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 48/115 (41%), Gaps = 15/115 (13%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSPIIADEEPNV----- 79
            H+S +  +  G  +  +F GS EGA DV+I    F   +G W    +     +      
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDASNGEWGGEQVLATRASTQSGTG 126

Query: 80  ----PMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWSPPWQL 180


>gi|87310347|ref|ZP_01092477.1| probable sialidase [Blastopirellula marina DSM 3645]
 gi|87286846|gb|EAQ78750.1| probable sialidase [Blastopirellula marina DSM 3645]
          Length = 649

 Score = 41.2 bits (95), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 39/159 (24%), Positives = 68/159 (42%), Gaps = 13/159 (8%)

Query: 147 LLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMR 206
           L   + V+  N +     ++ D G++WR  G   +       ++P          RV+M 
Sbjct: 458 LASTADVQRVNHFVCQCCLSDDGGKTWR-LGKESVDYPKRGAMEPEVVEIDGN--RVMMI 514

Query: 207 SFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKL-KDGRLLLAYNTV------- 258
             N +G +  + S DGG TWS  +   +  P +     ++   G LLL +N         
Sbjct: 515 FRNQLGHIAHAMSEDGGETWSEPQSLGVPGPEAPATIRRIPSTGDLLLIWNDNVEKGRHH 574

Query: 259 --SRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVI 295
             SR  L +A+S D+G +W     +E     E++Y +V+
Sbjct: 575 GGSRNPLSLAISSDEGKTWRLRRNVEAEKGHEYAYTSVL 613


>gi|432893410|ref|ZP_20105422.1| hypothetical protein A31K_02555 [Escherichia coli KTE165]
 gi|431424390|gb|ELH06486.1| hypothetical protein A31K_02555 [Escherichia coli KTE165]
          Length = 129

 Score = 41.2 bits (95), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 2/73 (2%)

Query: 28  HASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPNVPMWNPV 85
           HA+ I+ +  G  +  +F G+ EG  D+ +W      G  +W   +    +      NPV
Sbjct: 33  HAANILPLPDGALMCVWFAGTQEGIADISVWGSRLPAGGMQWSDAVKLSHDDTRSEQNPV 92

Query: 86  LFKLPSNGLLLFY 98
           LF  P N L L +
Sbjct: 93  LFLAPDNVLWLLW 105


>gi|336411775|ref|ZP_08592235.1| hypothetical protein HMPREF1018_04253 [Bacteroides sp. 2_1_56FAA]
 gi|335940119|gb|EGN01987.1| hypothetical protein HMPREF1018_04253 [Bacteroides sp. 2_1_56FAA]
          Length = 567

 Score = 40.8 bits (94), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|423283708|ref|ZP_17262592.1| hypothetical protein HMPREF1204_02130 [Bacteroides fragilis HMW
           615]
 gi|404580746|gb|EKA85454.1| hypothetical protein HMPREF1204_02130 [Bacteroides fragilis HMW
           615]
          Length = 567

 Score = 40.8 bits (94), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|412992355|emb|CCO20068.1| hypothetical protein PTSG_00864 [Bathycoccus prasinos]
          Length = 518

 Score = 40.8 bits (94), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 69/173 (39%), Gaps = 42/173 (24%)

Query: 165 VTVDAGRSWRKYGPIYIPNESL---SVIQPVPFHTANRTL---------------RVLMR 206
           ++ D G++W+  G I +PN  L   SV++     T N                  R+ M 
Sbjct: 286 ISKDKGKTWKAQGSITLPNTWLIEQSVVELTSKATNNGGGGGGDENDHNNNISVNRIAMF 345

Query: 207 SFNGIGRVCMSESCDGGLTWSY-AKPTQLL-NPNSGI-------------DG-------V 244
                G V   +S D G TW + AKP   L NPN+ +             DG        
Sbjct: 346 CRTAAGHVFRIDSNDLGATWEHKAKPVLALPNPNAKLTMKTFGPSGRYRADGGGKNWIVA 405

Query: 245 KLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENL--AMEFSYPAVI 295
            + D + L      SR  L++ +S D+G SW    ++EE +   +   YP +I
Sbjct: 406 VMNDHKELPQPYRRSRSKLRLIVSHDNGKSWTRLASIEEGIKPGLRHHYPTLI 458


>gi|378766118|ref|YP_005194579.1| Signal peptidase I [Pantoea ananatis LMG 5342]
 gi|386016767|ref|YP_005935061.1| signal peptidase I LepB [Pantoea ananatis AJ13355]
 gi|386078352|ref|YP_005991877.1| signal peptidase I LepB [Pantoea ananatis PA13]
 gi|327394843|dbj|BAK12265.1| signal peptidase I LepB [Pantoea ananatis AJ13355]
 gi|354987533|gb|AER31657.1| signal peptidase I LepB [Pantoea ananatis PA13]
 gi|365185592|emb|CCF08542.1| Signal peptidase I [Pantoea ananatis LMG 5342]
          Length = 324

 Score = 40.8 bits (94), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 47/97 (48%), Gaps = 12/97 (12%)

Query: 214 VCMSESCDGG--------LTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKV 265
           + ++ +CD G        +T+S  +P++ +   SG DG +  +G   L  N   RG L++
Sbjct: 165 LTINPACDNGKPCSKALPVTYSNIEPSKFVQTFSGFDGNEAGNGFYQLPLNDTMRGGLRL 224

Query: 266 ALSKDD-GDSWHDALTLEE---NLAMEFSYPAVIQAS 298
           A  K+  GD  HD L + E     +M +  P   QAS
Sbjct: 225 ATRKETLGDVTHDILLVTEAQSQASMYYQQPGQPQAS 261


>gi|289674394|ref|ZP_06495284.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
           syringae pv. syringae FF5]
          Length = 178

 Score = 40.8 bits (94), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 47/110 (42%), Gaps = 15/110 (13%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  E       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRESTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWS 122
                + NPV+   P N L LFY +   V  W+G       S + G +WS
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNAMVSSDMGASWS 175


>gi|60682407|ref|YP_212551.1| hypothetical protein BF2937 [Bacteroides fragilis NCTC 9343]
 gi|60493841|emb|CAH08632.1| putative exported protein [Bacteroides fragilis NCTC 9343]
          Length = 567

 Score = 40.8 bits (94), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|53714386|ref|YP_100378.1| hypothetical protein BF3099 [Bacteroides fragilis YCH46]
 gi|265766163|ref|ZP_06094204.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|423260884|ref|ZP_17241786.1| hypothetical protein HMPREF1055_04063 [Bacteroides fragilis
           CL07T00C01]
 gi|423267019|ref|ZP_17246001.1| hypothetical protein HMPREF1056_03688 [Bacteroides fragilis
           CL07T12C05]
 gi|52217251|dbj|BAD49844.1| hypothetical protein [Bacteroides fragilis YCH46]
 gi|263253831|gb|EEZ25296.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
 gi|387774645|gb|EIK36755.1| hypothetical protein HMPREF1055_04063 [Bacteroides fragilis
           CL07T00C01]
 gi|392697722|gb|EIY90905.1| hypothetical protein HMPREF1056_03688 [Bacteroides fragilis
           CL07T12C05]
          Length = 567

 Score = 40.8 bits (94), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|237804114|ref|ZP_04591699.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
           syringae pv. oryzae str. 1_6]
 gi|331026095|gb|EGI06151.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae pv.
           oryzae str. 1_6]
          Length = 115

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 30/59 (50%)

Query: 213 RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDD 271
           RV  S + DGG TWS   P    NPNS +  V +    LL+A N +  G  K++L   D
Sbjct: 3   RVLASRTEDGGQTWSEPYPLTPSNPNSSLAAVGVPGKGLLVALNDLREGRFKLSLYGTD 61


>gi|66361505|pdb|2BER|A Chain A, Y370g Active Site Mutant Of The Sialidase From
           Micromonospora Viridifaciens In Complex With Beta-Neu5ac
           (Sialic Acid)
          Length = 601

 Score = 40.4 bits (93), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 86/209 (41%), Gaps = 36/209 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+    +NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 201 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313

Query: 229 AKPTQLLNPN--SGIDGVKLKDGRLLLAY 255
           +K   +  P   SG     L DG   L Y
Sbjct: 314 SK---VFQPGSMSGSTLTALPDGTYGLLY 339


>gi|255532110|ref|YP_003092482.1| hypothetical protein Phep_2216 [Pedobacter heparinus DSM 2366]
 gi|255345094|gb|ACU04420.1| hypothetical protein Phep_2216 [Pedobacter heparinus DSM 2366]
          Length = 652

 Score = 40.4 bits (93), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 115/283 (40%), Gaps = 58/283 (20%)

Query: 84  PVLFKLPSNGLLLFY-----------KIGQE---VQKWSGCMKRSYNKGVTWSEREQLPP 129
           P L K P+  LLLF            ++ Q    V      MKRS + G TWS +  +  
Sbjct: 305 PALAKTPNGELLLFAEGRSTPTSAKERVAQNTMPVGNTDIIMKRSADGGNTWSNQLTVLA 364

Query: 130 G----------ILGPSKNKPILLE----NGLLLCGSSVESW--NSWGSWMEVTVDAGRSW 173
           G           +  + N  I+L+     G     S+  ++  NS   +   + D G SW
Sbjct: 365 GNGSNLTYCFPQVVTTDNGKIILQYSTIGGAFAANSNAFTYDVNSQRIYQIESNDNGLSW 424

Query: 174 RKYGPIYIP---NESLSVIQPVPFH-----TANRTLRVLMRSFNGIGRVCMSESCDGGLT 225
               P+ I    N +   IQ    H     +A    R++M        V M+ S DGG+T
Sbjct: 425 TI--PVEITAFKNTAAGYIQGGAVHGIALKSATYNKRLVMPLTYSTNVVRMALSDDGGIT 482

Query: 226 WSYAKP---TQLLNPNSGIDGVKLKDGRLLLA---YNTVSRGVLKVALSKDDGDSWHDAL 279
           W   K    ++L         V+L DGRL++     NT  +  L V+ S D GD+W  A 
Sbjct: 483 WRLGKAISGSRLTKATV----VELSDGRLMMVAGHTNTSPKNKL-VSYSSDGGDNWTAAA 537

Query: 280 TLEENLAM-EFSY---PAVIQASDGSVHITYTYNR---TQIKH 315
            +  ++   ++ +     +++  DG + +    NR   T++K+
Sbjct: 538 NISADVKTGDYGHLFGGIILKGKDGEILLVNPINRETDTEVKN 580


>gi|423271118|ref|ZP_17250089.1| hypothetical protein HMPREF1079_03171 [Bacteroides fragilis
           CL05T00C42]
 gi|423274942|ref|ZP_17253888.1| hypothetical protein HMPREF1080_02541 [Bacteroides fragilis
           CL05T12C13]
 gi|392699042|gb|EIY92224.1| hypothetical protein HMPREF1079_03171 [Bacteroides fragilis
           CL05T00C42]
 gi|392704221|gb|EIY97360.1| hypothetical protein HMPREF1080_02541 [Bacteroides fragilis
           CL05T12C13]
          Length = 567

 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGQLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|171911563|ref|ZP_02927033.1| glycoside hydrolase family 33, candidate sialidase
           [Verrucomicrobium spinosum DSM 4136]
          Length = 391

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 58/237 (24%), Positives = 95/237 (40%), Gaps = 43/237 (18%)

Query: 76  EPNVPMWNPVLFKLPSNGLLLFYKIGQEVQ----KWSGCMKRSYNKGVTWSEREQLPPGI 131
           +PN  + +  L +LP    +LF   G + +     ++G   RS ++G TW+  E    G 
Sbjct: 55  KPNSMVCDTTLRELPDGSWILFILAGGDTEPSPLNYTGVT-RSKDQGRTWTPLEAFDVGF 113

Query: 132 LGPSKNKPI-------LLENG--LLLCGSSVESW-NSWGSWMEVTVDAGRSWRKYGPIYI 181
             P + K I       ++ +G   L   +  + W N W SW   + D+ ++W K  P  +
Sbjct: 114 --PREGKTIGQGPTELMIRDGRATLYFSTHSKHWANDWRSWFLTSDDSFKTWSK--PQEV 169

Query: 182 PNESLSVIQPVPFHTANRTLRVLMRSFNGIG-----------------RVCMSESCDGGL 224
           P   L     +  H   +  R+L+   + IG                 R  +  S DGG 
Sbjct: 170 PGR-LKERTFIRNHIITKDGRILLPFQHYIGPEAEQDKAPLDRAFTNPRNGVIMSSDGGK 228

Query: 225 TWSYAKPTQLLNPNSGI-----DGVKLKDGRLLLAYNTVSR-GVLKVALSKDDGDSW 275
           TW+     +L   +        +  +L DGR+ +        GVL  A SKD G +W
Sbjct: 229 TWTEHGNIRLTTDDKYFGWAENNLAELSDGRIAMVIRGDKLGGVLYYAESKDGGRTW 285


>gi|86141536|ref|ZP_01060082.1| hypothetical protein MED217_05942 [Leeuwenhoekiella blandensis
           MED217]
 gi|85832095|gb|EAQ50550.1| hypothetical protein MED217_05942 [Leeuwenhoekiella blandensis
           MED217]
          Length = 595

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 56/126 (44%), Gaps = 28/126 (22%)

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWS-------YAKPTQLLNPNSGIDGVKLKDGRLLLAY 255
            L+ ++   G++ +  S D G TWS       Y        P    D ++LK+G +LLAY
Sbjct: 90  ALVVTYESHGKILVKRSTDLGQTWSSLIVVADYKNGVSNTTP----DLIELKNGTILLAY 145

Query: 256 -----NTVS---RGVLKVALSKDDGDSWHD------ALTLEENLAMEFSYPAVIQASDGS 301
                NT S      +KV  S D G SWH+      A TL EN A E   P  +Q   G 
Sbjct: 146 NPRPGNTASVDEHFSIKVIQSSDGGRSWHNDQTVYTADTLFENGAWE---PTFLQLPSGE 202

Query: 302 VHITYT 307
           + + ++
Sbjct: 203 IQLYFS 208


>gi|422641291|ref|ZP_16704715.1| BNR repeat-containing glycosyl hydrolase, partial [Pseudomonas
           syringae Cit 7]
 gi|330953679|gb|EGH53939.1| BNR repeat-containing glycosyl hydrolase [Pseudomonas syringae Cit
           7]
          Length = 206

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 49/115 (42%), Gaps = 15/115 (13%)

Query: 27  CHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTF--KDGRWQSP-IIADEEPNVP--- 80
            H+S +  +  G  +  +F GS EGA DV+I    F    G W    ++A  +       
Sbjct: 67  VHSSAVTALPGGDLMSVWFAGSREGAGDVEIRTSRFDAASGEWGGEQVLATRDSTQSGTG 126

Query: 81  -----MWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKR---SYNKGVTWSEREQL 127
                + NPV+   P N L LFY +   V  W+G       S + G +WS   QL
Sbjct: 127 KYIRKLGNPVIALAPDNRLWLFY-VSVSVGGWAGSTVNGMVSSDMGSSWSPPWQL 180


>gi|343086452|ref|YP_004775747.1| glycosyl hydrolase family protein [Cyclobacterium marinum DSM 745]
 gi|342354986|gb|AEL27516.1| glycosyl hydrolase BNR repeat-containing protein [Cyclobacterium
           marinum DSM 745]
          Length = 385

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 74/175 (42%), Gaps = 22/175 (12%)

Query: 159 WGSWMEVTVDAGRSWRKYGPIYIP-NESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMS 217
           +G+ +  + D G +W   G I    NES    Q +       TL + MRS+ G  R  +S
Sbjct: 214 YGAHVIFSDDHGVTWELGGSIRPKMNES----QLIEMADGKGTLLMNMRSYYGKNRRALS 269

Query: 218 ESCDGGLTWSY-AKPTQLLNPNSGIDGVKL-------KDGRLLLAYNTVSRGVLKVALSK 269
            S DGG TWS   + T L+ P      ++        KD  L    ++  R  + + LSK
Sbjct: 270 VSKDGGETWSDPVEATSLIEPVCQAAFIRYSWPDEKGKDLVLFSNPSSTKRENMTIKLSK 329

Query: 270 DDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT------YNRTQIKHVVL 318
           D G +W  +  L +  +   +Y A+    +G +   Y       Y +  + H+ L
Sbjct: 330 DSGKTWIASKVLNQGPS---AYSALAALPNGDILCLYEKGEKNPYEKLVLSHLSL 381


>gi|440717064|ref|ZP_20897563.1| Exo-alpha-sialidase [Rhodopirellula baltica SWK14]
 gi|436437890|gb|ELP31484.1| Exo-alpha-sialidase [Rhodopirellula baltica SWK14]
          Length = 409

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 107/259 (41%), Gaps = 35/259 (13%)

Query: 91  SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
           +NG LL +  G++    S     MKRS + G TWS+   L   G L  +  + +LL++G 
Sbjct: 74  NNGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGKLSFNNPQAVLLQSGR 133

Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
           +L             S     NS+ ++++ + D G++W   R           ++ +   
Sbjct: 134 ILLMYQRSKLGEHKASDGYGPNSYFTFVQTSDDDGQTWAAPRDVSKQVKRERKVTSVAAG 193

Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
           P             R++M    G     RV  + S D G TW+     P       + + 
Sbjct: 194 PGIGIVLQNGPHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253

Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVI---Q 296
            V+L DGR++L   T  +G     KVA+S + G +W   L +++ L       +++   +
Sbjct: 254 MVELSDGRVMLNSRTQGKGSTKHRKVAVSANGGQTWS-QLQIDQTLIEPTCQASILRYSR 312

Query: 297 ASDGSVHITYTYNRTQIKH 315
            S+G   I +    +Q K 
Sbjct: 313 PSEGKSRIIFANPASQTKR 331


>gi|365828125|ref|ZP_09369956.1| hypothetical protein HMPREF0975_01739 [Actinomyces sp. oral taxon
           849 str. F0330]
 gi|365264307|gb|EHM94117.1| hypothetical protein HMPREF0975_01739 [Actinomyces sp. oral taxon
           849 str. F0330]
          Length = 770

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 52/234 (22%), Positives = 90/234 (38%), Gaps = 47/234 (20%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESW 156
           +  S + G TWS+   L P +       LG      I L +G       + L  ++  +W
Sbjct: 506 IAHSDDDGATWSKPRDLNPQLKQPWMRFLGTCPGNGIALRHGPRPGRLVIPLYFNNDGNW 565

Query: 157 NSWGSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLR 202
            +  + +  + D G +W                  P    +E+ S+ +       + TL 
Sbjct: 566 LAMCATVAYSDDHGETWNLGHGPNDGRRTPDGALDPRTFVDETWSLHEAAVVERRDGTLV 625

Query: 203 VLMRSFNGIGRVCMSESCDGGLTWS-----------YAKPTQLLNPNS-GIDGVKLKDGR 250
           + MR+ +  GRV +SES D G TW            + +P  +  P++ G D +   +  
Sbjct: 626 LYMRNQHPCGRVAVSESHDAGQTWGPIDFDEDLPEIWCQPNAICLPSAEGEDRIVFANAS 685

Query: 251 LLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
           L+L +    RG   + LS D G +W  + T        + Y  +    DG + I
Sbjct: 686 LMLPF----RGCGVIRLSLDGGRTWRRSRTFNPG---HYVYQCMCALPDGRIGI 732


>gi|289807542|ref|ZP_06538171.1| hypothetical protein Salmonellaentericaenterica_25287 [Salmonella
          enterica subsp. enterica serovar Typhi str. AG3]
          Length = 55

 Score = 39.7 bits (91), Expect = 1.7,   Method: Composition-based stats.
 Identities = 18/49 (36%), Positives = 25/49 (51%)

Query: 9  GLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKI 57
           LV  +   P +      CHAST+V +  G  + A+F G CEG+ D  I
Sbjct: 4  ALVNRQVILPESGTESFQCHASTLVRLPCGTLVAAWFAGLCEGSEDTAI 52


>gi|585539|sp|Q02834.1|NANH_MICVI RecName: Full=Sialidase; AltName: Full=Neuraminidase; Flags:
           Precursor
 gi|216784|dbj|BAA00852.1| neuraminidase [Micromonospora viridifaciens]
          Length = 647

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 189 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 246

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+    +NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 247 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 299

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 300 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 359

Query: 229 AK 230
           +K
Sbjct: 360 SK 361


>gi|317501714|ref|ZP_07959904.1| hypothetical protein HMPREF1026_01848 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|336439652|ref|ZP_08619260.1| hypothetical protein HMPREF0990_01654 [Lachnospiraceae bacterium
           1_1_57FAA]
 gi|316896882|gb|EFV18963.1| hypothetical protein HMPREF1026_01848 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|336015616|gb|EGN45427.1| hypothetical protein HMPREF0990_01654 [Lachnospiraceae bacterium
           1_1_57FAA]
          Length = 1310

 Score = 39.7 bits (91), Expect = 1.9,   Method: Composition-based stats.
 Identities = 27/97 (27%), Positives = 44/97 (45%), Gaps = 13/97 (13%)

Query: 216 MSESCDGGLTWSYAKPTQLLNPNSGI------DGVKLKDGRLLLAYNTVSRGVLKVA--L 267
           +++S D G TWS  +     N +  +       G+    GRL+      + GV + +   
Sbjct: 267 ITKSTDNGATWSAPQLVDAKNESEQVFLIGPGRGITTSTGRLIFPCYQYTSGVQRTSTIY 326

Query: 268 SKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHI 304
           S DDG +WH   T+  N     S  AVI  +DG +++
Sbjct: 327 SDDDGTTWHRGATVSGN-----SSEAVITEADGRLYL 358


>gi|157831017|pdb|1EUT|A Chain A, Sialidase, Large 68kd Form, Complexed With Galactose
 gi|157831018|pdb|1EUU|A Chain A, Sialidase Or Neuraminidase, Large 68kd Form
          Length = 605

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 147 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 204

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+    +NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 205 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 257

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 258 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 317

Query: 229 AK 230
           +K
Sbjct: 318 SK 319


>gi|82407379|pdb|1WCQ|A Chain A, Mutagenesis Of The Nucleophilic Tyrosine In A Bacterial
           Sialidase To Phenylalanine.
 gi|82407380|pdb|1WCQ|B Chain B, Mutagenesis Of The Nucleophilic Tyrosine In A Bacterial
           Sialidase To Phenylalanine.
 gi|82407381|pdb|1WCQ|C Chain C, Mutagenesis Of The Nucleophilic Tyrosine In A Bacterial
           Sialidase To Phenylalanine
          Length = 601

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+    +NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 201 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313

Query: 229 AK 230
           +K
Sbjct: 314 SK 315


>gi|55670665|pdb|1W8N|A Chain A, Contribution Of The Active Site Aspartic Acid To Catalysis
           In The Bacterial Neuraminidase From Micromonospora
           Viridifaciens.
 gi|56554377|pdb|1W8O|A Chain A, Contribution Of The Active Site Aspartic Acid To Catalysis
           In The Bacterial Neuraminidase From Micromonospora
           Viridifaciens
          Length = 601

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+    +NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 201 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313

Query: 229 AK 230
           +K
Sbjct: 314 SK 315


>gi|320531595|ref|ZP_08032538.1| BNR/Asp-box repeat protein, partial [Actinomyces sp. oral taxon 171
           str. F0337]
 gi|320136172|gb|EFW28177.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 171 str.
           F0337]
          Length = 665

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D GR+W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 442 DHGRTWQAGQPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 496

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
              P Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 497 EPVPDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 554

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 555 GASWVASKVFHEPF---VGYTTIAVQSDGSI 582


>gi|311748209|ref|ZP_07721994.1| putative neuramidase [Algoriphagus sp. PR1]
 gi|126576699|gb|EAZ80947.1| putative neuramidase [Algoriphagus sp. PR1]
          Length = 388

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/140 (22%), Positives = 65/140 (46%), Gaps = 10/140 (7%)

Query: 150 GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFN 209
           G   ++++ + +    T D G+++    PI IP  + S+   +P    +R +  +     
Sbjct: 207 GDPQDNFSEYQAHGFFTDDHGKTYGISEPISIPGSNESIAAELP---NDRMIMSIRNQKG 263

Query: 210 GIGRVCMSESCDGGLTWSYAK-PTQLLNPNSGIDGVKLKD--GRLLLAY----NTVSRGV 262
            I +  ++ S DGG TW   +   +L +P      + L D  GR +L +    +T++R  
Sbjct: 264 DIRQRILAYSSDGGDTWDSEEFEEELPDPVCQGSIISLTDSRGRFILVHSNNADTLNRNN 323

Query: 263 LKVALSKDDGDSWHDALTLE 282
           L +  S D+G +W   + ++
Sbjct: 324 LSIKFSYDEGKTWPKTMLID 343


>gi|201067792|gb|ACH92894.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 51/107 (47%), Gaps = 10/107 (9%)

Query: 195 HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG---VKLKDGRL 251
           H      +  +R+ +G  +     S D G TW   +PT      +G+D    V+L DG L
Sbjct: 193 HAGRLVQQYTIRTADGTVQAVSVYSDDHGKTWQAGQPT-----GTGMDENKVVELSDGSL 247

Query: 252 LL-AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           +L +  + S G  KVA S D G +W + ++ ++NL        +I+A
Sbjct: 248 MLNSRASDSSGFRKVARSTDGGQTWSEPVS-DQNLPDSVDNAQIIRA 293


>gi|157831015|pdb|1EUR|A Chain A, Sialidase
 gi|157831016|pdb|1EUS|A Chain A, Sialidase Complexed With 2-Deoxy-2,3-Dehydro-N-
           Acetylneuraminic Acid
          Length = 365

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 77/182 (42%), Gaps = 31/182 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 147 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 204

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+    +NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 205 TWRAGEAVGVGM---DENKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 257

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 258 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 317

Query: 229 AK 230
           +K
Sbjct: 318 SK 319


>gi|421609699|ref|ZP_16050887.1| Exo-alpha-sialidase [Rhodopirellula baltica SH28]
 gi|408499472|gb|EKK03943.1| Exo-alpha-sialidase [Rhodopirellula baltica SH28]
          Length = 409

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 60/259 (23%), Positives = 108/259 (41%), Gaps = 35/259 (13%)

Query: 91  SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
           +NG LL +  G++    S     MKRS + G TWS+   L   G L  +  + ++L++G 
Sbjct: 74  NNGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGKLSLNNPQAVVLQSGR 133

Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
           +L             S     NS+ ++++ + D G++W   R         + ++ +   
Sbjct: 134 ILLMYQRSKLGEHKASDGYGPNSYFTFVQTSDDDGQTWAAPRDVSKQVKREKKVTSVAAG 193

Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
           P             R++M    G     RV  + S D G TW+     P       + + 
Sbjct: 194 PGIGIVLQNGPHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253

Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ--- 296
            V+L DGR++L   T  +G     KVA+S + G +W   L +++ L       ++++   
Sbjct: 254 MVELSDGRVMLNARTQGKGSTKHRKVAVSDNGGQTWSQ-LQMDQTLIEPTCQASILRYSW 312

Query: 297 ASDGSVHITYTYNRTQIKH 315
            S G   I ++   +Q K 
Sbjct: 313 PSKGKSRIIFSNPASQTKR 331


>gi|198273964|ref|ZP_03206496.1| hypothetical protein BACPLE_00100 [Bacteroides plebeius DSM 17135]
 gi|198273042|gb|EDY97311.1| BNR/Asp-box repeat protein [Bacteroides plebeius DSM 17135]
          Length = 364

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 85/211 (40%), Gaps = 22/211 (10%)

Query: 114 SYNKGVTWSEREQLP-PGILGPSKNKPILLENGLLLCGSSVESWNSWGS-----WMEVTV 167
           S + G TWSE + L  P ++  S    + L N   L G     WN         W  ++ 
Sbjct: 131 SEDNGQTWSEIQSLKKPCVMAFSS--IVKLSNNDYL-GVYHRGWNDQDRAPLTLWSAISH 187

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANR-TLRVLMRSFNGIGRVCMSESCDGGLTW 226
           D G +W     I    E  S  +P  F++ ++  L  + R  N +G   M  S D G TW
Sbjct: 188 DGGLNWTPSVKI-AEVEGRSPCEPCVFYSPDKKQLVCIARENNRVGNSLMMFSNDEGKTW 246

Query: 227 SYAKPTQLLNPNSGIDGVKLKDGRLLLAYN-------TVSRGVLKVALSKD--DGDSWHD 277
           S  + T             LKDGRLL+A+        T    V  V   KD  +G S   
Sbjct: 247 SAMRETPWGVTGDRHVARYLKDGRLLIAFRDMAPDSPTKGHFVAWVGTYKDLIEGTSGQY 306

Query: 278 ALTLEENLA-MEFSYPAVIQASDGS-VHITY 306
            + L  + A  +  YP +    DG+ V ITY
Sbjct: 307 RIKLLHSYAGSDCGYPGLEVLPDGTIVAITY 337


>gi|358636892|dbj|BAL24189.1| hypothetical protein AZKH_1876 [Azoarcus sp. KH32C]
          Length = 325

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 84/203 (41%), Gaps = 38/203 (18%)

Query: 88  KLPSNGLLLFYKIGQEVQKWS----GCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL-L 142
           K+P N  L       E   W+    G +  + + G TW+ +       +  S+++P+  +
Sbjct: 82  KVPVNATLTGVSFADERNGWAVGHWGVILHTEDGGETWTSQR------METSEDRPLFSV 135

Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLR 202
                  G++V  W    S +  T D G++W K      PN++           A+R L 
Sbjct: 136 HFTDAQHGTAVGLW----SLLLTTTDGGKTWDKVSLPAPPNDT----------RADRNLM 181

Query: 203 VLMRSFNG----IG-RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
            +     G    +G R  +  S D G TW+Y +PT      S   G  L DG LL+A   
Sbjct: 182 HVFADRRGDLFVVGERGTVLHSADRGQTWAY-RPTGY--RGSFWSGTALADGTLLVAG-- 236

Query: 258 VSRGVLKVALSKDDGDSWHDALT 280
             RG L    S D G++WH  +T
Sbjct: 237 -LRGSLY--RSSDGGETWHKVVT 256


>gi|83644164|ref|YP_432599.1| hypothetical protein HCH_01307 [Hahella chejuensis KCTC 2396]
 gi|83632207|gb|ABC28174.1| hypothetical protein HCH_01307 [Hahella chejuensis KCTC 2396]
          Length = 57

 Score = 39.3 bits (90), Expect = 2.6,   Method: Composition-based stats.
 Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 281 LEENLAMEFSYPAVIQASDGSVHITYTYNRTQIKHV 316
           LEE  A+E+SYP    +  G +HI YT+ R  IKHV
Sbjct: 10  LEE--AVEYSYPGAYISRSGDLHIAYTWRRKTIKHV 43


>gi|288963051|ref|YP_003453330.1| hypothetical protein AZL_f00260 [Azospirillum sp. B510]
 gi|288915303|dbj|BAI76786.1| hypothetical protein AZL_f00260 [Azospirillum sp. B510]
          Length = 382

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 49/221 (22%), Positives = 81/221 (36%), Gaps = 24/221 (10%)

Query: 107 WSGCMKRSYNKGVTWSEREQLP-----PGIL--------GPSKNKPILLENGLLLCGSSV 153
           W G  + S + G +W+    LP     P I+        G  + + +   +G +L     
Sbjct: 147 WGGYSRHSDDGGRSWTPHRDLPPLPGHPDIVPGRRPFHGGAVRGRAVEAADGTILQTGYT 206

Query: 154 E--SWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH-TANRTLRVLMRSFNG 210
              S  ++ S +  + D G SW     I I  E+ +       H  A+  L    R+   
Sbjct: 207 HHPSTGTYASHLFASTDRGESWGHRAVIAIDPEAKAGFCETALHLDADGVLHAFHRTTGL 266

Query: 211 IGRVCMSESCDGGLTWS-YAKPTQLLNPNSGIDGVKLKDGRLLLA--YNTVSRGVLKVAL 267
              +  S S D G +W  + +   + +P    D   L DGRLLLA  Y     G+     
Sbjct: 267 DDHLATSRSTDLGHSWEPWRRHAVVGHP---YDACPLPDGRLLLAGGYRHAPYGIRARLY 323

Query: 268 S--KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
              +   D   + +  ++  A +  YP      DG   + Y
Sbjct: 324 DPRRQQPDEGTELVLRDDGPAPDLGYPWAAVLPDGRAMVAY 364


>gi|75766432|pdb|2BZD|A Chain A, Galactose Recognition By The Carbohydrate-Binding Module
           Of A Bacterial Sialidase.
 gi|75766433|pdb|2BZD|B Chain B, Galactose Recognition By The Carbohydrate-Binding Module
           Of A Bacterial Sialidase.
 gi|75766434|pdb|2BZD|C Chain C, Galactose Recognition By The Carbohydrate-Binding Module
           Of A Bacterial Sialidase
          Length = 601

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 76/182 (41%), Gaps = 31/182 (17%)

Query: 71  IIADEEPNVPMWNPVLFKLPSNGLLLFY--KIGQEVQKWS-----GCMKR----SYNKGV 119
           I AD  P+ P W    F     G+ L Y    G+ +Q+++     G  +     S + G 
Sbjct: 143 ITADITPD-PGWR-SRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVYSDDHGR 200

Query: 120 TWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPI 179
           TW   E +  G+     NK + L +G +L  +S +S  S    + V+ D G S   YGP+
Sbjct: 201 TWRAGEAVGVGM---DANKTVELSDGRVLL-NSRDSARSGYRKVAVSTDGGHS---YGPV 253

Query: 180 YI------PNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE-----SCDGGLTWSY 228
            I      P  + S+I+  P   A      ++   N   +   S+     SCD G TW  
Sbjct: 254 TIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPV 313

Query: 229 AK 230
           +K
Sbjct: 314 SK 315


>gi|149175555|ref|ZP_01854175.1| probable sialidase [Planctomyces maris DSM 8797]
 gi|148845540|gb|EDL59883.1| probable sialidase [Planctomyces maris DSM 8797]
          Length = 386

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 82/347 (23%), Positives = 132/347 (38%), Gaps = 66/347 (19%)

Query: 20  NSAP-FKSCHA--STIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKD------------ 64
           N+ P  KS HA   ++  +  G  L  Y  G    A D+K+ L   +D            
Sbjct: 14  NAKPHVKSVHAYFPSVASLPDGSLLSMYMLGEAFEAVDLKLHLSRSRDQGQTWEYQGSID 73

Query: 65  ----GRWQSPI--IADEEPNVPMWNPVLFK---LPSNGLLLFYKIGQEVQKWSGCMKRSY 115
               GR  S    +A  E    + N V F     P  GL     +G    +    + RS 
Sbjct: 74  TSVTGRQTSTFGRLAVTETGELIANLVRFDRTDFPGEGLCNPQTLGMVPSEL--LIIRSL 131

Query: 116 NKGVTWSEREQLPPGILGPSKN--KPI-LLENGLLLCGSSVESWNSWGSWME-------- 164
           ++G TWSE   + P + GP      P+ +L +G  +  +S  +W  W   +         
Sbjct: 132 DQGKTWSEPTLVTPPLEGPEFEMCSPLTILNDGRWIWPTS--TWRDWNGALPNGNRMLAL 189

Query: 165 VTVDAGRSWRKYGPI-YIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSE----S 219
           V+ D G SW +Y  I +  N +L   +          L  +   ++      +      S
Sbjct: 190 VSRDQGSSWDEYLDIMHSENNNLIFWESKVVEYPEGRLLAVAWCYDEASSTDLPNHYAIS 249

Query: 220 CDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSK-DDGDSW--- 275
            DGG +WS    TQ+      +    L+DG LL  Y  + +  L   LS+ DD  +W   
Sbjct: 250 DDGGTSWSEPASTQI--QGQTLTPCLLEDGSLLNIYRRMDQPGLWACLSRLDDAGNWING 307

Query: 276 -------HDAL--------TLEENLA-MEFSYPAVIQASDGSVHITY 306
                  H+ L         + +N A ++F  P + + SD  + +T+
Sbjct: 308 DQQPLWGHNQLEGITATGENMSDNFAVLKFGAPHITRLSDSELFVTF 354


>gi|424513260|emb|CCO66844.1| unknown protein [Bathycoccus prasinos]
          Length = 847

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 63/156 (40%), Gaps = 25/156 (16%)

Query: 165 VTVDAGRSWRKYGPIYIPNESL-----SVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
           ++ D G++W K G I  P  SL     + I  +P  +      + M      G V  S S
Sbjct: 487 ISSDNGKTWTKKGDIRHPKTSLLEGAIAEISALPQGSDEPQPALTMLFRTDCGCVFRSMS 546

Query: 220 CDGGLTWS--YAKPTQLLNPNSGIDGVKL-KDGRLLLAYNT---------------VSRG 261
            D G TWS  +  P  L N N+ +D   L   G L +A+N                + R 
Sbjct: 547 FDFGATWSDPHPSPGNLPNVNTKLDMTSLYPTGHLAMAFNNHRRWEHIGSKEHPCKLCRT 606

Query: 262 VLKVALSKDDGDSWHDALTLEENLA--MEFSYPAVI 295
            L  ALS + G+ W      ++ +   +   YP+++
Sbjct: 607 HLHFALSSNGGEDWTTVAAFDDLVQPDVRIHYPSIL 642


>gi|383119129|ref|ZP_09939868.1| hypothetical protein BSHG_1859 [Bacteroides sp. 3_2_5]
 gi|251946348|gb|EES86725.1| hypothetical protein BSHG_1859 [Bacteroides sp. 3_2_5]
          Length = 567

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 38/166 (22%), Positives = 66/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESEPSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           +GRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 NGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|343522379|ref|ZP_08759345.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 175 str.
           F0384]
 gi|343401788|gb|EGV14294.1| BNR/Asp-box repeat protein [Actinomyces sp. oral taxon 175 str.
           F0384]
          Length = 771

 Score = 38.5 bits (88), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 94/247 (38%), Gaps = 48/247 (19%)

Query: 114 SYNKGVTWSEREQLPPGI-------LGPSKNKPILLENG-------LLLCGSSVESWNSW 159
           S + G+TWS    L P +       LG      I L N        + L  ++ ++W + 
Sbjct: 510 SDDDGLTWSRPRDLNPELKQPWMRFLGTCPGNGISLRNAPHAGRLIVPLYFNNDQNWLAM 569

Query: 160 GSWMEVTVDAGRSWR--------------KYGPIYIPNESLSVIQPVPFHTANRTLRVLM 205
            + +  + D G +W+              +  P    +E+ S+ +       +  L + M
Sbjct: 570 CATVAYSDDHGETWQLGHGPNEGRQTPQGELDPQTFVDETWSLHEAAVVERHDGVLVLFM 629

Query: 206 RSFNGIGRVCMSESCDGGLTW-----SYAKPTQLLNPNS-------GIDGVKLKDGRLLL 253
           R+ +  GRV +SES D G TW       A P     PN+       G D +   +  L+L
Sbjct: 630 RNQHPRGRVAVSESHDAGQTWGPIRFDEALPEIWCQPNAISLPSPEGEDRIVFANASLML 689

Query: 254 AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYTYNRTQI 313
            +    RG   + LS D G SW  + T        + Y  +    DG + I +  N  Q 
Sbjct: 690 PF----RGCGVIRLSLDGGRSWKYSRTFNPG---HYVYQCMCVLPDGRIGILWE-NECQG 741

Query: 314 KHVVLLP 320
            +   LP
Sbjct: 742 LYFSRLP 748


>gi|201067786|gb|ACH92891.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.5 bits (88), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAADDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|418935311|ref|ZP_13489091.1| glycosyl hydrolase protein, partial [Rhizobium sp. PDO1-076]
 gi|375057995|gb|EHS54139.1| glycosyl hydrolase protein, partial [Rhizobium sp. PDO1-076]
          Length = 200

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 21/80 (26%), Positives = 34/80 (42%), Gaps = 2/80 (2%)

Query: 21  SAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDG--RWQSPIIADEEPN 78
           ++P    HA+ +  +  G     +FGG+ EG  D+ I++     G   W  P    E+  
Sbjct: 38  ASPTVQNHAANLAFLADGTLTCVWFGGTMEGMGDISIYMSRLAPGATAWSVPEKMTEDVT 97

Query: 79  VPMWNPVLFKLPSNGLLLFY 98
               NP++F  P     L Y
Sbjct: 98  KSEQNPIVFNAPDGKAWLIY 117


>gi|375359189|ref|YP_005111961.1| hypothetical protein BF638R_2942 [Bacteroides fragilis 638R]
 gi|301163870|emb|CBW23425.1| putative exported protein [Bacteroides fragilis 638R]
          Length = 567

 Score = 38.5 bits (88), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 38/166 (22%), Positives = 65/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  D G TW  ++P+      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DDGQTWGESEPSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|201067830|gb|ACH92913.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 10/107 (9%)

Query: 195 HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG---VKLKDGRL 251
           H      +  +R+  G  +     S D G TW   +PT      +G+D    V+L DG L
Sbjct: 193 HAGRLVQQYTIRTAGGAVQAVSVYSDDHGKTWQAGQPT-----GTGMDENKVVELSDGSL 247

Query: 252 LL-AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           +L +  + S G  KVA S D G +W + ++ ++NL        +I+A
Sbjct: 248 MLNSRASDSSGFRKVARSTDGGQTWSEPVS-DQNLPDSVDNAQIIRA 293


>gi|376289488|ref|YP_005161735.1| hypothetical protein CDC7B_0282 [Corynebacterium diphtheriae C7
           (beta)]
 gi|372102885|gb|AEX66482.1| hypothetical protein CDC7B_0282 [Corynebacterium diphtheriae C7
           (beta)]
          Length = 678

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 49/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLEN----GLLLCGSSVESWNSW 159
           M  S + G+TWS    L P +        G S    I L+     G LL        + +
Sbjct: 452 MLTSDDDGLTWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRDTHAGRLLIPMYYTKGDQF 511

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
            +    + D G +W +       +  L + +      ++ ++ +  R+ +  G V  + S
Sbjct: 512 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 566

Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
            DGG +WS           +++P  L   N   DG+   +   LL +    RGVL+  +S
Sbjct: 567 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 620

Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            +DG SW             + Y  + Q  DGS+
Sbjct: 621 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 651


>gi|375292226|ref|YP_005126765.1| hypothetical protein CDB402_0252 [Corynebacterium diphtheriae INCA
           402]
 gi|371581897|gb|AEX45563.1| hypothetical protein CDB402_0252 [Corynebacterium diphtheriae INCA
           402]
          Length = 678

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 49/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLEN----GLLLCGSSVESWNSW 159
           M  S + G+TWS    L P +        G S    I L+     G LL        + +
Sbjct: 452 MLTSDDDGLTWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRDTHAGRLLIPMYYTKGDQF 511

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
            +    + D G +W +       +  L + +      ++ ++ +  R+ +  G V  + S
Sbjct: 512 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 566

Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
            DGG +WS           +++P  L   N   DG+   +   LL +    RGVL+  +S
Sbjct: 567 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 620

Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            +DG SW             + Y  + Q  DGS+
Sbjct: 621 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 651


>gi|239919163|gb|ACS34894.1| sialidase [Corynebacterium diphtheriae]
          Length = 687

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 49/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLEN----GLLLCGSSVESWNSW 159
           M  S + G+TWS    L P +        G S    I L+     G LL        + +
Sbjct: 461 MLTSDDDGLTWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRDTHAGRLLIPMYYTKGDQF 520

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
            +    + D G +W +       +  L + +      ++ ++ +  R+ +  G V  + S
Sbjct: 521 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 575

Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
            DGG +WS           +++P  L   N   DG+   +   LL +    RGVL+  +S
Sbjct: 576 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 629

Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            +DG SW             + Y  + Q  DGS+
Sbjct: 630 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 660


>gi|201067846|gb|ACH92921.1| sialidase [Actinomyces oris]
 gi|201067850|gb|ACH92923.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 10/107 (9%)

Query: 195 HTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDG---VKLKDGRL 251
           H      +  +R+  G  +     S D G TW   +PT      +G+D    V+L DG L
Sbjct: 193 HAGRLVQQYTIRTAGGAVQAVSVYSDDHGKTWQAGQPT-----GTGMDENKVVELSDGSL 247

Query: 252 LL-AYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQA 297
           +L +  +   G  KVA S D G +W + ++ +ENL        +I+A
Sbjct: 248 MLNSRASDGSGFRKVARSTDGGQTWSEPVS-DENLPDSVDNAQIIRA 293


>gi|201067718|gb|ACH92857.1| sialidase [Actinomyces oris]
 gi|201067720|gb|ACH92858.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067848|gb|ACH92922.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGQPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
              P Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVPDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWVASKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067840|gb|ACH92918.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDKPEIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GPSWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067752|gb|ACH92874.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|443722941|gb|ELU11586.1| hypothetical protein CAPTEDRAFT_201074 [Capitella teleta]
          Length = 375

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 60/225 (26%), Positives = 95/225 (42%), Gaps = 21/225 (9%)

Query: 96  LFYKIGQEVQKWSGCMKRSYNKGVTWSEREQLPPG--ILGP--------SKNKPILLENG 145
           L + I  + ++    M  S + G TWS+RE LP    I GP        S  K  L+  G
Sbjct: 134 LAHNISNKARQHMPVMITSGDGGQTWSDREVLPFKGLIFGPGHGIQKQQSPAKGRLIIPG 193

Query: 146 LLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNES-LSVIQPVPFHTANRTLRVL 204
            L     VE   +    +  + D G +W+   P+   +E+ ++ +     +   R    L
Sbjct: 194 YL--NQKVEGRANQAVVL-FSDDHGNTWQTSEPLNTGDETAIAELADGRIYMNTRQHAAL 250

Query: 205 MRSFNGIGRVCMSESCDGGLTWSYA-KPTQLLNPNSGIDGVKLKDGRLLLA--YNTVSRG 261
            R+    GR  +S S DGG+ W  A K   L  P       +  D  LL A   +T +R 
Sbjct: 251 GRAPAPNGR-WVSTSSDGGVNWLAAEKDLALHTPVCQGSVEQYGDAGLLFANPASTRARV 309

Query: 262 VLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITY 306
            + V  S+D G SW +++T+    A    Y  +   S+G V + +
Sbjct: 310 NMTVRYSEDQGSSWKNSVTVYPGAA---GYSDLGALSNGDVAVLF 351


>gi|201067838|gb|ACH92917.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 66/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++WR   P+    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWRAGTPVGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAK-----PTQLLN-------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                   P  + N       PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDKNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWMTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067828|gb|ACH92912.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPDPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067748|gb|ACH92872.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 66/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   +S S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVSHSTDGGQTWS 273

Query: 228 YAK-----PTQLLN-------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                   P  + N       PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDKNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067738|gb|ACH92867.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|449135021|ref|ZP_21770484.1| Exo-alpha-sialidase [Rhodopirellula europaea 6C]
 gi|448886316|gb|EMB16724.1| Exo-alpha-sialidase [Rhodopirellula europaea 6C]
          Length = 409

 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 56/262 (21%), Positives = 111/262 (42%), Gaps = 41/262 (15%)

Query: 91  SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
           ++G LL +  G++    S     MKRS + G TWS+   +   G L  +  + ++L++G 
Sbjct: 74  NDGTLLAFAEGRQGGDHSDNDMIMKRSTDNGSTWSDVVLMNDQGKLSLNNPQAVVLDSGR 133

Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFH 195
           +L             S      S+ ++++ + D G++W     +   ++ +   + V   
Sbjct: 134 VLLMYQRSKLGEHKASDGFGAESYFTFVQTSDDDGQTWSAPHDV---SKQVKREKNVTSV 190

Query: 196 TANRTLRVLMRSFNGIGRVCM--------------SESCDGGLTWSYAK--PTQLLNPNS 239
            A   + +++R+ +  GR+ M              + S D G TW+     P       +
Sbjct: 191 AAGPGIGIILRNGSHKGRIIMPFNQGPYNDWRVYAAYSDDDGQTWNIGDVAPGDGKGHAN 250

Query: 240 GIDGVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ 296
            +  V+L DGR++L   T  +G     KV +S D G +W   L +++ L       ++++
Sbjct: 251 EVQMVELADGRVMLNARTQGKGSTKHRKVTVSDDGGQTWS-KLQIDQTLIEPTCQASILR 309

Query: 297 ---ASDGSVHITYTYNRTQIKH 315
               S+G   I +    +Q K 
Sbjct: 310 YSWPSEGKSRILFANPASQTKR 331


>gi|201067774|gb|ACH92885.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067722|gb|ACH92859.1| sialidase [Actinomyces oris]
 gi|201067728|gb|ACH92862.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067652|gb|ACH92824.1| sialidase [Actinomyces oris]
          Length = 367

 Score = 37.7 bits (86), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067784|gb|ACH92890.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067756|gb|ACH92876.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067768|gb|ACH92882.1| sialidase [Actinomyces oris]
 gi|201067780|gb|ACH92888.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067714|gb|ACH92855.1| sialidase [Actinomyces oris]
 gi|201067842|gb|ACH92919.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067750|gb|ACH92873.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067672|gb|ACH92834.1| sialidase [Actinomyces naeslundii]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
             +P    N    +D  ++          D R   LLL+++      SR    +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    EN      Y  +   SDGS+
Sbjct: 332 GASWATSKVFNENF---VGYTTIAVQSDGSI 359


>gi|201067770|gb|ACH92883.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067844|gb|ACH92920.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067754|gb|ACH92875.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067794|gb|ACH92895.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067674|gb|ACH92835.1| sialidase [Actinomyces naeslundii]
          Length = 359

 Score = 37.7 bits (86), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 66/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
             +P    N    +D  ++          D R   LLL+++      SR    +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    EN      Y  +   SDGS+
Sbjct: 332 GASWATSKVFNENF---VGYTTIAVQSDGSI 359


>gi|374375918|ref|ZP_09633576.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
           19437]
 gi|373232758|gb|EHP52553.1| glycosyl hydrolase BNR repeat-containing protein [Niabella soli DSM
           19437]
          Length = 390

 Score = 37.7 bits (86), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 67/266 (25%), Positives = 102/266 (38%), Gaps = 59/266 (22%)

Query: 94  LLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQ----LPPGILGPSKNKPILL--ENGLL 147
           LLL+  +G  +   +GC  R        S +EQ      P  +  S   P L+  + G L
Sbjct: 14  LLLWVTLGWLLLIMNGCSAR------LQSAKEQTTTVFAPDAVYKSTRIPALVYTKRGSL 67

Query: 148 L--CGSSVESWNSWGSW---MEVTVDAGRSWRKYGPIYIPNESLS--VIQPVPFHTANRT 200
           L  C   + + + W      M  ++D GRSW    PI I +      V  P P   AN  
Sbjct: 68  LAFCEGRIGTASDWADMNLIMRRSIDGGRSW---APIVIIDSMKGGPVGNPTPIVDANGI 124

Query: 201 LRVLMRSFNGIGRVCMSESCDGGLTWSYAKPT---------------QLLNPNSGIDGVK 245
           + +L +     G    SE  D G TW  AKP                ++L P  G  G++
Sbjct: 125 IHLLYQRDYAEGFYTFSE--DDGKTW--AKPVNITATYNAFRPEYNWKVLAPGPG-HGIQ 179

Query: 246 LKDGRLLLAY-----------NTVSRGVLKVALSKDDGDSWHDALTLEENLAM----EFS 290
           L++GRLL A             + S   +    S D G +W     + ++ A       S
Sbjct: 180 LQNGRLLSAVWLANSSKTKPRRSHSPSCVATIYSDDHGKTWKRGAIIADSSAAVANPNES 239

Query: 291 YPAVIQASDGSVHITYTYNRTQIKHV 316
            P  +Q  DG+V ++      Q++  
Sbjct: 240 MP--VQLEDGTVLMSIRNPSKQLRRA 263


>gi|225155223|ref|ZP_03723717.1| Exo-alpha-sialidase [Diplosphaera colitermitum TAV2]
 gi|224803978|gb|EEG22207.1| Exo-alpha-sialidase [Diplosphaera colitermitum TAV2]
          Length = 374

 Score = 37.7 bits (86), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 63/266 (23%), Positives = 107/266 (40%), Gaps = 53/266 (19%)

Query: 81  MWNPVLFKLPS---------------NGLLLFYKIGQEVQ-KWSGC---MKRSYNKGVTW 121
           +W   LF  P+               NG+LL +  G++ +  W+      + S + G TW
Sbjct: 9   IWKTTLFSAPALGYALTRIPALVCTKNGVLLAFCEGRDDKSDWADINIIYRHSTDGGETW 68

Query: 122 SEREQLPPGILGPSKNK-PILLENGLL--LCGSSVESWNSWGSWMEVTVDAGRSWRKYGP 178
           S+   +  G  GP+ N  PI  ++G +  +C  + +       +   + D G +W +   
Sbjct: 69  SDARIILDGEGGPASNSTPIAGKDGTVHFICQKNYKD-----VYYYRSTDDGLTWSEPRD 123

Query: 179 I------YIPNESLSVIQPVPFHTAN-RTLRVLMRSF----------NGIGR-VCMS--E 218
           I      Y       VI P P H    +  R+L+  +           G  R  C+S   
Sbjct: 124 ITSVFDHYKAEYDWEVIAPGPGHAIELKNGRLLVPVWLCDPAGPALVGGDHRPSCVSTIA 183

Query: 219 SCDGGLTWS----YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLK-VALSKDDGD 273
           S DGGLTW      A  T ++   S     +L DGR+++   + S+G  + V++S D   
Sbjct: 184 SDDGGLTWKRGDIIANTTDIIRNPSECTVAELSDGRVMINIRSESKGYRRLVSISADGTT 243

Query: 274 SWHDALTLEENLAMEFSYPAVIQASD 299
            W   +  +E L       + + A+D
Sbjct: 244 GWTKPV-FDEALYDPICMASFLSATD 268


>gi|201067798|gb|ACH92897.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.7 bits (86), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|344204493|ref|YP_004789636.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343956415|gb|AEM72214.1| hypothetical protein Murru_3194 [Muricauda ruestringensis DSM
           13258]
          Length = 870

 Score = 37.7 bits (86), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 47/180 (26%), Positives = 78/180 (43%), Gaps = 34/180 (18%)

Query: 112 KRSY----NKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTV 167
            RSY    + G TWSE   L P     S  K +L+ NG ++  S+  ++ S        +
Sbjct: 251 SRSYFYSNDDGATWSEVLVLSPD--NYSILKSLLISNGKIIVSSTSGNYES--------L 300

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G SW  +    I  ++++      ++            F G G   +  S D G++W+
Sbjct: 301 DNGMSW-TWNNTGIRTQNITTFASNDYYL-----------FTGTGGNGLFRSADNGVSWT 348

Query: 228 YAKPTQLLNPNS-GIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLA 286
            A  + +  PN+  I  +   D +L++A    S GV    LS D+GD+W   +    N+A
Sbjct: 349 -AINSGIDRPNAFTIHEMIEVDNKLIIA---TSGGVF---LSNDNGDTWEKKINPSVNIA 401


>gi|383317755|ref|YP_005378597.1| outer membrane receptor protein [Frateuria aurantia DSM 6220]
 gi|379044859|gb|AFC86915.1| outer membrane receptor protein [Frateuria aurantia DSM 6220]
          Length = 809

 Score = 37.7 bits (86), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 51/121 (42%), Gaps = 11/121 (9%)

Query: 156 WNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTL-RVLMRSFNGIGRV 214
           W  +GS +   VDA  S          N+ +S      F T   +L RV MR  N    V
Sbjct: 591 WRFYGSRVSAVVDAYAS-------NFSNKQISGYDQNTFQTVYLSLGRVHMRGLNTELNV 643

Query: 215 CMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDS 274
            +S+      ++SY +     N NSG DG+    G+ LL      R +L V +S   G +
Sbjct: 644 KLSQDWSIYGSYSYTQAIMQDNLNSGGDGIYYTRGKTLL---NTPRNLLNVGVSFHHGPA 700

Query: 275 W 275
           W
Sbjct: 701 W 701


>gi|358640105|dbj|BAL27401.1| hypothetical protein AZKH_p0518 [Azoarcus sp. KH32C]
          Length = 321

 Score = 37.4 bits (85), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 81/203 (39%), Gaps = 38/203 (18%)

Query: 88  KLPSNGLLLFYKIGQEVQKWS----GCMKRSYNKGVTWSEREQLPPGILGPSKNKPIL-L 142
           K+P N  L       E   W+    G +  + + G TW+ +       +  S+++P+  +
Sbjct: 78  KVPVNATLTGVSFADERNGWAVGHWGVILHTDDGGETWTRQR------VETSEDRPLFSV 131

Query: 143 ENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLR 202
                  G++V  W    S +  TVD G++W +      PN++           A+R L 
Sbjct: 132 HFTDAQHGTAVGLW----SLLLRTVDGGKTWDRVSLPPPPNDT----------RADRNLM 177

Query: 203 VLMRSFNG----IG-RVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNT 257
            +     G    +G R  +  S D G TWSY          S   G  L DG LL+A   
Sbjct: 178 HVFADRRGELFVVGERGTVLHSNDHGQTWSYQATGY---RGSFWSGTSLSDGTLLIAG-- 232

Query: 258 VSRGVLKVALSKDDGDSWHDALT 280
             RG L    S D GDSWH   T
Sbjct: 233 -LRGSLY--RSTDGGDSWHKVAT 252


>gi|32472463|ref|NP_865457.1| sialidase [precursor] [Rhodopirellula baltica SH 1]
 gi|32443699|emb|CAD73141.1| Sialidase [Precursor] [Rhodopirellula baltica SH 1]
          Length = 409

 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 59/259 (22%), Positives = 108/259 (41%), Gaps = 35/259 (13%)

Query: 91  SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
           ++G LL +  G++    S     MKRS + G TWS+   L   G L  +  + ++L++G 
Sbjct: 74  NHGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGKLSLNNPQAVVLQSGR 133

Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
           +L             S     NS+ ++++ + D G++W   R         + ++ +   
Sbjct: 134 ILLMYQRSKLGEHKASDGYGPNSYFTFVQTSDDDGQTWAAPRDVSKQVKREKKVTSVAAG 193

Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
           P             R++M    G     RV  + S D G TW+     P       + + 
Sbjct: 194 PGIGIVLQNGPHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253

Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENLAMEFSYPAVIQ--- 296
            V+L DGR++L   T  +G     KVA+S + G +W   L +++ L       ++++   
Sbjct: 254 MVELSDGRVMLNARTQGKGSTKHRKVAVSDNGGQTWSQ-LQMDQTLIEPTCQASILRYSW 312

Query: 297 ASDGSVHITYTYNRTQIKH 315
            S G   I ++   +Q K 
Sbjct: 313 PSKGKSRIIFSNPASQTKR 331


>gi|417303698|ref|ZP_12090746.1| Exo-alpha-sialidase [Rhodopirellula baltica WH47]
 gi|327540032|gb|EGF26628.1| Exo-alpha-sialidase [Rhodopirellula baltica WH47]
          Length = 409

 Score = 37.4 bits (85), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 54/226 (23%), Positives = 95/226 (42%), Gaps = 32/226 (14%)

Query: 91  SNGLLLFYKIGQEVQKWSG---CMKRSYNKGVTWSEREQL-PPGILGPSKNKPILLENGL 146
           +NG LL +  G++    S     MKRS + G TWS+   L   G L  +  + ++L++G 
Sbjct: 74  NNGTLLAFAEGRQGGDHSENDLIMKRSTDNGSTWSDVVLLNDQGQLSLNNPQAVVLQSGR 133

Query: 147 LLC-----------GSSVESWNSWGSWMEVTVDAGRSW---RKYGPIYIPNESLSVIQPV 192
           ++             S     NS+ + ++ + D G++W   R         + ++ +   
Sbjct: 134 IVLMYQRSKLGEHKASDGYGPNSYFTLVQTSDDDGQTWSAPRDVSKQVKREKKVTSVAAG 193

Query: 193 P-----FHTANRTLRVLMRSFNGIG---RVCMSESCDGGLTWSYAK--PTQLLNPNSGID 242
           P             R++M    G     RV  + S D G TW+     P       + + 
Sbjct: 194 PGIGIVLQNGRHKGRIIMPFNQGPYNNWRVYAAYSDDDGQTWNMGDVAPGDGKGHANEVQ 253

Query: 243 GVKLKDGRLLLAYNTVSRGVL---KVALSKDDGDSWHDALTLEENL 285
            V+L DGR++L   T  +G     KVA+S + G +W   L +++ L
Sbjct: 254 MVELSDGRVMLNARTQGKGSTKHRKVAVSDNGGQTWSQ-LQIDQTL 298


>gi|201067730|gb|ACH92863.1| sialidase [Actinomyces oris]
 gi|201067836|gb|ACH92916.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.4 bits (85), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGQTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPRPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|423250701|ref|ZP_17231716.1| hypothetical protein HMPREF1066_02726 [Bacteroides fragilis
           CL03T00C08]
 gi|423254027|ref|ZP_17234957.1| hypothetical protein HMPREF1067_01601 [Bacteroides fragilis
           CL03T12C07]
 gi|392651658|gb|EIY45320.1| hypothetical protein HMPREF1066_02726 [Bacteroides fragilis
           CL03T00C08]
 gi|392654585|gb|EIY48232.1| hypothetical protein HMPREF1067_01601 [Bacteroides fragilis
           CL03T12C07]
          Length = 567

 Score = 37.4 bits (85), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 38/166 (22%), Positives = 65/166 (39%), Gaps = 31/166 (18%)

Query: 139 PILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNE-----------SLS 187
           P+ + NG  +  ++       G +   + D G +W+    +  P+            +  
Sbjct: 143 PVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCSNTVTSPDHQGGGFHKGIRWNHG 202

Query: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247
            ++P      + TL +LMR+         S+  DGG TW  ++ +      +     +L 
Sbjct: 203 AVEPTVVELKDGTLWMLMRTSQDFHYQAFSK--DGGQTWGESETSPFYGTITMPTLGRLA 260

Query: 248 DGRLLLAY-NTV-----------------SRGVLKVALSKDDGDSW 275
           DGRLLL + NT                  +R V  VA+S DDG +W
Sbjct: 261 DGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSDDDGKTW 306


>gi|201067806|gb|ACH92901.1| sialidase [Actinomyces oris]
          Length = 359

 Score = 37.4 bits (85), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGQTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 273

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 274 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 332 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 359


>gi|201067658|gb|ACH92827.1| sialidase [Actinomyces naeslundii]
 gi|201067662|gb|ACH92829.1| sialidase [Actinomyces naeslundii]
 gi|201067668|gb|ACH92832.1| sialidase [Actinomyces naeslundii]
 gi|201067678|gb|ACH92837.1| sialidase [Actinomyces naeslundii]
 gi|201067702|gb|ACH92849.1| sialidase [Actinomyces naeslundii]
 gi|201067708|gb|ACH92852.1| sialidase [Actinomyces naeslundii]
 gi|201067712|gb|ACH92854.1| sialidase [Actinomyces naeslundii]
          Length = 359

 Score = 37.4 bits (85), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
             +P    N    +D  ++          D R   LLL+++      SR    +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW       EN      Y  +   SDGS+
Sbjct: 332 GASWATGKVFNENF---VGYTTIAVQSDGSI 359


>gi|201067692|gb|ACH92844.1| sialidase [Actinomyces naeslundii]
          Length = 359

 Score = 37.4 bits (85), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
             +P    N    +D  ++          D R   LLL+++      SR    +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW       EN      Y  +   SDGS+
Sbjct: 332 GASWATGKVFNENF---VGYTTIAVQSDGSI 359


>gi|201067650|gb|ACH92823.1| sialidase [Actinomyces naeslundii]
          Length = 376

 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   P     +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 219 DHGKTWQAGTPTGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVARSTDGGQTWS 273

Query: 228 YAKPTQLLNPNSGIDGVKL---------KDGR---LLLAYNTV----SRGVLKVALSKDD 271
             +P    N    +D  ++          D R   LLL+++      SR    +++S DD
Sbjct: 274 --EPASDKNLPDSVDNAQIIRAFPNAAPSDPRAKVLLLSHSPNPKPWSRDRGTISMSCDD 331

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW       EN      Y  +   SDGS+
Sbjct: 332 GASWATGKVFNENF---VGYTTIAVQSDGSI 359


>gi|376283834|ref|YP_005157044.1| hypothetical protein CD31A_0333 [Corynebacterium diphtheriae 31A]
 gi|371577350|gb|AEX41018.1| hypothetical protein CD31A_0333 [Corynebacterium diphtheriae 31A]
          Length = 678

 Score = 37.4 bits (85), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 48/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENGL----LLCGSSVESWNSW 159
           M  S + G++WS    L P +        G S    I L+ G     LL        + +
Sbjct: 452 MLTSDDDGLSWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRGTHAGRLLIPMYYTKGDQF 511

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
            +    + D G +W +       +  L + +      ++ ++ +  R+ +  G V  + S
Sbjct: 512 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 566

Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
            DGG +WS           +++P  L   N   DG+   +   LL +    RGVL+  +S
Sbjct: 567 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 620

Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            +DG SW             + Y  + Q  DGS+
Sbjct: 621 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 651


>gi|325067903|ref|ZP_08126576.1| BNR/Asp-box repeat protein [Actinomyces oris K20]
          Length = 685

 Score = 37.4 bits (85), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 65/151 (43%), Gaps = 26/151 (17%)

Query: 168 DAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWS 227
           D G++W+   PI    +   V++      ++ +L +  R+ +G G   ++ S DGG TWS
Sbjct: 491 DHGKTWQAGTPIGTGMDENKVVE-----LSDGSLMLNSRASDGSGFRKVAHSTDGGQTWS 545

Query: 228 YAKPTQLLN------------PNSGIDGVKLKDGRLLLAYNTV----SRGVLKVALSKDD 271
                Q L             PN+  D  + K   LLL+++      SR    +++S DD
Sbjct: 546 EPVSDQNLPDSVDNAQIIRAFPNAAPDDPRAK--VLLLSHSPNPKPWSRDRGTISMSCDD 603

Query: 272 GDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
           G SW  +    E       Y  +   SDGS+
Sbjct: 604 GASWTTSKVFHEPF---VGYTTIAVQSDGSI 631


>gi|376247613|ref|YP_005139557.1| hypothetical protein CDHC04_0246 [Corynebacterium diphtheriae HC04]
 gi|376250431|ref|YP_005137312.1| hypothetical protein CDHC03_0263 [Corynebacterium diphtheriae HC03]
 gi|372111935|gb|AEX77994.1| hypothetical protein CDHC03_0263 [Corynebacterium diphtheriae HC03]
 gi|372114181|gb|AEX80239.1| hypothetical protein CDHC04_0246 [Corynebacterium diphtheriae HC04]
          Length = 687

 Score = 37.4 bits (85), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 48/214 (22%), Positives = 83/214 (38%), Gaps = 36/214 (16%)

Query: 111 MKRSYNKGVTWSEREQLPPGI-------LGPSKNKPILLENGL----LLCGSSVESWNSW 159
           M  S + G++WS    L P +        G S    I L+ G     LL        + +
Sbjct: 461 MLTSDDDGLSWSTPRDLTPDVKEEWMRFCGTSPGNGIQLQRGTHAGRLLIPMYYTKGDQF 520

Query: 160 GSWMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSES 219
            +    + D G +W +       +  L + +      ++ ++ +  R+ +  G V  + S
Sbjct: 521 FAAALYSDDHGDTWHRG-----ESPDLEMYESTLVEASDGSVLLWARNQHPSGAVAHARS 575

Query: 220 CDGGLTWS-----------YAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALS 268
            DGG +WS           +++P  L   N   DG+   +   LL +    RGVL+  +S
Sbjct: 576 TDGGHSWSQVTFNPNVPEIFSQPNALRVTNP--DGIIFANASQLLPFR--GRGVLR--MS 629

Query: 269 KDDGDSWHDALTLEENLAMEFSYPAVIQASDGSV 302
            +DG SW             + Y  + Q  DGS+
Sbjct: 630 TNDGASWQHNRVFNPR---HYVYQCMTQLPDGSI 660


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.136    0.433 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,754,777,526
Number of Sequences: 23463169
Number of extensions: 255997723
Number of successful extensions: 474583
Number of sequences better than 100.0: 951
Number of HSP's better than 100.0 without gapping: 553
Number of HSP's successfully gapped in prelim test: 398
Number of HSP's that attempted gapping in prelim test: 470861
Number of HSP's gapped (non-prelim): 1845
length of query: 321
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 179
effective length of database: 9,027,425,369
effective search space: 1615909141051
effective search space used: 1615909141051
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)